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1, POLITICAL ASPECTS 
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I 
POLITICAL ASPECTS 


Colonialism is the establishment and mainte- 
nance, for an extended time, of rule over an alien 
people that is separate from and subordinate to 
the ruling power. It is no longer closely associated 
with the term “colonization,” which involves the 
settlement abroad of people from a mother country, 
as in the case of the ancient Greek colonies or the 
Americas. Colonialism has now come to be identi- 
fied with rule over peoples of different race inhab- 
iting lands separated by salt water from the im- 
perial center; more particularly, it signifies direct 
political control by European states or states settled 
by Europeans, as the United States or Australia, 
over peoples of other races, notably over Asians 
and Africans, To this category should be added 
Japan’s rule over her dependent territories, lost 
after World War 11. 

Some further features of the “colonial situation” 
are: domination of an alien minority, asserting 
racial and cultural superiority, over a materially 
inferior native majority; contact between a ma- 
chine-oriented civilization with Christian origins, 
a powerful economy, and a rapid rhythm of life 
and a non-Christian civilization that lacks machines 
and is marked by a backward economy and a slow 
rhythm of life; and the imposition of the first civili- 
zation upon the second (Balandier 1951, p. 75). 

The Belgians attempted in the earlier years of 
the United Nations to broaden the concept of colo- 


nialism to include all ethnically distinct minorities 
discriminated against in their home countries. 
Contending that such minorities were often in 
greater need of UN attention than the people in 
overseas dependencies, the Belgian thesis proposed 
their acceptance as nonself-governing peoples 
under Chapter xı of the Charter. This interpreta- 
tion was generally rejected in the UN and by the 
colonial and former colonial peoples themselves. 
The most serious shortcomings of the narrower 
interpretation are that it excludes the Asian and 
other non-Russian peoples in the Soviet Union, 
alleged to be dominated by the Great Russians, and 
the Africans and Asians of South Africa, barred 
from the main stream of the country’s life by the 
apartheid doctrine. South Africa lacks the geo- 
graphic separation of colonies from the imperial 
center but can be at least partially brought within 
the colonial rubric because the dominant group is 
white European whereas the ruled are of different 
race and color. The Asian peoples of the Soviet 
Union are usually placed outside the traditional 
colonial category even though Western observers 
often accuse the U.S.S.R. of being the colonial 
power par excellence. 

Definitions of colonialism couched in terms of 
value and emotion take quite a different form. This 
is most true of those left-wing analysts who can 
find nothing but evil in colonialism. Thus the “Great 
Soviet Encyclopaedia” of 1953 speaks of coloniza- 
tion as the military or economic enslavement of 
any dependent country and sees it as accompanied 
by bestial exploitation and extermination of the 
indigenous population. The more leftward-inclined 
Asian and African leaders frequently denounce 
colonialism in similar terms. 
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Historical evolution of modern colonialism 


Modern colonialism started with the fifteenth- 
century voyages of the Portuguese along the west 
coast of Africa, which in 1498 brought Vasco da 
Gama to India. The Portuguese and Spaniards were 
the first to establish their dominions overseas and 
clung to them long after their imperialist drive 
had lost its forward thrust. The Americas were 
wholly taken over as European domains, the Dutch 
and British began to stake out their claims in India 
and the Indies, and France had won and lost more 
than one empire by 1815. The first blows for anti- 
colonialism were struck by the American Revolu- 
tion and the subsequent liberation of most of Latin 
America. 

Although Europe’s imperial expansion and growth 
in power continued during the first three-quarters 
of the nineteenth century, the circumstances of 
the times tended to discourage the extension of 
colonial holdings. Britain’s command of the seas 
and its industrial head start gave it a virtual mo- 
nopoly of access to the world overseas, making 
unnecessary the kind of exclusive control that colo- 
nialism offered. The abandonment of mercantilism 
and the swing to laissez-faire and free trade made 
colonies less attractive than they had been before. 
Bentham had pleaded that Britain and France 
should rid themselves of their dependencies; Turgot 
saw colonies falling from the tree like ripe fruit; 
and Disraeli, assuming the colonies would soon be 
independent, regarded them in 1852 as “millstones 
around our neck.” 

China was opened to the penetration of the West 
but was not subjected to colonial rule. Only in 
India did the British more or less consistently ex- 
pand their colonial sway, and France took over 
Algeria and made its first encroachments in Indo- 
china. In Britain it was even seriously proposed, 
not long before the start of the scramble for Africa, 
that there should be a withdrawal from African 
holdings. 

A very different climate of opinion and range of 
action prevailed in the last quarter of the century. 
The restraints on colonialism were swept away in 
the new imperialist flood that speedily completed 
the partition of the world between the imperial 
powers. Africa was almost totally divided into 
European dependencies. In other areas as well, new 
colonies were carved out or old ones consolidated 
and extended, as in southeast Asia, where the 
Dutch, French, and British greatly expanded the 
scope and intensity of their rule in the Indies, 
Indochina, Malaya, and Burma. Changing power 
relationships brought a redivision of territories in 


the Spanish-American War, in the Boer War, and 
after World War I with the transfer of German and 
Turkish holdings, 

To assess the causes of the change in the last 
decades of the nineteenth century would involve 
the whole range of theories of imperialism, but 
certain elements particularly related to colonialism 
may be singled out. Such men as Jules Ferry, 
Joseph Chamberlain, and Cecil Rhodes justified the 
revival of colonialism in terms of the needs of the 
new industrial system and by the demands of a 
Darwinian struggle between nations and races. The 
entry of France, Germany, and Italy, followed by 
Japan and the United States, into the imperial 
rivalry, not to mention Russia’s expansive mood, 
seemed to substantiate Lenin’s dictum that only 
colonial possession gave a complete guarantee 
against the risks of competitive struggle. The new 
wave of protectionism and governmental interven- 
tion at home restored validity to the assertion of 
direct political control overseas. Such control 
seemed particularly justified in tropical Africa, 
where it was arguable that only the assumption 
of full responsibility by a Western government 
could establish the conditions under which modern 
enterprise could function. This position found pow- 
erful support in the prevalent theories holding that 
certain races, notably the Teutonic or Anglo-Saxon, 
had a peculiar genius for government. 

The transition away from colonialism. Western 
imperialism reached its highest point before World 
War I, although several decades went by before 
World War 1 brought a full rejection of colonial- 
ism. The Spanish-American War marked the begin- 
ning and the end of any large-scale American 
involvement with colonialism, and the Boer War 
crystallized the hostility of many, in Britain and 
elsewhere, to imperialism. The years preceding 
World War 1 were the last in which a complacent 
colonialism could flourish as a part of what seemed 
the natural order of things. Liberal and socialist 
attacks on colonialism were growing, although the 
belief in white supremacy lingered on. The adop- 
tion of the mandates system in the Versailles peace 
settlement was one significant expression of the 
doubts that were beginning to undermine coloni- 
alism. The only significant additions to the colonial 
domains between the two world wars were short- 
lived: Mussolini’s anachronistic seizure of Ethiopia 
and Japan’s drive on China and, later, southeast 
Asia. 

All the forces opposed to colonialism and sap- 
ping its vitality grew in strength in the interwar 
years. The success of the Russian Revolution 
brought into being a world-wide network of agita- 


tion against imperialism, and nationalist activities 
and organizations were multiplying in the depend- 
ent territories themselves. In the imperial centers 
the will to maintain empire steadily declined with 
the spread of ideas hostile to racialism and colonial 
domination. World War n greatly hastened the 
process through the Japanese displacement of the 
colonial powers in southeast Asia, the further 
weakening of those powers at home, the intensifi- 
cation of anti-imperialist opinion throughout the 
world, and the atmosphere of change that per- 
meated many of the colonies. 

After 1945 the flood tide of anticolonialism 
swept away the colonial system with a speed and 
thoroughness that matched colonialism’s advance 
at the close of the nineteenth century. The posses- 
sion of colonies, so long a matter of pride and 
prestige, now became a sin to be expiated only, 
if at all, by the granting of immediate independ- 
ence. The League of Nations’ indifference to the 
problem was replaced by the profound involvement 
of the United Nations in the process of decoloni- 
zation. 


Attitudes toward colonialism 


Attitudes toward colonialism have varied greatly 
from time to time and from place to place. Most 
frequently, colonialism has been accepted as merely 
one manifestation of the ever-present truth that 
the strong dominate the weak. Although the mis- 
sionary element has rarely been wholly absent, the 
usual presumption has been that every colony does 
or ought to exist for the benefit of the mother 
country. 

The justifications of colonial rule cover a wide 
range, often resting upon the right of the con- 
queror, perhaps bolstered by a claim of racial 
superiority. Where the interests of the dependent 
peoples are taken into account, it is held that an 
extended period of guardianship is necessary to 
enable them to “stand by themselves under the 
strenuous conditions of the modern world.” Here 
the mission civilisatrice and the “white man’s bur- 
den” come into play. Some French spokesmen for 
colonialism acclaim it as the universal instrument 
for the spread of civilization, pointing to them- 
selves and many of their neighbors as products of 
Roman colonization. 

The defense of colonialism is likely to adopt 
some variant of the criterion laid down by John 
Stuart Mill, who, in the case of peoples not yet 
ready for representative government, defended 
alien rule on the ground that the colonial mode of 
government was as legitimate as any other if it 
was the one which in the existing state of civili- 
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zation of the subject people most facilitated their 
transition to a higher stage of improvement. Lord 
Lugard (1922) introduced another element in pro- 
posing that the colonial powers were under a dual 
mandate obligating them to secure the advance- 
ment of their dependent territories and to develop 
them in the interest of the world at large. 

The assumptions on which such defenses of 
colonialism rest have been increasingly subject to 
challenge in recent decades. The more moderate 
present-day approach tentatively accepts colonial 
rule if the authorities devote themselves to pre- 
paring their wards for independence, but growing 
skepticism as to the trustworthiness of the colonial 
powers has led to the insistence that they accept 
international supervision in so doing. The UN 
Charter looked to independence or self-government 
for all dependent peoples, tightened control over 
the trust territories surviving from the mandates 
system, and brought all nonself-governing terri- 
tories into the international public domain. 

The more radical approach denounces the impo- 
sition of alien rule as always evil under all circum- 
stances. This starting point eliminates all contro- 
versy as to whether one colonial system or policy 
is better than another by blanket condemnation of 
all, leaving immediate independence as the only 
way out. Building on the anticolonial resolution of 
the 1955 Bandung Conference, the UN General 
Assembly in its 1960 Declaration on the Granting 
of Independence gave this position international 
recognition. This declaration denounced the alien 
subjugation of peoples as a denial of human rights 
and an impediment to peace, proclaimed the right 
of all peoples to self-determination without condi- 
tions or reservations, and repudiated the doctrine 
of tutelage by asserting that inadequacy of politi- 
cal, economic, social, or educational preparedness 
shall never serve as a pretext for delaying inde- 
pendence. Asian and African opinion has con- 
stantly been moving toward this radical position, 
pillorying colonialism as the source of most of the 
world’s troubles and proclaiming that the higher 
law of anticolonialism renders all remnants of the 
era of European colonialism illegitimate and open 
to attack. 

The anticolonialists project such doctrines into 
the future through the use of the concept of neo- 
colonialism, which accuses the imperialists, among 
whom the Americans figure prominently, of regard- 
ing the independence that the colonial peoples have 
wrung from them as only the occasion to adopt 
more subtle tactics of domination and exploitation. 
Overt colonial rule is thus replaced by economic 
and other forms of control, including the provision 
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of aid, and the nominally free countries are Bal- 
kanized and manipulated in the imperial interest. 

The colonial and former colonial powers see 
what has been happening in recent decades in a 
very different light. They reject the charge of being 
oppressors and exploiters and point to their accom- 
plishments in advancing their dependent peoples 
in every sphere, including the granting of inde- 
pendence to hundreds of millions since 1945, How- 
ever, they differ greatly in the way in which they 
have envisaged their colonial mission. The position 
of four of them—Britain, France, Belgium, and 
Portugal—may be briefly sketched to indicate the 
wide range of variation. 


Varying colonial policies 

Great Britain. With the exception of the United 
States, whose colonial holdings were far smaller, 
Great Britain could adapt itself more easily to the 
new dispensation than any of the other colonial 
powers. The entire British policy of regarding colo- 
nies not as integral parts of the mother country 
but as countries with their own distinctive ways 
of life facilitated autonomous development. The 
colonial peoples were given an increasing share 
in the governing councils, public service, and judi- 
ciary and thus were started on what came to be 
a standard cycle culminating in self-government 
and then independence. This was a cycle through 
which the older dominions had passed and which 
was tested again in India, Ceylon, and elsewhere 
in the interwar decades. World War 1 brought 
both a heightening of the belief that colonial rule 
imposed responsibility for the well-being of de- 
pendent peoples and an acceptance of the need to 
move speedily to end colonialism. India’s inde- 
pendence, in 1947, started a process of decoloni- 
zation that dismantled the British empire in Asia 
and most of Africa and the West Indies; and al- 
though difficulties have cropped up where there is 
a substantial amount of white settlement or race 
mixture, as in the Rhodesias, Kenya, and British 
Guiana, the British have been able to transform 
most colonies into independent states within the 
framework of long-established policies that were 
already in operation. 

France. France, on the other hand, was forced 
to undertake a basic reversal in direction. In con- 
trast to the British, the French inclined always to 
a policy of cultural, economic, and political assim- 
ilation. It was characteristic of French policy that 
the 1944 Brazzaville conference of leading colonial 
officials decreed that France’s work of civilization 
in her colonies excluded any idea of autonomy or 
of evolution outside the French empire, “even in 


the distant future.” French aid for the colonies was 
greatly increased after World War n, and many 
reforms were introduced, but the bonds linking the 
colonies to France were not relaxed until the eve 
of independence. The Indochinese and Algerian 
wars demonstrated French reluctance to accept 
colonialism’s end. Even in 1958, when Guinea 
opted for independence by voting “no” in the ref- 
erendum on de Gaulle’s constitution, Guinea was 
treated as an outlaw. Yet in the succeeding months 
de Gaulle reconciled himself and the French people 
to African independence on terms of intimate col- 
laboration between France and the newly freed 
countries—terms often so intimate as to lead to 
charges that a French neocolonialism had been 
instituted, rendering independence nominal. 

Belgium. Belgian rule over the Congo, which 
came to an abrupt end on July 1, 1960, was an 
unusual combination of elements. The Belgians 
concentrated power in Brussels, as did the French 
in Paris, but they did not follow France in asso- 
ciating Africans with them in the imperial center 
nor Britain in drawing the Congolese into the local 
administration and governing councils. The great 
triumvirate—the Belgian government, the giant 
corporations, and the church—made tremendous 
strides in economic development, and to a lesser 
extent in welfare and education, laying the foun- 
dation for what would have been a solid structure 
if uninterrupted decades of colonial rule had 
stretched ahead. The Belgian philosophy of coloni- 
alism explicitly excluded the creation of an elite 
on the French or British model until mass educa- 
tion would have spread widely and a middle class 
come into being. The haste with which Belgium 
moved to sever its formal ties with the Congo 
following the riots in Leopoldville in January 1959 
gave no opportunity to bridge the immense gap 
between its patronizing paternalism and the respon- 
sibilities suddenly assumed by the Congolese, who 
were left with a government lacking trained Afri- 
can leaders and officials and an army lacking 
African officers. 

Portugal. The Portuguese offer a fourth variant 
of colonialism, ruling over an empire shorn of Goa 
but still reaching from Macao to Mozambique and 
Angola. Oldest of the Western colonial powers, 
Portugal continues to protest vigorously that she 
has no nonself-governing peoples but only equal 
provinces of a single indivisible realm. The Portu- 
guese boast that they are free of racial prejudice, 
but their African colonies are marked by the cleav- 
age between the few thousands of “civilized” or 
“assimilated” Africans and the millions of “uncivi- 
lized.” The 1961 rising in Angola led the Portuguese 


government to announce a number of reforms, 
including the abolition of the regime do indigenato 
and the establishment of a single status for all 
within the Portuguese domain, but the overwhelm- 
ing majority of the Africans remain illiterate and 
touched by little more of modernity than pressure 
to work as laborers for the Europeans. Further- 
more, embittered competition has inevitably broken 
out between the advancing African elements and 
the thousands of Portuguese peasants and workers 
officially encouraged to emigrate to Portuguese 
Africa with the double purpose of relieving home 
poverty and establishing the Portuguese presence 
so firmly as to make it unchallengeable. The heart 
of the problem is that Portugal is itself only a 
partially developed country, having lived for many 
years under a dictatorship and being unable to 
overcome its own poverty and mass illiteracy. Its 
ability to secure the advancement of millions of 
people overseas is obviously questionable. 


The literature of colonialism 


The literature dealing with colonialism is wide- 
ranging and diverse and reflects the changing 
nature of the colonial problem. For the most part 
it consists of studies of particular dependencies or 
groups of dependencies, but a substantial body of 
literature dealing with general aspects of colonial- 
ism has also been built up. Several studies have 
undertaken to compare the colonial policies of the 
powers in terms of the goals that have been set, the 
success with which these goals have been reached, 
and the administrative and other machineries that 
have been employed. At both ends of the spectrum 
the motives lying behind the acquisition of depend- 
encies and the evaluation of the forces leading to 
the present surge of decolonization open challeng- 
ing vistas to inquiry. Among the themes that have 
recurred regularly in the examination of coloni- 
alism are the relative values of direct and indirect 
rule, centralization and decentralization, varying 
types of economic policy, the acceptability and ef- 
fects of white settlement, pressures of different 
kinds to aid in recruiting a labor force, and the 
scope and nature of the educational system. 

In the interwar decades there appeared in the 
literature of colonialism the relatively new theme 
of international control over the colonial powers, 
but since 1945 this has been superseded by the 
processes and problems of decolonization and the 
means of securing economic and political develop- 
ment. The transition through the last stages of 
colonial rule to independence has been studied in 
a number of instances but still offers an unusually 
rich field of inquiry. Africa, achieving independ- 
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ence almost overnight, has come in for unprece- 
dented attention. Now that colonialism is virtually 
at an end, it becomes possible to explore in depth 
and in detail what type of colonial system has pro- 
duced the best results, but before this question can 
be meaningfully explored it is necessary to deter- 
mine the scale of values by which colonialism in 
its various guises is to be measured, 


The era of colonialism is far too close to us for 
any definitive and objective assessment of it to be 
possible. A few salient points may, however, be 
tentatively put forward. 

(1) Colonialism imposed alien and authoritar- 
ian regimes on subordinate societies. These regimes 
tended to train a few of their subjects in bureau- 
cratic management and required passive acquies- 
cence from the remainder. 

(2) Although for long periods passive acquies- 
cence was indeed largely attained, as colonialism 
advanced it also stimulated nationalist agitation 
and organization and came to be more and more 
passionately detested, particularly by those among 
the colonial people who came into closest contact 
with the European superiors. 

(3) The anticolonial forces have derived their 
inspiration and ideas primarily from the teachings 
of the colonial powers themselves, have for the 
most part adopted Western forms of organization 
and action, and have been led by men intimately 
acquainted with the West. 

(4) For good or ill, colonialism has been the 
primary channel through which the ideas and tech- 
niques, the spiritual and material forces of the 
West, have impinged upon the rest of mankind. 


RUPERT EMERSON 


[See also IMPERIALISM; MODERNIZATION; NATION; 
NATIONALISM. ] 
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LLI 
ECONOMIC ASPECTS 


This article will deal with the economic theories 
propounded and the practices adopted at different 
times during the four and a half centuries of 
European colonization and with the effects these 
had on the colonies. It will examine what differ- 
ence the end of formal empires in the mid-twen- 
tieth century made to the economic relations be- 
tween metropolitan powers and former colonies. 

The primary fact about colonialism as a his- 
torical phenomenon is that precast theories seldom 
dictated or even strongly influenced practice. At 
every stage the economic relationships between 
colonies and metropolis were determined pragmat- 
ically, according to current European practices and 
needs and conditions in the colonies. Theories were 
secondary, designed to justify or attack existing 
practice, Even in the case of Adam Smith’s Wealth 
of Nations (1776), when a new approach seems 
to have had demonstrable effects, it is clear that 
changed colonial policies resulted more from al- 
tered circumstances than from an intellectual ac- 
ceptance of his arguments. 


Economic theories of colonialism. Theories of 
colonialism may be placed into four periods of 
time: prior to 1660; from 1660 to 1776; from 1776 
to 1870; and from 1870 onward. Such divisions 
are arbitrary, but they provide a primitive frame- 
work for analysis. 

Before 1660. During the two centuries before 
1660 colonial economic theories were of minimal 
importance, The expansion of colonial territories 
proceeded experimentally, reflecting a wide diver- 
sity of aims: crusading zeal against Islam, the 
missionary impulse, geographical curiosity, the de- 
sire for bullion and for luxury trades in the East, 
and land hunger. None of these objectives required 
theoretical justification—though men like Barto- 
lomé de Las Casas might denounce particular as- 
pects of Spanish native policy in America—and 
the economic systems imposed on American settle- 
ments and Eastern trading bases were derived from 
the simple premise that each colonizing country 
was entitled to any advantages its new possessions 
offered. The northern European countries that 
came to colonization later—England, France, and 
Holland—had more time to theorize before acting; 
but no one produced a general theory of colonial- 
ism, and contemporary attitudes must be deduced 
from passing references. Most French and English 
commentators, such as Bodin, Antoine de Mont- 
chrétien, Marc Lescarbot, Bacon, and Richard Hak- 
luyt, mixed economic with other rewards of coloni- 
zation indiscriminately: the need for a route to 
the East via a northwest passage, the utility of 
American bases in the war with Spain, the interests 
of transatlantic fishermen, the possibility of find- 
ing gold and silver, the support a maritime empire 
would give the navy, the value of outlets for un- 
employed people, the possibility of new markets 
and sources of raw materials, and the Christian 
mission to convert the heathen. Most of these were 
commonplaces of Iberian writers: there was no 
attempt to select or to provide a rationale of colo- 
nialism. 

The mercantilist era. During the period after 
about 1660, however, colonial theories became 
more sophisticated. This was the classical age of 
mercantilist thought, though it must be remem- 
bered that the multiplicity of arguments and prac- 
tices it embraced were first dubbed a “system” by 
Adam Smith in 1776 [see ECONOMIC THOUGHT]. 
The essence of mercantilism was that it projected 
current metropolitan preoccupations into the colo- 
nies and assumed that dependencies existed solely 
to serve these particular interests. There was still 
some diffusion of aims, but the primary considera- 
tions were now economic advantage and the value 


of colonial trade for supporting an artificially large 
merchant marine. The economic possibilities of 
colonies were categorized, As producers of raw 
materials they served their owners by freeing them 
from dependence on European supplies, which 
might be cut off during war and for which monop- 
oly prices were often charged. Colonial products 
could, moreover, be paid for in exported manu- 
factures, saving foreign exchange, and could be 
re-exported to Europe to help the balance of trade. 
Conversely, colonies provided uniquely favorable 
markets for European exports, since they were 
monopolies, and thus helped to maintain employ- 
ment in metropolitan industries. Since they were 
subordinate, they could be prevented from building 
competing industries, and their economies could 
be made entirely complementary. These arguments, 
based on observation of established practices, were 
the staple of pamphleteers and statesmen from the 
second quarter of the seventeenth into the early 
nineteenth century: of Richelieu, Colbert, and Vau- 
ban in France; of Child, Petty, Davenant, Defoe, 
Arthur Young, and many others in Britain. Though 
never brought together into a coherent academic 
“system,” they formed a well understood and only 
occasionally criticized corpus of concepts, conven- 
iently summarized as the pacte colonial. 

Adam Smith: free trade. Attacks on these prin- 
ciples became significant only in the mid-eighteenth 
century, led by the French physiocrats and Ency- 
clopedists; the first because they disapproved of 
overemphasis on industrial production for the 
colonies, the second because they disliked all mo- 
nopoly [see Economic THOUGHT]. But it was Adam 
Smith (1776), normally taken to be the first aca- 
demic economist, who first attacked mercantilism 
root and branch and provided an alternative theory 
of colonial economics. He did this by applying his 
theory of the division of labor to colonial produc- 
tion and trade. The value to Europe of colonies in 
America (he largely ignored possessions elsewhere) 
was merely that they provided new articles for 
international trade and extended the market for 
European manufactures, Such advantages were in- 
dependent of any colonial system and were dimin- 
ished to the extent that any state tried to monop- 
olize its colonial trade. Monopoly raised the cost 
to consumers both in America and Europe; dis- 
couraged foreign capital from colonial investment; 
raised the profits of metropolitan capital and so 
reduced each country's competitiveness; and made 
the metropolis dangerously dependent on colonies 
of uncertain loyalty. In addition, the higher profits 
allegedly made by European merchants on the 
monopoly trade of the colonies had to be set against 
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the costs of imperial government and defense, 
which were paid by the general taxpayer. Hence 
a better colonial system would be one in which 
there was no monopoly and in which common 
costs were shared between colonies and metropolis. 
Better still, all colonies should be liberated, for 
once their trade was open to the world, the prin- 
ciple of the division of labor could be fully applied, 
and Europe would no longer bear the unrewarding 
cost of imperial organization. 

Smith was rightly pessimistic about Europe's 
willingness to admit the truth of these facts. For 
half a century his arguments convinced only the 
minority, and no colony was liberated voluntarily, 
for there were always enough traditionalists to 
argue that the benefits of colonialism were real and 
to run against colonial monopoly and therefore 
purely economic rewards. But the trend of eco- 
nomic theory, as expounded in England by men 
like David Ricardo and the Benthamites, continued 
to run against colonial monopoly and therefore 
against colonies, for the two were still assumed 
to be inseparable. In any case the question had 
become almost academic by the 1830s, for once 
the United States, Spanish America, and Brazil 
were independent the British alone retained a sub- 
stantial empire, compensating by gains in India 
for what they had lost in America. But in the 
1820s, the British, as they moved toward interna- 
tional free trade, were also losing interest in their 
colonies, which were increasingly criticized as fields 
of government expenditure unrequited by economic 
advantages. India and trading bases in the East 
were generally accepted, because they were eco- 
nomically self-supporting and were an increasingly 
valuable market for British manufactures; but 
settlement colonies were falling into disrepute. 

Modern period: capital outlet. In the century 
after 1830 two distinct theories were developed to 
justify or rationalize colonies as economic phenom- 
ena under the new conditions. The first theory ap- 
plied only to colonies of white settlement in North 
America, South Africa, and Australasia and con- 
cerned only the British. Against those who de- 
nounced such possessions, E. G. Wakefield, in a 
series of publications starting with his Letter From 
Sydney in 1829 and culminating in his Art of 
Colonization in 1849 argued that suitable colonies 
were valuable to the parent state, even under free 
trade conditions, provided they were correctly or- 
ganized. Denying the precepts of Ricardo, he ar- 
gued that an industrialized state could generate 
surplus capital which it was unprofitable to invest 
at home either in agriculture (because of the 
declining profitability of marginal lands) or in 
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industry (because foreign markets did not expand 
fast enough to absorb its products). Adopting 
Smith’s theory of “new equivalents,” he held that 
this capital would be more profitably employed if 
it were exported to places where fertile land was 
in good supply. Provided ample labor was made 
available by emigration, capital would be more 
productive in these new lands and at the same time 
would expand the market for British manufactures 
and increase the supply of cheap foodstuffs and 
industrial raw materials. Colonies would provide 
these advantages better than independent states 
because colonies could be forced to be free-trading, 
and the new settlements would remain primary 
producers for the indefinite future. 

Wakefield's theory and the detailed prescriptions 
he laid down for new settlements were never fully 
tested; but his basic argument carried conviction 
and constituted the main justification for settle- 
ment colonies under free trade conditions. But 
since only Great Britain possessed important settle- 
ment colonies in the nineteenth century, Wake- 
field’s theory had limited applicability. The vast 
majority of new European colonies in the period 
after 1830 were tropical territories in Africa, Asia, 
and the Pacific that did not fit his formula. By 
the last quarter of the century new theories were 
necessary to justify these acquisitions to the mass 
electorates of Europe. 

Modern period: market outlet. Thus the second 
main theory of colonialism in the modern period 
was specially adapted to the facts of tropical de- 
pendencies. Significantly, no leading economist 
dared challenge Adam Smith by formulating a 
comprehensive doctrine; the new “neomercantilist” 
theories were evolved by politicians and business- 
men rather than academic economists. Theory fol- 
lowed the fact of new colonies and had to justify 
them. As propounded and widely disseminated by 
such statesmen as Jules Ferry of France and 
Joseph Chamberlain of Britain, the new argument 
was that tropical colonies were essential as markets 
(débouchés) for the surplus products of European 
industry as it expanded under the influence of 
renewed domestic protectionism—necessary safety 
valves for industrial capitalism—without which 
Europe faced social chaos and perhaps revolution. 
As a secondary point it was held that industrial 
states needed guaranteed sources of cheap indus- 
trial raw materials and food and that colonies en- 
sured that any one country could not be made to 
pay monopoly prices for them. In addition, some 
argued that Europe had surplus capital that was 
best invested in tropical plantations, mines, and 
communications. Thus, in general, such colonies 


provided the solution to most of the economic and 
social problems of industrial Europe under condi- 
tions of protectionism. 

Twentieth-century criticism. By the early twen- 
tieth century these arguments were widely accepted, 
even to the extent that many believed tropical 
colonies actually did provide these projected advan- 
tages and that they had been deliberately acquired 
for these functions. Seldom were any of these as- 
sumptions true: the irony was that critics of the 
new colonization accepted them and founded their 
new anticolonialist arguments on these alleged 
facts. Critics had been vocal throughout the later 
nineteenth century, mostly on financial or humani- 
tarian grounds. After about 1900 they divided into 
two groups: those who believed the profits of tropi- 
cal empire to be real but deplored them; and those 
who believed the rewards to be illusory. The first 
school was led by European Marxists who regarded 
the export of capital to the tropics as evidence that 
Europe was entering a new phase of “monopoly 
capitalism” when it was no longer profitable to 
invest in protected domestic markets because they 
were dominated by a few great trusts. Europe had 
to export surplus capital or allow capitalism itself 
to stagnate. Ultimately the nations would compete 
for the limited supply of colonies, and the resulting 
wars for imperial redivision would inaugurate the 
socialist revolution and the end of capitalism. This 
was the basic argument propounded by V. I. Lenin 
in his influentia] pamphlet Imperialism, the Highest 
Stage of Capitalism (1916). Others, like Rosa Lux- 
emburg, produced variants on the same theme. 

The rival school of critics was led by J. A. Hobson, 
whose Imperialism: A Study (1902) stated the case 
against the profitability of tropical colonization. 
Hobson accepted the propagandists’ argument that 
colonies were the product of surplus capital seeking 
fields for investment but held that this surplus 
existed only because the social and economic sys- 
tems of Europe denied the masses sufficient con- 
sumer capacity to justify increased investment in 
home industry. Hobson was wrong in thinking that 
the bulk of the exported capital had gone to the 
new dependencies: in fact most was going to the 
old settlement colonies, India and the independent 
American countries. But he rightly pointed out 
that most of the new colonies were too poor to 
provide valuable markets for manufactured ex- 
ports and held that the cost of defense and admin- 
istration, coupled with the moral degradation re- 
sulting from most of Europe's activities in Africa 
and Asia, outweighed any advantages such colonies 
might provide. His conclusion was that Europe’s 
best interest was to invest at home; to stick to free 


trade; and to place undeveloped tropical territories, 
whose raw materials were needed by the West, 
under international supervision. He thus brought 
the debate over colonization back to the position 
taken by Adam Smith. During the remaining half 
century of European colonialism no one satisfac- 
torily demonstrated that tropical empire was either 
necessary or profitable to the Western powers. 

The “practice” of colonialism. The practice of 
colonialism (taking this to mean the devices actu- 
ally employed to provide economic advantages to 
the owners of colonial dependencies) can be de- 
scribed as having evolved through three stages of 
unequal length: from the foundation of the first 
colonies to the 1830s; from that time to near the 
end of the century; and from about 1890 to the 
period of decolonization after 1945. In each period 
imperial arrangements closely followed contem- 
porary practices in the parent states and owed very 
little to theory. 

Before 1830: the mercantilist period. During 
the first period—the era of mercantilism—the pat- 
tern was set by Spain and Portugal as the pioneers 
of overseas colonization. They in turn instinctively 
applied the protectionist and monopolistic practices 
current in sixteenth-century Europe to their new 
possessions. Later-starting imperial states adopted 
most, though not ail, of these techniques, so that 
by the later seventeenth century there was a “sys- 
tem” largely common to all empires, varying mainly 
according to whether colonies were in America or 
the East. There were practices common to all: 
foreign ships were entirely excluded from colonial 
ports; virtually all colonial exports and imports 
were routed through the ports of the metropolis; 
and specified manufactures or processing of raw 
materials were banned in the dependencies. Spain 
and Portugal went further than others by insisting, 
until late in the eighteenth century, that American 
trade should be carried only in annual convoys and 
be restricted to a single metropolitan port. Spain 
also banned most intercolonial trade in America 
and restricted the Pacific trade from Mexico to the 
Philippines. By these means it was hoped that each 
metropolis would have a monopoly of colonial bul- 
lion and raw materials and a guaranteed colonial 
market for manufactures; that metropolitan mer- 
chants would be ensured a middleman profit on 
trade passing through the parent state; and that 
the colonies would be preserved as primary pro- 
ducers, ideal markets for industrial Europe. In 
addition, production of the most wanted colonial 
products was encouraged by complex systems of 
bounties and preferential tariffs in the home mar- 
kets, Trade with Africa and the East was dealt 
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with rather differently. Until after 1640 Portugal 
excluded even her own nationals from dealing in 
the more valuable Eastern commodities, leaving a 
monopoly to the crown. England, France, and 
Holland also imposed monopolies on their Eastern 
trade but granted them to privileged chartered 
companies with full administrative powers. 

The main imperial “profit” from these mercan- 
tilist practices arose from commercial monopoly, 
This is practically impossible to estimate quanti- 
tatively but almost certainly existed, at least to the 
extent that colonists had to pay a higher price for 
their imports and received lower returns for their 
exports than under a free trading system. This 
metropolitan profit varied inversely with the eco- 
nomic capacity of the parent state, favoring eco- 
nomically uncompetitive countries like Spain, who 
would otherwise have had little share in the trade 
of their own colonies, more than a country like 
Britain, which, by the eighteenth century, pos- 
sessed the greater economic potential. For Britain 
it has been estimated that in 1773 the gross “profit” 
of mercantile controls in North America lay be- 
tween $2.5 and $7 million, ignoring the ancillary 
costs involved in empire. But in addition to such 
commercial profits, Spain and Portugal made sub- 
stantial fiscal profits from their American colonies 
by transferring surplus revenue to the metropolitan 
exchequer. Portugal was receiving thereby some 
£900,000 annually in the mid-eighteenth century. 
No other colonial power used this device. The 
French subsidized their colonies. The British con- 
ceded the principle of fiscal autonomy until after 
1763 and then attempted to impose taxes on the 
colonies, not to make a profit but to offset the cost 
of colonial defense and administration. This at- 
tempt failed, although it had considerable impor- 
tance in the course of events leading to American 
independence. 

Nineteenth-century trend to free trade, The 
first and longest era of colonialism ended effectively 
during the first half of the nineteenth century— 
more because the colonies that had made it mean- 
ingful were now independent than because colonial 
theory had changed. The British, who possessed 
the only large empire, gradually adopted free-trade 
practices at home and extended them to the colo- 
nies, British colonies were virtually open to the 
world by 1830 and by 1860 the last vestiges of 
shipping controls and preferences on colonial prod- 
ucts had gone. Other colonial powers slowly fol- 
lowed suit. The Dutch threw open their colonial 
trade after 1815 but created a special monopoly 
of the carriage and sale of certain commodities 
that the Batavian government collected as part of 
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the “culture system” from 1830 to the 1870s. 
France preserved the colonial shipping monopoly 
together with preferences and certain exclusive 
regulations until the 1860s but had abolished them 
by 1870. Spain and Portugal never completely re- 
moved mercantile controls but largely liberalized 
them for their few dependencies, By 1870 the era 
of mercantilism seemed over, in that no imperial 
power then obtained economic advantages from 
its dependencies that were not available to the 
world. 

Resurgence of limited protectionism. The pe- 
riod of colonial free trade was very short-lived. 
The revival of protectionism in most parts of Eu- 
rope in the last quarter of the century led naturally 
to its extension to the new tropical empires, France 
adopted a strongly protectionist domestic tariff in 
1892 and extended it to all her colonies except 
West Africa, the Congo, and the Pacific, which had 
to have separate tariff systems because of interna- 
tional treaties or local economic conditions. Even 
so all colonies gave and received preferences. The 
Russians enclosed their new possessions in central 
Asia and the Far East within their metropolitan 
tariffs and gave bounties and preferences on se- 
lected colonial products. The United States incor- 
porated most new dependencies in the Caribbean 
and Pacific within the metropolitan tariff, leaving 
only the Panama canal zone and Samoa open to 
international trade on the open-door principle. 
Portugal, Spain, and Italy either assimilated their 
colonies to the metropolis or imposed preferences. 
But not all states reverted to mercantile concepts. 
The Dutch abolished their semimonopoly of Indone- 
sia in the 1870s and maintained an open door, al- 
though using quotas on imports in the 1930s. Ger- 
many before 1914 and Belgium also preserved the 
open door with moderate tariffs in their colonies. 
Britain resisted demands from pressure groups at 
home and from some settlement colonies and did 
not drop free trade until 1932, though the self- 
governing colonies indulged in protection and, after 
1899, gave Britain limited preferences. Some pref- 
erences were given to colonial products in the 
British market during and after World War 1. 
Britain reverted to protection in 1932, and the 
Ottawa agreements of that year led to a preferential 
system throughout the empire, coupled with quotas 
on some products and financial control by means 
of the sterling area. 

During the last phase of European colonialism, 
therefore, most colonial powers adopted some form 
of preferential system: hence the term “neomer- 
cantilist” to describe the new pattern. Yet it never 
approximated in its severity the mercantilism of 


the first period. No power excluded foreign ships 
or goods from its colonies, forced colonial trade to 
pass through metropolitan ports, forbade colonial 
manufactures that might compete with its own or, 
with the sole exception of Holland between 1830 
and 1877, transferred colonial revenues to the 
metropolitan treasury. 

Significance of colonial trade. Moreover, few 
colonial powers obtained anything approaching a 
monopoly of the overseas trade of their depend- 
encies or found their chief market or source of 
imports in their empires. Russia and the United 
States took most of the exports and supplied most 
of the imports of their possessions; but for both 
colonial trade was of marginal importance. Britain's 
share in her empire’s overseas trade declined 
steadily from about 49 per cent in the decade after 
1854 to 35 per cent in 1929-1933, rising only 
slightly thereafter as a result of imperial prefer- 
ences. But the colonies’ share in British total over- 
seas trade rose from about 28 per cent in the 
1850s to 51 per cent between 1950 and 1954. 
France continued to furnish a high proportion of 
her colonies’ imports—always more than half (ex- 
cept during the world wars )—but the proportion of 
colonial exports to France declined fairly steadily. 
On the other hand the colonies’ share of France's 
total overseas trade was always relatively small: 
about 10 per cent until 1914, then rising to 28 
per cent in 1934. Belgian territories in central 
Africa took 51 per cent of their imports from 
Belgium in 1928 and 46 per cent in 1938. They 
sent 70.6 per cent of their exports in 1928 and 
80.7 per cent in 1938 to Belgium. But the Congo 
played a very small part in Belgian overseas trade, 
supplying 2.4 per cent and 6.6 per cent of Belgian 
imports in 1928 and 1938 respectively and taking 
less than 2 per cent of Belgian exports in each 
year. The trade of the Netherlands Indies was not 
monopolized by Holland and played a minor part 
in Dutch overseas trade. In 1928 only 5.3 per cent 
of Dutch imports came from Indonesia, 7.2 per 
cent in 1938. In the former year 8.9 per cent of 
Dutch exports went to Indonesia and 9.5 per cent 
in 1938. Indonesian exports to Holland declined 
from 78.3 per cent in 1850 (during the period of 
the “culture system”) to 15.3 per cent in 1930; and 
Indonesian imports from Holland fell from 41.9 
per cent to 16.8 per cent in the same period. 

Although not comprehensive, these figures point 
to a firm conclusion about modern colonialism on 
its commercial side. Most colonial powers took a 
large proportion of the overseas trade of their 
colonies; but with the sole exception of Britain, 
whose empire was far larger and provided uniquely 
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favorable markets and varied sources of raw ma- 
terials, the colonies were of small importance to 
the world trade of their owners. Artificial factors, 
such as tariffs, preferences, and currency systems, 
probably affected the pattern of colonial trade to 
some extent, but never as much as mercantile con- 
trols had done before 1830. Since, moreover, there 
was never a total ban on foreign trade or shipping, 
and therefore no monopoly, it cannot be said that 
“neomercantilism” exploited dependencies; in fact, 
bounties, preferences, and quotas on colonial prod- 
ucts probably favored the colonial producer more 
than colonial tariffs helped the metropolitan pro- 
ducer. The old pacte colonial was indeed revived, 
but it was no longer exclusive and was reasonably 
two-sided in its benefits. 

Advantage of colonial investments. There re- 
mains the question of metropolitan advantage 
from capital investment in the colonies. Did politi- 
cal controls, especially of non-European labor, 
create especially favorable conditions for European 
capital in the dependencies, providing a “super- 
profit” for European “finance-capitalists”? The 
question is too complex to unravel briefly; but two 
points are reasonably certain. First, the great bulk 
of European investment in colonies was in govern- 
ment bonds or in fixed interest debentures in public 
utility companies. On both, the interest paid was 
only very slightly higher than that from comparable 
stocks in Europe or America, so that colonies were 
not forced to provide artificially high returns to 
metropolitan investors. Second, the return on some 
risk stocks—equities—was often high, and prob- 
ably higher than could be obtained at most times 
on industrial equities in Europe or the United 
States. But this was due less to the fact of colonial 
subordination than to the intense world demand 
for certain products possessed by some tropical 
colonies, for example, metals and rubber; and the 
return on capital invested in such ventures was 
not evidently different from that invested in inde- 
pendent countries of similar economic type that 
lacked the capacity to develop their own natural 
endowments, notably those in the Middle East and 
Latin America. In short, the special profitability 
of capital investment attracted to the tropical 
colonies reflected the inherent advantages of the 
strong world demand at particular times for their 
particular types of product, the scarcity of capital 
there, and the strong bargaining position of West- 
ern countries when dealing with politically or 
economically weak societies. Colonial investments 
did not depend on formal imperial control to pro- 
vide excess profits. 

This fact is the key to understanding the eco- 
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nomic relations between the industrialized Western 
powers and the new ex-colonial countries in the 
period after decolonization in the 1960s. The phe- 
nomenon denounced by Marxists and by nation- 
alists in the new states as “neocolonialism” was the 
continued economic dependence of the ex-colonies 
on their previous masters or on other Western 
powers. This could take the form of special com- 
mercial relations, such as exist between France 
and several ex-dependencies in west Africa and 
elsewhere; between Britain and many members of 
the sterling area; and between the United States 
and the Philippines or Puerto Rico. Alternatively, 
“neocolonialism” might consist in the “exploitation” 
of these “developing” societies through their re- 
liance on foreign capital, which tends to place 
segments of their economies under the control of 
foreign companies or governments. In either case 
“neocolonialism” implies that the advanced coun- 
tries are continuing to interfere in the economic 
life of onetime colonies as if they had not been 
liberated. 

It remains debatable how much truth there is 
in these allegations; but in fact “neocolonialism” 
is as imprecise a concept as colonialism. Facts are 
in any case more important than accusations. Most 
developing countries, like most earlier colonies, are 
primary economies, and depend heavily on more 
advanced states for markets, imported manufac- 
tures, capital, technical skills, and opportunities to 
train their own nationals, If they prefer to avoid 
contacts of these kinds in order to retain their en- 
tire freedom of action, the choice is now open to 
them, strengthened by the possibility of obtaining 
the same amenities from Russia or China as “non- 
imperialist” states. In practice, however, the bal- 
ance of advantage almost certainly lies with the 
receiving countries. In the 1960s they have been 
given much capital in the form of “aid”—i.e., 
grants or capital on noncommercial terms—and 
many have benefited substantially from preferential 
markets for their products in France, the Common 
Market, Britain, or the United States. [See FOREIGN 
AID, article on ECONOMIC ASPECTS; INTERNATIONAL 
INTEGRATION, article on ECONOMIC UNIONS; INTER- 
NATIONAL TRADE CONTROLS, article on TARIFFS AND 
PROTECTIONISM.] If “neocolonialism” exists, it is 
the inevitable product of an inherent imbalance 
between the advanced and the developing econ- 
omies, irrespective of the political factors involved. 
This one-sided relationship will disappear only 
when the new states reach the position already 
achieved by Japan and become as powerful eco- 
nomically as the ex-colonial powers on whom they 
continue to depend. Evidence provided by eastern 
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Europe after 1945 does not suggest that the eco- 
nomic status of nominally independent countries 
associated with socialist Russia differs substantially 
from that alleged to exist between the ex-colonial 
territories of Africa and the East with their one- 
time masters. 

D. K. FIELDHOUSE 
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COLOR VISION 
See under VISION. 


COMBINATIONS AND PERMUTATIONS 
See PROBABILITY, article on FORMAL PROBA- 
BILITY. 


COMMISSIONS, GOVERNMENT 


The term “commission” covers several usages: 
(1) the document, signed and sealed, that evi- 
dences an official appointment, such as a captain’s 
commission; (2) the term of office of an appointee, 
as in the phrase appended to a notary’s acknowl- 
edgment, “My commission expires on . . . (date)”; 
(3) the title of the heads of various departments or 
divisions of national, state, or municipal govern- 
ments, for example, commissioner of patents or 
county commissioner; (4) the name of an official 
agency or institution headed by a collegial body of 
commissioners. This article is confined to the last 
of these usages, excluding unofficial bodies (often 
self-constituted) that call themselves commissions 
and that may indeed perform some more or less 
analogous functions in the larger political system 
of society. Even so understood, the terminology is 
arbitrary: an agency called a commission may be 
otherwise indistinguishable from something else- 
where called a board, a council, a committee, a 
tribunal, or a court. Nuances of emphasis that in 
certain cases would support distinctions between 
these terms—resting on the source of the powers 
(presidents commission ), the larger or higher body 
from which the membership is drawn (judicial 


committee of the privy council), or the nature of 
the mandate given (board of examiners )—break 
down in the indiscriminate usage the words have 
received, The distinction attempted in the Ency- 
clopaedia of the Social Sciences (1930-1935), be- 
tween commissions as regulatory bodies (Dickinson 
1931) and boards as administrative (Fairlie 1930) 
or advisory (Macmahon 1930), has not caught on. 
The one common feature of all such bodies that is 
pertinent here is the plural headship of the institu- 
tion, in contrast with agencies having a single 
chief. This feature is usually, if not inevitably, 
accompanied by a certain degree of autonomy 
within the surrounding governmental structure. 

Government commissions, formally constituted, 
appear to be rare among the institutions of rela- 
tively primitive societies. But they occur frequently 
in the governments of advanced countries, whether 
the regimes are federal or unitary, presidential or 
parliamentary, one-, two-, or multiparty, or demo- 
cratic in the Eastern or the Western sense. In 
Britain, official bodies formally commissioned to 
carry out various governmental tasks, and em- 
powered in so doing to adjudge rights and impose 
penalties, date back at least to the time of Henry 
vm. During the sixteenth and seventeenth cen- 
turies, some of these (for example, the Court of 
High Commission) were seen as encroachments 
on the common law courts and gave “government 
by commission” a bad name as a threat to liberties 
—a reputation that lingered there in conservative 
legal quarters as late as Lord Hewart’s attack on 
administrative justice in The New Despotism (1929) 
and in the United States in the American Bar As- 
sociation’s anti-/New Deal campaign that culmi- 
nated in the Administrative Procedure Act of 1946. 
Meanwhile, during the nineteenth century in Brit- 
ain, royal commissions of inquiry became favored 
instruments leading to major advances in social 
legislation, such as the Factory Acts. In the United 
States, administrative and regulatory commissions 
were created sporadically in state and local govern- 
ments as early as the 1830s as a means of giving 
added competence, vigor, or impartiality to various 
governmental functions. They multiplied in num- 
ber during the post-Civil War decades and gained 
in prestige when Congress copied the model in 
establishing the Interstate Commerce Commission 
in 1887. After Galveston, Texas, experimented 
with it in 1901, the “commission plan” enjoyed 
wide popularity as a model form of municipal gov- 
ernment, Reliance on commissions became an 
article of the Progressive faith. 

On the Continent, commissions were formed 
around the monarchical establishments of the lead- 
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ing European powers in the sixteenth and seven- 
teenth centuries. Peter the Great introduced them 
into Russia, and the Soviet Union employs them 
in its budgeting and planning processes. 


Types and functions of commissions 


Two preliminary distinctions, if not too sharply 
drawn, are useful in sorting out types of commis- 
sions and the various functions they may perform. 
One is temporal, between bodies convened ad hoc 
to deal with a specific situation and those perma- 
nently established to handle a class of business 
arising regularly or intermittently. Thus, a bound- 
ary commission expires when its determination has 
been made; a truce commission is finished when a 
treaty of peace has been concluded; a claims com- 
mission is temporary if the claims it is set up to 
hear relate to a single episode, like the Philippines 
War Damage Commission, and permanent if the 
claims are of a class that will recur indefinitely. 
Very different considerations may enter into the 
appointments made to temporary commissions 
from those applicable to replacements in an on- 
going body. For the same reason, structure, powers, 
and procedure may also vary considerably. 

The second kind of distinction turns on the overt 
purposes commissions may be established to serve: 
(1) an arbitral commission to reach a decision 
binding on the parties to a dispute, such as the 
Electoral Commission that settled the outcome of 
the Hayes—Tilden election in 1876; (2) a commis- 
sion of inquiry to investigate, report findings, and 
perhaps pass judgment on the responsibility of in- 
dividuals for an event—often a disaster or a scan- 
dal—that has transpired, such as the Warren Com- 
mission following the assassination of President 
Kennedy in 1963; (3) a ceremonial commission 
to mark an occasion of moment, such as the George 
Washington Bicentennial Commission, in 1932; 
(4) a study commission to make plans or recom- 
mendations for future public policy, such as the 
two Hoover Commissions on Organization of the 
Executive Branch of the Government, in 1949 and 
1955; (5) an operating commission to conduct a 
branch of public enterprise, such as the Atomic 
Energy Commission, or to administer a subsidy 
system, such as the Federal Maritime Commission; 
(6) an advisory commission to maintain liaison 
between a governmental agency and some of its 
clientele publics, such as the Advisory Commission 
on Intergovernmental Relations; (7) a regulatory 
commission to supervise some field of public or 
private activity, such as the Civil Service Commis- 
sion or the Federal Communications Commission. 

These categories are suggestive rather than ex- 
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haustive. Clearly, most of the tasks they comprise 
could be—and in apparently similar circumstances 
often are—performed by single-headed agencies or 
by a single official with a suitable staff. A single 
judge can arbitrate a dispute or conduct an inquiry, 
as Justice Spence of the Supreme Court of Canada 
did in 1966 into the sex-and-security scandal 
charges brought by the Liberal government against 
members of the previous Diefenbaker government. 
A cabinet minister can organize a study, recom- 
mend policies, oversee a departmental operation, 
and regulate an industry: the secretary of the 
treasury, for example, does all these things every 
year. 

To understand the prevalence of commissions, 
therefore, it is helpful to employ a third sort of 
classification that looks at their latent functions 
and, hence, the ulterior purposes they may serve. 
For example, a collegial body may lend added dig- 
nity, authority, or apparent impartiality to official 
action and thus tend to legitimize both the action 
itself and the regime that established the commis- 
sion. A hard decision from such a source may be 
easier for its subject to accept, and in that case 
the beneficial effects accrue to all concerned. Simi- 
larly, the award of a tribunal or a rule adopted by 
a commission—the product of deliberation—may 
be more acceptable than the decree of a single of- 
ficial that smacks of whim or patronage. The effect 
of a united front may indeed be deemed so impor- 
tant as to warrant a caucus procedure, binding all 
the members of a commission to give an outward 
expression of unanimity to the decision of a major- 
ity. Alternatively, a commission that announces 
divided decisions, and publishes majority and dis- 
senting opinions, may nevertheless legitimize its 
work by its evident reliance on rationality to justify 
its actions. 

Again, when an unpleasant duty is to be done, 
for which no one involved wants personal respon- 
sibility, the task may be given to a commission. To 
take an extreme example: when a man is sen- 
tenced to be shot, the means usually used is a firing 
squad and not a single executioner so that no one 
may know whose bullet did the deed. More gen- 
erally, a collegial body, meeting in camera and 
acting collectively and impersonally, shields the 
responsibility of the members in anonymity. Here 
the benefits accrue to the members rather than to 
the regime. 

A commission is indispensable if its chief func- 
tion is representational, to assure a voice in delib- 
erations to each of several parties with diverse 
interests. International bodies, domestic intergov- 


ernmental agencies (established, for example, un- 
der an interstate compact), and tripartite tribunals 
to handle labor-management disputes fall into 
this category. Ex officio memberships, as on an 
interagency commission, are a mark of the repre- 
sentational function. 

Commissions have other latent functions, notably, 
bargaining, public education, delay, patronage, ap- 
peasement, frustration, and rubber-stamping. A 
commission composed of representatives of con- 
tending economic, sectional, or ethnic interests, 
with or without additional members as “impartial” 
mediators or referees, may provide the means of 
working out internally the compromises essential 
in reaching a decision acceptable to all the inter- 
ests. An interdepartmental or intergovernmental 
commission, or a mixed commission of officials and 
private parties, may function similarly as a forum 
for bargaining. In all these situations, successful 
bargaining contains conflicts that otherwise would 
result in stalemate or would spill over into a wider 
political arena. Where acquiescence in a decision 
rests basically on agreement between recognized 
representatives, for instance, in settling strikes, 
fixing prices, or allocating markets, the bargaining 
function reinforces the legitimacy of the commis- 
sion as an institution. 

A further constructive function is educational. 
Commission studies, reports, and recommenda- 
tions, clarifying issues and marshalling evidence, 
are important elements in the political process of 
building public support for many policy changes. 
In Britain this historic role of royal commissions 
and of parliamentary select committees has lately 
been more often performed by departmentally ap- 
pointed committees. In the United States it is a 
role of temporary commissions, regulatory com- 
missions, and congressional committees. 

The delaying function scarcely needs elabora- 
tion. Unsuccessful bargaining leads to a deadlock 
that it may be inconvenient to acknowledge. In- 
decision in the face of uncertainties over the costs 
and benefits of alternative policies may be equally 
awkward to confess. Controversies that cut across 
party lines endanger the stability of coalitions and 
jeopardize other objectives. So when an intractable 
problem evidently admits of no agreed or clear 
solution at present, tacit or explicit agreement may 
nevertheless (or therefore) be reached that the 
issue need not be settled now. Creating a study 
commission to report later may be a face-saving 
form of inaction that postpones the timetable for 
decision. The Wickersham Commission on Law 
Enforcement, established during the Hoover ad- 


ministration to temporize with the prohibition is- 
sue, was a Classic instance. The tactics of delay 
may boomerang, of course, as they did in that case. 

The patronage function is equally plain. Multi- 
member commissions afford a multiplicity of ap- 
pointments. Some of these, because of their promi- 
nence or the policy stakes that hang on them, will 
be filled according to tests of merit and fitness. 
But the more numerous (and hence obscure) the 
places or the more honorific the assigned duties, 
the more likely it is that they will be used to satisfy 
patronage claims. It would be a mistake to suppose 
that all the patronage accrues necessarily to the 
formal appointing authority. Commission appoint- 
ments may reflect demands, or serve to co-opt 
support, from quarters within the political system 
that are apparently unrelated to a commission's 
overt scope of activities. In the United States, for 
instance, ex-congressmen frequently find shelter 
in federal commissions. 

Finally, a commission may serve to pacify some 
clamorous group for whom nothing more will ac- 
tually be done: a study project will keep the mem- 
bers occupied and symbolize their concerns. This, 
as it turned out, proved to be the main function of 
the Kestnbaum Commission on Intergovernmental 
Relations in 1954-1955, some of whose sponsors 
and members appeared to desire a return to the 
Articles of Confederation. A policy proposal may 
also be given to a study commission hostile to its 
main thrust as a means of killing it or to a tame 
group for a foreordained approval. 


The study of commissions 

The universe of commissions, doing business 
under that name or another, is too heterogeneous 
for systematic study as a single category, apart 
from the initial difficulties of definition and col- 
lection in compiling an inventory. Research in the 
field, accordingly, has generally taken one or an- 
other of at least three broad paths, not altogether 
mutually exclusive: historical, topical, and com- 
parative, all dealing with selected fragments. 

Case studies have dealt with commissions in- 
dividually, or in groups, or with significant episodes 
in the life of single agencies. Their emphasis may 
be on the agency as an institution or, more broadly, 
on policy development in its field. Such studies 
range in bulk from Sharfman’s bland four-volume 
account, The Interstate Commerce Commission 
(1931-1937 )—which, though the work of an econ- 
omist, is devoid of economic analysis—to articles 
and pamphlets, for example, the Inter-university 
Case Program’s publication, The Public Advisory 
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Board and the Tariff Study (1956). Well-done case 
studies have the merits of realism and depth and 
the disadvantages of using history as a basis for 
theoretical generalizations. Henderson’s (1924) 
still-classic examination of the disillusioning first 
decade of the Federal Trade Commission's experi- 
ence illustrates the value of this type of work. 

Topical studies in Britain and the United States 
during the 1920s and 1930s were dominated by the 
concerns of public law—the scope of delegated 
powers of commissions in dealing with individuals 
and business firms and the extent of judicial review 
of administrative decisions. These concerns en- 
tailed a concentration on regulatory authorities and 
made more or less irrelevant the question of single- 
headed or multimember agencies. In Britain the 
Report of the Committee on Ministers’ Powers 
of 1932 defended existing trends toward delegation 
of power against Lord Hewart’s attack. In the 
United States the controversy lasted longer and 
produced a voluminous legal literature on evolving 
judicial doctrines of due process and on the combi- 
nations of powers that make a regulatory agency 
at once prosecutor-and-judge and legislator-and- 
judge. 

The U.S. Attorney General's Committee on Ad- 
ministrative Procedure, headed by Dean Acheson 
and supported by staff work under the direction 
of Walter Gellhorn, issued a report (1941) that 
was a landmark: it covered all federal agencies 
having regulatory powers and examined in detail 
the procedures each employed in exercising them 
—thereby getting into matters of internal organiza- 
tion and permitting systematic comparisons. World 
War u postponed action on the report, while it 
multiplied the number of regulatory agencies and 
the activities to be regulated. The Administrative 
Procedure Act of 1946 effected a compromise of 
the contending political forces. Treating rule-mak- 
ing and adjudicatory powers separately, it pre- 
scribed procedural safeguards for each but in lan- 
guage sufficiently general and with saving clauses 
sufficiently broad as not to prevent any substantive 
action previously authorized. Perhaps its most no- 
table feature was the requirement, in formal ad- 
judication, of the use of independent hearing 
examiners. Its symbolic success is attested by the 
fact that the whole topic has ever since remained 
largely in the province of specialists in administra- 
tive law, while the practice of regulation employs 
more lawyers and more red tape. 

The related topic of investigatory powers, involv- 
ing the compulsory appearance of witnesses and 
the production of records and reports, is in Britain 
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mainly an aspect of administrative or ministerial 
action, since royal commissions ordinarily are lim- 
ited to voluntary testimony and the courts are 
bound by statutory delegations. In the United States 
this topic, important as it is in the work of regu- 
latory and enforcement agencies, including com- 
missions, has aroused widespread political and 
scholarly interest chiefly in connection with con- 
gressional investigating committees, which are be- 
yond the scope of this article. [See LEGISLATION, 
article on LEGISLATURES.] 

The report of the U.S. President’s Committee on 
Administrative Management (1937), although its 
attack on the “headless fourth branch” proved abor- 
tive, ushered in a fresh range of more rewarding 
issues for study: the internal structure and workings 
of commissions; and the relations of commissions, 
and in particular the ten principal regulatory com- 
missions, to the president, to the Congress and its 
committees, to cognate executive departments, and 
to organized clientele groups. The U.S. Supreme 
Court’s decision in Humphrey's Executor v. United 
States, 295 U.S. 602 (1935), sustaining a statu- 
tory limit on the presidents power to remove a 
member of the Federal Trade Commission, had 
already raised the issue of commission independ- 
ence, but the focus of research thereafter shifted to, 
and has remained on, commission behavior rather 
than judicial doctrine about the separation of 
powers. Once the rule-making, adjudicatory, and 
enforcement powers of the commissions were held 
constitutionally valid, it was futile to argue whether 
these agencies were in the executive branch or 
somewhere else. 

Independence has several facets. Presidents rare- 
ly attempt to influence the outcome of particular 
commission decisions; this would seem improper 
when private rights are being adjudicated. But 
presidents freely use their appointing power to 
bring about changes in prevailing commission view- 
points. The scope for such changes depends on the 
frequency of vacancies, which varies considerably 
even though the terms of office are staggered and 
outrun the president’s own. The practice of reap- 
pointing incumbents is equally uncertain; Presi- 
dent Eisenhower in his first term reappointed none. 
The Board of Governors of the Federal Reserve Sys- 
tem had only 3 chairmen in the 30 years following 
its reorganization in 1935; and Chairman William 
M. Martin, a Truman appointee, survived into the 
Johnson administration. The Federal Trade Com- 
mission, by contrast, had 41 different chairmen in 
the first 50 years of its existence; only 4 lasted as 
long as 2 years. 

The other side of the coin of independence is 


political isolation. Without administration support 
it is seldom possible for a commission to get needed 
changes in its organic statute or larger appropria- 
tions. It is difficult also to arrange top-level inter- 
agency coordinating mechanisms for policy pur- 
poses across a broad economic front, such as 
transportation, water resources, or finance, when 
an independent commission is one of the agencies 
with jurisdiction in the premises. This was less of 
a problem in the pre-World War 1 days, when the 
segments of the economy subject to commission 
regulation seemed relatively self-contained. 

One ameliorating step followed a recommenda- 
tion of the first Hoover Commission and was justi- 
fied at the time chiefly on grounds of administra- 
tive efficiency. A series of reorganization plans in 
the 1940s and 1950s, affecting all the commissions 
except the ICC and the Federal Reserve System 
(where practice already conformed), concentrated 
in the chairman's office authority over all admini- 
strative affairs of a commission and authorized the 
president to designate the chairman, This is a 
milder form of a remedy previously urged and 
partially applied in the fields of shipping and civil 
aeronautics: segregating, as far as practicable, the 
adjudicatory aspects of a commission’s job from 
its other functions, namely, promotion, subsidy, 
and operations, and assigning a regular executive 
department to handle the operations. 

A special case is the Civil Service Commission, 
which combines an immense administrative and 
record-keeping task with a reviewing function in 
connection with personnel grievances. Its clientele 
consists of federal civilian employees and their 
unions. Since the Hoover Commission’s reports it 
has moved from a tradition of regulation to a posi- 
tion as champion of employee interests, while its 
chairman, wearing a second hat, has been drawn 
into the the White House staff as the president's 
adviser on personnel matters. Here the trend to- 
ward increasing judicialization, so characteristic of 
many regulatory commissions, has been reversed. 

Commission relations with clientele interests 
raise the issue of independence in another aspect, 
relevant to both regulatory and advisory bodies. A 
degree of intimacy breeds expertise and ease of 
communication on the part of the commission and 
confidence and voluntary compliance on the part 
of the clientele. Carried too far, it jeopardizes a 
commission's reputation for impartiality and invites 
conflicts of interest, if not outright corruption. 
Short of that, it fosters a parochial view that ex- 
cludes interests which are affected but not repre- 
sented in the immediate arena, as when banks and 
bank supervisors stand together against, say, sav- 


ings and loan associations. How much intimacy is 
enough? In the Progressive era, when the commis- 
sion movement arose, the expressed ideal was an 
impartiality that forbade all but formal and public 
contacts. A later and more realistic judgment 
would usually tolerate or welcome considerably 
closer relationships. Where the line is to be drawn 
depends importantly upon the degree to which a 
commission is engaged in adjudication or must 
rely on persuasion to accomplish its tasks. But the 
disillusioned note the tendency of intimacy to grow 
as time passes. This is the basis for Bernstein's 
probably oversimplified concept (1955) of a “life- 
cycle of regulatory commissions,” according to 
which the missionary zeal that attends the birth of 
a commission is dissipated as the clientele interests 
infiltrate their supposed governor and ultimately 
make it their captive. [See LoBBYING,] 

One other theoretical formulation may prove 
fruitful in further research on the politics of eco- 
nomic policy, including the policies entrusted to 
commissions. This is Lowi’s three-way categoriza- 
tion (1964) of domestic economic policies as either 
“distributive” (for example, the disposal of the 
public domain), or “regulatory,” or “redistributive” 
(for example, a graduated income tax), and his 
association of each of these categories with its 
characteristic type of politics—the politics of pa- 
tronage, the politics of interest-group conflict, and 
the politics of class conflict. It may be that the 
categories are not so distinct as he supposes, but 
they are suggestive of ways of ordering more pre- 
cise insights into the workings of commissions as 
well as of other governmental agencies. 

Finally, a word is in order about the comparative 
approach to the study of commissions, It is one thing 
—and a thing often done—to collect and describe 
seriatim a number of instances more or less simi- 
lar. It is something else again to devise analytical 
categories that will yield illuminating comparisons 
across a broad array of cases. Wheare’s revealing 
study (1955) of committees in British government 
is a systematic and methodologically sophisticated 
effort of this sort. It identifies seven “characters” 
—chairman, secretary, official, expert, layman, 
party member, and interested party—and examines 
each in the context of six types of committees— 
committees to advise, to inquire, to negotiate, to 
legislate, to administer, and to scrutinize and con- 
trol. Some of his committees are commissions, and 
his categories are applicable generally. 


Harvey C, MANSFIELD 


[See also ADMINISTRATION; ADMINISTRATIVE LAW; 
PRESIDENTIAL GOVERNMENT. ] 
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COMMODITY AGREEMENTS, 
INTERNATIONAL 


International commodity agreements (ICA’s) 
are essentially multilateral instrumentalities of 
governmental control that support the interna- 
tional price of individual primary commodities, 
especially through such arrangements as export 
quotas or assured access to markets. Accordingly 
international commodity agreements are to be dis- 
tinguished from commodity study groups, which 
are entirely lacking in operational responsibility; 
from international cartels of a nongovernmental 
character; and from the Combined Food Board 
(1942-1945) or the International Materials Con- 
ference (1951-1953), which involved international 
allocative machinery for a considerable number of 
primary commodities during periods of war-induced 
shortage. The proposed definition also excludes the 
following “near” forms of international undertak- 
ings: (1) bilateral bulk-purchase agreements; (2) 
multilateral control arrangements governing the 
market outlets for manufactured products, such as 
the International Cotton Textile Agreement negoti- 
ated in 1961; (3) arrangements for sectoral inte- 
gration, in the pattern of the European Coal and 
Steel Community or the Common Agricultural 
Policy of the European Economic Community; (4) 
plans for a commodity-reserve currency; (5) pro- 
posals for international food reserves; and (6) 
measures for the reduction of tariffs or nontariff 
restrictions against the international movement of 
goods or services. International commodity agree- 
ments, in their modern form, may be dated from 
the Brussels Sugar Convention (1902), under 
whose terms the major contemporary exporters of 
beet sugar undertook to support the international 
market by abandoning national systems of export 
subsidies. The most important agreement of the 
1920s was the Stevenson Rubber Scheme, imple- 
mented by the British and Dutch governments on 
behalf of their respective colonial territories in 
Malaya and the Netherlands East Indies. This 
scheme, which resulted in a sharp but short-lived 
price increase (Whittlesey 1931), was frankly re- 


strictionist in character, and experience under it 
was the major reason for certain safeguards in- 
corporated in chapter 5 of the Havana Charter for 
an international trade organization (United Na- 
tions 1947). 

Since the end of World War m, agreements have 
been successfully negotiated for wheat, sugar, tin, 
coffee, and olive oil. The International Wheat 
Agreements (IWA) of 1949 and 1953, and the 
postwar International Sugar Agreements (ISA) are 
prototypes, respectively, of two forms of commodity 
agreements—the multilateral contract and the vari- 
able export quota. Floor and ceiling prices were 
established for sugar and enforced essentially by 
regulating the permissible exports of member coun- 
tries; the sugar agreement provided, further, that 
stocks in exporter hands neither exceed nor fall 
short ‘of stated percentages of export quotas. An 
entirely different instrumentality was used for 
wheat. Importers agreed to accept specified quan- 
tities if the price fell to the minimum level stated 
in the agreement, and exporters agreed to provide 
specified quantities to member countries whenever 
the price was at the contract ceiling. At prices be- 
tween the floor and the ceiling, the wheat agree- 
ment was to be essentially inoperative. In the tin 
agreement (ITA), successively higher price thresh- 
olds were specified, at which a buffer-stock agency 
(a) had to purchase, (b) might purchase, (c) 
could not purchase or sell without specific author- 
ization, (d) might sell, and (e) was required to 
sell. The agreement also provided for imposition of 
export controls after buffer accumulations ex- 
ceeded specified amounts. The major sanction in- 
volved in the agreement for coffee, negotiated at a 
prolonged conference in 1962, was the certificate 
of origin, to be required of importing countries as 
a means of limiting their takings from any ex- 
porters electing to “go it alone.” 

Historically, U. S. policy with respect to interna- 
tional commodity agreements has been character- 
ized by a certain degree of ambivalence. Avoiding 
agreements on industrial raw materials subject to 
wide fluctuations in demand, it has, until very re- 
cently, participated only in agreements in which 
the United States has predominantly a producer 
interest, notably the International Wheat Agree- 
ment. Even in the case of sugar (of which the 
United States remains a net importer), it has acted 
more in a producer than in a consumer capacity; 
too large a differential between domestic prices 
and those prevailing abroad would embarrass the 
continuation of the domestic sugar-control system. 
From time to time the United States has toyed with 
the idea of an agreement for lead and zinc, as a 
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means of terminating an existing system of uni- 
laterally imposed import quotas, which was the 
source of considerable irritation in trade relations 
with Mexico, Peru, Australia, and Canada. 

Prerequisites for negotiation. Empirically, if 
not in theory, the following would seem to be 
among the primary conditions to be met if an inter- 
national commodity conference is to materialize 
into an agreement: 

(1) Inelastic demand. Clearly, if close substi- 
tutes are available, supporting the market price of 
any individual commodity is certain to have im- 
mediately and sharply adverse effects. The exist- 
ence of synthetic rubber explains the complete lack 
of any postwar agreement for the natural product; 
restrictive agreements for individual oilseeds are 
ruled out by the existence of a considerable list of 
alternative seeds, as well as by competition from 
butter; but sugar has lent itself to a continuous 
succession of agreements since 1937. 

(2) Reasonably stable market shares. Since 
export quotas generally divide up markets propor- 
tionately to the national shares prevailing in some 
base period, difficulties arise whenever there are 
either abrupt or longer-term shifts in the propor- 
tions contributed by various producing countries. 
The progressive displacement of exports of raw cot- 
ton from the United States by those originating else- 
where, compounded by the development of syn- 
thetic fibers, precluded the successful negotiation 
of an international cotton agreement in the post- 
war period, and the rising volume of exports from 
African countries seriously complicated the negoti- 
ation of the International Coffee Agreement of 
1962, 

(3) Distress price levels. As is best illustrated 
by the breakdown in negotiations for a revised 
sugar agreement in 1951 and for a cocoa agree- 
ment in 1963, exporting countries are not pre- 
pared to accept the necessary compromises unless 
prevailing prices are extremely low. 

(4) Mixed producer—consumer interest. The 
longest-lived agreements currently in effect involve 
commodities concerning which the major indus- 
trial countries have rather mixed motives, Thus 
the United Kingdom is interested, as an importing 
country, in relatively low prices for sugar; but as 
champion of the Commonwealth countries in the 
West Indies as well as Oceania, the United Kingdom 
does not wish world sugar prices to fall to disas- 
trous levels. In pre-Castro days, the United States 
sought a higher price for Cuba’s sugar shipments 
to non-U.S, destinations than did even the Cubans, 
who were rather more impressed with the desira- 
bility of maintaining export volume. Even the new 


agreement for coffee reflects some mutuality of 
producer and consumer interest within the major 
importing countries: no domestic sources of supply 
exist, but the temperate zone industrial nations are 
broadly concerned for the well-being of less de- 
veloped countries in tropical regions of Latin 
America and Africa, which supply most of the 
world’s coffee exports. 

Obsolete principles versus realities. Prominent 
in the chapter of the Havana Charter dealing with 
intergovernmental commodity-control agreements 
were provisions purporting to benefit the consumer, 
especially via (a) equal representation for import- 
ing and exporting countries; (b) participation by 
all countries “substantially interested” in the par- 
ticular commodity; (c) the checkreins of publicity 
in the form of an annual report; and (d) the as- 
surance of increasing market outlets for supplies 
originating in the regions of most efficient pro- 
duction. 

A wide gulf separates the principles underlying 
these provisions from the hard realities of the 
agreements actually negotiated in the postwar 
period. The U.S.S.R. continues to vote in the Inter- 
national Sugar Agreement and the International 
Wheat Agreement as an exporting country, al- 
though the dynamics of international trade are 
such that it has recently become a heavy net im- 
porter of both. Under present circumstances, the 
United States, while not itself a member of the 
ITA, in effect imposes a ceiling on international 
tin prices by regulating the rate at which tin dis- 
posals are made out of that nation’s strategic stock- 
piles. Similarly in tue case of wheat, the interna- 
tional market has been governed less by the [WA 
than by oligopolistic pricing practices on the part 
of the Canadian Wheat Board and the U.S. Com- 
modity Credit Corporation. Membership of a host 
of nations in present-day international commodity 
agreements may merely complicate the processes 
of administration and decision making, while in at 
least one instance—the decision of the United 
Kingdom not to affiliate with the IWA of 1953— 
the absence of a major wheat-importing country 
may have had a salutary effect in moderating the 
exercise of oligopolistic power. 

The Organization of Petroleum Exporting Coun- 
tries (OPEC), established in 1960, is a special case. 
It violates, thus far without challenge, the Havana 
Charter provisions requiring consumer representa- 
tion. It utilizes a process of collective bargaining— 
not with importing countries, but with producing 
and marketing companies largely controlled by 
citizens of the advanced industrial countries, 
notably the United States, the United Kingdom, the 
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Netherlands, and France. Perhaps the time is be- 
coming ripe for a truly international undertaking 
concerning petroleum. An internal system of pro- 
rationing in the United States, on behalf of domes- 
tic producing groups, has already, and inevitably, 
given rise to a system of import controls, and a 
strong case can be made for having import quotas 
enforced by a multilateral, rather than a unilateral, 
instrumentality. Germany, Italy, and Japan, for 
example, have very little direct control over petro- 
leum supplies but are major consumers and im- 
porters. The further fact that the petroleum- 
exporting countries include relatively wealthy 
members of the less developed world, while poorer 
members are heavily dependent on petroleum im- 
ports, also argues for a degree of restraint in the 
exercise of bargaining power by the OPEC. 

Economic effects. The international commodity 
agreements suffer from the various limitations that 
characterize all efforts to support artificially the 
market position of individual commodities. In par- 
ticular, price targets tend to be set too high, long- 
run elasticities in demand as well as in supply 
tend to be underestimated, and cost structures tend 
to be built up so that any favorable effects on pro- 
ducer income are at best transitory. Longevity in 
the agreements, accordingly, is not necessarily a 
virtue and in the case of sugar has been achieved 
only by making inoperative the key provisions gov- 
erning export quotas during periods (especially of 
high prices) when agreement on market shares has 
proved impossible. 

It has been argued that stabilization of the price 
paid for only a portion of world export sales tends, 
broadly, to destabilize the price of the remainder 
(Johnson 1950), The general case for this theo- 
retical position has not, however, been definitively 
proved. An important consideration is the inelas- 
ticity of demand in the stabilized portion of the 
market relative to that in the unstabilized sector. 
Thus, the assurance of adequate supplies of sugar 
to the United States and of wheat to the United 
Kingdom, under successive international agree- 
ments or national control programs, has tended, on 
balance, to be stabilizing. 

Moreover, while it is generally true that prices in 
national markets tend to fluctuate less widely than 
those of commodities sold without protection in 
residual “free” markets, it by no means follows that 
free market prices as a group are necessarily less 
stable than prices of primary commodities subject 
to world market conditions (i.e., primary com- 
modities for which prices throughout the noncom- 
munist world tend to differ chiefly by transporta- 


tion costs together with nominal tariffs). Chiefly 
for reasons of supply inelasticity, compounded in 
the case of cocoa by demand inelasticity and in 
the case of natural rubber and wool by rather wide 
cyclical fluctuations in demand, cocoa, natural rub- 
ber, and wool have historically tended to be among 
those commodities experiencing the widest fluctua- 
tions in market price. While sterling-area pro- 
ducers are dominant exporters of all three, and the 
foreign-exchange reserves of the sterling area ac- 
cordingly tend to fluctuate in keeping with their 
current market strength or weakness, none has 
been governed by an agreement in the postwar 
period. Moreover, the fact that the price swings 
experienced by these commodities have by and 
large been reversible has led the major exporting 
countries to introduce various devices—from na- 
tional marketing boards, through variable export 
taxes to prepayment of producers’ income taxes— 
that do have the effect of “stabilizing” producer in- 
come from year to year (although not the nation’s 
total foreign-exchange earnings). This approach 
represents an adjustment toward living with in- 
stability (Nurkse 1958). 

Controls over the market price of individual com- 
modities have undesirable side effects, politically 
as well as economically. The severity of export 
quotas imposed under the tin agreement from De- 
cember 1957 through September 1960 appears to 
have had a long-term effect on productive capacity; 
when export restrictions on tin were relaxed, out- 
put failed to revive in step with a strong recovery 
in consumption, and, accordingly, this commodity 
provides a classic example of the irreversible sup- 
ply curve. One possible lesson to be learned from 
the Cuban experience under Fidel Castro is that 
there is a relationship, subtle and indirect rather 
than overt, between economic forms of market con- 
trol and a degree of political tyranny. Such a phi- 
losophy was shared by the supporters of the Anti- 
Corn-Law League in nineteenth-century England, 
who built their case on a presumed linkage be- 
tween freer trade and world peace. 

Current pressures favoring agreements. Serious 
disadvantages notwithstanding, the number of in- 
ternational commodity agreements has been tend- 
ing to proliferate, and there are good reasons for 
expecting that trend to continue. For one thing, 
the United States has moved, by a succession of 
moderate steps, from a position of doctrinaire op- 
position to these agreements to one where official 
policy has become that of willingness, in the words 
of President Kennedy, “to cooperate in serious, 
case-by-case examinations of commodity market 
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problems.” Such agreements tend to be strongly 
favored by the less developed countries as a means 
of “stabilizing” (i.e., raising) the foreign exchange 
earned by their major exports. In Europe, interna- 
tional market-sharing arrangements have enjoyed 
the active support of French authorities for more 
than a decade. The German Federal Republic, as 
the major importer of agricultural commodities 
within the European Economic Community, favors 
agreements as instrumentalities for maintaining a 
place for overseas suppliers within the Common 
Market. On similar grounds, an agreement for 
grains has also received some scholarly support 
(Coppock 1963). Moreoyer, the United Kingdom, 
which has until recently relied on a policy of cheap 
food imports together with a program of direct pay- 
ments to its domestic agricultural producers, has 
begun to develop a series of agreements with 
major overseas suppliers of grain and meat, in 
order to reduce the budgetary burden resulting 
from a combination of direct payments, unrestrict- 
ed domestic production, and unlimited imports. 
International commodity agreements have con- 
siderable appeal to all external suppliers favorably 
impressed with the short-term advantages, and pre- 
pared to overlook the longer-run disadvantages, of 
having market outlets assured on a quota basis. 
Alternatives, Various efforts have been made 
to invent mechanisms other than international 
commodity agreements for transferring purchasing 
power to less developed countries whose earnings 
have been either cyclically or chronically depressed. 
Certain of these alternatives, such as proposals for 
a commodity-reserve currency (United Nations 
1964a), would serve the ends of foreign aid and 
international monetary “reform” at the expense of 
undermining the role of the price system as the 
major instrument of economic management in 
(relatively) free-enterprise societies, Others, oper- 
ating through overt financial transfers ( United Na- 
tions 1964b; Swerling 1964), have the major 
advantage of leaving the price system largely un- 
affected as the allocator of economic resources. 
Finally, the International Monetary Fund—al- 
though with considerable reluctance and after some 
delay (Fleming & Lovasy 1960)—has acted to 
make one extra tranche (i.e. one-fourth of the 
nation’s IMF quota) available for compensating 
shortfalls in export proceeds of less developed 
countries, when those shortfalls arise for reasons 
not within the control of the country experiencing 
the balance-of-payments difficulties (International 
Monetary Fund 1963). Such an approach has the 
important virtue of taking into account fluctuations 


f 
in export volume {rather than responding exclu- 
sively to variations in commodity prices. 
\ o Boris C. SWERLING 
\ 
[See also CARTELS AND‘TRADE ASSOCIATIONS; INTERNA- 
TIONAL TRADE; INTERNATIONAL TRADE. CONTROLS.] 
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COMMONS, JOHN R. 


As an adventurous economic historian and theo- 
rist, John Rogers Commons (1862-1945) was a 
leader in the development of a number of vital 
areas of investigation, especially industrial rela- 
tions and monetary economics, His academic tal- 
ents, along with a sanguine temperament and a 
gift for economic statesmanship, made him a major 
architect of economic reform in the period of 
American history that covers Theodore Roosevelt's 
Square Deal, Woodrow Wilson’s New Freedom, and 
Franklin D. Roosevelts New Deal. 

Commons was born in Hollandsburg, Ohio, in 
1862; his parents were active abolitionists. His reli- 
gious Calvinist mother, hoping that he would be- 
come a minister, sent him, in 1882, to her alma 
mater, Oberlin College. He paid part of his expenses 
by working as a printer, at which time he became 
interested in trade-unionism and Henry George’s 
single-tax movement. 

After receiving a B.A. in 1888, he attended The 
Johns Hopkins University for two years of graduate 
study. There he was attracted to the two leading 
movements for the reform of the dominant classical 
economics. His teacher, Richard T. Ely, was a 
prominent figure in one, namely, the infusion into 
classical economics of the viewpoint of the German 
historical school, which emphasized the use of his- 
tory, statistics, comparative economic development, 
jurisprudence, and ethics. The objective was to 
develop a sound economics for social guidance. 
The other movement was the marginalist, or utility, 
approach, in which some of Commons’ friends 
among the graduate students took a prominent 
part. This school he viewed as providing a more 


logically unified theory of price determination than 
did classical economics. Thus armed with the latest 
tools of inquiry and an overwhelming interest in 
practically all the controversial social and economic 
issues of the day, in 1890 he began his academic 
career. Before the decade was over he had taught 
at four schools—Wesleyan, Oberlin, Indiana, and 
Syracuse. The abolition of his chair of sociology at 
Syracuse in 1899 temporarily ended his academic 
career. In 1904, Ely, who had moved to the Univer- 
sity of Wisconsin, brought his protégé there. The 
period at Wisconsin proved to be Commons’ most 
creative one. 

Commons soon made a reputation as the lead- 
ing student of American labor problems. With the 
collaboration of disciples, he published the pioneer- 
ing A Documentary History of American Industrial 
Society (1910-1911) and History of Labour in the 
United States (1918-1935). His theory of the 
origin and development of the American trade- 
union movement as a response to the scope of 
product markets, which he presented most clearly 
in “American Shoemakers, 1648-1895: A Sketch 
of Industrial Evolution” (1909), is still the most 
widely accepted one. The trade-union movement, 
he held, arose out of the efforts of individual work- 
men to protect their job rights against cheaper 
labor. The effectiveness of unions in improving 
wages and other major terms of employment de- 
pends upon their ability to control and standardize 
conditions within the product market areas in the 
industry in which they operate. The market area 
for some goods and most services is local; for others 
the region is important; for most goods the market 
area has become the entire nation, and in such 
cases success of the union depends on its ability 
to organize the whole industry. 

Commons had extraordinary success as a policy 
maker. He helped prepare most of the legislation 
that made Wisconsin the laboratory in social and 
economic reform for other states and the federal 
government. This included, notably, legislation for 
civil service, public utility and railroad regulation, 
workmen's accident compensation, and unemploy- 
ment insurance. He not only helped to draft the 
bill creating the Wisconsin Industrial Commission 
to supervise the state’s labor laws but also was a 
member of the first commission, from 1911 to 
1913. He did similar work for the federal govern- 
ment in 1901 as a member of the Industrial Com- 
mission, which made the first comprehensive in- 
quiry into the American economy, and in 1914- 
1915 as a member of President Woodrow Wilson's 
Industrial Relations Commission. The major de- 
signers of the epoch-making Social Security Act of 
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1935 were his students. He also played a leading 
role in the development of the notion of having the 
federal authority manage the monetary system, to 
maintain, as he put it in the famous Strong Bill of 
1928, “the stability of commerce, industry, agricul- 
ture, and employment, . . . and a more stable pur- 
chasing power of the dollar .. .” (H.R. 11806). 

Commons wrote three major treatises in eco- 
nomic theory. The first, The Distribution of Wealth 
(1893), was an ingenious attempt to combine 
marginalist economics with the emphasis of the 
German historical school on the fundamental im- 
portance of jurisprudence for achieving sound 
economic growth and equitable distribution of 
income. In the other two, Legal Foundations of 
Capitalism (1924) and Institutional Economics 
(1934a), which should be read with his early 
essay “A Sociological View of Sovereignty” (1899— 
1900), he sought to demonstrate the importance 
for economic theory of collective action in all its 
varieties. These included not only the state but also 
a host of voluntary associations, such as the cor- 
poration and the trade union; in fact, collective 
action conceptually embraced all institutions, since 
Commons defined an institution as “collective ac- 
tion in control of individual action.” 

From the standpoint of the theory of collective 
action, he developed, as the basic economic unit of 
investigation, “the transaction,” which included 
both social and individual action. He borrowed 
from Thorstein Veblen the concept of transactions 
performed by going concerns, but he broadened it 
so that it applied to any form of desirable organized 
economic activity, containing conflicting but recip- 
rocal interests. The principles of the going concern 
are the network of “working rules,” or customs. To 
Commons, these controls are “laws” broadly con- 
ceived; they are the outgrowth of experience and 
make possible the orderly growth of individual 
action. Some type of judge ultimately determines 
the appropriateness of the working rule. Thus, he 
viewed the Supreme Court of the United States as 
“the supreme faculty of political economy” for the 
nation. 

A major pivot in Commons’ analysis was the role 
of the administrative process. The germ of this 
idea dates back to his Proportional Representation 
(1896). He believed that the sound enactment of 
new working rules takes place through “collective 
bargaining” between the representatives of the 
affected organized group interests. His model was 
the Wisconsin Industrial Commission with its ad- 
visory committees. Such quasi-judicial commis- 
sions, he maintained, were a fourth branch of 
government and had as their ideal “reasonable regu- 
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lation through constructive investigation.” With 
the world-wide growth of collective action and the 
consequent problems of adjusting conflicts be- 
tween organized groups, or “blocs,” Commons’ in- 
stitutional theory has been found increasingly rele- 
vant, and his economic statesmanship continues to 
be admired. 

JosEPH DORFMAN 


[For the historical context of Commons’ work, see Eco- 
NOMIC THOUGHT, article on THE INSTITUTIONAL 
SCHOOL. See also ELY; SELIGMAN, C. G.] 
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COMMUNICATION 


This article is a generalized description of the 
communication process as a multichannel system. 
For articles on specific systems of communication, 
see the articles listed under ArT as well as Com- 
MUNICATION, ANIMAL; LANGUAGE; KINESICS; PAR- 
APSYCHOLOGY; and PERCEPTION, article on SPEECH 
PERCEPTION. Approaches to the study of commu- 
nication are discussed in ATTITUDES, article on 
ATTITUDE CHANGE; COGNITIVE THEORY; COMMU- 
NICATION, MASS; COMMUNICATION, POLITICAL; 
Lincuistics; and PROPAGANDA. 


The communicative process involves areas of 
human behavior that have long been of interest to 
the publicist, the humanist, and the politician. 
Social scientists have generally accepted the com- 
municative process as a given and have concerned 
themselves primarily with its success or failure, its 
effects, or its improvement. More recently, struck 
by the barriers to the flow of information between 
societies, between teachers and students, between 
the politician and the electorate, between parents 
and children, between husbands and wives, and 
between the professional and his clientele, a num- 
ber of disciplines have focused both thought and 
research effort upon the mechanisms of informa- 
tion transmission. By and large, increased interest 
has not been accompanied by a departure from 
earlier preoccupation with normative evaluation of 
the process or by a shift from earlier preconcep- 
tions about its central dynamic. 

Two relatively unquestioned fallacies have served 
as deterrents to the development of theories about 
communication and to the organization of research 
to test such theories. The first of these has been 
the dominating dyadic frame into which commu- 
nication research has been forced. Communication 
has been studied as a process identified by the pas- 
sage of information through the transmission of 
more or less meaningful symbols from one indi- 
vidual to another, from one group or representative 
of a group to another group or representative. Thus, 
the ideal model for the communicative process is 
based on the dyad: a knowledgeable monadic 
father or teacher who emits knowledge-carrying 


symbols that enter into the head of a less knowl- 
edgeable or nonknowledgeable monadic child. Such 
a conception is deceptively familiar, and it has the 
absolute support of common sense. This model has 
strong underpinnings provided by our attitude 
toward teaching and learning, superordination and 
subordination, wisdom and ignorance. It is reified 
by the writing and reading process and by the 
derivations of literacy represented by books, news- 
papers, motion pictures, and radio and television. 
The dyadic model is part of cultural tradition. As 
such it has impeded the systematic investigation 
of the larger system of which the dyadic situation 
is little more than incidental. The dyadic view 
forces research into Ptolemaic structure. Commu- 
nication behavior is masked by definitions of com- 
munication as “that process whereby encapsulated 
particles of meaning are transmitted between indi- 
vidual organisms by means of specialized sending 
and receiving devices.” 

Literate man’s anthropocentric and ethnocentric 
predilection to locate the avenue of transmission 
in the aural-audio channel or in derivative chan- 
nels concerned with the transportation of the writ- 
ten word or with the vocalized written word has 
been equally misleading. Enamored of the almost 
limitless capacity of language to store cues to 
accumulated experience, some scholars have con- 
fused language, culture, and social interaction. In 
societies that stress vocal performance and have 
confidence in the reliability of lexicalized informa- 
tion, a storehouse of social experience is available, 
in verbal form, to the investigator. It is, however, 
methodologically dangerous to extrapolate from 
this special situation to statements about mankind 
as essentially a vocally communicating species. 
Review of carefully drawn ethnographic reports 
reveals that societies vary widely in their rules of 
evidence, in the extent to which, for example, 
tangibility takes preference over visibility or audi- 
bility or some combination of these. It is both naive 
and ethnocentric to define communication in terms 
based upon the self-view of talk-oriented societies. 

Human society, wherever examined and of what- 
ever order of development, has language and is 
dependent upon it. We have been unable to detect 
any comparable development among other animals. 
Yet, at the moment we can only theorize about the 
importance of this distinction between nonhuman 
and human interactional behavior. We have no 
data whatsoever that can give us security about 
when it is that the men in any society talk or 
listen and when they do not. More significantly, 
we have only the most poetic conception of what 
portion of experience is shared when man in any 


society vocalizes or listens. And, until we do know 
more about the distinctive, the exclusive, functions 
of speech, comparisons of the relative efficiency 
of human and animal communication must remain 
broadly theoretical and, usually, invidious. How- 
ever, even at this early stage of descriptive investi- 
gation of either animal or human communication, 
one thing stands out clearly: Man’s possession of 
language and the absence (if true) of such a sys- 
tem among his prehominid forebears need not be 
predicated upon or used as proof of the fact that 
man has abandoned, lost, or suppressed the multi- 
sensory reinforcement mechanisms that enable him 
to be as continuously social as the nonhominids. 
The too narrow view of communication behavior 
and the word-centered conception of communica- 
tion has been based upon, and feeds into, an 
unnecessarily restricted view of the relationship 
between biological and social processes. 

Preconditions for communication. Man is an 
occupant of multiple niches, physiographic and 
social. He lives in, has reconstructed, and adapts 
to the most diverse range of environments of any 
of the animals. This environmental diversity com- 
bines with man’s long gestation period, extended 
infancy, lengthy maturation, and low birth rate 
to make him specifically dependent upon sustain- 
ing relationships with his fellows. Having evolved 
over millions of years from socially dependent 
forebears, man has demonstrated a special capacity 
for intricate subdivisions of task and role assign- 
ment and regulation. As in lower animals, his rela- 
tionship to his group mates is more than sequences 
of trial and error, action and reaction. His interac- 
tive behavior is not mechanical but interdepend- 
ently systemic. Because of a complex division of 
labor, man’s performances are seldom isolated 
actions in themselves. The social performance of 
a given member of society is by definition incom- 
plete; task accomplishment is dependent upon 
continuative, coordinate, parallel, or complemen- 
tary individual or subgroup behaviors. These are 
rarely ritualized, mechanical, conjoint perform- 
ances. Flexibility and adaptability demand continu- 
ous and reliable feedback, contribution, and correc- 
tion between the performing membership; and this 
is never apart from the capacities and behavior of 
the performers. 

To support social organization, a society, human 
or animal, must, in the sense of G. H. Mead, be 
composed of “significant symbolizers” (1934). 
Society depends upon predictability and regularity. 
Every human society finds it necessary to kill, 
specially define, regulate, rehabilitate, or isolate 
unreliable or unpredictable symbolizers. Sanity or 
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reliable symbolization (as measured by a particular 
society within its own special thresholds of signifi- 
cant symbolization) must be maintained if that 
society is not to be destroyed by accumulating dis- 
crepancy and misinformation. 

It seems logical, but hardly obvious, that a single 
channel, the auditory—aural or any other, has too 
narrow and too specialized a range and is too easily 
adulterated to be entrusted with social regulation, 
a task so critical to survival. Man must, except in 
the nonexample of “wolf-men,” maintain contact 
with other men throughout life. He must engage 
in communicative activity in the light and in the 
dark, upwind and downwind, in situations of 
dyadic intimacy and at audibility-absorbing dis- 
tance, in situations that require absolute quiet, and 
in situations where competitive wave lengths drown 
out or distort his own oral contributions or recep- 
tion. He must deal with others who are only con- 
fused by his oral productions: to the speaker they 
are sensible utterances; to the listener they are but 
nonsense in his ears. He must be predictable after 
long separations. And, above all, he must not 
destroy the conditions whereby the young can 
incorporate a language that he, as an adult, neither 
understands nor is able to teach. With all of these 
preconditions it should not be surprising that man 
(like his animal forebears) is a regulated multi- 
sensory station in a transmission system, a multi- 
channel interactor. 

Intercorporeal influence. The theoretical struc- 
ture presented here is appropriate to the extra- 
individual province shared by biology, sociology, 
and anthropology. One theoretical point regarding 
sense perceptions, however, is cogent. We will 
assume that animals do not “see,” “hear,” “feel,” 
and so on, as separate operations, even though 
neural terminals may seem to be specifically stim- 
ulated by light, sound, heat, cold, pressure, and 
chemical activity. This is in agreement with Cherry: 
“It is even questionable whether the various ‘senses’ 
are to be regarded as separate, independent detec- 
tors. The human organism is one integrated whole, 
stimulated into response by physical signals; it is 
not to be thought of as a box, carrying various 
independent pairs of terminals labeled ‘ears,’ ‘eyes,’ 
‘nose,’ et cetera” (Cherry 1957, pp. 127-128). 

At first sight this may seem nihilistic. However, 
such a position concerning physiological or psycho- 
logical processes in no way denies the fact that 
the structural properties of vocalic language and 
of nonlexical body motion language may be ab- 
stracted from their behavioral matrices. Structural 
properties must be isolated for description and 
analysis prior to the investigation of the relation- 
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ships of these to each other and to other commu- 
nicative modalities in the communicational system. 

By means of kinesic analysis we have been 
relatively successful in recent years in isolating 
structured and patterned communicative elements 
in visible aspects of human body motion. These 
have been tested and found necessary to the social 
activity of a variety of human situations (Bird- 
whistell 1961; 1962; Scheflen 1965). This encour- 
ages us to postulate that other, and perhaps all, 
directly or indirectly influenceable somatic systems 
within the living body provide potentials for human 
interconnectedness. Interconnectedness is used in 
the sense of the structural aspects of interaction. 
The capacity for interconnectedness is a conse- 
quence of, and the necessary condition for, the 
flow of viable influence within the membership of 
a social group. The capacity for interconnectedness 
provides the social group with the potential for 
adaptability and continuity. It is the regulated flow 
of this influence that has given society a history 
at least as long as bisexuality and viviparous birth. 

Influence is a term used to cover all intercor- 
poreal processes of patterned somatic sensitization 
and response. It is postulated that the relative sta- 
bility of adaptations leads to a patterned intercorpo- 
real influence structure. This constitutes a vital 
reservoir of regular and variable behavior, from 
which the society “selects” for iconic representation 
and coding at the social level. It is not easy to 
think about matters such as these and remain 
disciplined about levels of organization. Custo- 
marily, “the ladder of organization levels” is stated 
in terms that characterize cells that are within a 
corporeal field as interacting with cells within that 
given field to form organs; these organs interact 
within that corporeal field to form a physiological 
system that is somehow integrated to engage, but 
only at the next higher level, with other physiologi- 
cal systems. I would not argue for a moment with 
this model as a source of canons to prevent reduc- 
tionistic “explanation.” However, to take too sim- 
plistic a view of the organizational ladder may 
blind us to relationships, to interdependencies, that 
exist at lower levels of organization. And this can 
easily occur if we somehow impose on our material 
an implicit framework that defines physiological 
processes as a level above biological processes and, 
logically enough, psychological processes as a level 
above physiological process—and then, finally, 
sociological processes as an abstraction of com- 
binations of these psychological processes. 

The speciational grouping, the multicorporeal 
environment, is the natural and necessary field of 
all complex organisms. And this environment is 


not merely, if ever, an organization of organic 
wholes. The division of labor, of whatever com- 
plexity, necessary to maintain a multicorporeal 
environment cannot be explained exhaustively by 
the description of action—reaction sequences be- 
tween total organisms. I am positing that there 
exists a pattern of somatic influence composed of 
ordered partials within the somatic structure of 
the component organisms. These partials emer- 
gently combine to form interdependent patterns 
that are distributed regularly in the multicorporeal 
field. Thus, the viability of the particular organism 
is dependent upon appropriate somatic activity of 
other organisms within the multicorporeal field. 
In the viable social grouping there exists a series 
of biological acts that are necessarily intercorporeal 
if the given organism and, in consequence, the 
multicorporeal field are to survive. Thus, the indi- 
vidual incorporates part of a pattern that requires 
the activity of several organisms to gain comple- 
tion. I am not restricting these generalizations to 
such deceptively apparent intercorporeal operations 
as are involved in the successful or unsuccessful 
delivery of fertile sperm to fertilizable ovum, of 
mother’s milk to neonate, or of body warmth to 
the chilled other. These are but climactic incidents 
in a far more complex and far-reaching structurali- 
zation of intracorporeal partials which in context 
become intercorporeal transactions, extraindividual 
and essentially biological acts. 

Timing and structure. The achievement of 
readiness for, and the accomplishment, completion, 
and abandonment of interdependent action is 
dependent upon intricate and reliable timing oper- 
ations of readiness within both the operative 
individual and the multicorporeal systems. These 
timing operations are integral, if not central, to all 
communicative activities. Timing operations are 
present in all observed interactive systems. They 
are not necessarily of greater or lesser priority in 
a theoretical hierarchy of importance in communi- 
cation. In one sense, timing is an integral function 
of, and evidence for, the existence of communica- 
tive structure. 

In certain species, seasonal timing and coordina- 
tion of readiness seem to be maintained by exter- 
nal contexts of influence. For example, a wide 
range of organisms are coordinated by a complex 
dependence upon light or heat variations as occa- 
sioned by regular earth-solar relationships. Special- 
ized diets that are weather dependent provide 
further controls for coordination. Other more 
omnivorous organisms may be further regulated 
from within at one level by nutritional require- 
ments and at another level by the oestrous cycle. 


As ethology continues its productive investigation 
of animal interactive behavior, we should get in- 
creased perspective upon the range of regulatory 
systems, 

There is no need to postulate that man is more, 
or less, subject to such timing systems than are 
other animals. Nor do we need to assume that in 
man such operations have been assigned to, or 
surrendered to, culture. There is no need to be 
caught with false paradoxes of nature and nurture. 
At the moment, all of our evidence here is sugges- 
tive and circumstantial, and any generalization 
must be tentative. It is literally true that we still 
do not know how the membership of any ani- 
mal grouping monitors its interactional patterning, 
how the component relationships remain suffi- 
ciently stable to permit continuity (membership 
replacement), or how the membership organizes 
and accumulates reaction to change permitting 
adaptation in a changing milieu. However, it is 
clear even from our limited data that intercorporeal 
timing is, for mammals at least, more than a 
mechanical response of a genetically precalibrated 
system in a physiographically regular environment. 
There is every evidence that complex organisms 
remain influenceable, that they incorporate de- 
pendent partials throughout life. This is the bio- 
logical context in which human communication 
becomes comprehensible. And it is not a context 
abandoned in the evolutionary leap from prehom- 
inid to man. It remains active beyond the lives of 
individual men. 

It has been pointed out that (a) complex or- 
ganisms are organically dependent upon other 
organisms; (b) each organism has patterned par- 
tials whose activation or suppression is necessary 
to its survival and to its emergent contribution to 
biological continuity; and (c) these partials gain 
completion by regularity calibration at a series of 
levels, Only extended research can isolate and 
describe the mechanisms by means of which these 
partials are incited, triggered, released, suppressed, 
and modified by information flowing in the inter- 
corporeal interstices. 

Up to this point we have been discussing the 
relationship between incorporated partials of influ- 
ence and intercorporeal interconnectedness. There 
is a temptation to relegate all activities of this 
nature to a special area of investigation, perhaps 
to be described as “biological or intercorporeal 
communication.” Such a move inevitably would 
serve to reinstate earlier preconceptions, which see 
such events of interconnectedness as distinct from 
and both phylogenetically and ontogenetically 
prior to social communication. The latter, by such 
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definition, would be assigned concern with the 
stimulation and regulation of interpersonal activity 
in the form of social partials. It would seem inad- 
visable to give up the methodological advantage 
gained by the recognition of structured partials by 
arbitrarily dividing behavior into that to be termed 
biological as contrasted with a remainder assigned 
to society. It would seem more economical to 
search the interactional situation for communica- 
tive activity through all apparent avenues and 
channels of influence and to recognize that, at least 
in this framework of behavior, biological and social 
events coexist. At the least, if we are concerned with 
problems of the regulation of change and dynamic 
continuity, they are practically inseparable. There 
is no need to distinguish between biological and 
social communication. Communication can serve 
to cover all such events. 

Multichannel communication. It is not profit- 
able to use the term “precommunication” for inter- 
corporeal connectedness and influence partials or 
to describe them as the “raw materials of com- 
munication.” Such phrasing is more appropriately 
applied to organic, physiological, and psychological 
processes. Communication is the dynamic aspect 
of interconnection and is only indirectly related to 
these individuative processes. As soon as the mem- 
bership of a grouping is subject to and dependent 
upon intermembral influence and mechanisms are 
operative to reduce the likelihood of the individual 
organism being comparably influenced by members 
of alien groups and species, it can be said that we 
have a society and that communication is taking 
place. A decipherable social code exists whenever 
a pattern of behavior presented in regularly struc- 
tured contexts of behavior will serve regularly to 
trigger, regulate, or modify the behavior of an in- 
fluenced membership. The minimal meaningful 
variation of behavior (of whatever size and carried 
along whatever channel) within the pattern may 
be termed a cue. The cue is always both an event 
in itself and an aspect of structure; that is, it 
always both carries identity and instructs the par- 
ticipant in locating the relevant contexts of its oc- 
currence. Parenthetically, a cue need not be a 
direct derivation from the behavioral complex of 
the incorporated partial. Indeed, we know that in 
language the discriminating cues are arbitrary and 
bear no discoverable generic relationship to the 
activity that they signal. Kinesic cues also have this 
same quality of being arbitrary. In body motion as 
in spoken language, the cue is iconic for the partial. 
We may make the working assumption that the 
other channels operate in the same manner. How- 
ever, only reliable data about the other modalities 
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will permit us to conclude safely that this arbitrary 
cue-and-experience association is characteristic of 
all communicative activity. 

As soon as a group develops even the most rudi- 
mentary system of triggering cues, communication 
becomes continuous. These cues, of whatever shape 
or degree of conspicuousness, constitute a finite 
segment or structure of segments. Communication 
is a continuous process made up of these discon- 
tinuous and related elements, The cue signals some 
kind of change in the environment. However, the 
absence of a specific and discernible cue does not 
create a situation of nonsociety, of noncommunica- 
tion. In the absence of a cue a “steady” state can 
be said to be signaled; the ordered activity within 
the state is in itself cueful. 

The concept of the steady state in its contrast 
with activating cues can be fatally obscurantist as 
an idea if it prevents recognition of the fact that 
it is only a heuristic simplification of, or an obser- 
vational slice from, the very complex ordering of 
interpersonal and group relationships present in all 
societies, For example, the membership of a vivip- 
arous and bisexual grouping is not, by definition, 
homogeneous. Every observation of group behavior, 
either animal or human, leads us to the conclusion 
that intermembral cues cannot appear in isolation, 
are never things-in-themselves. They are always 
cross-referenced. Cues of whatever shape or dura- 
tion are always modified by the signature of the 
sender and, usually, by the identification of the 
addressees. Thus, even the most context-specific or 
event-specific cue is bound to complex modification 
cues, which serve to signal the reliability of the cue 
by transmitting information about the context, the 
sender, and the intended receiver of the signal, “To 
whom it may concern” messages undoubtedly occur 
within a social grouping, but they are probably 
sufficiently rare as to constitute special cases. At 
the same time, while certain situations of urban 
life may permit a period of relative anonymity in 
the message sender, these are certainly of short 
duration and are usually, as in the case of mob 
action, socially defined as a serious threat to the 
society. 

By the mid 1950s it had become obvious from 
observation of even the most ritualistic inter- 
actional sequences, that is, within the most explicit 
of contexts, that signature and address was a com- 
plex communicational procedure. Preliminary in- 
spection and analysis showed that much of the 
material loosely assembled under the rubrics of 
“social identity” and “social status and role” was of 
communicative consequence, These systematic ob- 
servations, when coupled with some of the insights 


supplied by Goffman (1956) and given depth of 
biological perspective by ethological data, dissi- 
pated any confidence that sign—symbol dichotomies 
were useful in communication theory. There do not 
seem to be instances of phenomena so restricted 
or so isolated as to be designated as signs, Elabo- 
rate, signaled contextual redundancy is required 
for sign reliability. This insight serves several pur- 
poses. First, it relieves communication theory of 
the problem of dealing with isolated units of mean- 
ing, which are unrelated to the communicative 
stream; that is, the cue, when seen as effective in 
a variety of contexts, always signals the presence 
of a code to which the cue belongs. Second, it 
clarifies the necessarily continuous nature of the 
communicative process. And, finally, supported by 
an increasing body of evidence indicating the 
ultimate inseparability of linguistics and kinesics 
in communicative systems, it makes manifest the 
inadequacy of any theory of communication based 
on monochannel message transmission. 


Inspection of any extensive body of interactional 
data offers proof that humans vocalize but a very 
small percentage of their interactional time. Fur- 
thermore, such vocalizations do not character- 
istically take place in a situation devoid of tactile 
contact, shielded against olfactory and gustatory 
sensation, shaded to obscure visual experience or 
in a situation of such anesthetic force as to pre- 
vent proprioceptive and automatic feedback. It is 
not necessary to depreciate the role of language or 
even of words and sentences in the communicative 
process. However, from the point of view of the 
behavioral scientist concerned with communica- 
tion, language is an infracommunicational system. 
I am convinced that neither language nor com- 
munication can be either studied or understood so 
long as we assume that either subsumes the other. 
A monochannel analysis of communication must 
ignore or deny too much evidence to gain support 
unless the definition of communication is limited 
to the wholly aware, completely purposive trans- 
mission of commonly held, explicit, and denotative 
verbal information between interactants. 


Ray L. BIRDWHISTELL 


[Directly related are the entries CoMMUNICATION, ANI- 
MAL; KīINESICS; LANGUAGE, article on LANGUAGE 
AND CULTURE; SEMANTICS AND SEMIOTICS. Other 
relevant material may be found in the biographies 
of Sapir; WuorF.] 
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I 
SOCIAL AND PSYCHOLOGICAL ANALYSIS 


It is not easy to mark the boundaries of the sub- 
ject of animal communication. Some quite good 
reasons might be offered for viewing the subject 
as encompassing all phenomena of life, for the very 
notion of biological adaptation can be said to in- 
volve an animal population’s or, more to the point, 
a gene pool’s communication with its environment. 
Even when biological adaptation is considered only 
at the behavioral level, this broad view may seem 
increasingly appropriate as we come to appreciate 
heretofore unsuspected ways by which animals 
communicate with their environments—such as 
the echo-ranging systems of the dolphins. But how- 
ever impressive the echo-ranging systems of dol- 
phins, bats, eared seals, and so on, are, they are 
no more (nor less) marvelous than, say, the olfac- 
tory systems of carnivores or the visual system of 
higher primates. An active wresting of adaptively 
pertinent information, rather than a mere reacting 
to environmental changes, is a characteristic fea- 
ture of all higher modes of adaptation. 

Thus, there is usually little advantage to be 
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gained from viewing animal communication as ex- 
tending across all of an animal population’s inter- 
actions with every aspect of its environment, and 
it is common to restrict most discussions of the 
subject to what may be called social communica- 
tion. This is a valuable restriction, for with the 
exploitation of social modes of adaptation, an addi- 
tional range of environmental information becomes 
significant, namely, that arising from the social 
environment. The problems associated with these 
adaptations are quite special because the reciprocity 
of influence and concomitant fluidity in social in- 
teractions is usually much greater than in the 
interactions between animals and their nonsocial 
environments. This is not to say that the processes 
and mechanisms of social interaction are neces- 
sarily different in kind from those involved in inter- 
action with nonsocial environments but only to 
point out that pressures related to social factors 
have been involved in their evolution (Wynne- 
Edwards 1963). It is because of this that it is 
especially reasonable to distinguish social commu- 
nication as a special part of the more general prob- 
lem of the communication of animals with their 
environments. 

However, in making this distinction we must be 
careful not to make social communication con- 
ceptually more special than it really is. Most often 
social communication is considered a subordinate 
part of social behavior. This is clearly wrong, for, 
basically, social communication and social behavior 
are identical. At bottom, can anything more be said 
of social behavior than that it is the partial, and 
mostly reciprocal, determination of one or more 
animals’ actions by other animals? And, reversing 
the coin, can the occurrence and reception of com- 
municative signals be something more than just 
this social behavior? There are probably many rea- 
sons why social communication has been mistakenly 
thought to be a subject set apart from the rest of 
social behavior. No doubt some are related to the 
apparent ease with which we seem to distinguish 
man’s linguistic actions from the rest of his social 
interactions. But man’s linguistic activities consti- 
tute an exceedingly specialized form of animal 
communication, and it is well to bear constantly 
in mind that it is extremely risky to apply general- 
izations and ideas having their basis in what we 
feel to be the properties of linguistic communica- 
tion to nonlinguistic communication. (Caution in 
proceeding in the reverse direction is at least 
equally warranted.) Actually, nonlinguistic modes 
of communication are exploited as much by hu- 
mans as by any other social fauna, and if we were 
to attend only to these, the inclination to separate 
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communication from social interaction would surely 
diminish. 

The distinction between social communication 
and social behavior is also promoted by the feeling, 
again prompted by a presumed analogy to human 
linguistic actions, that successful communication 
involves the transmission of information or some 
other commodity. That is, it is thought that in com- 
munication something is literally made common to 
both signaler and recipients that would otherwise 
remain the private possession of the signaler. This 
notion is also linked with the idea that communi- 
cation occurs only when one animal, the signaler, 
performs some sort of action and by this action 
generates the signal whose reception by others con- 
stitutes the information transmission. One of the 
difficulties with this formulation of animal com- 
munication is that there are many situations in 
which an animal may affect the conduct of others 
in its group, not through any actions of its skeletal 
musculature, but simply by sensory attributes of its 
body over which it in no sense can be said to have 
control, such as the coloring of its feathers or of its 
beak. Even when the communicative events derive 
from particular behavioral acts, the applicability of 
the notion of information transmission is open to 
question, although to whatever extent it is ap- 
plicable it is just as much so to all of social be- 
havior. In animal communication the information 
that might be said to be transmitted is precisely the 
social significance of any attribute, behavioral or 
otherwise, of the participants in any social inter- 
action. 


Signaling systems 

Nonetheless, specializations in social modes of 
adaptation do occur very widely, even in species not 
routinely living in groups, in which specific actions 
by one participant appear to function primarily to 
produce patterns of stimulation that characteris- 
tically evoke regular changes in the behavior of 
others. Such conventionalization of the means of 
integrating animal social behavior can well be re- 
garded as signaling systems, always appreciating 
that we are only referring to especially systematic, 
regularized episodes of social interaction. Social 
specializations of this sort, as with any biological 
specialization, are the more pronounced the more 
directly they contribute to the integrity of a species, 
and thus, animal signaling systems appear most 
often in the government of social episodes relating 
to biologically critical areas, such as reproduction 
and infant care, species and group integration, 
spacing and population control, predator defense, 
cooperative foraging, etc. 


There are many questions about any animal sig- 
naling system that require answers if we are to 
develop an understanding of these parts of biology. 
To date none have been more than partially an- 
swered for any single species, and as the answers 
can certainly be expected to be interrelated not only 
with each other but also with the immense num- 
bers of other questions remaining to be answered 
about any species’ mode of adaptation, we can view 
our current state of knowledge only with despair. 
But however woefully lacking in answers, we will 
discuss these questions in three very much over- 
lapping groups, relating in turn to the structures, 
functions, and mechanisms of animal signaling 
systems. 

Structural aspects. The most basic structural 
question concerns the physical nature of the sig- 
nals. For any species this depends on the nature of 
the afferent and efferent adaptations and on the 
demands of the habitat. In principle every available 
avenue of exteroception is potentially exploitable 
for mediating social intercourse, and in fact, all are 
exploited by one animal form or another. There are, 
of course, restrictions imposed by sensitivities of 
the receptor systems and by glandular and motor 
capabilities for generating signals commensurate 
with these sensitivities. For instance, exposure of 
colored surfaces could have no value for intra- 
specific signaling in color-blind species. But many 
other factors (carefully reviewed by Marler 19614) 
may participate in the selection of signaling modes 
that are perhaps less apparent—as for instance, 
the time and distance over which social interactions 
occur, the sources of signal distortion in the signal- 
ing medium, and the exposure to predation associ- 
ated with signaling actions. Because signaling sys- 
tems are intrinsically social phenomena, their 
adaptive exploitation requires the characteristics of 
signal emission, transmission, and reception to be 
well matched, to insure the effective mediation of 
social episodes. 

This brings us to questions concerned with the 
nature of the particular signals, where similar de- 
grees of complexity must be faced. Only detailed, 
systematic investigation can reveal which features 
of a given signal possess social significance, i.e., 
are critical in determining the results of signal 
reception. Fortunately there have been some very 
notable successes in developing appropriate meth- 
ods for identifying the critical stimulus features 
governing social interactions, such as are described 
by Tinbergen (1951; 1953a). Ideally, the most 
effective procedure exposes animals to models in- 
corporating stimulus variations of the features of 
the naturally occurring signals and measures the 


effects of such variations on the conduct of recipi- 
ents. This has been practicable in a number of 
instances where the natural signal involves only a 
single sensory mode and remains relatively stable 
for long durations, such as the chemical signals 
that have been isolated as parts of the social adap- 
tations of various insects (Wilson 1963). Even 
with auditory signals, which are usually much 
briefer and less steady, advances in recording and 
reproducing techniques have made it possible to 
employ a close approximation to the model-manipu- 
lation approach by selective filtering of recordings 
of the original signals in order to determine the 
behavioral effects (Busnel 1963; Falls 1963); and 
it can be expected that before long the very close 
parallels to the model-manipulation procedures that 
have been so productively used in the analysis and 
synthesis of human speech sounds (e.g., Liberman 
1957) will begin to be used with equal effect by 
animal behaviorists concerned with acoustical sig- 
naling. 

Unfortunately, the full power of the model- 
manipulation method can perhaps never be fully 
brought to bear on the problem of identifying the 
significant features of many kinds of animal sig- 
nals. This is because, especially in social episodes 
involving close proximity of the participants, the 
signals may involve several sensory modalities 
simultaneously. The difficulties are further intensi- 
fied when the features of the signals are produced 
by many different concurrent motor adjustments, 
as in the signals of some carnivores and terrestrial 
primates (Hinde & Rowell 1962; DeVore 1965). 

When the signals are shifting patterns of multi- 
modal stimulus combinations, the determination 
of the contribution made by each aspect to the sig- 
nificance of the total complex may never be com- 
plete. Nevertheless, the ingenuity of investigators 
and the prospects for exploitable technological de- 
velopments in manipulating complex stimulus 
events in the field or laboratory, if not limitless, are 
at least sufficiently unpredictable to permit the 
hope that much progress can be made in analyzing 
the constituents of even the most complex and fluid 
signals, for we know that signal complexity can- 
not exceed the socially effective resolution capaci- 
ties of its recipients. We appear to be further aided 
in meeting this problem of signal complexity by 
the seeming absence of gestalt properties in their 
reception (Hess 1962). That is, the social effects 
of parts of a signal complex may interact in simply 
an additive fashion, so that the effects of subsets 
of the total complex may be studied separately. 
This possibility is well illustrated in studies by 
Miller and his associates (1959) in which they 
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compared the stimulus values of (1) fearful mon- 
keys, (2) life-size still photographs of fearful 
monkeys, and (3) life-size still photographs of just 
the facial regions of fearful monkeys in evoking 
conditioned avoidance responses in other monkeys. 
They found that all three different stimulus com- 
plexes were effective, which suggests that such 
conditioning procedures, together with advanced 
techniques of stimulus manipulation, might very 
well be valuable in analyzing critical features of 
even the most complex social signals. 

Perhaps an even more imposing barrier to the 
exploitation of the model-manipulation method is 
raised by the evidently very important role played 
by context effects in determining the behavioral 
consequences of the signals, especially the more 
elaborate modes of social adaptation. Thus, ap- 
parently, the same signal emitted in different social 
contexts (Schaller 1963) or following different 
prior signaling actions (Altmann 1962) does not 
evoke the same behavioral effects. As a result of 
such difficulties, this extremely valuable technique 
of analyzing the stimuli that control general animal 
behavior may be of limited use in studying many 
kinds of signals operating in social interactions. 

But there is no reason for pessimism. Careful 
field observation by competent (ecologically ori- 
ented) workers aided by good cinematic and audio 
field recording can be expected not only to yield 
precise identifications of the signals occurring in 
conventional behavioral interactions but also to dis- 
close what are their more or less invariant features 
—though perhaps not the particular contribution 
made by each, nor their interactions, in determin- 
ing the behavioral effects of signal reception. These 
prospects are heightened by the relative stability 
of animal signals. In profound contrast to human 
linguistic signals, which change incessantly and 
with great rapidity, nonlinguistic signals of man 
and other animals tend to be steady and to remain 
so for long periods of time. And since even the fluc- 
tuating auditory displays of songbirds (Thorpe 
1961a) and dolphins (Dreher & Evans 1964) tend 
to occur as repetitive trains, we can hope to attain 
rather close specifications of these signals. 

Not only must extended observation of free- 
ranging animals be relied on to furnish much of 
the information needed for determining the par- 
ticular physical characteristics; it is also required 
to provide additional structural data essential to 
understanding the functioning of any signaling 
system, We need to know the features of as many 
of the signals as possible, how variable each is, and 
to what degree the different classes of signals inter- 
grade. We also need to know their distributions of 
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occurrence in an animal population, in terms of 
their grading by age, sex, and social status and in 
terms of the presence and ranges of “dialectical” 
differences. 

Another very important question is whether sig- 
nals or their constituent features occur in different 
simultaneous or sequential combinations, That is, 
do these systems possess syntactical properties? So 
far only very few combinatorial phenomena have 
been found that are loosely analogous to the syn- 
tactic operations of intensification in human lan- 
guages. However, the foremost advantage of syn- 
tactic systems is the large number of different 
events provided by a few combinatorial elements, 
and as far as is known, nonlinguistic signaling 
systems involve impressively small numbers of 
different signals. 

Functional aspects. Once some degree of reso- 
lution can be provided for questions concerning the 
structural properties of animal signaling systems, 
combined observation and experimentation in the 
field and laboratory is also indispensable for answer- 
ing the next broad set of questions—those ad- 
dressed to the functioning of the systems. These 
questions occur on at least two levels. The first is 
concerned with the functional determinants of the 
glandular—muscular actions of the participants in 
a particular social episode; the other is concerned 
with the functioning of such episodes on an ecolog- 
ical level, that is, in the contribution that these 
conventionalized behaviors make to the species’ 
over-all mode of adaptation. Questions on both 
levels must be posed in terms relevant to the social 
and ecological settings into which the evolution of 
these signaling processes has fitted them. Although 
it is completely clear that deep understanding of 
functional problems cannot be achieved without 
recourse to experimental methods, the scientific 
value of the uses of such methods is directly pro- 
portional to their degree of congruence with in- 
formation concerning the nature of the ecological 
adaptations of animal populations. 

Before pursuing these functional considerations 
of animal signaling systems any further, we should 
pause briefly to immunize what is to follow against 
some of the conceptual poisons that so easily infect 
discussions of this kind. Fortunately, most of the 
deleterious aspects of the biologist’s notion of func- 
tion have been removed by the recognition of the 
worthlessness of teleological explanations for the 
teleological mechanisms that so pervade the phe- 
nomena of life. But they continue to pose a threat 
when the functioning of communicative acts is con- 
sidered, because of the seductive qualities of think- 
ing about signaling actions in terms of the purposes 


or intentions of the signaler. It is quite easy to 
avoid such thinking by thoroughly appreciating 
that animal signaling actions, such as the “waggle” 
dance performed by workers of some species of 
bees, are purposive in exactly the same way as a 
shark’s kidneys or a sparrow’s hemoglobin. Some 
day, perhaps, a behavioral theory will be developed 
in which the notion of purposive actions appears 
in a way that goes beyond the general idea of bio- 
logical adaptation and that sharply distinguishes, 
say, only some kinds of human actions as pur- 
posive in this special sense. Until then the term 
“purposive” will be a predicate more of features of 
biological populations than of specific actions, and 
we might do better just to use the term “adaptive” 
and get on with the task of discovering the specific 
features of particular signaling, neuromuscular, 
renal, and biochemical systems and how they 
function to confer selective advantage on their 
possessors. 

On the behavioral level two principal questions 
about the functioning of signaling systems confront 
us. The first is concerned with the circumstances 
of an animal's environment (external and internal) 
that determine its signaling actions and can be 
taken as concerning the reference of the signals 
generated by such actions. The second question 
concerns the behavior changes contingent upon 
signal reception, or what may be called the sig- 
nificance of the signal. Because the reference of a 
signal may be only indirectly connected to its sig- 
nificance, describing the functions of signaling 
systems in terms of transferring information or 
conveying meaning may often be quite strained. 
Even when the reference and significance of a sig- 
naling action can be specified, nothing is added on 
the behavioral level of analysis by assertions about 
the information transmitted or the message con- 
veyed, although these may be warranted, and in- 
deed important, considerations on an ecological 
level of analysis. 

The emotional components, In examining the 
events of signal emission from a viewpoint some- 
where between the glandular—muscular and eco- 
logical levels of analysis, one gets, along with Dar- 
win and many others, a strong impression of their 
fundamentally emotional character. That is, signal 
emission is associated with shifts in the levels of 
arousal of emotional or drive states in the animal, 
induced by the environmental changes to which the 
signal refers. This characterization is, of course, 
flagrantly anthropomorphic and becomes increas- 
ingly objectionable as the signaling systems of ani- 
mal forms below the birds and mammals are 
considered. At present there is not much meaning 
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to discussions of the emotional qualities of a 
spider’s courtship behaviors. Nevertheless, to the 
degree that the term “emotional” can be provided 
any technical meaning with respect to human be- 
havior, this anthropomorphistic usage does have 
some measure of justification, because of the emo- 
tional character of much of man’s nonlinguistic 
signals, particularly those emitted by human in- 
fants, and their apparent homology with the sig- 
nals of other animal forms (Andrew 1963; Schaller 
1963). Actually, a more forceful objection to the 
emphasis on the emotional qualities involved in the 
processes of animal signal emission might be that 
they are quite unexceptional in this regard. If the 
bulk of all animal behavior, man’s included, were 
organized and activated by emotional processes, 
then accounting for the apparent exceptions would 
present the most challenging problems. Perhaps the 
most notable of such exceptions would be those 


forms of human interaction, mediated by linguistic > 


signaling systems, that appear to be specifically 
protected, though often only flimsily, against emo- 
tional influences by traditional conventions (Hebb & 
Thompson 1954). But even when linguistic sig- 
nals do not refer to emotional processes (though 
they very often do), much of the variation in their 
linguistically nonsignificant features is subject to 
emotional determination. 

Not only do animal signaling actions appear to 
be associated with emotional processes in the 
emitter; their significance, that is, their effect on 
their receivers, is largely emotional as well. Char- 
acteristically, the consequences of signal reception 
involve changes in the emotional status of the re- 
ceiver, frequently leading to its emission of signal- 
ing actions in turn, which evoke further emotional 
changes in the initial signaler, and so on, until 
sometimes quite elaborate interaction episodes, 
such as the courtship patterns of many birds, are 
sequentially unfolded in this way. Obviously, there 
is more that transpires in such episodes than just 
exchanges of signals, but the important point is 
that the actions of the participants are steered by 
the emotional changes arising from signal reception. 

Mechanisms. The strength of these restrictions 
is unknown, but it has an important bearing on the 
kinds of answers that might be proposed to ques- 
tions about the mechanisms governing signal emis- 
sion, the behavior consequences of signal reception, 
and the development of these mechanisms in 
species and their members. The specific features of 
these mechanisms have not been established for 
any species, but gratifying progress has been made 
in determining the general nature of neurochemical 
components of the mechanisms and their dy- 


namics, particularly for some avian species (as, for 
instance, in the work of Lehrman 1962; Schein & 
Hale 1959; Holst & Saint Paul 1962). 

This work suggests that whatever the various 
mechanisms may be, we should be prepared to ex- 
pect them to be no different from the mechanisms 
governing other, nonsocial behavior. Similarly, we 
can expect the development of their components 
to involve interactions of various sorts between 
genetic and experiential factors (Marler 1963). 
And we should not be surprised to find, especially 
for the more advanced forms, that different devel- 
opmental histories underlie the mechanisms gov- 
erning the emission of different signals or the 
responses to them in a given species. For instance, 
it could very well be that for a particular signal 
hereditary factors are more penetrant in the de- 
velopment of the mechanisms governing its emis- 
sion than the behavioral results of its reception, or 
vice versa. Because of this we may find that cer- 
tain signaling mechanisms in some animals are 
highly plastic, not only in terms of their references, 
but also in the physical form of the signals they 
produce. This is already well established for phona- 
tory signaling of certain species of “talking” birds 
(Ginsburg 1960; Grosslight et al. 1962), but more 
surprisingly, the techniques of operant condition- 
ing have also been used to manipulate vocalization 
of some nonhuman mammals, such as the domestic 
dog (Salzinger & Waller 1962), the house cat 
(Molliver 1963), the bottlenose dolphin (Turner 
1964 ), and the chimpanzee (Hayes & Hayes 1950). 

However, it is far from clear that any of the 
above cases represents a departure from the emo- 
tional character of these signaling actions. The 
degree to which the activation of these signaling 
mechanisms may be divorced from emotional proc- 
esses is a particularly exciting question, for the 
answers may yield a better understanding of the 
basis of this independence in human speech 
(Geschwind 1964). 

JARVIS BASTIAN 


[Other relevant material may be found in EMOTION; 
ETHOLOGY; IMPRINTING; ĪNSTINCT; KINESICS.] 
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I 
COMMUNICATION MODELS AND SIGNALING BEHAVIOR 


To understand the ways whereby animals com- 
municate with each other requires the cooperative 
attack of a wide variety of disciplines, ranging from 
bioacoustics and biochemistry through anatomy to 
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sensory physiology and neurophysiology, and from 
comparative psychology and zoology to anthro- 
pology and linguistics. The diverse lines of research 
that have converged on the study of animal com- 
munication—which, in turn, seems so far to have 
constituted the principal axis of synthesis in the 
entire field of animal behavior—may together be 
subsumed under the label zoosemiotics. This word 
was coined (Sebeok 1963a, p. 465) to emphasize 
the necessary dependency of this emerging field on 
a science that involves, broadly, the coding of in- 
formation in cybernetic control processes and the 
consequences that are imposed by this categoriza- 
tion where living animals function as input-output 
linking devices in a biological version of the tra- 
ditional information-theory circuit, with a trans- 
coder added. Thus, the science of zoosemiotics lies 
at the intersection of general semiotics, the doctrine 
of signs, and ethology, the biological study of be- 
havior [see ErHOLOGY; SEMANTICS AND SEMIOTICS]. 


Zoosemiotics 


The essential unity of a zoosemiotic event may 
be decomposed, for a field observer's or laboratory 
experimenter’s convenience, into six aspects, and 
the sphere of animal communication studies has in 
practice tended to divide roughly in accordance with 
these dimensions. The hexagonal model which is 
suggested here (Broadhurst 1963, chapters 2, 3) 
entails a communication unit in which a relatively 
small amount of energy or matter in one animal, 
the source, brings about a relatively large redis- 
tribution of energy or matter in another animal, 
the destination, and postulates a channel through 
which the participants are capable of establishing 
and sustaining contact. Every source requires a 
transmitter, which serves, by a process called en- 
coding, to reorganize the messages it produces into 
a form that the channel can accommodate; and a 
receiver is required to reconvert, by a process called 
decoding, the incoming messages into a form that 
can be understood by the destination, The source 
and the destination are therefore said to share, at 
least partially, a code, which may be defined as 
that set of transformation rules whereby messages 
can be converted from one representation to an- 
other. An ordered selection from a conventional 
set of signs is a message. The physical embodi- 
ment of a message is a signal, which is usually 
mixed with noise. (The term “noise” here refers to 
variability at the destination not predictable from 
variability introduced at the source.) Finally, to be 
operative, the message presupposes a context re- 
ferred to and apprehensible by the destination. 

The origin, propagation, and effect of signs can 


be studied by an examination of the manner in 
which an animal encodes a message, how this is 
transmitted in a channel, and how the user decodes 
it. Since any form of physical energy can be ex- 
ploited for purposes of communication, a primary 
task is to specify the sensor or constellation of 
sensors employed among the members of a given 
species or among members of different species. 
Organisms may have sensors for chemicals in so- 
lution or dispersed in air (taste and smell), sensors 
for light (vision), sensors for pressure changes 
(tactile perception or hearing ), or still other sensors 
involving, for example, parts of the electromag- 
netic spectrum besides the visual portion. Many 
animals employ multiple sense organs: thus, com- 
munication among bees involves olfactory, optical, 
acoustic, and other mechanical signals; a herd of 
mule deer achieves social integration by hearing, 
vision, smell, and touch. 

Chemical systems provide the dominant means 
of communication in most animal species ( Wynne- 
Edwards 1962, chapter 6; Wilson & Bossert 1963; 
Bossert & Wilson 1963) and perhaps even in birds. 
Information is transferred by substances referred 
to as pheromones (Wilson 1963). Chemical sig- 
nals may be emitted by an animal's entire body 
cover, the skin, by special scent glands (e.g., as in 
ruminants), and by still other parts of the body. 
They may be received throughout the body (e.g., 
as in some aquatic invertebrates) or by specialized 
structures called chemoreceptors, that is, by the 
distal organs of smell and the proximal organs of 
taste. It is as yet unknown whether any animal can 
modulate the intensity or pulse frequency of 
pheromone emission to formulate new messages. 
Pheromones tend to function as yes—no signals: a 
particular scent either is produced or it is not; once 
emitted, however, the odor is very likely to persist 
and, thus, convey a message after the departure of 
its source from the site. The one great advantage, 
therefore, of chemical signals is their capacity— 
exploited for social integration by terrestrial mam- 
mals especially—to serve as vehicles of communi- 
cation into the future. This function whereby an 
individual can leave messages for another to find 
in his absence or, by a delayed feedback loop, even 
leave messages for himself, is analogous to the 
human use of script (Haldane 1955). Its over- 
whelming evolutionary value is confirmed by the 
fact, for example, that herbivores usually leave 
telltale trails, at the risk of revealing their where- 
abouts to carnivores, while some of the latter broad- 
cast a strong smell advertising their presence, at 
the sacrifice of part of the surprise element in 
hunting. 
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Optical systems (Marler 1961b; Wynne-Edwards 
1962, chapter 2; Tinbergen 1964) presuppose re- 
flected daylight in the case of diurnal species and 
bioluminescence (McElroy & Seliger 1962) in those 
species that dwell in dark but transparent media. 
Patterns of visual activity are highly variable as 
to shape and color, duration, and range of inten- 
sity; they can also be actively displayed by move- 
ments and postures, as in three-spined sticklebacks 
(Tinbergen 1953a); by facial expressions, as in 
the primates (Andrew 1964); or by intermittent 
flashing, as in fireflies. Thus, visual signs are both 
flexible and transient: they can be rapidly switched 
on or off. These capacities allow for precise coding 
of information and may even be exploited to mis- 
direct, as in protective displays involving “eyespots” 
in moths (Blest 1957). Ethologists, concerned with 
the origin and evolution of visual and other forms 
of signaling behavior, have described and pro- 
visionally classified them into three principal cate- 
gories: movements signifying intention, that is, 
those which seem to be preparatory or incomplete 
versions of functional acts (such as choking in 
kittiwakes); autonomic effects (such as piloerec- 
tion in dogs); and so-called displacement move- 
ments, that is, those which appear to be irrelevant 
in the context in which they are delivered (such as 
preening in pigeons during courtship). Ethologists 
think that there has been an evolutionary process 
of increased adaptation to visual signaling behav- 
ior; they call this modification ritualization, 

Tactile systems include rather disparate phe- 
nomena, from all sectors of the animal world. They 
all have in common the requirement that the indi- 
viduals communicating by such means must be in 
physical contact; suckling, copulating, fighting, 
social grooming or mutual preening. Although com- 
munication in this channel is thereby limited to 
relatively short distances, its effective range can be 
increased somewhat by the use of elongated feelers, 
such as antennae, tentacles, barbels, fin rays, or 
the like, and considerably further by, for example, 
lines of silk, as in many spiders (Wynne-Edwards 
1962, chapter 7). Since tactile signals are subject 
to wide variations in duration and intensity, they 
are particularly useful for the transmission of quan- 
titative information, such as distances, The nature 
of topographic discriminations achieved through an 
animal's body surface or through more or less 
sensitive contact receptors is not well understood; 
the possibilities of cutaneous message processing 
are, however, being explored (Geldard 1960), 

Partly because it has an immediate appeal to the 
imagination of men and partly because it was 
stimulated by technological refinements that be- 


came available in the decades after World War 11, 
the study of the mechanical vibrations by which 
some animals communicate constitutes one of the 
most advanced branches of zoosemiotics (Sym- 
posium on Animal Sounds and Communication 
1960; Busnel 1963). Bioacoustical systems may 
operate in the air, as with insects and with birds, 
which as a class are the most vocal of animals. The 
mammals that communicate this way are the bats 
(Griffin 1958), shrews, rodents, deer, seals (Bar- 
tholomew & Collias 1962), the carnivores, espe- 
cially Felidae (Moelk 1944) and Canidae (Hafez 
1962, pp. 442-443), and the monkeys and anthro- 
poid apes (Yerkes & Learned 1925; Altmann 1966). 
Bioacoustical systems for social communication 
and display may also operate underwater (Wynne- 
Edwards 1962, chapter 4; Symposium on Marine 
Bio-acoustics 1964) and have been variously recog- 
nized among Crustacea, aquatic insects, fishes, 
and Cetacea; in recent years whale communication 
has been receiving increased attention (Sebeok 
1963a). Finally, rhythmic changes of density in a 
solid may also form an acoustical system, e.g., the 
quacks of queen honeybees are transmitted directly 
through the hive material (Wenner 1962). Al- 
though both visual and auditory perception occur 
in space and time, vision is pre-eminently the 
spatial sense, just as audition is the temporal. In 
acoustic communication, reaction times are typical- 
ly fast (the speed of response in the avian ear is 
estimated to be about ten times that in the human: 
Thorpe 1963a) and sound signals can be received 
at as great distances as chemical ones. The emis- 
sion of sounds involves a minute output of energy, 
and their transient character makes accurate tim- 
ing possible. If perceived by the proper receiving 
organs, sounds—especially those in the higher fre- 
quencies—can be directional, when it is advantage- 
ous that information about the sender's location 
be broadcast; on the other hand, with the use of 
lower frequencies the sender’s whereabouts may be 
kept concealed. Sound fills the entire space around 
the source and therefore does not require a straight 
line of connection with the receiver: a signal can 
travel around corners and is not usually inter- 
rupted by obstacles. This flexibility in frequencies, 
intensities, and patterns is important in that it 
allows for considerable specific differentiations, as 
well as, within a species, for individual variation 
with many shadings and emphases. 
Echolocation—where the encoder and decoder 
of an acoustic (or, sometimes, electrically coded) 
message is the same living animal in a situation of 
rapid feedback—may be regarded as a special case 
of communication. This phenomenon was first dis- 
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covered in bats (Griffin 1958) but has since been 
found in all sorts of other animals, including 
gyrinid beetles, the South American oilbird, the 
southeast Asian swiftlet, and some seals and por- 
poises (Kellogg 1961). It has also been shown that 
moths of the family Arctiidae emit noisy ultra- 
sonic pulses when they detect insectivorous bats 
pursuing them and that these bursts may function 
to mislead their predators (Dunning & Roeder 
1965). 

It is well known that certain fishes generate elec- 
tric fields. It seems probable that some of the feebler 
impulses are employed for signaling, at least in 
those species (such as a number of mormyrids and 
gymnotids) where the frequencies and patterns of 
discharge are distinct (Wynne-Edwards 1962, chap- 
ter 5). Such electric rhythms have even been com- 
pared to the song displays of birds and ascribed an 
analogous territorial function. While it is suspected 
that thermal sensation, also, is employed as a com- 
munication channel among certain animals, there 
is as yet insufficient evidence to substantiate this. 

Matter, as well as energy, may serve as a mes- 
sage conductor, as in honeybees, ants, and termites, 
where processes that bring food and water also 
transport information vital to the survival of the 
colony. This form of semiotic biosocial facilitation 
of interindividual stimulative relationships is known 
as trophallaxis (Wheeler 1926; Lindauer 1962). 


Context and significance of signals 


Although it is both necessary and proper to dis- 
tinguish the several channels of animal communi- 
cation and to study each in isolation, the redun- 
dancy that prevails among the multiplicity of bands 
in natural systems—an effect sometimes referred 
to as the law of heterogeneous summation—must 
soon become an object of both theoretical and prac- 
tical concern. The over-all code that regulates an 
animal communication network often seems to con- 
sist of a set of subcodes grouped in a hierarchy. 
Fluctuations that occur in these subcodes are de- 
termined by such factors as the kind of information 
to be transmitted, the availability of alternative 
channels, and the distance between source and 
receiver. Thus, in the mountain gorilla, vocaliza- 
tions employed when the individual is hidden from 
view by dense vegetation serve to draw attention 
to the animal emitting them; these sounds notify 
other gorillas of the specific emotional state of the 
performer and alert them to watch for gestures, 
which then communicate further information. Pos- 
tures and gestures, especially facial expressions, 
coordinate behavior within the group when the 
distance between the members decreases; the vis- 


ual subcode is in turn replaced by the tactile sub- 
code when distance is still further diminished, as 
between a female and her small infant (Schaller 
1963). 

In each species the source of a message must 
share a code with its destination. This is the criti- 
cal element of communicative commerce; it con- 
stitutes a particularized version of the universal 
“need to know.” Every emitting organism's selec- 
tion of a message out of its species-consistent code, 
as well as the receiving organism’s apprehension 
of it, proceeds either in accordance with a “closed” 
genetic program, automatically and predictably 
shaping a wholly prefabricated set of responses, or 
with reference to the animal's unique memory store, 
which then directs the way in which the genetically 
precoded portion of the total behavior program is 
acted out. Even in “open” behavior programs cer- 
tain releaser mechanisms can begin to function 
only at predetermined stages of maturation. The 
relationship of learning to instinct in animals in 
general has been reviewed by Thorpe (1963b), and 
the relative influence of inheritance and learning 
in the development of vocalizations by Marler 
(1963), while the question of learning from alien 
species, especially the capacity of certain birds and 
mammals to imitate human speech—the phenome- 
non of mimicry—has been examined by Andrew 
(1962). As one would expect, the higher verte- 
brates tend to be increasingly malleable through 
learning experience—certain domesticated species 
can even be motivated by verbal instruction (ele- 
phants, for example, can be trained to distinguish 
between up to 24 different commands by purely 
auditory perception ). 

The basic assumption of zoosemantics is that in 
the last analysis all animals are social beings, each 
species with a characteristic set of communication 
problems to solve. All organic alliances presuppose 
a measure of communication: protozoa interchange 
signals; an aggregate of cells becomes an organism 
by virtue of the fact that the component cells can 
influence one another. Creatures of the same spe- 
cies must locate and identify each other; moreover, 
they must convey information as to what niche 
they occupy, in territory as well as status, in the 
social hierarchy and also as to their momentary 
mood. Intraspecific and interspecific messages fur- 
thering ends such as these can be coarsely cate- 
gorized in terms of their ecological or functional 
contexts, and different scholars have devised 
roughly comparable, but all more or less subjec- 
tive, schemes to classify the supposed signification 
of the signals they have observed. Thus, it is pos- 
sible to note intraspecific expressions of threat, 
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warning, fear, pain, hunger, and—at least in the 
highest of animals—such feelings as defiance, well- 
being, superiority, elation, excitement, friendliness, 
submission, dejection, and solicitude and interspe- 
cific warning signals, intimidating signals, decoying 
signals, and positive or negative masking signals 
(Wynne-Edwards 1962, pp. 24-25). Frings and 
Frings (1964) have noted species identification in 
aggregational systems (cellular, sessile, mobile, 
social, and interspecific) and dispersal systems 
(ritualized fighting, aggressive displays, and terri- 
torial behavior); social cooperation, involving such 
items of information as alarm signals (subclassi- 
fied as indicators of departure, distress, warning, 
and the like) and food signals; sexual attraction 
and recognition; courtship and mating; and com- 
munication in the parent—young relationship. Col- 
lias (1960, p. 387) relates all acoustic messages to 
five contexts: food gathering, predatism, sexual 
and allied fighting behavior, parent—young inter- 
relations, and aggregations and group movements. 
Armstrong, with due cautioning about difficulties 
and uncertainties, lists the categories required to 
cover the range of context in the auditory com- 
munication of birds (see Table 1). 


Table 1 
Identity Motivation Environment 
Species Sexual need Location 
Sex Need (other than sexual) (a) individuals 
Individuality Escape or alarm (b) objects 
Status Territory 
Predators 


Source: Adapted from Armstrong 1963, p. 6. 


The associative ties between signals and their 
meanings are often arbitrary (as opposed to iconic) : 
thus, tail movements in a dog denote friendship; 
in a cat, hostility; and in a horse, the presence of 
flies. Some signals are “shifters” (Sebeok 1965a), 
whose meaning differs according to the situation: 
thus, the honeybee’s tail-wagging dance has more 
than one denotatum, for it designates either a food 
source or a nesting site; the gesture pattern is 
identical, its pragmatic import depending, not upon 
variation in the form of the expression, but solely 
upon the attendant physical context. The herring 
gull’s head tossing has more than one function: it 
occurs as a precoital display, but this is indistin- 
guishable from the head tossing exhibited by a 
female begging for food (Tinbergen [1953b] 1961, 
p. 112). 

The importance of compiling a complete inven- 
tory of the behavior of every species studied has 
been stressed by, among others, Tinbergen (1951), 


who called collections of raw materials of this sort 
“ethograms.” An ethogram should, of course, in- 
corporate a description of the zoosyntactic proper- 
ties of the codes peculiar to each species. In this 
respect an ethographer plays the role of a crypt- 
analyst, receiving messages not destined for him 
and ignorant of the pertinent transformation rules. 
Although there are perhaps 700,000 animal species 
extant, virtually none of the codes in use is fully 
understood, not even the one regulating the remark- 
able communication system honeybees have devel- 
oped. While the fact that these bees perform 
intricate movements—the famous “dances”—in di- 
recting their hivemates to a source of food supply 
or to new quarters has been widely reported and 
is now a familiar story, it is not so well known that 
these insects transmit information through acoustic 
means as well. It has been amply demonstrated 
that the length of the train of sound emitted during 
the straight run of the dance tells the distance of 
the find (Wenner 1964), but this discovery is bound 
to be only a first step in the comprehensive un- 
folding of the auditory subcode of the bees; re- 
searchers in several laboratories are now investigat- 
ing this hardly anticipated facet of the apiarian 
ethogram. 


Following certain distinctions noted by Morris 
(1946, pp. 219-220), it seems useful to mark three 
possible approaches to zoosemiotics: pure, descrip- 
tive, and applied. 

Pure zoosemiotics is concerned with the elabora- 
tion of theoretical models or, in the broadest sense, 
with the development of a language designed to 
deal scientifically with animal signaling behavior. 

Descriptive zoosemiotics comprehends the study 
of animal communication as a natural and as a 
behavioral science, in its pragmatic, semantic, and 
syntactic aspects, as briefly outlined in this article. 

Finally, applied zoosemiotics aims to deal with 
the exploitation of animal communication systems 
for the benefit of man, Utilitarian applications— 
tasks, in the main, for the future—may be con- 
fidently envisaged in wildlife management (e.g., 
control of the coyote by input of its own or its 
prey’s distress calls); agriculture (e.g., pollination 
of orchard trees by electronically programmed 
bees); and pest control (e.g., the prevention of 
woodpecker damage to wooden utility poles by use 
of repellent chemical and auditory signals), Our 
knowledge of basic zoosemiotic processes may also 
be put to practical uses to supplement existing hu- 
man information-handling devices (in aiding the 
deaf and the blind, in assisting communication 
with man in outer space), and to advance bionics, 


38 COMMUNICATION, ANIMAL: Models and Behavior 


the science of converting living systems into me- 
chanical and electrical analogues. Linguists and 
psycholinguists who are concerned with animal 
communication are interested chiefly in disclosing 
the biological and anthropological origins of human 
communication. They seek answers to particular 
questions such as these: what are the anatomical 
and physiological correlates of verbal behavior and 
what sensory and cognitive specializations are re- 
quired for language perception? what motivates 
the onset and accomplishment of language learning 
in the development of human infants? why do sub- 
human forms lack the capacity to acquire even the 
beginnings of language? how can present evolu- 
tionary theory account for the unique form and be- 
havior of language in man? what is the genetic 
basis for language propensity, man’s species-specific 
biological endowment? Zoosemiotics, although still 
in its infancy, provides the scientist with a simpler 
setting in which to search for solutions than does 
the far more complex sociobiological environment 
that constitutes the framework of man’s commu- 
nicative behavior. 

THOMAS A, SEBEOK 


[See also ErHoLocy; SEMANTICS AND SEMIOTICS; SO- 
CIAL BEHAVIOR, ANIMAL.] 
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I 
THE STUDY OF MASS COMMUNICATION 


Urbanization, industrialization, and moderniza- 
tion have created the societal conditions for the 
development of mass communications. In turn, 
these processes of social change produce societies 
that are highly dependent on mass communica- 
tions. Mass communications comprise the institu- 
tions and techniques by which specialized social 
groups employ technological devices (press, radio, 
films, etc.) to disseminate symbolic content to 
large heterogeneous and widely dispersed audi- 
ences, In other words, mass communications per- 
form essential functions for a society that uses 
complex technology to control the environment. 
These functions of mass communications include 
the transmission of a society’s heritage from one 
generation to another, the collection of information 
for the surveillance of the environment, and the 
correlation of the various parts of the society in 
response to changes in the environment. Social 
science research on mass communications seeks 
an objective understanding of the institutions that 
fashion mass communications and the conse- 
quences of communication and mass persuasion 
for human society. 

The social scientific perspective. In surveying 
the extensive research on mass communications, 
one finds that there are great gaps between the 
orientations of social scientists and those of mass 
media personnel and their critics. First, there is a 
great difference in estimates of the effectiveness 
and potency of the mass media based on the find- 
ings of social scientists as compared with the view- 
points of those directly involved in operating the 
channels of mass communications. Mass media 
personnel, as well as their critics, tend to contend 
that the mass media are all-pervasive influences 
and powerful agents of social change. Thus they 
point to the individual and dramatic impact of 
specific programs and campaigns, such as the 
publication in Germany of President Woodrow 
Wilson’s Fourteen Points early in 1918, or Or- 
son Welles’s dramatization of H. G. Wells’s War 
of the Worlds over the CBS radio network in 1938 
(for which see Cantril 1940). They point to the 
long-term consequences of the mass media in 
fashioning tastes and moral standards and in cre- 
ating images of political leaders. While social scien- 
tists continue to differ in their particular inferences 
and conclusions, in general they tend to view the 
impact of the mass media as circumscribed. They 
see the mass media as limited agents of social 
change and as only one element among others, 
such as technological progress, organizational 
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controls, cultural and ideological forms, and the 
processes of socialization and personality develop- 
ment. 

In part, this gap is due to the different questions 
being asked by mass media personnel and by social 
scientists. Professional practitioners in the mass 
media are seeking specific and pragmatic answers 
to practical communications problems, while re- 
search workers are more concerned with general 
principles and hypotheses. In part, this gap is due 
to the weaknesses and limitations of social science 
research on mass communications, which, because 
of its highly fragmented character, is often not 
cumulative and therefore unable to supply valid 
answers to basic issues. 

Second, the mass media have been subjected to 
uninhibited social criticism by some intellectuals 
and practitioners who see them as contributing to 
the demise of civilization. These critics hold the 
view that the growth of the mass media, in and of 
itself, deteriorates moral and intellectual standards. 
This point of view stands in contrast to the aspira- 
tion of intellectuals at the turn of the century, who 
hoped that with the proliferation of the mass media, 
modern society, however large and complicated, 
could yet fulfill the requirements of the democratic 
process. It was in the mass media that hopeful 
thinkers saw a new opportunity for mass education 
and the elevation of men’s minds. Modern political 
history has undermined such intellectual hopes, 
and in the contemporary world the mass media are 
seen by critics as speeding up the development of 
a mass society and the destruction of individuality. 
But the social scientific point of view must reject 
the notion that the growth of the mass media 
necessarily produces an undifferentiated society 
with a general lack of articulation and an inability 
to make collective decisions. Researchers must see 
the mass media as instruments of social control 
and social change that may have either positive or 
negative consequences, depending upon their or- 
ganization and content. 

Popular images of the pervasive effects of the 
mass media were generated by the use of propa- 
ganda during World War 1, by the growth of mass 
advertising in the United States during the 1920s, 
and by the use of mass techniques of agitation in 
the rise of European totalitarian movements. Thus, 
it is understandable that the first results of empiri- 
cal research were to challenge such perspectives 
and to debunk popular notions. For example, al- 
though the pioneering studies on the impact of the 
movies carried out under the auspices of the Payne 
Foundation (Charters 1933) showed definite and 
discernible consequences of moviegoing for youth 
behavior—often socially undesirable consequences 
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—the over-all conclusions hardly attributed a per- 
vasive influence to the film in shaping youth cul- 
ture. Specific studies on totalitarian’ states con- 
ducted during World War m and thereafter also 
revealed that after the seizure of power by dictators 
mass persuasion became less important as a basis 
of control in these political systems. The image of 
limited communication effect was, in particular, 
reinforced by research on the basis of civilian and 
military morale in Germany and Japan (see, for 
instance, Shils & Janowitz 1948), which showed 
that ideology was of limited significance and that 
hostile propaganda could operate only within spe- 
cific confines, In addition, laboratory studies on 
the impact of the mass media, as well as studies 
using the sample survey technique, also tended to 
produce findings that highlighted the limitations 
of mass effects, especially since these research 
procedures were used mainly to study specific 
messages and short-term effects. 

Nevertheless, students of mass communications 
recognize that available research describes only 
part of a complex process and that the findings 
of specific empirical studies need to be evaluated 
and integrated by means of a more systematic 
frame of reference that takes into consideration 
the fundamental nature of personality and the 
broader process of social change. This frame of 
reference includes, first, the assumption that the 
mass media both reflect the social structure and 
social values of a society and operate as agents of 
social change. Because of the diffuse nature of 
communications processes, the mass media are 
both causes and effects; or, in the language of 
social research, they are both independent and 
dependent variables. Therefore, the full range of 
effects can only be understood by making infer- 
ences about causal processes. Second, the analysis 
of mass communications involves not only a study 
of the continuous process of transmitting symbols 
and their effect on audiences but also the equally 
complex and subtle process by which the audience 
communicates with and influences the communi- 
cator. In fact, this assumption implies that the 
analysis of mass communications is incomplete 
unless this two-way process is included. Third, 
mass communication systems invariably involve 
an interplay with interpersonal communications. 
Again, a comprehensive analysis requires the study 
of how interpersonal communications condition 
the communicator as he produces messages and 
content and, in turn, how interpersonal communi- 
cations negate or increase the impact of mass com- 
munications on audiences. 

As a result of the complexity of the mass com- 


munication process, most research has been ori- 
ented toward probing one or another phase of the 
total process. Harold Lasswell’s phrase, “Who says 
what to whom with what effect?” has been the 
general format in which specific research proceeds: 
The “who” question includes the study of the organ- 
ization and personnel of the mass media; “what” 
refers to the content of the mass media; “to whom” 
points to the structure of the audience and various 
audience characteristics; and the “with what effect” 
aspect has received attention in the studies of mass 
media impact and audience response. Although this 
format was coined over thirty years ago (Lasswell 
1932), it still presents a highly useful approach 
for integrating the large number of diverse ap- 
proaches to the study of the mass media of com- 
munications and their effects. But the study of 
each element must be thought of as a step in 
understanding the total process and especially in 
estimating the long-run consequences of mass 
media. 


Personnel and organizational structure 


The “who” question has been investigated through 
two different but highly interrelated approaches. 
First, who are the people—the managers, directors, 
writers, performers—who produce and transmit 
mass communications? This is the sociology of an 
occupational and professional group. What are the 
social origins, educational backgrounds, career 
lines, and professional organizations of mass media 
personnel? What type of personalities are attracted 
to work in the mass media, and what are their 
self-images and social perspectives? 

Second, since mass communications must inevi- 
tably be produced by large organized collectivities 
rather than by individual persons or small groups, 
what are the decision-making processes in mass 
media enterprises? How are these enterprises struc- 
tured in terms of status, power, and other elements 
of social control? What consequences do the tech- 
nological characteristics of the various media have 
on their internal organization? How does the con- 
trol of the mass media relate to the economic and 
political organization of the society? 

Personnel and professionalization have been the 
least explored aspects of the mass media. However, 
two comprehensive studies by Leo Rosten—one on 
the Washington corps of correspondents (1937) 
and the other on the Hollywood movie colony 
(1941)—reveal several central issues. The Wash- 
ington correspondents represent a case of the highly 
developed but informal type of professionalization, 
where rules and regulations concerning standards 
of performance have evolved and are enforced by 


colleague pressures so as to raise the level of per- 
formance, On the other hand, Hollywood, as a 
movie colony and subsequently as a television 
center, represents an extreme case of the type of 
media establishment that has a high level of social 
and interpersonal tensions; in such communica- 
tions enterprises the demand for spontaneity and 
creativity necessarily outruns human energies. The 
popular stereotype of Hollywood as a frenzied, 
schizoid community staffed by persons with con- 
stant fears of failure and frequent feelings of self- 
hatred is a caricature that is apparently not with- 
out support in fact. 

The limited number of studies of the sociology 
of the creative arts in the mass media, together 
with astute observations of participants who have 
written on the subject, such as James T. Farrell 
(1946), suggest that one major source of discon- 
tent and “alienation” among mass media personnel 
derives from the organizational need to bureauc- 
ratize creative effort. The result is a divorce of 
creative workers from control over and identifica- 
tion with the end products of their work. The term 
“bureaucratization” must be used with reservation, 
for a considerable number of productive activities 
in the mass media have not become rationalized. 
Therefore, sociological observations about aliena- 
tion among the producers of mass aesthetics are 
difficult to translate into precise conclusions. These 
notions apply to the very small numbers of truly 
creative personalities and not to the vast bulk of 
symbol handlers and technicians. Moreover, it 
would appear to be an error to assume that inevi- 
tably the essence of creativeness is lost in organized 
group effort. We need merely to recall the corps of 
assistants who worked with Michelangelo and 
Rubens or the monuments to collective artistic 
creativity such as the Sainte-Chapelle and the 
cathedrals of Chartres and Milan. 

The significant point is not that artistic and 
creative work has been collectivized in the mass 
media for the first time in human experience, but 
that it has been extensively collectivized on a scale 
never before possible. The technological require- 
ments of the mass media and the exaggerated 
pressure to create rapidly under deadlines force a 
high degree of specialization and a detailed review 
of each person’s efforts. In the setting of a massive 
and complex organization, as is to be found in 
many of the mass media, it is not difficult for the 
individual worker, whether artist or not, to lose or 
otherwise abnegate his sense of personal respon- 
sibility for the quality of the work eventually pro- 
duced. Nevertheless, against this response must be 
weighed the pressures of creativity or of profes- 


COMMUNICATION, MASS: Study 43 


sional responsibility to maintain areas of individu- 
ality even in these large-scale organizations. One of 
the reasons that such pressures continue to exist is 
that the pervasive demand for new ideas and new 
content insures a constant and ever increasing 
search for talent. 

It is also important to distinguish between 
genuine creativity and professional responsibility 
among mass media personnel. Social research has 
little to say about the conditions under which 
genuine creativity appears, but it is clear that the 
organization of the mass media has tended to in- 
hibit or at least dampen the development of pro- 
fessional responsibility. It is very difficult to apply 
the concept of professionalism to the mass media 
personnel in a one-party state, while in nations 
with multiple-party systems and relatively auton- 
omous communications institutions, the status of 
mass media personnel is more that of employees 
of a large-scale organization than that of practicing 
professionals. Even in Great Britain, where the 
organization of journalists is highly developed, the 
professional associations are more concerned with 
conditions of work than with professional stand- 
ards. In democratic societies there are no bodies 
for enforcing professional standards among jour- 
nalists or even quasi-public bodies for reviewing 
and evaluating their performance. Professionaliza- 
tion therefore takes the more limited form of an 
emphasis on more adequate educational prepara- 
tion (a topic about which there is little agreement), 
a concern with informal relations among special- 
ists, and the development of devices such as the 
increased reliance on the “by-line” in order to iden- 
tify mass media products with individual producers 
and writers [see JOURNALISM], 

The absence of higher levels of professionaliza- 
tion in the mass media is a result of the structure 
and process of decision making within the mass 
media. Because of the presumed importance of the 
mass media as instruments of social and political 
control, these institutions become fused with the 
basic control structure of any society. In a totali- 
tarian state, this control is comprehensive but not 
without inherent limitations. If the media of such 
political systems are to serve more than merely 
to reaffirm basic societal loyalties, and if they are 
to disseminate information and contribute to col- 
lective problem solving, then some limited degree 
of independence is required. Alex Inkeles, in his 
Public Opinion in Soviet Russia (1950), a research 
study that describes the organization of the Soviet 
mass media, points to such devices as letters to the 
editors and reports of self-criticism as efforts to 
increase the validity and acceptance of mass media 
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content. In some one-party socialist societies there 
have been modifications of central party controls, 
including the development of limited areas of pro- 
fessional responsibility for mass media personnel. 
Often the modification takes the form of creating 
specialized periodicals with limited circulation to 
reach specialized groups without disturbing the 
larger process of mass media control via precen- 
sorship. 

In multiparty states with mixed forms of media 
ownership and control, the historical development 
of the mass media shows a trend toward greater 
freedom from government control. In such states 
there is typically an emphasis on the necessity of 
an independent and competitive mass media sys- 
tem. However, political theorists have come more 
and more to recognize that the removal of govern- 
mental interference does not necessarily, or in fact, 
produce mass media systems that meet all the re- 
quirements of a free society. There have been a 
small number of penetrating studies of the control 
Structure of the mass media in the United States 
and Great Britain. Most of these studies were un- 
dertaken by foundations, universities, and, in a few 
notable cases, governmental agencies. They all con- 
cluded that certain technological, economic, and 
organizational factors may prevent competition 
from supplying an effective basis for high levels of 
mass media performance. One of the most irapor- 
tant of these studies was conducted in the United 
States under the aegis of a quasi-public sponsor, 
the Commission on the Freedom of the Press. It is 
noteworthy that the principal financial supporter 
of this commission was Henry Luce, head of the 
Time-Life publishing corporation. 

The work of the commission included historical 
surveys of the radio, motion picture, and book in- 
dustries, as well as a comprehensive review of the 
role of the government in the mass media process. 
The policy recommendations of the commission 
(Commission on Freedom of the Press 1947) were 
afterward closely paralleled by the findings of the 
British Royal Commission on the Press (Great 
Britain . . . 1947-1948). In short, while govern- 
ment interference was rejected by both reports, 
the view they set forth was that traditional concep- 
tions of competition would not guarantee adequate 
media performance. Instead, it was recommended 
that the mass media accept public responsibility 
for presenting a comprehensive and meaningful 
interpretation of contemporary events and that the 
government would have to take a positive role in 
this process. 

Underlying these recommendations was a series 
of empirical observations that were documented in 
the 1940s and have been repeatedly confirmed by 


subsequent research. For the United States, these 
studies point to a drift of the major media toward 
increasing centralization in their decision-making 
processes, but none has even suggested that com- 
plete monopolistic control is or will be the outcome 
in any of the mass media industries. The evolving 
pattern is rather that which appears to obtain in 
many other areas of mass production, namely, that 
a limited number of very large units dominate a 
wide sector of a particular medium or even a com- 
bination of media. That a degree of competition 
has characterized the relations among these or- 
ganizational giants cannot be denied, This compe- 
tition is to a considerable extent enhanced by the 
fact that the audience can choose between various 
media. 

Moreover, technological changes do not inher- 
ently move in the direction of supporting more and 
more concentration. For example, frequency modu- 
lation (FM) radio has introduced a new network 
of decentralized units; and even in the newspaper 
field in the United States, the trend toward con- 
solidation has leveled off as new reproduction tech- 
niques have been introduced. Equally apparent, 
however, is the fact that the large producers of 
mass communications have often cooperated with 
each other in generally successful efforts to fend 
off attempts by other, supposedly countervailing, 
power groups (such as the government, the 
churches, and other public or private interest or- 
ganizations) to influence decisions regarding the 
structure and content of the mass media. 

In assessing the consequences of this drift to- 
ward power concentration, simple stereotyped con- 
clusions are not warranted. For the United States, 
there is considerable evidence that the larger and 
more all-embracing these industries become, the 
more they come to resemble public institutions and 
the more sensitive they grow to the shifting im- 
peratives of public opinion, public relations, and 
public responsibility. Of course, the mass media 
have developed codes of performance to protect 
themselves from the extreme excesses of public 
pressure. These codes have tended to be negative in 
outlook and to neglect the needs of specialized 
audiences. It has been suggested that in some cir- 
cumstances the fewer the units of mass communi- 
cations the less they are susceptible to the dictates 
of particular outside vested interest groups. Thus 
it is argued, for example, that publishers in a com- 
munity with only one newspaper are relatively 
immune to the pressures of advertisers, inasmuch 
as the latter have no recourse to the threat of tak- 
ing their business elsewhere. 

To make these observations is not to suggest, 
however, that where mass media are operated as 


business enterprises a community of interests with 
other business enterprises fails to operate. Nor is 
it to argue that the mass communicators’ growing 
consciousness of the attitudes of the public means 
that the control of the mass media is inevitably 
becoming more responsible. The meaning of public 
opinion and public responsibility may be read and 
interpreted in different ways. Recourse to the dic- 
tates of public taste and opinion may quite con- 
ceivably mean little more than the misuse of survey 
data to justify existing tastes, rather than stimulate 
new and more enlightened interests. Furthermore, 
an easy acquiescence to the amorphous and often 
ambiguous desires of the audience may simply re- 
inforce those pressures and opportunities for the 
abdication of leadership responsibility for change. 
If in the democratic societies the issues of media 
control focus on professional and organizational 
responsibility, throughout wide sectors of the world 
the basic issue is still the establishment of greater 
autonomy of the press. Raymond Nixon's investi- 
gation of the freedom of the press throughout the 
world indicates that there is a gradual trend toward 
broader freedom. He linked this development to 
higher levels of education and economic develop- 
ment (Nixon & Ward 1961 ). However, there is no 
reason to believe in the inevitability of such a 
trend or to assume that specific variables are at 
work, This issue involves the most fundamental 
and complex processes of political development. 


Content of mass communications 


The symbols and messages of the mass media 
are available for widespread consumption. The 
sheer availability of mass communications content 
has stimulated a considerable body of research into 
the “what” aspect of Harold Lasswell’s four-faceted 
question. Research into mass media content, or 
content analysis, as it has come to be called, has 
been particularly influenced by the recognition that 
such content is amenable to quantitative treatment 
(see especially Lasswell & Leites 1949; Berelson 
1952), As a result, content analysis procedures, 
both quantitative and qualitative, have been ap- 
plied to all types of media content, most frequently 
for descriptive purposes and to a lesser extent as an 
analytical tool for analyzing the communications 
process and its impact. First, content analysis has 
been an effective instrument for describing both 
short-term and long-term trends in media content. 
The range of topics covered by descriptive trend 
studies is indeed broad: for example, such studies 
have traced the decline in prophetic religious 
themes in popular sermons (Hamilton 1942), the 
growth of scientific authority as a basis of child- 
rearing advice in women’s magazines (Mead & 
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Wolfenstein 1955), and the trend in propaganda, 
from World War 1 to World War 1, toward a less 
emotional, less moralistic, and more truthful 
orientation (Kris & Leites 1947). Second, content 
analysis supplies an approach for comparing the 
same material as presented in different media 
within a nation or the contents of the same media 
as between different nations, For example, Asheim 
(1949) found that in converting dramatic fiction 
into movies the result was not to produce “happy” 
ends but rather indeterminate endings. For cross- 
national purposes, school textbooks have supplied a 
convenient device for revealing societal differences 
(Walworth 1938). 

Third, the procedures of content analysis are 
also particularly appropriate for comparing media 
content with some explicit set of standards or ab- 
stract categories. For example, studies have been 
done to determine whether newspapers conform in 
their content to particular standards. In this vein, 
content analysis is also used as a form of propa- 
ganda detection or propaganda analysis. The ob- 
jective may be to identify the use of particular 
propaganda devices, such as simplification, glitter- 
ing generalities, testimonials, and the like. Alter- 
natively, the objective may be to uncover propa- 
ganda strategies by the use of such analytical 
categories as distortion, parallel presentations, or 
patterns of imbalance. Propaganda analysis can 
be based on comparing a suspect source with a 
set of categories derived from a source identified 
as representing a biased or propaganda outlet. 
This particular technique was applied during World 
War n by the Organizations and Propaganda Anal- 
ysis Section of the U.S. Department of Justice to 
compare native fascist publications in the United 
States with Axis media content. Findings based on 
these procedures were admitted in court trials of 
Nazi agents and native fascists and had the conse- 
quence of absolving many suspect newspapers and 
radio stations from charges that were based on 
only a few examples of bias in their presentation 
of news or in their editorial comments. 

Fourth, content analysis offers a relatively pre- 
cise technique for describing the diffusion of sci- 
entific and scholarly knowledge and for observing 
the process of popularization of scientific materials. 
A leading example of this aspect of content anal- 
ysis is the readability test (Flesch 1951); such 
tests have been developed to help editors and pub- 
lishers judge the difficulty of a particular commu- 
nication and to estimate the type and size of 
audiences that can readily understand the message. 

On the basis of the existing body of quantitative 
and qualitative research, several broad generaliza- 
tions may be made about the contents of mass com- 
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munications. First, what is communicated by the 
mass media is a highly selected, nonrepresentative 
sample of all that is available for communication. 
Likewise, the content that is received and con- 
sumed by the potential audience is a highly selected 
sample of all that is communicated. Second, con- 
siderably more communications content is enter- 
taining than informative; there is more of the sort 
that distracts and diverts attention and less of the 
quality that stimulates consideration of the central 
social, economic, and political problems of living. 
However, notwithstanding the demonstrable dif- 
ference between the contents of mass communica- 
tions and the contents of human existence, there 
remains in the mass media a quantity of the sober 
and serious, the educational and informational, so 
considerable that it has served, in the view of some 
experts (see, for instance, Schramm 1954), to con- 
found and confuse rather than to educate and 
inform segments of the mass audience. Third, be- 
cause mass communications are commonly aimed 
at the largest possible audience, most of them are 
simple in form and uncomplicated in content. In 
their desire to be understood by the overwhelming 
majority of their audiences, mass communicators 
have tended to de-emphasize intricate presenta- 
tions, the meanings of which may be unclear and 
may be misinterpreted. However, with the growth 
of mass literacy and with ever larger segments of 
the population receiving college-level education, 
counter trends have developed. As a society enters 
the phase of advanced industrialization, there is 
a trend within the general mass media toward de- 
voting a portion of their content to high-level mate- 
rial, just as there is a trend toward higher quality 
in those media, such as FM radio and certain pub- 
lishing houses, that cater to more specialized au- 
diences. 

If content analysis procedures are to produce 
more than descriptive findings, it is necessary to 
address the simple but basic question, “What does 
the content of mass communications mainly reflect 
—the characteristics of the mass audience or the 
characteristics and intentions of the communica- 
tor?” Undoubtedly, the contents of most mass 
communications reflect both of these elements. In 
any particular study it is difficult to separate out 
the relevant importance of each element, and 
therefore the significance and validity of content 
analysis remains decidedly problematic. But it is 
precisely by making assumptions about the condi- 
tions under which the contents of the mass media 
serve either as indicators of the intentions of the 
communicators or as reflections of the interests 
and values of the audiences that content analysis 


is transformed from a descriptive tool to a device 
for analyzing the process of mass communications. 
` The use of content analysis in order to infer the 
intentions of the communicator is best applied to 
highly purposive communications such as political 
content. Nathan Leites (1951) speaks of the “op- 
erational code,” that is, the basic assumptions and 
directives underlying the communications of politi- 
cal elites. The more knowledge an analyst has 
about the organizational setting of political com- 
munication the more feasible is such content anal- 
ysis. Thus, Gabriel Almond, in his study The Ap- 
peals of Communism (1954), is concerned with 
the contrast between the pattern of internal com- 
munications among Communist party leaders and 
their core members, on the one hand, and the 
content of their communications to larger external 
audiences, on the other, Leo Lowenthal and Nor- 
bert Guterman, in Prophets of Deceit (1949), ana- 
lyze the content of fascist output in order to probe 
its underlying intentions and to assess the limited 
extent to which these agitators propose an explicit 
program. This approach also makes the assump- 
tion that agitational propagandists have the ability 
to reflect the repressed aspirations of their particu- 
lar publics. While their’format and appeals may be 
extreme and their audiences small, their content 
reflects a measure of potential political desires of 
the larger society and therefore warrants the 
closest scrutiny. 

By contrast, if content analysis is to be used as 
a measure of the underlying values and sentiments 
of the audience, it is appropriate to use mass 
media content with wide appeals and to trace 
changes through time or to make comparisons be- 
tween different countries. We are dealing here 
with the notion of the “focus of attention.” One 
of the earliest efforts in this regard was that of 
Hornell Hart (1933), who studied changes in so- 
cial attitudes and interests through analyses of 
selected popular magazines published from 1900 
to 1930; he found a general decline for this period 
in the status of religion and religious sanctions, 
which he took as a measure reflecting changing 
attitudes toward religion. Leo Lowenthal (1961) 
has reported on the change in the heroes of popu- 
lar magazine fiction from idols of production to 
idols of consumption. Martha Wolfenstein and 
Nathan Leites (1950) have used the movies as 
an indicator for the comparative analysis of sexual 
attitudes in Great Britain, France, and the United 
States, while McGranahan and Wayne (1948) 
studied dramatic plays to compare the Zeitgeist of 
the United States with that of Germany. Content 
analysis of the mass media has become an element 


in the cultural analysis of traditional societies that 
are in the process of modernization. Because of 
the problems involved in making inferences about 
causality in content analysis, the burden of the 
task of analyzing the impact of mass media has 
shifted to the direct study of audience structure 
and audience response. [See CONTENT ANALYSIS.] 


Audience research 


One aspect of the direct investigation of the 
impact of the mass media is the description of the 
size and structure of the audience for each particu- 
lar medium of communication. There is a large 
amount of material, especially from the indus- 
trialized nations of Europe and North America, but 
also increasingly from all the nations of the world, 
that describes the “to whom” of mass communica- 
tions in terms of gross characteristics and media 
preferences. The Department of Mass Communi- 
cations of UNESCO has an active program for 
collecting basic statistical data on the development 
of the mass media and the size of audiences 
throughout the world. These world-wide reports 
rely mainly on statistical data about the number 
and circulation of newspapers, radio receiving sets, 
movie theaters, published books, and other meas- 
ures of audiences that derive from production 
figures. Important aspects of audience size can 
also be measured from such “built-in” or automatic 
measures of consumption as ticket sales, news- 
stand circulation, or subscription sales, However, 
a basic impetus to audience research has come 
from those media, such as radio and television, 
which lack such simple measures; the managers 
of these media frequently want to know more 
about the social characteristics of their audiences 
and to find out what particular segments of their 
output receive the greatest attention. 

Interest in audience research is particularly 
strong in the United States, where the mass media 
are heavily supported by commercial advertising 
revenues. Paul F. Lazarsfeld has been one of the 
leading experts in the development of such re- 
search, Although various mechanical and elec- 
tronic devices as well as self-reporting question- 
naires and diaries have been used, the standard 
approach is to make use of sample surveys to 
measure audience size and composition. Frequently 
these surveys are conducted by telephone and in- 
volve elaborate and rapid field work in order to 
measure the relative position of leading mass 
media performers. For the United States, Handel 
(1950) has summarized movie audience research, 
and Bogart (1956) has done the same for tele- 
vision. 
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In countries where radio and television are op- 
erated by the government, audience research is 
carried out for the purpose of understanding audi- 
ence reactions and as an aid to program planning. 
Thus, for example, the British Broadcasting Cor- 
poration and the various West German regional 
networks have extensive audience research pro- 
grams. Even in such one-party socialist states as 
Poland, radio audience research is undertaken both 
for scientific purposes and in order to take into 
account consumer preferences and tastes. 

A summary of the more quantitative findings 
of audience surveys will show some of the changes 
that have taken place as the content and roles of 
the media have changed. Although American audi- 
ences are by no means typical, they do reflect the 
pattern existing in those countries where there is 
an increasingly high level of mass media penetra- 
tion together with some choice between alternative 
offerings. In the United States television has 
emerged as the dominant medium, and national 
surveys in the mid-1960s reveal that the typical 
American family watches as much as four hours 
of television daily. Despite this extensive exposure 
to television, 85 per cent of the families read one 
or more newspapers regularly, and 60 per cent read 
one or more magazines regularly. As newspapers 
have transformed themselves from essentially polit- 
ical journals to purveyors of news, human interest 
features, and amusement materials, their reader- 
ship ranks have been broadened to become more 
representative of the whole population. Television 
has captured large segments of the former radio 
audience; however, radio broadcasting of music 
and news has re-emerged in response to more spe- 
cialized demands. The movies have suffered the 
most from the impact of television; but film attend- 
ance is still extensive among young people, and 
“super-features” attract mass adult audiences. The 
number of comic books sold each month exceeds 
the number of children in the country. Clearly, the 
term “mass audience” is no misnomer. 

Researchers have also devoted energy to the 
question of whether mass media exposure is com- 
petitive or cumulative (Columbia University .. - 
1946). Is exposure to one medium associated with 
exposure to others, or does one medium crowd 
out the next? No clear-cut answers are available, 
for at least two patterns can be discerned. In one 
sense, exposure is cumulative: with an increase in 
level of education, persons who are exposed to one 
medium (including television) are likely to expose 
themselves to newspapers, magazines, and books. 
In other words, as a person's field of interest is 
broadened by more education, his interests in the 
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mass media also grow. However, there is a point 
at which competition sets in, for even with the 
growth of leisure there are limits to available time. 
Especially among better educated persons, exten- 
sive involvement with television reduces the time 
spent on and interest in other mass media. For 
these groups, there appears to be some competition 
between the printed media ( newspapers, maga- 
zines, and books) and the electronic media ( tele- 
vision and radio). 

By contrast, the development of the mass media 
and audience exposure in the so-called new nations 
reflects limited levels of literacy and scarcity of 
technological facilities. However, most of these na- 
tions make large investments to increase literacy 
and to expand the mass media as part of the proc- 
esses of economic development and the formation 
of central political control. In many of them lit- 
eracy has risen at a faster rate than was the case 
for Europe during the nineteenth century. While 
these new mass media systems are not developed 
in depth, after a country achieves independence 
there is characteristically a striking increase in the 
capacity of these media to disseminate messages 
from the central political authority. In these na- 
tions the media and audience structures that 
develop are different from those of Western indus- 
trialized nations. Newspapers and magazines sup- 
ply crucial but limited channels of communication 
between elite groups, especially in the urban cen- 
ters, while radio emerges as the national mass me- 
dium because of its low cost and because it does 
not depend on the prior development of literacy. 

Audience research has developed in the direction 
of seeking to describe more precisely the social and 
psychological characteristics and the specific con- 
tent preferences of the persons who make up the 
audiences of particular media. Routine surveys of 
audience structure, especially those sponsored by 
commercial groups, proceed in terms of basic cate- 
gories such as age, sex, education, occupation, and 
income, But these categories are not refined enough 
to capture the complexities of contemporary social 
structure. Nor do the categories focus sharply 
enough on the web of group and associational life 
through which a person is integrated into the larger 
society. As a result, the explanatory power of con- 
ventional research into audience structure is not 
very great. 

Emerging lines of research are reflected in the 
work of university-based scholars, such as Harold 
Wilensky (1964), who have sought to classify au- 
dience membership in terms of more refined cate- 
gories, such as career patterns and work settings. 
This approach focuses on the content as well as on 


the amount of education. Emphasis is placed on 
the bureaucratic setting in which the person fol- 
lows his profession or occupation and on the role 
of voluntary associations in conditioning media 
exposure. 

Audience research has also come progressively 
to concentrate on social-psychological and person- 
ality characteristics. Both from a theoretical point 
of view and in the practical application of research 
findings, it is not enough to know the gross char- 
acteristics of those persons who are exposed to 
a particular channel of communication. It is 
equally important to isclate those social-psycho- 
logical predispositions that can be appealed to in 
order to mobilize new audiences or to change the 
exposure patterns of existing ones. 

Walter Lippmann’s classic book, Public Opinion 
(1922), in which he developed the term “stereo- 
type,” still remains a basic point of departure for 
the analysis of audiences. Human personality has 
a powerful capacity to simplify social reality and 
to select congenial elements from the media con- 
tent. The oversimplification of social reality, espe- 
cially when it is rigidly rooted in personal and 
social needs, is the essence of the process of stereo- 
typing. Following up on these basic insights and 
broadening the perspective to include the observa- 
tions of psychoanalysis resulted in a series of bril- 
liant studies by Herzog (1943), Warner and Lunt 
(1941), and Henry (1947). 

In recent years, advertisers have shown consid- 
erable interest in the study of the motive structures 
underlying exposure to various types of sale mes- 
sages. The term “motivation research” has been 
loosely used to refer to studies that seek to under- 
stand the social psychology of audience exposure 
to advertisements. Popular accounts of these devel- 
opments (see, for instance, Packard 1957) have 
attracted widespread public attention, but they 
have not answered the question of the extent to 
which such audience research has increased the 
power of advertising beyond that produced by 
forceful and imaginative practitioners operating 
without benefit of systematic research. 

At some point research on audience character- 
istics blends with the direct study of audience 
reactions and media impact. Audience research in- 
cludes studies of media preferences, interest in 
particular types of content and messages and in 
the mass imagery of the media, and the level of 
confidence and trust that the audience places in 
different media sources. These dimensions are not 
only audience characteristics but also indicators of 
the ongoing impact of the mass media. Thus in 
the United States, for example, surveys indicate 


that popular trust in television as a news source 
has gradually risen to a level comparable to that 
for newspapers, The increased confidence in tele- 
vision as a news source is concentrated among 
young people who have grown up in a television 
culture and thus accept this channel with fewer 
reservations than their parents. 

In the United States audience research spon- 
sored by commercial television companies also 
documents extensive audience criticism of televi- 
sion content, including complaints about the great 
amount of violence and the heavy emphasis on 
advertising. While such criticism reflects a popular 
culture that encourages critical remarks about tele- 
vision, and while this criticism does not lead to 
marked resistance to television exposure or articu- 
lated demands for changes in programming, these 
attitudes are noteworthy. These findings underline 
the conclusion of careful observers that television 
no longer has a single mass audience but, like the 
film audience, is becoming more specialized. Ac- 
cording to Steiner (1963) there are at least two 
major segments in the television audience, On the 
one hand, there is the mass audience that is satis- 
fied with a common fare and that exercises little 
or no selectivity in its viewing habits. On the other 
hand, there is another segment that consumes 
massive dosages of television but that actively 
searches for a more subtle media fare. (The dis- 
tinction is between the viewers of the nondescript 
westerns and those who require westerns that have 
historical, revealing, or thoughtful contents, such 
as “High Noon” or “Gunsmoke”; another example 
is the distinction between the viewers of an anthol- 
ogy of light drama and those who watched the 
“Play of the Week.”) Nevertheless, these same 
findings on audience structure indicate that as the 
national levels of education rise, there are marked 
changes in the mass media, but that because of the 
economic and organizational reasons described 
above in the section on media control, a lag per- 
sists in the capacity of the mass media to supply 
the demands of more discerning audience tastes 
and standards. 


Communications effects 


The findings of studies of communications ef- 
fects are strongly influenced by the research meth- 
ods that are employed. The three basic approaches 
are experimental studies, both laboratory and 
quasi-laboratory experiments; surveys based on in- 
terviews or questionnaires; and intensive case 
studies employing participant observation, infor- 
mal and group interviews, personal documents, 
and other sources of documentation. While the 
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technical aspects of effect studies have been great- 
ly refined and improved, these methodologies were 
already employed in the Payne Foundation studies 
of films which were completed during the early 
1930s. 

The first two methods are the most extensively 
utilized because of their presumed quantitative 
precision. In the experimental technique, persons 
are given controlled exposure to a communication, 
and the effects are evaluated on the basis of the 
measurement of attitudes before and after such 
exposure. This method also requires comparison 
with a control group that is not exposed to the 
message. In the sample survey method, data are 
collected by means of questionnaires or interviews 
dealing with media exposure and attitudes, opin- 
ions, and behavior. The goal is to derive conclu- 
sions from correlations obtained between the degree 
or conditions of exposure to various communica- 
tions and the measured attitudes and behavior. A 
sample survey can be a type of field experiment if 
the interviews are repeated during the period in 
which the population studied is exposed to the 
mass media. The third approach, that of the case 
study in depth, has emerged less as a specific 
method of research than as a strategy of evaluation 
and synthesis of material from a variety of sources. 

From the thousands of laboratory experiments, 
it has been demonstrated that under contrived 
conditions even brief messages can produce meas- 
urable changes in attitudes among selected groups. 
Because experiments deal with persons removed 
temporarily from their social group attachments, 
it is understandable that experimental findings are 
formulated as psychological and social-psychologi- 
cal propositions without regard to the cultural and 
social setting. Under the stimulation of Carl Hov- 
land and his associates (e.g., Hovland et al. 1953), 
this form of experimentation has become highly 
sophisticated, and the theoretical assumptions have 
been made explicit. This approach has generated 
some illustrative findings with regard to the condi- 
tions under which communications tend to be ef- 
fective, for example: communications are more 
effective when they seek to alter peripheral rather 
than central attitude patterns, when they are cu- 
mulative, and when they seek to reinforce rather 
than to convert existing attitudes. However, it is 
recognized that there are limiting conditions that 
are not included in the experimental design. 

Experiments have demonstrated a “sleeper” ef- 
fect, namely, that the consequences of exposure 
to even a brief message can be delayed and become 
manifest some time after the exposure. Consider- 
able research has been done on such topics as the 
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credibility and prestige of the source, the order of 
presentation of items, and the manner of presenta- 
tion of controversial issues. For example, experi- 
ments have shown that among better educated 
persons the presentation of both sides of an issue 
produces more attitude change than the presenta- 
tion of only one side, while less educated persons 
are more influenced by communications that em- 
ploy one-sided arguments (Hovland et al. 1949). 

Investigation of the question of whether atti- 
tudes are changed more by arguments that diverge 
a little or by ones that diverge a great deal from a 
person’s opinion has led to contradictory findings. 
The importance of the issue or its salience to the 
person may be an intervening variable; that is, for 
important issues arguments that diverge slightly 
may produce more change in attitudes than do 
marked differences (Hovland et al. 1957). Psy- 
chologists have been particularly concerned with 
the role of anxiety in producing or inhibiting the 
receipt of information and modifying attitudes, 
since a great deal of mass communication makes 
use of “scare” techniques. Laboratory research in- 
dicates that there is a point at which the anxiety 
produced by such appeals becomes so great that 
it actually inhibits attitude change (Janis & Fesh- 
bach 1953). 

The findings from experiments seem to show 
evidence of attitude changes that are greater than 
those reported by survey research studies. This 
discrepancy is due to the differences in the re- 
search technology and the research objectives, for 
neither approach supplies definitive and compre- 
hensive answers. Experiments deal with specific 
delimited messages, while surveys must of neces- 
sity focus on broader flows of communications. 
The experimental situation is seen as contrived, 
and therefore the results have been labeled by 
some specialists as unreal. In particular, because 
the experimental situation deals with specific mes- 
sages, it rules out those contradictory messages 
that a person encounters in the real world of com- 
munications. In other words, because the subjects 
are “forced” to be exposed to a message, the pro- 
cess of self-selection of content is weakened. It is 
precisely this mechanism of self-selection of mate- 
rial congruent with one’s existing opinion that re- 
duces the impact of mass communication. Sample 
survey findings are better able to cope with this 
process of self-selection. Moreover, experiments 
generally tend to deal with immediate reactions to 
mass communications, while surveys cover a longer 
period of time and therefore include the extinc- 
tion effect. 

Experimental studies tend to deal with relatively 
unimportant issues as compared with surveys. 


Again, this would lead to uncovering greater im- 
pact in the case of experiments, but there have 
been efforts to introduce important substantive 
issues into experimental work. Perhaps the most im- 
portant difference between the two methods is that 
in experimental work the salience of social group 
affiliation is eliminated and the audience is re- 
duced to a collection of individuals, a form of mass 
so to speak, and therefore becomes more liable to 
persuasion. Some experiments have built-in group 
process variables and have found that communi- 
cation effects are therefore reduced, An alternative 
formulation of the issue is the contention that most 
experiments deal with university undergraduates 
in an educational setting, that is, a population and 
a setting predisposed to change, and that this is 
not an adequate representation of social reality. 

Nevertheless, the experimental approach has 
important advantages because of its precision and 
ability to focus on specific variables. In contrast, 
because the survey approach cannot focus on spe- 
cific messages, it may overlook particular forms of 
mass media impact. For example, survey work on 
the impact of television has followed the same 
pattern as the earlier work on the impact of 
movies. In Great Britain, Hilda Himmelweit and 
her associates (Himmelweit et al. 1958) found 
that television did not have a profound effect on 
children’s behavior and academic achievement; its 
effect was more discernible on a small minority 
of overexposed children for whom there was reason 
to believe that television served as a form of de- 
fense, as a result of social and personal difficulties. 
However, experimental studies of exposure to tele- 
vision programs containing aggressive materials 
give a clearer indication that exposure to such con- 
tent produces disruptive and socially undesirable 
effects. It is as if the experimental approach op- 
erates as a magnifying glass in revealing subtle 
reactions that become obscured by more diffuse 
research tools. 

If the experimental method is subject to criti- 
cism because of its contrived and oversimplified 
character, the survey approach has the pervasive 
difficulty of adequately analyzing the host of inter- 
vening variables between the communications “in- 
put” and the resulting attitudes and social behav- 
ior. However, the strength of the survey method 
rests in its concern with group process, that is, 
both with primary group structures and the role of 
the opinion leader. Mass communication research 
has been influenced by observational and case 
studies, particularly in industrial sociology, which 
have underlined the crucial importance of pri- 
mary groups (face-to-face intimate associations ) in 
fashioning attitudes and morale. The impact of 


research findings on military morale during World 
War 11 served to refine and extend this type of 
sociological analysis. In particular, Edward Shils 
and Morris Janowitz (1948) observed that the 
German Wehrmacht was relatively immune to 
Allied propaganda appeals because of the effective- 
ness of primary group cohesion, which protected 
the rank and file from the appeals of outside 
sources. The hard-core noncommissioned officers 
constituted a cadre of “opinion leaders” who sup- 
ported the control structure; direct attachment to 
and trust in Hitler served as the basis of secondary 
attachments. Only when the primary group struc- 
ture was disrupted were German military person- 
nel accessible to the symbols of Allied propaganda. 

There has been a variety of survey researches 
that have elaborated the importance of primary 
groups and of rank-and-file opinion leaders in 
conditioning the impact of the mass media. Preoc- 
cupation with these concepts has led to an inter- 
personal or “two-step” theory of mass communi- 
cations. As set forth by Elihu Katz and Paul F. 
Lazarsfeld (1955), this theory asserts that the 
mass media have influence through informal lead- 
ers who have high exposure to the mass media and 
who in turn make their interpersonal influence felt 
on their close associates. In this view, the innova- 
tion of new practices takes place because the mass 
media supply ideas to these opinion leaders, who 
then rely on face-to-face contact as the mechanism 
of diffusion [see DIFFUSION, article on INTERPER- 
SONAL INFLUENCE]. 

Undoubtedly this is one process of audience re- 
sponse, but there are other processes through 
which the mass media have an impact. First, there 
is considerable research evidence that some persons 
are directly accessible to the mass media because 
they have weak rather than strong group attach- 
ments (see Daugherty & Janowitz 1958). These are 
persons characterized by a high degree of “anomie” 
and a limited degree of social integration. Often 
these individuals verbalize their distrust of societal 
institutions and the mass media; nevertheless, they 
come to depend heavily on selected channels for 
their opinions and social support. Second, on cer- 
tain issues, certain persons will find themselves 
under conflicting primary group pressures, espe- 
cially during political campaigns (Janowitz & Mar- 
vick 1956). These persons are likewise more acces- 
sible to the mass media than those persons who 
live in a politically homogeneous culture. Third, 
there are both temporary and more chronic social 
and psychological conditions, especially during 
periods of stress and crisis, that weaken the effec- 
tiveness of informal face-to-face pressures and 
controls; under such circumstances the mass media 
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can more directly impinge on the individual's 
attitudes, values, and behavior [see BRAINWASHING; 
PERSUASION]. 

The two-step theory of communication is neither 
precise enough in its conceptualization of opinion 
leaders nor detailed enough in the accumulation 
of empirical materials. The term “opinion leader” 
might best be reserved for the limited number of 
top-level figures in journalism, politics, economic, 
and professional life who are strategic in the intro- 
duction of new ideas. For the United States, the 
number would be in the hundreds at most, and 
such persons are not to be located by sample sur- 
veys. In addition, one can speak of mid-level and 
local community level opinion leaders of increas- 
ing numbers but of decreasing influence. Even 
local community level opinion leaders are limited 
in number and not identifiable by conventional 
sample surveys. Forms of sociometric and reputa- 
tional designation have been used to describe these 
leaders, especially in studies of community power 
structures. These researches indicate that, while 
there are general opinion leaders, leadership can 
vary on the basis of the specific issues involved. 
Moreover, formal office and official position also 
have elements of opinion leadership, since power 
to influence opinion is not confined to informal 
and interpersonal networks [see CoMMUNITY, arti- 
cle on THE STUDY OF COMMUNITY POWER]. 

The persons identified in sample surveys as 
opinion leaders might best be described as local 
“activists.” They do have higher levels of exposure 
to the mass media and greater involvement in local 
community affairs than other people. These acti- 
vists also tend to be better educated and of higher 
socioeconomic status than the population in gen- 
eral. Therefore, one important research task is to 
understand the dynamics of opinion formation in 
low income groups and in marginal groups, where 
the penetration of local activists is incomplete and 
fragmentary. There is reason to believe that for 
these groups such functionaries as trade union 
officials, teachers, and political party officials are 
more crucial than informal “opinion leaders.” More- 
over, because lower income people characteristi- 
cally have weak networks of social contact outside 
their familial settings, their incomplete views of the 
larger social order tend to be fashioned by the 
mass media. 


Mass media and social control 

The emphasis on discovering specific reactions 
and changes generated by media content tends to 
produce a basic distortion or limitation in their 
findings. The bulk of the content of the mass media 
is not designed to challenge or modify the social 
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and political structure of a nation, either in a one- 
party state or in a democratic society. This is not 
to underemphasize the ability of minority groups 
to have their points of view presented in the mass 
media; rather, this is to emphasize that a funda- 
mental impact of the mass media is to contribute 
to the patterns of social control. Therefore, the 
impact of the mass media must be judged not only 
in terms of the changes in attitudes and behavior 
produced but also in terms of the reinforcing effect 
on social norms and social behavior. 

To study mass media as a system of social con- 
trol, it is essential to encompass all of the elements 
of Lasswell’s formula. This requires the use of the 
case study, despite all of its scientific limitations. 
W. I. Thomas’ and Znaniecki’s classic study, The 
Polish Peasant in Europe and America, includes 
one of the first major theoretical and empirical 
analyses of the functional significance of the press 
as an instrument of social control (1920). Thomas 
demonstrated that for the submerged Polish com- 
munity under alien rule in Europe or in minority 
Status in Chicago the native language press sup- 
plied an important element of group integration 
and a linkage to the wider community. Robert E. 
Park continued this sociological perspective in his 
study The Immigrant Press and Its Control (1922). 
Lasswell and Kaplan (1950), in their analysis of 
political power, also assigned to mass communica- 
tions a crucial role in maintaining and fashioning 
the symbols of legitimate government. 

Theorists with such widely diverse formulations 
of contemporary social structure as Louis Wirth 
and Talcott Parsons have emphasized the impor- 
tance of mass communications systems as instru- 
ments of social control (see Wirth 1948; Parsons 
1942). Within this general frame of reference, re- 
search studies of the structure of industrial socie- 
ties have taken various forms. For example, Riley 
and Riley (1959) have sought to probe peer group 
structure and the consumption of the mass media 
as elements of normative structure among young- 
sters. In his study The Community Press in an 
Urban Setting, Morris Janowitz (1952) sought to 
fuse within a single work analyses of several as- 
pects of the press: its historical development, 
ownership and control, the social role of the pub- 
lisher, the image of the community as reflected by 
the contents of the press, and the functions of the 
local press for its readers. This research viewed 
the community press as one of the social mecha- 
nisms through which the individual is integrated 
into the urban social structure. Harold Wilensky 
(1964) has traced out the role of the mass media 
in leisure in order to distinguish those occupational 


groups for whom the mass media operate to 
strengthen their associational integration and those 
for whom the mass media serve as a substitute for 
group membership. This perspective has also pene- 
trated studies of political communication. The em- 
phasis of such research is less on the study of 
audience reactions to specific messages and the 
specific political decisions that are presumed to 
be generated than it is on the role of the mass 
media in defining the political issues of the day and 
in fashioning the relevance of politics for the indi- 
vidual (see, for instance, Lang & Lang 1959). 

The social control perspective also supplies a 
basis for the development of the comparative anal- 
ysis of mass media systems, a research theme that 
is likely to grow in importance in the years ahead. 
The first of such analyses were case studies of the 
Nazi social system by Harold Lasswell, Ernst Kris 
and Nathan Leites, and others, who analyzed the 
declining role of mass communications as the 
Nazis shifted to a heavier reliance on organiza- 
tional pressure and terror, Alexander Inkeles and 
Barrington Moore have studied the Soviet Union 
during different periods of political control and 
have emphasized the manner in which the elites 
have utilized the apparatus of mass agitation, not 
only as propaganda devices but also as imperfect 
and fragile channels for informing themselves 
about popular attitudes and loyalties [see PROPA- 
GANDA]. 

In the comparative analysis of mass communi- 
cations, the use of mass communications in the 
rise to power and the consolidation of power by the 
Chinese Communist party present fundamental 
issues for communications research. Historians 
will debate the question whether the Chinese com- 
munists’ seizure of power and their consolidation 
of power took place with less terror than in the 
case of the Russian Revolution. If there was less 
resistance to their consolidation of power, and 
consequently less need for terror, this may have 
been the result of the greater decay of the tradi- 
tional Chinese social structure. But in the process 
of political revolution, mass agitation and effective 
propaganda of the deed appear to be important 
techniques. While the rise of “thought reform” 
(brainwashing in popular jargon) is not confined 
to China, it has been practiced in China on such 
an immense scale that it has contributed to com- 
mon understandings and therefore has effectively 
articulated with organizational controls. As a result, 
during the first fifteen years after their seizure of 
power, the Chinese communists were able to rule 
without the mass purges that characterized the 
Russian Communist party. 


Social scientists have made efforts to integrate 
the study of the mass media as instruments of con- 
trol with the study of political and economic devel- 
opments in the so-called new nations of Africa 
and Asia. For example, Daniel Lerner (1958) has 
emphasized the general pattern of increase in 
standard of living, urbanization, literacy, and ex- 
posure to the mass media during the process of 
transition from traditional to modern society. Social 
research must guard against the danger of impos- 
ing categories grounded in Western experience on 
these mass communication processes. While there 
is a heavy emphasis on the expanding of the mass 
media in developing societies, the penetration of 
the central authority into the daily consciousness 
of the mass of the population has to overcome pro- 
found resistances. In this process of modernization, 
the resulting forms are not inevitably Western or 
European but are likely to include important neo- 
traditional elements, especially in the area of mass 
media and mass culture. 


The responsibility of social scientists 


Finally, research into mass communications has 
led to extensive debate about the moral implica- 
tions of social research and the professional respon- 
sibility of social scientists. A minority of social 
scientists, as well as outside observers, have ex- 
pressed concern that the findings might create the 
basis for extensive mass manipulation that would 
weaken and destroy democratic freedoms and val- 
ues. By contrast, the typical opinion among social 
scientists working in this field is that knowledge 
accumulates slowly and that this fear is greatly 
exaggerated. They rest their position on the histori- 
cal observation that even without the benefit of 
scientific research political agitators intuitively 
have succeeded in the worst imaginable forms of 
mass manipulation. 

A more careful and reasoned defense is to be 
found among those social scientists who are con- 
cerned with a “sociology of knowledge” position. 
These persons hold that the pursuit of scientific 
knowledge is a valid and legitimate human goal, 
provided the research procedures are carried out 
with due regard for human dignity. They accept 
the notion that agitators have succeeded without 
scientific knowledge, but they do not claim that 
this observation relieves the social scientist from 
both personal and professional responsibility about 
the use of his findings. Obviously, in a free society 
there is a limit to his control over the use of his 
findings, but he must take reasonable steps to pro- 
tect both himself and society. These steps involve 
seeking to insure that his findings are accurately 
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reported in both the professional and popular media 
and that his research is not made the permanent 
private property of a particular group or sponsor. 
He must also seek through professional associa- 
tions to establish and enforce adequate standards 
of performance. 

The “sociology of knowledge” position rests not 
merely on personal and professional responsibility 
but also on a theory of knowledge as well. In this 
view, the accumulation of knowledge is designed 
to assist understanding of the power, and therefore 
of the limits, of mass communications. It assumes 
that a deeper understanding of social and political 
processes can serve to reinforce a pluralistic society. 
In fact, research into mass communication has 
served to reduce the image of the omnipotence of 
communications. Mass communications operate 
within definable parameters, and when mass 
manipulation becomes excessive, although the re- 
sults can be disruptive and disastrous, there are 
other social processes at work as well. In short, 
research into mass communications has served to 
emphasize the underlying issue that “institutional 
building” is required for effective social change. 
There is no logical or demonstrated reason that it 
must of necessity create a basis for mass manipu- 
lation. 

Morris JANOWITZ 


[See also COMMUNICATION, POLITICAL; FILM; LITERA- 
TURE; PROPAGANDA.] 
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I 
CONTROL AND PUBLIC POLICY 


Every society controls its mass media in accord- 
ance with its policies and needs. The controls may 
be legal and political (that is, laws and censor- 
ship), economic (ownership and support), or social 
(criticism and the giving or withholding of patron- 
age). They may be positive, designed to obtain a 
desired kind of performance from the media, or 
negative, intended to prohibit a given kind of per- 
formance that might endanger the state, the rights 
of individuals, or the norms of society. The patterns 
and degree of control depend upon the political and 
economic orientation of the society. 


Government and the mass media 

All constitutional systems affirm the principle of 
freedom of expression. Thus, for example, the 
United States bases its policy in regard to the mass 
media on the first amendment to the constitution, 
“Congress shall make no law . . . abridging the free- 
dom of speech, or of the press,” and the fourteenth 
amendment, “No state shall make or enforce any 
law which shall abridge the privileges or immuni- 
ties of citizens of the United States.” Article 12 of 
the Spanish charter of July 13, 1945, proclaims 
that “all Spaniards may freely express their ideas 
as long as they do not advocate the overthrow of 
the fundamental principles of government.” Article 
125 of the constitution of the Soviet Union states 
that “the citizens of the U.S.S.R. are guaranteed by 
law (a) freedom of speech; (b) freedom of the 
press.” According to article 19 of the Universal Dec- 
laration of Human Rights, adopted by the UN Gen- 
eral Assembly in 1948: “Everyone has the right to 
freedom of opinion and expression; this right in- 
cludes freedom to hold opinions without interfer- 
ence and to seek, receive and impart information 
through any media and regardless of frontiers.” 
Obviously such unanimity of principle would not 
give rise to such differences in practice unless free- 
dom of expression was differently defined and 
understood in the several cultures. 


56 COMMUNICATION, MASS: Control and Public Policy 


The historical background. The first mass 
media came into being in western Europe under 
authoritarian governments that were already con- 
cerned over the emergence of an ambitious middle 
class and justifiably worried over what printed mat- 
ter might do to rouse the people against the power 
centers. Therefore, steps were soon taken to con- 
trol the new media. Patents, or licenses to publish, 
were given only to persons regarded as politically 
“safe,” and these people were permitted to exercise 
the privilege of publishing only as long as they 
refrained from rocking the ship of state. In the 
seventeenth century many books and periodicals in 
the areas of politics and religion were censored be- 
fore publication. When licensing and censorship 
became cumbersome, in the late seventeenth cen- 
tury, the governments found it more convenient to 
rely on the threat of punishment after publication 
—for example, prosecutions for treason or seditious 
libel, the former charge, as Siebert says, being “re- 
served for activities which shook the foundations 
of the state; sedition . . . for the irritating flea- 
bites of the dissident and the nonconformist” 
(Siebert et al. [1956] 1963, p. 23). 

These controls, of course, were aimed at pre- 
serving the government in power. But in a deeper 
sense they grew out of a long tradition of authori- 
tarian philosophy, from Plato, who thought the 
state was safe only in the hands of wise men (Re- 
public, especially books 6 and 8; Apology; Crito), 
to Hobbes, who argued that the power to maintain 
order is sovereign and not subject to individual 
objection (Leviathan, 1651), to Hegel, for whom 
“this substantial unity is an absolute unmoved end 
in itself, in which freedom comes into its supreme 
right, On the other hand this final end has supreme 
right against the individual, whose supreme duty 
is to be a member of the State” (Philosophy of 
Right, 1821). 

Freedom of expression—the classic doctrine. As 
the rulers of Europe had feared would happen, 
printed materials were in the forefront of a series 
of democratic revolutions in the seventeenth and 
eighteenth centuries. These revolutions brought in 
governments that based their information policies 
on the philosophy of the European Enlightenment, 
the concept of natural rights of man, and the con- 
fidence that rational man, given a fair chance, can 
distinguish truth from error. Thus France wrote 
into article 11 of its Declaration of the Rights of 
Man, in 1789 (and confirmed it in the preamble 
to the constitution of October 27, 1946), that “the 
unrestrained communication of opinion being one 
of the most precious rights of man, every citizen 
may speak, write, and publish freely.” John Milton 


(Areopagitica, 1644) spoke confidently of a self- 
righting process if free expression were permitted: 
“Let her [Truth] and Falsehood grapple; who ever 
knew Truth put to the worse, in a free and open 
encounter?” John Stuart Mill (On Liberty, 1859) 
said: “If all mankind minus one were of one 
opinion (and only one person were of the contrary 
opinion), mankind would be no more justified in 
silencing that one person, than he, if he had the 
power, would be justified in silencing mankind.” 

Out of such doctrine, in the eighteenth and 
nineteenth centuries, grew a_private-enterprise 
press, accustomed to publishing and criticizing 
freely, and with it the concept of a “free market 
place of ideas” from which the public could freely 
select what it felt to be right and true. The relation 
of mass media to government was almost precisely 
reversed. Instead of existing by sufferance as long 
as they supported and advanced the policies of gov- 
ernment, the media now came to be considered 
representatives of the public in keeping watch on 
government, and therefore to be kept as free as 
possible from control by government. 

Freedom of expression—modern variations, In 
the late nineteenth and the twentieth centuries ex- 
treme libertarianism has been modified in practice, 
for a number of reasons. For one thing, modern 
psychology identified many areas of irrationality in 
“rational man” and cast considerable doubt on his 
ability to distinguish truth from skillful propa- 
ganda. Furthermore, the growing concentration of 
ownership of the media made it more doubtful that 
all viewpoints would be adequately expressed in a 
free-enterprise market place. The emergence of films 
and the broadcast media presented new control 
problems, because of their supposed influence on 
morals and beliefs and because the allocation of 
broadcast channels could not be left to uninhibited 
competition—someone had to referee. Therefore, 
even in countries where libertarianism had been 
strongest, owners and operators of the media were 
asked to assume greater responsibility for their 
actions, and more controls were placed on the 
newer media—film, radio, and television—than on 
the press. 

However, nations in the modern world have so 
combined libertarian and authoritarian doctrine 
and practice that among the noncommunist states 
it is possible to discern almost a complete spectrum 
of systems, from the most libertarian to the most 
authoritarian—let us say, from England, where the 
doctrines of Milton and Mill still keep the govern- 
ment’s hands as much as possible off the media, to 
Spain, where the law of April 22, 1938, is still in 
force, saying that because of the essential functions 
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of the press it “could not be suffered to continue to 
exist in independence of the State . . . it is for the 
State to organize, supervise, and contro] that na- 
tional institution, the periodical press.” 

Totalitarian control of mass media. Since 1917 
another type of mass-media system has developed, 
beginning in the Soviet Union and based on the 
thinking of Hegel (Philosophy of Right, 1821), 
Marx (Capital, 1867-1894), and Lenin (1901). 
Lenin argued that private ownership is incompati- 
ble with freedom of the press. “Freedom is a sham,” 
he said, “as long as the best printing plants and 
the huge stocks of paper are in the possession of 
the capitalists” ({1919] 1935-1938, vol. 7, pp. 226- 
227). Therefore, said the Soviet delegate to the fifth 
session of the United Nations Economic and Social 
Council, “in order to enjoy effective freedom of the 
press, the highly complex technical means of mod- 
ern information must be controlled. . . . Only when 
the resources necessary for the control of the press 
are public property do the people enjoy effective 
freedom of the press.” What is the press to be free 
to do? Its purpose, said a Soviet theorist, “is not to 
trade in news but to educate the great mass of the 
workers and to organize them, under the sole guid- 
ance of the Party, to achieve clearly defined aims.” 
“We must and shall transform the press,” said 
Lenin in 1918, “into an instrument for the eco- 
nomic re-education of the masses” (last three quo- 
tations are from Terrou & Solal 1951, p. 51). 

In the Soviet Union and other communizing 
countries, the press is thus intended to serve an in- 
strumental purpose under the close control of the 
Communist party, which thinks of itself as a kind 
of general staff for the proletariat. None of the 
media are privately owned; they are either parts of 
the government or operated by an official organiza- 
tion. The result is a planned and systematic press, 
in contrast to the somewhat less systematic ones of 
countries where private competition alone has de- 
termined where a newspaper should survive. Need- 
less to say, the entire media structure is very closely 
supervised and integrated, and controversial events 
are reported and discussed for the Soviet citizen 
with a unanimity of interpretation to which a 
Western reader is unaccustomed. 


Methods of political and legal control 

The Commission on Freedom of the Press 
(19474, vol. 1, pp. 62-68) has listed most political 
and legal controls that have commonly been applied 
to the media. They include licensing in advance; 
censorship of offending material before publication; 
seizure of offending material; injunctions against 
publication of a newspaper or book or of specified 


content; requirement of surety bonds against libel 
or other offense; compulsory disclosure of owner- 
ship and authority; postpublication criminal penal- 
ties for objectionable matter; postpublication col- 
lection of damages in a civil action; postpublication 
correction of libels and other misstatements; dis- 
crimination in granting access to news sources and 
facilities; discrimination and denial in the use of 
communications facilities for distribution; taxes; 
discriminatory subsidies; and interference with 
buying, reading, and listening. 

Almost any mass-media system will, of course, 
be subject to certain basic statutory controls, among 
them a law designed to protect individuals or 
groups against defamation, a copyright law to pro- 
tect authors and publishers, a basic statute de- 
signed to preserve the common standard of decency 
and morality against infraction by the mass media, 
and another basic statute to protect the state 
against treasonable and seditious utterances. 
Where the systems chiefly diverge is in the extent 
to which they are subordinate to political authori- 
ties and the extent to which in their controls they 
go beyond the statutory restrictions just listed. 

Subordination to political authorities. There 
are different degrees of subordination, of course. 
But clearly subordinate to political authorities are 
communizing media systems like those of the 
U.S.S.R., classical authoritarian systems like those 
of Spain and Portugal, and new-nation systems like 
that of Ghana. Clearly not subordinate are systems 
like those of the United States, Great Britain, 
France, Australia, and other countries of Europe 
and the Americas where libertarian philosophy has 
been strong and of new states like India where an 
effort has been made to maintain libertarian ideas 
despite the stresses and strains with which all new 
governments have to contend. Between the two 
extremes are a very large number of countries in 
which the media are partly subordinate, partly not. 

Mass media subordinate to the political au- 
thorities are ordinarily subject to the classical 
authoritarian controls: licensing, censorship, and 
punishment for issuing material offensive to the 
government or the prevailing norms. Most systems 
have an official organization of some sort to super- 
vise the media and enforce the controls. 

In the Soviet Union, for example, there is Glavlit, 
a directorate established by an order of June 9, 
1931, “to exercise every form of politico-ideological, 
military and economic supervision of printed works, 
manuscripts, photographs, pictures, etc., intended 
for publication and distribution, and also to super- 
vise broadcasts, lectures, and exhibitions.” There 
are local branches of Glavlit, and their representa- 
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tives are placed in press and printing establish- 
ments (Terrou & Solal 1951, p. 119). Glavlit has 
the right to authorize a new periodical and to close 
it. A book or pamphlet must bear the imprimatur 
of Glavlit, and every page of proof must be ap- 
proved and signed by a representative of Glavlit 
before it may be printed. Furthermore, mass media 
in the communizing states are generally official 
enterprises, the editors of many papers are ap- 
pointed by the Central Committee of the Com- 
munist party at the appropriate level, and party 
propagandists and theoreticians are placed with 
influential publications. Thus, the mass media in 
such a system are integrated into the official struc- 
ture in a way that builds control into their every 
operation, 

In noncommunist authoritarian systems, owner- 
ship of the press may or may not be public, but as 
Terrou and Solal (1951, pp. 127-128) say of 
Spain, it makes no difference because “the man- 
agement of the enterprise is subject to a ‘power of 
oversight’ and publishing is hemmed in by a net- 
work of obligations and restrictions which make it 
quite unnecessary to lay down special provisions 
for the formation or operation of the enterprise. . . . 
The supervision to which it is subjected places it, 
in relation to the authorities, in a state of absolute 
hierarchical subordination.” The mechanism for 
this subordination, in the case of Spain, includes a 
State organization for management and super- 
vision, empowered to fix the size of circulation, in- 
tervene in appointment of members of the man- 
agerial staff, oversee the work of the staff, censor 
where necessary, and issue rules for the journalistic 
profession. 

Thus, at the authoritarian end of the political 
control spectrum it is perhaps less appropriate to 
speak of state intervention than of state integra- 
tion, inasmuch as the media are so completely 
interwoven with public policy and the administra- 
tive machinery of the state. 

Control in nonsubordinate countries. At the 
other end of the spectrum, however, in the coun- 
tries where the media are not subordinate to po- 
litical authorities, there is a conscious attempt to 
keep state intervention at a minimum. This is re- 
flected in a high proportion of private ownership in 
the media, a minimum of prior authorization, a 
maximum of freedom to criticize and to argue 
opposing viewpoints, and an attempt to keep official 
controls, if at all possible, in the courts. 

In the United States, for example, the newspaper, 
magazine, and book press, radio and television, 
and films are mostly privately owned; a variety of 


enterprises and a high degree of competition are 
encouraged. In most of the other countries where 
the media are not subordinate to political authori- 
ties, the press and films (at least the entertainment 
film industry) are likely to be privately owned and 
operated. 

Radio and television broadcasting. Broadcast- 
ing is organized in a great variety of patterns. In 
many of the Latin American countries, as in the 
United States, it is private enterprise. In such coun- 
tries as Great Britain, Japan, Canada, and Aus- 
tralia private and public broadcasting enterprises 
operate side by side. In these countries special 
efforts are made to keep the public broadcasting 
system free of governmental political influence. The 
British Broadcasting Corporation (BBC), for ex- 
ample, is a public corporation—neither a private 
nor a state enterprise—responsible to a board of 
distinguished citizens (Briggs 1961). In a majority 
of countries, however, broadcasting is a state mo- 
nopoly. In Belgium it is a public foundation; in 
France, a public service with its own budget; in 
Germany, a public establishment operating typical- 
ly under a large broadcasting council, representing 
many cultural, religious, and professional organiza- 
tions, that appoints the board of governors; in 
Italy, a company operating under concession from 
the state; in India and numerous other countries, 
an administrative service, 

Broadcasting, however, by its very nature is hard 
to keep completely free of government. Even in the 
United States, where every effort is made to keep 
the government’s hands off radio and television 
content, it is still necessary for the government, 
through the Federal Communications Commission 
(FCC), to choose among applicants for channels— 
which it does chiefly on the basis of their experi- 
ence and financial soundness and the programs 
they promise. Many other countries, because broad- 
casting is so important for reaching all the people, 
have felt it necessary to keep it a national service, 
under government management. 

Motion pictures and the press. Motion pictures 
are censored on moral grounds even in countries 
where the media are not subordinate to political 
authorities. In the United States this censorship has 
usually occurred at the state or municipal level; in 
some other countries, at the national level (Com- 
mission 1947c; Lyon-Caen & Lavigne 1957). The 
continuing economic crisis of the film industry in 
some countries has made it the beneficiary of 
government loans and subsidies to an extent un- 
matched by any other privately owned mass medi- 
um, but the prime motive of this government inter- 
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vention (whatever its eventual results) is to make 
the industry profitable, as well as a source of na- 
tional prestige (Political and Economic Planning 
1952). 

The press, in nonsubordinate countries, is gen- 
erally quite free from official control beyond the 
statutory ones mentioned earlier. It is not usually 
required to obtain a license to publish, beyond, per- 
haps, registering the names of the journal and the 
responsible persons. In peacetime, at least, it is 
ordinarily free of government censorship (for dis- 
cussion of these issues, see especially Johnson 
1958; Williams 1957). 

Freedom of political comment. The media of 
the nonsubordinate countries are typically quite 
free to criticize or oppose government policies. In 
time of war or national crisis, of course, any gov- 
ernment is edgy about criticism or uninhibited cir- 
culation of news. In World War 1, for example, 
U.S. news media accepted a voluntary censorship 
of news, presided over by a well-known newsman, 
Byron Price, but overseas the military services 
censored news originating in areas under their con- 
trol. A sense of national crisis leads many young 
and developing countries to restrict criticism from 
the media even in peacetime. However, in the case 
of older and more solid states in peacetime, a truly 
extraordinary amount of criticism is permitted— 
even encouraged. France provides an excellent ex- 
ample of a press full of vigorous and conflicting 
viewpoints (although under President de Gaulle 
there has been a large number of prosecutions for 
printing comments offensive to the chief of state), 
and during at least twenty of the last thirty years 
two-thirds or more of the daily newspapers in the 
United States have been editorially opposed to the 
president in office. In the United States Justice 
Holmes’ doctrine that “clear and present danger” 
to the country must be proved (Schenck v. United 
States, 1919) stands between most critics of the 
government and punishment for harsh words. 

Even in countries where criticism is encouraged, 
however, governments are frequently accused of 
controlling the release of and access to official news 
in the government's best interest (Rourke 1961). 
This is generally called “managing the news.” 


Economic controls 

When a Western newspaperman says that the 
Soviet press is not free, he is likely to mean that it 
is not politically free. When a Soviet newspaper- 
man says that the U.S. and western European press 
is not free, he is likely to mean that it is not eco- 
nomically free. Thus, Vyshinskii ({1938] 1948, pp. 


612-613) charged that the London Times “is the 
organ of banks, connected through its directors 
with Lloyd’s Bank, with the largest railroad com- 
panies, with insurance companies.” Hearst, he said, 
was “a big American capitalist.” The Western 
press, he charged, was free for capitalists only; true 
freedom “consists essentially in the possibility of 
freely publishing the genuine, not the falsified, 
opinions of the toiling masses, rather than in the 
absence of preliminary censorship.” 

Patterns of ownership. In a private-enterprise 
system, of course, control does go with ownership, 
and Breed (1952) and Rosten (1937) have shown 
how the wishes of the owner and top management, 
even when not directly expressed as orders, are still 
reflected in the performance of the enterprise. For 
this reason, and because of the importance, in 
libertarian theory, of representing a wide range of 
viewpoints in a “free market place of ideas,” some 
observers have been concerned over trends toward 
bigness and fewness in the United States media and 
also in the media of some other countries. Whereas 
newspaper competition was the rule fifty years ago, 
now less than 5 per cent of U.S. cities and towns 
with daily newspapers have competitive ownership 
(for a description of this trend, see Nixon 1949). 
Starting a city daily in the United States is now so 
costly that it is seldom attempted. The result is that 
ownership has been increasingly restricted and has 
come to represent the wealthier and usually more 
conservative strata of society. For example, al- 
though a majority of United States voters are regis- 
tered in the Democratic party, over two-thirds of 
the newspapers that endorse presidential candidates 
usually endorse the Republican candidate. 

There has been no decrease in the number of 
broadcasting stations, as there has been in news- 
papers, but much of their programming is provided 
by three television and four radio networks (Bricker 
1956). To prevent further concentration, the gov- 
ernment permits each network to own no more 
than five stations. Until recent years there was a 
concentration of film making in a few gigantic 
enterprises, which owned both studios and theaters. 
Government antitrust action requiring the separa- 
tion of studio and theater enterprises and the 
growth of new companies to make films for tele- 
vision have reversed the trend toward concentra- 
tion in the movie industry (Bernstein 1960). The 
wire news services to newspapers and broadcasting 
stations are provided mostly by two large news 
agencies, the Associated Press, a cooperative, and 
United Press International, a privately owned com- 
pany. In a famous case in 1945 the Associated 
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Press was required, under antitrust legislation, to 
accept clients for its services even though the new 
clients might be in competition with older ones; it 
thus became, in effect, a common carrier for news 
(Associated Press v. United States, 1945). 

Influence of advertisers. Directly or indirectly, 
advertisers have the power to affect private mass- 
media enterprises. Efforts to influence newspaper 
policy by withholding advertising have more 
often failed than succeeded in the United States 
(Schramm 1957, pp. 132-133), but the economic 
decisions of advertisers and agencies, quite apart 
from any political intent, have sometimes held the 
power of life and death over very large enterprises. 
For example, the decisions of many large firms to 
advertise elsewhere, especially on television rather 
than in print, together with rising costs, have 
caused the closing of some of the largest magazines 
in the United States and of the New York Mirror 
in 1963, when it had the second largest circulation 
among newspapers in the United States. 

In the case of commercial broadcasting, adver- 
tisers can exercise a certain amount of influence 
on content by deciding whether or not to sponsor a 
particular program. Some countries (Great Britain, 
for example) do not permit the sponsorship of a 
program and sell only the time between programs 
(see Wilson 1961 for the background to this com- 
promise). A certain amount of indirect economic 
influence on the content of broadcasting arises out 
of the importance of audience ratings (U.S. Con- 
gress, House Committee on Interstate and Foreign 
Commerce 1961). It is important to provide many 
advertisers with the largest possible markets, and 
this so-called tyranny of ratings tends to result in 
more attention to large-audience programs than to 
the needs of smaller audience groups and in a flood 
of imitations of successful programs. 

Other economic restrictions. Import restric- 
tions, basically economic but sometimes used also 
as political controls, decrease the flow of informa- 
tion and informational materials across many 
national borders. 

In countries where the mass media are not finan- 
cially sound, economic control is sometimes exerted 
by allocation of newsprint or other scarce resources 
to favored enterprises. In some countries bribes to 
reporters and editors are reported to be common. 
And one of the most common economic controls 
where the press is new or weak is subsidization by 
a political group or individual, 


Social controls 


Public opinion, especially when reflected in sub- 
scriptions, audience ratings, or film attendance, is 


a powerful control on private-enterprise media and 
to some extent a control on publicly owned media. 
Thus, for example, the BBC, although a public 
corporation, has continually modified its television 
programming as a direct result of comparisons be- 
tween its audience ratings and those of commercial 
stations. 

Because feedback to the mass media is usually 
slight, these enterprises are often influenced by 
letters or telephone calls from the public, A few 
hundred letters and calls about an offensive state- 
ment have been known to result in putting a U.S. 
network program off the air (Schramm 1957, p. 
152). Some critics are influential. The criticism of 
books is more highly developed than that of films 
or broadcasts, and there is almost no regular criti- 
cism of newspapers. The scarcity of such criticism 
led the Commission on Freedom of the Press, in the 
United States, to recommend “the establishment of 
a new and independent agency to appraise and re- 
port annually upon the performance of the press” 
(Commission [1947b] 1958, p. 100). 

Opinion has proved especially effective when it 
represents large organized groups. The Catholic 
Legion of Decency, for example, has influenced the 
movie industry with its recommendations of accept- 
able and unacceptable films and the implied threat 
of boycott by a large group of consumers (Facey 
1945). Members of opinion groups have sometimes 
been effective in influencing the selection and dis- 
play of books in libraries (Fiske 1959). Trade as- 
sociations and other affiliated groups have been 
alert to what they consider unfair treatment of 
their members, especially in films. Indeed, it has 
been remarked that if the film industry fully heeded 
the protests of special groups, it might very well 
not be able to make movies with people in them. 
More recently, however, the American film indus- 
try, doubtless as part of its attempt to recapture the 
audiences lost to television, but also encouraged by 
the commercial success of some imported “art” 
movies, has taken a more aggressive attitude 
toward the censors. How far this trend will continue 
and whether it is supported by a genuine change in 
public taste remain matters for speculation. 


Some public policy problems 


A high proportion of the mass media’s public 
policy problems grow out of either an attempt to 
balance freedom and responsibility in media per- 
formance or a conflict of rights. Countries where 
libertarian roots are deep and where most media 
are not subordinate to political authorities are likely 
to be preoccupied with keeping their media free; 
whereas authoritarian countries (although many 
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of them would deny that their media are not free) 
are more likely to be concerned with keeping the 
media responsible. 

In the United States this relationship between 
freedom and responsibility raises continuing policy 
problems. There is no great desire to return to the 
unbridled and irresponsible political writing of 
the early nineteenth century, which Mott called 
“The Dark Ages of Partisan Journalism” ({1941] 
1962, pp. 167-180). Libertarianism of that kind is 
obsolete. But many observers and critics feel that 
the media should not only be free from something; 
they should also be free for something. What the 
mass media should be free for, the kind of responsi- 
ble performance to be expected of them, has been 
studied during the last twenty years by a number 
of expert commissions, notably the Commission on 
Freedom of the Press, in the United States, and a 
series of royal commissions in the United Kingdom, 
Canada, Australia, and Sweden, among others. 
Many of the problems before these commissions 
have dealt also with conflicting rights: the public’s 
“right to know” versus individual rights, such as 
privacy and fair trial; the public's right to protect 
moral standards versus the artist’s right to be heard 
or seen and the adult public’s right to receive adult 
media; the media owners’ right to free expression 
versus the public’s right to a variety and balance of 
viewpoints and interpretations; and so forth. It is 
generally agreed that the best hope of resolving 
such conflicts in the public interest (in a country 
like the United States) is for the owners and oper- 
ators of the media to assume the responsibility for 
providing a free market place of ideas and a vigor- 
ous service sensitive to individual rights and 
minority as well as majority needs. 

Freedom and responsibility. If the media are 
not responsible, said the Commission on Freedom 
of the Press (1947b), someone will have to enjoin 
responsibility upon them. The implication is that 
the government would have to do this. And yet 
there has been the greatest reluctance on the part 
of critics, media, and government alike to permit 
the federal government to move into control of 
media content. For example, the so-called Blue 
Book of the FCC (see Siepmann [1950] 1956, pp. 
37-40) became a cause célèbre in 1946 chiefly be- 
cause it implied that the FCC, having granted a 
license partly on the basis of an applicant’s promise 
to deliver a certain kind of programming, should 
check up on whether he keeps his promise. The 
United States federal courts have more often proved 
themselves a resource for overturning local and 
state censorship than for confirming any govern- 
ment controls on content. Indeed, perhaps the most 


successful control a government agency has been 
able to place on content has been through the Fed- 
eral Trade Commission, which monitors television 
and radio advertisements for false claims. 

Self-regulation—some dilemmas. The emer- 
gence of professional organizations and of profes- 
sional codes of good practice has sometimes been 
hailed as a recognition of responsibility by the 
media. The newspaper code (“Canons of Journal- 
ism,” adopted by the American Society of News- 
paper Editors) is a highly generalized statement, 
requiring no specific adherence. The Production 
Code of the Motion Picture Association of America, 
on the other hand, contains a detailed list of speci- 
fications for avoiding offense to moral sensibilities 
and pressure groups. It is administered by an office 
supported by film makers, which is empowered to 
require changes in films, if deemed necessary, be- 
fore awarding a Seal of Good Practice. The radio 
and television codes are less specific. They are ad- 
ministered by an agency of the National Associa- 
tion of Broadcasters, which, so far as is known, has 
never dropped a member for nonadherence to the 
code. In both the film and the broadcasting indus- 
tries some enterprises do and some do not subject 
themselves to the code. A number of schools of 
journalism have, in a sense, established self- 
regulation by joining in a program to evaluate their 
curricula by professional standards. 

Whereas self-regulation is an evidence of re- 
sponsibility, still the working of the codes and regu- 
lating programs illustrates one of the problems of 
nations and media that operate in the libertarian 
tradition: it is relatively easier to apply negative 
than to apply positive controls. It is easier for a 
society to ensure that what it dislikes is not done 
by the media or to the media than to ensure that 
what it feels it needs from the media it gets. 


WILBUR SCHRAMM 


[See also CENSORSHIP; COMMUNICATION, MASS, article 
on TELEVISION AND ITS PLACE IN MASS CULTURE; 
CONSTITUTIONAL LAW.] 
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LLL 
TELEVISION AND ITS PLACE IN MASS CULTURE 


It is notoriously difficult to define mass commu- 
nications precisely, but television, among all the 
media, is likely to have the highest proportion of 
those characteristics which feature in any reason- 
able definition. It has by far the largest audiences 
of all forms of communication; indeed, it some- 
times reaches virtually the entire population of a 
nation—as, for example, in the United Kingdom 
during telecasts of great national interest, such as 
a royal message, a state funeral, or a football final. 
Radio, too, can have most of a nation for its audi- 
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ence, but it cannot cross language barriers. Televi- 
sion, being visual, does just that. Through the use 
of satellites it is now multiplying its audiences 
several times over. The same program that can be 
seen in the United States can also reach huge 
audiences across the countries of Europe up to the 
iron curtain or sometimes even beyond it. Further- 
more, it reaches these audiences at the same time 
or, give or take a few seconds, at virtually the 
same time; these enormous audiences are not made 
up of successive audiences, as are those of the 
cinema. 

Television’s audiences are more amorphous and 
less self-selected than those of the other media of 
communication, Most of its audiences are made up 
of general consumers of television, that is, people 
who sit down ready to be entertained for the greater 
part of an evening, without necessarily having 
chosen a specific program at a particular time out 
of a feeling that it has a special appeal to them 
as differentiated individuals. They do not divide 
themselves, nor are they often divided by the pro- 
viders of television, according to age, sex, class, or 
region. By comparison, the readership of the na- 
tional press in Great Britain, for example, is still 
roughly assignable to social class, though this char- 
acteristic is weakening under the impact of the 
newer “classless” media. 

Television reaches right into the living rooms of 
its amorphous audience. It seeks to hold them and 
to retain that hold, rather than being content to be 
reached for as the mood arises. This becomes less 
true if there is more than one channel and if the 
programs on those channels are arranged so as to 
contrast with each other and encourage selective 
viewing. However, where the funds for television 
come from advertising, the point holds even where 
there are several channels: they all narrow their 
programming toward those items which seem likely 
to keep the average consumer—viewer from switch- 
ing off or over. 

Television is an archetypal form of mass com- 
munication also because it is an industry—an in- 
dustry whose products are recreation, ideas, and 
education. Further, the men who produce the pro- 
grams—authors, scriptwriters, visual artists, direc- 
tors, producers, actors, program controllers—are 
physically and to some extent culturally separated 
from their audiences, and that separation is pro- 
gressively increasing. Mass-communication indus- 
tries, comprising large bodies of highly endowed 
and intensely professional people with more than 
average articulateness, inevitably become narcis- 
sistic toward their own trade; they progressively 
worship the machine that draws on their talents 


and increasingly prefer not to ask searching ques- 
tions about their more complex interrelations with 
their society and the individuals who compose it. 
Their link with the audiences is indirect; they put 
their programs into the camera, and the engineers 
beam them into millions of separate homes; there 
is no one-to-one relationship. The programs them- 
selves become consumer products for which the 
producers have a sort of pure professional or aes- 
thetic love; around them accrue much of the folk- 
lore and legend common to all such highly so- 
phisticated mysteries. 

Like all major mass communications, television 
has a continuous production belt. It is a recrea- 
tional, cultural, educational, and informational 
“sausage machine,” which must be fed day after day 
so as to produce at roughly the same times roughly 
the same amounts and proportions of material. 
Inevitably and progressively (unless the process is 
deliberately resisted) it turns out the same kind 
of material, which is all of the same density and 
texture although it appears with different packag- 
ing—material that is predigested, homogenized, 
sterilized, artificially colored and seasoned each 
and every day. The pressures toward this kind of 
uniformity arise from the persistent need to deliver 
interesting goods day after day, to meet the sched- 
ules, and to keep up the ratings. The servants of 
the machine may and do change, as the shifts 
rotate daily and weekly, and as personnel come 
and go over the years in the great factory. But the 
consumer product has built into it a range of estab- 
lished and recurrent characteristics, partly because 
of the necessary organizational structure of the 
factory, partly because of the inherent qualities of 
the medium itself, and partly because of pressure 
from the audience to provide them with what they 
have come to expect. There is, of course, more 
variation than has been suggested; however, there 
is much less than one might have expected. 

Questions about television’s possibilities for 
growth as well as its dangers are, then, questions 
about all mass communications. Moreover, these 
are not only questions about the medium in itself. 
No medium of mass communication can be ade- 
quately discussed solely in terms of itself; such a 
discussion inevitably leads to a discussion of the 
society which television reflects and affects and 
which uses it in this way rather than that. How- 
ever, this important outer area can be touched on 
only peripherally within the limits of this article. 


Television's relation to its audiences 


Although television is important and pervasive 
in its effects on attitudes and behavior, it is prob- 


ably not as important as it is commonly thought 
to be. There are not a great many reliable psy- 
chological studies on the effects of television, 
though the number is fast increasing, nor are their 
results in all respects definitive. All that we can 
say at present is that television probably cannot 
change taste radically or upset deep-seated assump- 
tions, but that where it follows the grain of per- 
sonal or social predispositions, it can reinforce 
them. Recurrent violence on television seems likely 
to encourage acts of violence only in the psychotic 
or mentally disturbed, and television programs 
which go along with the tastes and interests of a 
particular social group are most easily assimilated 
and tend to strengthen those tastes. These effects 
are increased when a given theme recurs often, 
is dramatically presented, and provides the viewer 
with the opportunity to identify himself with a 
character on the screen (see Klapper 1960; Hal- 
loran 1964). 

Even these limited findings have important in- 
dications for program planning. To what extent 
can one risk further unbalancing the unbalanced? 
Should a program planner settle for meeting the 
existing expectations of the largest majority? How- 
ever, these are small questions in comparison with 
those about which we are still almost wholly in 
the dark, For example, it may have been proved 
that television is likely to cause the acting out of 
violence only in those who are already disturbed: 
but is it also true that exposure to repeated vio- 
lence—as a thing simply to be enjoyéd—has no 
effect, at the deepest levels of the psyche, on the 
millions of “normal” viewers? 

Patterns of programming. What should tele- 
vision offer its audiences? There can be no simple 
answer. In practice, answers differ from country 
to country according to prevailing ideologies; these 
ideologies determine the administrative and finan- 
cial structure of the various television systems and 
so determine their patterns of programming. 

In Britain, both the British Broadcasting Corpo- 
ration and the commercially financed but publicly 
controlled channels (ITV) are required by law to 
“inform, educate, and entertain.” In practice, both 
systems entertain first, inform next, and educate 
least of all (in terms of time given to specific for- 
mal education). In the United States the place of 
entertainment is manifestly more prominent; in 
totalitarian countries information and education 
have a more important place than in Britain or the 
United States and tend, insofar as both are colored 
by ideological purpose, to be run into one another. 

But what functions in relation to their audiences 
might be said to be more or less fundamental to 
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any television system in a democracy? First, it 
should provide a good deal of reasonably straight- 
forward services for the known needs of the major- 
ity of the population. The broadcasters have to 
amuse most of us in the periods when we most 
seek amusement, and they have to keep us (though 
it can only be sketchily) abreast of what has been 
happening in the world. Thus they have to provide 
entertainment ranging from the equivalent of a 
variety show to that of a concert hall, as well as 
a range of informational sources varying from the 
equivalent of a morning newspaper to that of a 
weekly journal of opinion. Moreover, all this must 
be done in the light of the fact that television is 
very expensive and that therefore the number of 
channels available (so far as we can see at present) 
is not likely to be sufficient to allow very fine break- 
downs by audience taste. For much of its time, 
television will have to think, if not of absolute 
majorities, at least of sizable minorities. 

But even this approach is a different concept of 
programming from the one dominant in those 
countries in which the audience sought most of 
the time is the largest possible majority of the total 
population. It is not sensible to oppose this view 
with the notion of television as the servant of small 
minorities; but even with only one channel, a tele- 
vision system can serve a range of overlapping 
sizable minorities much of the time. This is the 
promise of educational television in the United 
States (Schramm et al. 1963). The word “over- 
lapping” is important. Large minorities are not 
discrete sections of the total population. The same 
people can belong to different large minorities at 
different times, depending on how they are feel- 
ing. Of course, programming can be so arranged 
as to hit the dead center of average taste most of 
the time and so discourage any form of selectivity 
in viewing. If the broadcasters are successful in 
obtaining this kind of consistent attention, then 
they are tempted to decide that their audiences are 
as uniform in tastes as their own programs. The 
history of some unexpectedly successful programs 
(for instance, in Great Britain), which were not 
designed for the “mass audience” but which never- 
theless got a surprisingly hefty minority audience, 
challenges that thinking. 

One can extend this idea that the taste of today’s 
audiences is actually more varied than program 
planners are led to think or lead themselves to 
think. The existing tastes of most of us are not 
simply a product of irremovable hereditary factors; 
they are to a large extent a product of our oppor- 
tunities, education, social class, available money, 
and where we happen to be born. It seems reason- 
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able to ask whether broadcasters should simply 
reflect the average range of interests which a great 
number of other environmental forces have to- 
gether produced at any particular time. If they 
decide to do this, they should realize that their 
role only appears to be a passive one. They will, 
in fact, be harnessed to the service of, and made 
to pull in the same direction as, many other power- 
ful forces whose aim—in commercial democracies 
—is to exploit the existing range of tastes and 
interests. In that apparently passive role, television 
will therefore not be passive at all; as seen above, 
it will by its nature reinforce the existing narrow- 
ness in range of taste and the existing assumed 
limits. Should the broadcasters not be free, within 
sensible limits, to cater also to the potential as well 
as to the actual tastes of their audiences? Thus 
they might decide to offer programs which some 
people, though not as yet a large majority or even 
a sizable minority, find imaginatively and intellec- 
tually exciting, in the hope that others, who might 
otherwise never have had a chance to know of 
such possible interests, may become interested too. 

This part of the argument is much confused by 
the clash of slogans. Shall broadcasters “give the 
people what the people want,” or shall they “give 
the people what they ought to have”? This fruitless 
contrast was usefully dismantled by the British 
Committee on Broadcasting (the Pilkington Com- 
mittee) in its report published in 1962. 


The choice is not between either giving the public 
what it wants, or giving the public “what someone 
thinks is good for it,” and nothing else. There is an 
area of possibility between the two; and it is within 
this area that the choice lies. The broadcasting au- 
thorities have certainly a duty to keep sensitively 
aware of the public’s tastes and attitudes as they now 
are and in all their variety; and to care about them. 
But if they do more than that, this is not to give the 
public “what someone thinks is good for it.” It is to 
respect the public’s right to choose from the widest 
possible range of subject matter and so to enlarge 
worth while experience. Because, in principle, the 
possible range of subject matter is inexhaustible, all 
of it can never be presented, nor can the public know 
what the range is. So, the broadcaster must explore it, 
and choose from it first. This might be called “giving 
a lead”: but it is not the lead of the autocratic or arro- 
gant. It is the proper exercise of responsibility by pub- 
lic authorities duly constituted as trustees for the 
public interest. (Great Britain 1962, p. 18) 


Any decision to put on programs other than 
those apparently justified by simple “feedback” 
from the market or by plain consonance with 
existing taste can be called paternal, patronizing, 
or propagandist. Indeed, such a decision is danger- 


ous. Program planners are also mortal men with 
circumscribed views and assumptions that are not 
always examined. Their interpretation of the need 
to offer their audiences the chance to widen taste 
may, for example, amount to no more than the 
desire to make as many people as possible into 
“middlebrows”; they may have very simple views of 
the relations between social class and cultivation. 

Criteria of value in programming. It is ex- 
tremely difficult to define a “good” program. “Good 
in relation to what?” is a useful question, but it 
is only the first question; relied on for long, it 
becomes the motto for an evasive relativism. 
“Counting heads” is not a sufficient test of a pro- 
gram’s success, but it is not irrelevant. When a 
program attracts an unexpectedly small audience, 
this may indicate that, though the producer did all 
he could to make it interesting without compromis- 
ing the integrity of his subject, only a tiny minority 
of people are at present willing to be interested; 
on the other hand, it may indicate that not enough 
thought—at least, not enough of the right kind of 
thought—went into the effort to communicate 
widely. Again, technical competence is important 
and technical brilliance admirable. But, in the face 
of the difficulty of defining “good television” with- 
out seeming either patronizing or a victim of the 
market, many television producers hold on to tech- 
nical virtuosity or “professionalism” as an end in 
itself. There are certain programs where the pos- 
sibilities of the medium are being extended through 
a fascinating exploration of its inherent character- 
istics. However, in general, technical skill is and 
must be the servant of the particular subject or 
theme which is being presented. 

In presenting that material, the broadcasters 
must aim at “objectivity” and “balance.” This is 
often said, and although it sounds easy, it is in fact 
difficult. The two words are not synonymous. “Ob- 
jectivity” means the decision not to push a partic- 
ular editorial line, but rather to present the rough 
and qualified contours of truth as accurately as 
possible. “Balance” means ensuring that all sides 
have a fair and a reasonable chance to be heard. 
Both “objectivity” and “balance” have to be posi- 
tively and flexibly interpreted, or they will become 
polite synonyms for sitting on the fence and for 
balancing one side against the other so nervously 
that all sides lose their force. The problem—which 
can be solved—is to avoid propaganda or partial- 
ity, to suggest the complextity of truth, and to 
allow the sense of the importance of making 
choices to be felt. 

“Objectivity” and “balance” are not sufficient cri- 
teria of value in programs. Furthermore, though 


remembering that tastes do differ provides a useful 
check to narrow-mindedness, it still leaves virtually 
all the difficult questions about standards unan- 
swered. We have seen that thinking in terms of 
fixed sociocultural hierarchies does not help much 
either. There is no room here to do more than has 
been done: to point to the complexity of the prob- 
lems, indicate some of their elements, and reiterate 
their importance, As a coda, another quotation 
from the Pilkington Report may be useful: 


Triviality is not necessarily related to the subject mat- 
ter of a programme; it can appear in drama, current 
affairs programmes, religious programmes or sports 
programmes just as easily as in light comedy or variety 
shows. One programme may be gay and frivolous and 
yet not be trivial. Another programme may seek pres- 
tige because it deals with intellectual or artistic affairs 
—and yet be trivial in its grasp or treatment; it may, 
for example, rely for its appeal on technical tricks or 
the status of its compere, rather than on the worth 
of the subject matter, and the depth of its treatment. 
Triviality resides in the way the subject matter is ap- 
proached and the manner in which it is presented. 
(Great Britain 1962, paragraph 98) 


On the more practical aspects of the relations 
between television and its audiences, the question 
is, What links should exist between the two? This 
question is best discussed in the final section of 
this article, which deals with structures. The point 
of mentioning it here is that it underlines once 
again television’s inherent trend toward centraliza- 
tion and thus its separation from its audiences. We 
have already noted that separation increases as 
television’s practitioners become more internally 
engrossed in their work, because by definition the 
practitioners are self-consciously professional and 
most members of their audiences are not. Television 
is not a home industry or a folk industry. Even its 
conventional art has to be carefully planned; it 
doesn’t just happen because someone just happens 
to feel that way. Television is a many-layered in- 
dustry with stage after stage interposed between 
an original idea and what eventually appears on 
the screen. Inevitably, it produces an ethos—a 
mystique—of its own. 


The qualities of the medium itself 


Television production is a several-stage process, 
and thus it is an even more complicated process 
than that of producing a finished performance of 
a dramatist’s final script in the live theater. More- 
over, this medium often uses several writers at 
the same time on the same script or teams of 
writers working concurrently on a number of 
scripts or a series; in other instances a story line 
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and thematic pattern are established and different 
men take over the script writing at different times, 
or the relationship between a writer and his pro- 
ducer is a continuous dialectic. All these differ- 
ences between television and live theater need not 
mean loss to a dramatist used to the ways of the 
theater; he may gain from them so long as he 
recognizes their possibilities as well as their limits. 
The medium has qualities which restrict those 
who use it or at least closely define their areas of 
movement, but so do all art forms and channels 
for the expression of art. As in other fields of art, 
the tension between the given limitations of the 
medium and its new and peculiar possibilities 
challenges the imagination of a writer and suggests 
different ways of exploring experience. The restric- 
tions of budget for any particular program, as well 
as problems of available space and personnel, are 
similar in kind to those technical and commercial 
pressures which were felt, say, in the development 
of the novel during the nineteenth century. The 
relevant considerations here can range from the 
simple and obvious to the most psychologically 
obscure: from merely circumventing a shortage of 
cameras to discovering new possibilities within the 
medium by means of a creative response to the 
challenge of that shortage. The parallels with, for 
example, the writing of sonnets are obvious. 
About the qualities of the medium itself not a 
great deal that is authoritative has been written, 
but there are some useful hints and guesses. Tele- 
vision has a peculiar “immediacy” and “fluidity”: 
it strongly suggests a sense that “it’s all happen- 
ing’—a sense of “thereness,” “thisness,” and “now- 
ness”—and it tends to break down pre-existing 
categories, It resists stage acting and even cinema 
acting, breaks through the picture frame or mental 
proscenium arch, and merges the spectator with 
the picture, on its sidelines if not at the center. 
On television everything tends to become a form 
of documentary. Television tends to seek “person- 
alities” of its own. Of course, television personali- 
ties help to build up “channel loyalty” and link 
disparate items, and so program planners like them. 
But the power of personalities is more than that: 
they are part of the self-validating and self-suffi- 
cient immediacy of television in its own right. 
Television has an exceptional sense of immediate 
history. When a great public occasion—for ex- 
ample, a state funeral such as Sir Winston Church- 
ill's—is seen on television, the sense of immediate 
historical presence is overwhelming. Furthermore, 
television can make a funeral in a back garden 
seem heavy with the meaning of life. It heightens 
even the most “ordinary” ordinariness of life and 
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seems to give it dramatic significance. Because of 
its sense of actuality, of its Brechtian breaking of 
distance, we are all sidewalk spectators of the 
drama of life when we watch television. Sometimes 
one really believes in the reality of an event only 
because it has been validated through being seen 
on television. 

We know little about what all this really means 
yet; it is too early to determine how far television 
may be encouraging a more plastic and fluid psy- 
chic responsiveness, or how far it may be the 
agent of a disconnected, discontinuous flow of 
value-free sensations (see McLuhan 1964). When 
it is not simply providing routine servicing, tele- 
vision is constantly seeking its own forms—its own 
kinds of drama, comedy, current affairs, and edu- 
cational programs. Much has been done, but tele- 
vision programming is still at no more than the 
end of the beginning. This exploration, plus the 
assumption, already discussed, that an audience's 
potential range is greater than its actual range, 
requires the emphasis to be put on openness and 
experimentation in program planning. About both 
the medium itself and its deepest relationships with 
its audiences we still know too little to be dogmatic. 


The formal structures of television 


There are three main kinds of formal structure 
for television, as for all broadcasting: the state- 
controlled, the commercially impelled, and the in- 
between, or—as I would prefer to call it—the 
potentially democratic. 

In the first, the communications system is an 
arm of government and so at bottom propagandist, 
no matter what its apparent face. It can be found 
in one form or another in communist countries, 
fascist countries, and in other types of authori- 
tarian countries, especially those in which the 
Roman Catholic church is strong. It stresses the 
party line and the national philosophy, and keeps 
a tight hold on the channels for debate. Under this 
system the greatest sufferers are information and 
education, since both are consistently distorted. 
Television is regarded chiefly as a channel through 
which things are done to people rather than as a 
medium with its own possibilities and character- 
istics. Therefore state-controlled television tends 
not to be experimental either about what the me- 
dium can best do or about what audiences might 
enjoy if they had the chance. 

In the second system—the commercially im- 
pelled—television is tied to commerce, usually be- 
cause it makes its money by selling advertising time. 
Such systems can vary from those in which ad- 
vertising is regarded (or said to be regarded) 


chiefly as a convenient means for financing the 
programs to systems in which (public professions 
notwithstanding) the main feature is advertising. 
In the former type of system, the programs are 
still the most important thing, whereas in the 
latter type they are regarded as carriers or bait for 
the commercials; however, in the end, the pres- 
sures on all such systems are very much the same. 
This emphasis on commercial values can happen 
under any system which allows advertisements but 
obviously is likely to be more extreme under di- 
rect sponsorship. 

The drive is to maximize profit by increasing the 
number of impacts delivered or persons reached; 
the audience is seen as a vast number of possible 
consumers whom it would be advantageous to hold 
as one large mass. The basic urge of this kind of 
television is, therefore, to keep the largest possible 
number of people content for the largest possible 
amount of time. Its natural drive is the reverse 
of that of state-controlled systems; it stresses en- 
tertainment rather than information and educa- 
tion. It tends to narrow the range of programs 
offered, so as to concentrate on types of programs 
considered likely to attract a mass audience, Thus 
it tends to ignore minorities, because minorities 
are subdivisions of the potential single mass audi- 
ence. A good example of this self-defeating con- 
centration on a narrow range of programs known 
to be popular can be seen in the programming of 
New York City’s numerous television channels dur- 
ing prime time. The commercially impelled system 
also reduces the urge to experiment, whether in 
regard to the potential tastes of the audiences or 
to the more freely imaginative possibilities of the 
medium itself; it inherently seeks to exploit exist- 
ing tastes, 

The “potentially democratic” system. I called 
the third system the “potentially democratic” be- 
cause I doubt that a full-fledged democratic broad- 
casting system exists anywhere in the world. At its 
best, the British Broadcasting Corporation some- 
times gets near to it. In such a system the state 
does not have day-to-day supervision or any execu- 
tive control (though in the public interest there 
must be some overriding guardianship); nor is 
the advertiser able to distort the uses of the me- 
dium, since there is no advertising. 

The two chief difficulties in such a system are 
paying for it and providing the necessary minimum 
of supervision without inhibiting the broadcasters. 
Advertising—with the advertisement revenue-re- 
ceiving body formally and genuinely separated from 
the program-providing body—might just be pos- 
sible at one end of the financial spectrum; the 


danger here is the steady spread of the pressures 
of advertising into the planning of programs. Mass- 
media advertising is like a seamless garment, and 
to think you can seal off a part of it is to be like 
the young lady of Riga who thought she could ride 
a tiger with impunity. At the other end of the finan- 
cial spectrum is a system of direct payment from 
public funds to the broadcasting authority, who 
would be entrusted to use it for broadcasting in the 
public interest, with “public interest” not too re- 
strictively defined. This has various possible forms, 
too, and some are less tricky than others. But ob- 
viously the danger here is that the state will exer- 
cise increasing, creeping pressure, as the direct 
provider of funds. 

In between these two ends of the financial spec- 
trum there are, again, several different ways of 
paying for broadcasting. On the whole, the most 
successful is the license fee, because it gives most 
scope for the broadcasters to serve the public inter- 
est in the wide and flexible sense described earlier. 
Although there are arguments against this approach 
—for example, that it is a form of “poll tax,” or 
that it is too expensive because the money has to 
be collected in millions of small packets—the argu- 
ments in its favor seem overwhelming. The license 
fee falls on all, and almost all use broadcasting; 
furthermore, it multiplies and subdivides the 
sources of money so that no one large agency is 
tempted, because it is footing the bill, to try to 
exercise consistent strong pressure or feels cheated 
if things do not go in just the way it wants. How- 
ever, once practically every family has a television 
set, the ceiling on new licenses has been reached, 
and from that time on, the amount of the license, 
if it is to serve its purpose, must be tied to increases 
in wages and incomes. 

The supervision of a “democratic” (nonstate and 
noncommercial ) broadcasting system is even trick- 
ier than the problem of paying for it. There must 
be a buffer between the broadcasting professionals 
and whatever party happens to be in power at any 
one time—a buffer which allows the broadcasters 
to stand on their own professional feet and exercise 
proper editorial independence but which also stops 
them from too narrowly, too remotely, or less than 
responsibly defining the public interest. Countries 
trying to operate this system vary in the buffer they 
use, but the British Broadcasting Corporation’s 
board of governors, for both its strengths and its 
weaknesses, is an interesting particular instance 
(for the problems of educational television in the 
United States, see National Educational Television 
1960). 

Below the government level, it is important that 
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broadcasters be kept responsive to public need. But 
“public need” is not easy to define, and some large- 
scale bodies which claim to speak for “ordinary 
opinion” are no more than philistine and restrictive. 
There seems to be a need for a range of responsive 
and well-informed bodies forming links between 
the producers and their audiences: not pressure 
groups in a political or economic sense but interest 
groups, specialist groups, and professional groups 
—all helping to form a texture of relevant critical 
understanding and of challenge and response be- 
tween the two sides of the operation, 

In all this, the overriding need is to find a struc- 
ture that will let professionalism in broadcasting 
(in the sense defined above) grow and become, as 
far as possible, responsible for itself. The fully dem- 
ocratic broadcasting system which might emerge 
if the twin pressures of the state and the advertisers 
were thus avoided is difficult to define precisely 
(and even more difficult to put into practice), but 
a rough workable definition is certainly possible. 
Such a system would not assume an automatic and 
close correlation between “cultivation” and social 
class; it would not think of itself as offering some 
fixed sociocultural status symbols downward to the 
masses. Nor would it reject the standards of civili- 
zation in the pursuit of popular support. Nor would 
it peddle “culture” as a form of marginal differen- 
tiation in the “affluent society.” It would be a very 
varied and pluralist organization with a strong but 
dispersed life. Furthermore, it would be highly pro- 
fessional in that it would be not only technically 
efficient but also would devote a great deal of 
thought to the possibilities of the medium and to 
its own varied and changing relationships with its 
audiences; and it would regard its audiences, too, 
as varied and changing. It would be editorially ob- 
jective but not inhibitedly given to fence sitting. 
Finally, it would combine a very high degree of 
freedom with a relevant sense of responsibility. 


RICHARD HOGGART 


[See also FiLM; Mass SOCIETY.] 
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Iv 
AUDIENCES 


The rise of the mass media has been inseparably 
linked to the growth of national cultures with a 
linguistic unity, to the increase of literacy, and to 
key events in the technological revolution—from 
movable type to the transistor. In these relation- 
ships, cause and effect intermingle. The rise of 
literacy in the Western world was accompanied by 
the birth of the first inexpensive newspapers, but 
mass literacy was required before any of the print 
media could win a wide public. Mass literacy in 
turn depends on an organized system of mass edu- 
cation, inconceivable in a purely feudal or tribal 
society. 

“Mass,” in this context, suggests an absence of 
social structure, although not necessarily of group 
identity. A principal characteristic of the mass 
audience is that it is dispersed, An audience for an 
FM radio broadcast may be far smaller than one 
for an open-air concert, but the former constitutes 
a mass audience in the sense that its members 
perceive themselves (and are perceived by the 
radio station’s manager) as potentially consisting 
of all persons within range of the station’s trans- 
mitter. The audience at a “live” concert quite evi- 
dently does not see itself in this way. The reason 
is that the messages of mass communication are 
open to all. The oldest form of such open com- 
munication is the public proclamation—a form 
that includes the public poster and the wall news 
bulletin. Yet, public announcements of this kind, 
with their implicit reliance on secondary transmis- 
sion of the news by word of mouth, are not ordi- 
narily thought of within the framework of the 
great mass media of today. 

Traditionally, the mass media have been re- 
garded as “merely means of transmission” (Hall & 
Whannel 1964). But as McLuhan (1964) points 
out, “the medium is the message,” in the sense that 
its own peculiar attributes help to determine the 
very meaning of the communication. In short, the 
essence of any medium is that it mediates. It is not 
so much a neutral channel through which a mes- 
sage flows as it is a distinctive material in which 
the message is recast. Such mediation is already 
performed by the written word as it appears in any 
society that has evolved beyond the level of folk 


culture; indeed, the transmission of culture de- 
pends first and foremost upon the written word. In 
this sense, the scribes of antiquity were the pre- 
cursors of Caxton and Gutenberg. The printing 
press was essentially a mechanical means for 
achieving on a larger scale what the medieval 
manuscript copyists (whose work, organized in the 
monasteries on a group basis, had some of the 
features of mass production) had already assured 
in principle, 

The invention of movable type did not in itself 
make the book into a mass medium, since two 
necessary conditions were absent: mass literacy 
and the means for widespread dissemination of 
identical communications content. The latter con- 
dition did not really come into existence until the 
arrival of mass-produced and inexpensive news- 
papers in the mid-nineteenth century. These were 
made possible by the high-speed rotary press and 
cheap wood-pulp paper and were later made still 
cheaper by the introduction of automatic type- 
casting machines, The dramatic effect of these in- 
novations can be inferred from the fact that from 
1846 to 1850 daily newspaper circulation nearly 
doubled. 

The first newspapers, in the seventeenth cen- 
tury, had been feeble affairs by modern standards, 
in terms of both content and number of readers. 
Yet, only an arbitrary cutoff point can be used to 
state when their circulations reached proportions 
that would justify calling them mass media, Ac- 
cording to one authority, in 1704 the average daily 
sale of all London newspapers combined was 
7,300, or about 2.7 million a year (Altick 1957, 
pp. 47-48), Whether this circulation is considered 
large or small for a country where the population 
hardly exceeded 6 million depends on the historical 
perspective from which it is viewed. At any rate, 
it is certain that the number of newspapers sold in 
England almost doubled between 1753 and 1780. 
Individual newspapers, however, had not reached 
appreciably larger circulations by the end of the 
century; over-all sales were greater simply because 
there were more newspapers (ibid., p. 48). More- 
over, the stamp duty on English newspapers made 
them too expensive for the average member of the 
public, although copies were available free to 
habitués of the clubs and coffeehouses that flour- 
ished in the larger cities, It was not until the nine- 
teenth century that newspapers became as cheap 
and as widely circulated as they are today. In the 
period 1860-1870, for instance, most daily news- 
papers in New York sold for four cents or less, and 
by 1872 at least three of them were credited with 
daily circulations in the range of 90,000 to 100,000; 
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comparable circulations were achieved by London 
and Paris newspapers during the same period 
(Mott [1941] 1962, pp. 403-404). 

Radio, in the days before tape recording, ex- 
tended the concept of what makes a mass medium 
beyond the criterion of mere reproducibility of com- 
munications content. Unlike the telephone (a 
medium of private communication made possible 
by mass facilities), radio has the same ability as 
print or cinema to put the same message before 
many people at the same time. It is precisely the 
direct extension of the communicator’s message 
beyond the scope of interpersonal contact that 
transforms private communication into a mass 
medium, Thus, the enormous distribution of phono- 
graph records, for instance, makes them a mass 
medium, particularly in the area of popular music, 
where fashions in style and in the fortunes of indi- 
vidual performers show sharp fluctuations and 
strong susceptibility to organized promotion. 

The great twentieth-century inventions—film, 
radio, phonograph records, and television—have 
made literacy unnecessary for understanding mass 
communication, but the spread of the mass media 
presupposes a society that has attained a degree of 
affluence, in which mass literacy is taken for 
granted. The mass media are in evidence through- 
out the mid-twentieth-century world, but the size of 
media audiences varies enormously from place to 
place. 

Current media statistics, such as those in 
UNESCO's World Communications, are rapidly out- 
dated, particularly in the case of the newer media. 
In 1955 there were about 6 million television sets 
in use in western Europe. By 1960 the number had 
quadrupled, and by 1965 it had grown eightfold. 
By contrast, in the relatively stagnant economies of 
Latin America and Africa the number of television 
sets showed only a very slow rate of growth (for 
these and related figures see U.S. Information 
Agency 1966, p. 4, table 1). 

It is highly tempting, but also misleading, to 
classify countries according to the incidence of 
radio or television sets, cinemas, or newspaper or 
magazine circulation per capita. The penetration 
of a medium may be diffused or concentrated in 
terms of either geography or social class. 

In certain countries of Africa, Asia, and South 
America the mass media are in full use only among 
upper-class and upper-middle-class urban elements, 
who represent a small part of the total population. 
However, if the absolute size of this minority is 
substantial (as in India), the media may still rep- 
resent a considerable cultural and political force. 
For instance, the news of President Kennedy’s 
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assassination was soon known by a very large pro- 
portion of the Indian people, even those in villages 
far from a radio set. As is documented by Lerner 
(1958), even in a folk society of illiterate peasants 
close to the ancestral soil and without access to 
radio, television, movies, or the printed word, there 
are usually found geographically dispersed mem- 
bers of the extended mass society. Deep in the 
jungle or on the plains, surrounded by villagers or 
tribesmen, there is always a trader, a government 
administrator, a teacher, a medical technician, an 
agricultural specialist, or an itinerant peddler. Any- 
one who has ties to the outer world sustained by 
reading, by radio, or by occasional excursions to 
the urban centers can act as a relay point through 
which new information and ideas reach those who 
live by the traditional order. 

But the newer media, which do not depend on 
the printed word, have direct effects quite inde- 
pendent of such secondary diffusion, In remote 
hamlets throughout the world the agents of govern- 
mental and international welfare organizations 
erect their movie screens each night to show train- 
ing films on health, agriculture, sanitation, or poli- 
tics. In ten thousand tropical open-air cafes the 
radio blares from dawn to late at night, within 
hearing range of all who pass by. In the big cities 
of even the most backward lands crowds of slum 
dwellers gather each night in taverns and clubs and 
before the windows of radio stores to watch the 
television shows. They constitute a regular tele- 
vision audience even though the price of a set is 
well beyond their means. 

The very universality of the mass media appears 
to serve contradictory functions. The diffusion of 
the same information to different groups enforces 
social cohesion by creating common heroes and 
symbolic reference points. At the same time, this 
very diffusion may be a socially divisive force 
insofar as it emphasizes particular subcultural in- 
terests in a complex and heterogeneous society. 
Mass media may help a group to define its own dis- 
tinctive character, as may be seen in the cases of 
the French-speaking audience of the Canadian 
Broadcasting Corporation, the native readers of 
Drum in the Republic of South Africa, or audi- 
ences of Negro weekly newspapers and radio sta- 
tions in the United States, 

The institutions of modern political democracy 
are inconceivable without mass media, through 
which political information and opinion can be 
disseminated and discussed. Yet, paradoxically, 
the totalitarian governments of the twentieth cen- 
tury could not have established or maintained 
themselves in power without making use of press, 


film, and radio, Autocracy, whether in eighteenth- 
century France or nineteenth-century Russia, has 
long used censorship of the media as a device for 
stifling dissident voices. But only recently have 
state-controlled media been used, as in the Ger- 
many of Goebbels or the Ghana of Nkrumah, to 
create mass support for the ideology of the regime 
in power. 

In countries with representative governments 
radio and television have added significant new 
dimensions to the operations of the political proc- 
ess by introducing new criteria for evaluating can- 
didates—their ability to project their personalities 
with charm and vigor, in a context dominated by 
professional entertainers. Broadcasting has also 
brought a vicarious sense of direct participation in 
the political process to a broadly scattered elector- 
ate that relied in the past on the more impersonal 
contacts of print. The intrusion of cameras and 
microphones into political meetings, demonstra- 
tions, and conventions has brought a new self- 
consciousness to the behavior of the participants 
and altered the very character of such events. 


Comparative mass media systems 


If nations are to be classified according to the 
prevalence of mass media, we must first set aside 
countries such as Yemen or Mali, which lack in- 
digenous media and therefore depend on radio sig- 
nals or occasional publications that cross their 
national boundaries. Other countries can be ranked 
according to the level of their print culture, as 
measured by the rate of literacy and the figures 
for circulation and readership of newspapers and 
magazines. 

The highest per capita readership of newspapers 
is found in the United Kingdom, where a news- 
paper is sold for every two people every day (United 
Nations . . . 1964, p. 331). Short distances enable 
the morning national newspapers of that country 
to attain overnight distribution throughout the 
nation. Thus, the distinctions between the news- 
papers tend to be based on politics and social class 
differences rather than on local interests, although 
some national dailies publish regional editions. In 
contrast, the United States has no national news- 
papers and the major part of even the big-city 
dailies is made up of local news and advertising. 

In general, areas with a strongly developed print 
culture, as in Europe, North America, and Japan, 
are also those with the most advanced broadcast- 
ing culture. But this is not always the case. Some 
Asiatic and South American countries emerged into 
the age of radio and television, while bypassing the 
spread of universal literacy. In many Latin Ameri- 


can slums television antennas are marks of social 
status, which sprout from the roofs of dismal 
shanties whose illiterate inhabitants lack running 
water and other basic necessities. In some coun- 
tries (e.g., Israel and the Republic of South Africa) 
reasons of politics or economic need have kept out 
television as a matter of official policy, in spite of 
public demand. 

The relative importance of the various mass 
media in any country cannot be judged by the ex- 
tent to which people have access to them. Coun- 
tries with comparable populations (including com- 
parable literacy rates) vary substantially in the 
number of newspapers or magazines actually read 
by the average reader. Broadcast listening and 
viewing hours vary with national cultural values, 
tastes, and behavior styles even when the propor- 
tions of set ownership are comparable. Listening 
patterns also reflect government action and the 
state of the economy, which together determine the 
hours of the broadcast day, the number of stations 
on the air, the quality of programming output, and 
the intensity of competition. It should also be re- 
membered that the penetration of a mass medium 
extends beyond its physical presence, through mes- 
sages relayed by word of mouth—the “two-step 
flow,” described by Katz and Lazarsfeld (1955). 
The number of copies printed may give no clue to 
the real influence of a publication, Addison and 
Steele's Spectator may have had a far greater num- 
ber of readers per copy than any newspaper or 
magazine in England today. In many areas of the 
world publications are still passed from hand to 
hand or, as in the Soviet Union and China, placed 
on billboards for all to see. 

The extent of media penetration in a country 
cannot be predicted solely from knowledge of its 
educational or economic level. For instance, in the 
Congo, where a third of the population could read 
in 1950, two newspapers were circulated for every 
1,000 people, and there were six radios for the 
same number (United Nations . . . 1964, p. 80). 
But in Nigeria, with a lower level of literacy (about 
11 per cent), the penetration of the press was four 
times as great and of radio twice as great (p. 102). 
Other contrasts may be found in Latin America. 
Argentina, with 86 per cent literacy, had 155 copies 
of the press for every 1,000 people (p. 177). Uru- 
guay, with the same proportion of literates, had 
260 copies per 1,000 (p. 194). While Argentina 
had 167 radios for every 1,000 people, Uruguay 
had 354, but Argentina, which introduced tele- 
vision earlier, had 42 sets per 1,000 people (p. 
177) and Uruguay only 24 (p. 195). Venezuela, 
with half its population illiterate, had only 96 
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newspapers circulating per 1,000 population but 
had 194 radios and 60 television sets, more than 
Argentina (p. 196), Another interesting contrast 
in rival economic systems was afforded by East 
Germany and West Germany, both with virtually 
universal literacy. The number of newspapers per 
1,000 people was much higher in East Germany 
(456) than in West Germany (307). The propor- 
tion of radios was about the same (about 3 for 
every 10 people), but West Germany led in tele- 
vision sets—139 per 1,000, compared with 90 per 
1,000 in East Germany (pp. 286, 290). 


The concept of audience 


Comparative studies of mass media are some- 
what obscured by ambiguous terminology, which 
reflects changing uses of the word “audience” it- 
self, The term audience (Latin audientia) connotes 
a situation in which a powerful, but passive, audi- 
tor (such as a king or magistrate) listens to an 
advocate or suppliant actively presenting his case. 
Hence comes its later application to the theater, In 
the classic tradition drama exists only in relation 
to the audience, That heightening and purging of 
the emotions of which Aristotle wrote in his Poetics 
is inconceivable except in the mutual empathic re- 
lation of actors and onlookers bound together in 
an interplay of feeling, even though the words and 
gestures are all on one side of the stage. Since the 
audience is an integral part of the dramatic spec- 
tacle, it must also be regarded as a body that is 
collective in nature and that has social character- 
istics of its own apart from the characteristics of 
its individual members. Aristotle was no doubt de- 
scribing an “ideal type” rather than rendering a 
literal description of the Athenian theatergoers of 
his day. By definition, an audience, in the Aris- 
totelian sense, must be conscious of its presence 
and purpose in being where it is and of its own 
reactions, There is a social cohesion among the 
members of the audience, but this rises out of the 
spectacle, which is their common experience, Their 
unity is forged by the spectacle and exists only as 
long as the spectacle continues, Drama arouses the 
feelings of awe and pity only to the degree that it 
can play on those universals in human experience 
and aspiration which are intensified in a collec- 
tive setting. 

From its early associations with the sense of 
hearing, the word “audience” in the nineteenth 
century began occasionally to be applied to the 
reading public as well. To Charles Dickens on the 
lecture platform there may hardly have seemed to 
be a major distinction between the people who 
read his books in serial form and those who gath- 
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ered to hear his readings. But it was only with the 
motion picture that the term “audience” became 
applicable to a mass medium in the classical (i.e., 
theatrical) meaning. The movie theater is, after 
all, superficially identical with the legitimate the- 
ater in its seating arrangements, its fixed spatial 
relationship between spectacle and public, and its 
rule that the audience must sit in darkness and for 
the most part in silence. 

There are two new elements in the motion pic- 
ture experience. In the first place, the movies can 
do without the conventions that ordinarily dis- 
tinguish the world of the drama from the world of 
reality. Events on the theatrical stage are always 
directly perceived as re-enactment, as fiction. The 
very presence of flesh-and-blood actors makes it 
impossible for the audience to confuse the depicted 
action with the truth. But movie photography, with 
its ability to collapse time and to broaden scenic 
perspectives to take in all the complex settings, 
landscapes, backgrounds, and moving multitudes 
of the real world, can create the illusion that what 
is being shown is somehow a direct representation 
of something that actually did happen. The second 
critical difference between the cinema and the 
theater (and the one that most concerns us in this 
context) lies in the absence of feedback. The com- 
munication is disembodied, it flows in one direc- 
tion. Except for the long-range influence of box- 
office receipts, fan mail, and the articles of critics, 
there is no way for actors, directors, or producers 
to know at first hand whether they have succeeded 
or failed, because there is no regular way for the 
audience to express itself either during or immedi- 
ately after the performance. 

Radio and television, like the movies, are deriva- 
tions of the theater inasmuch as they are spectacles 
arranged for the benefit of a specific group. Also 
like the movies, they communicate content in an 
order and at a rate over which the audience has 
no control (often with a time lag between the origi- 
nal performance and the exposure) and reach 
people who cannot transmit their reactions directly 
to the performers. The listeners to a particular 
broadcast (and to some extent the regular listeners 
to a series of broadcasts) share a common experi- 
ence and a certain consciousness of each other’s 
presence. Quite early in the evolution of radio pro- 
gramming the studio audience was introduced as a 
device to promote empathy on the part of the scat- 
tered listeners. It also served to provide the live 
entertainers with the feedback reactions considered 
necessary to evoke their best efforts. As technology 
became more sophisticated, the latter aspect was 
given less importance. But it was still found neces- 


sary to produce the illusion of a studio audience, 
in order to stimulate the response of the unseen 
listeners or viewers. This was made possible by the 
use of recorded applause and laughter that could 
be adjusted to specified degrees of loudness and 
enthusiasm, 

There is a fundamental difference between the 
feedback from audience to producers of films or 
broadcasts and that from audience to writers or 
editors of print media, Newspapers have a long- 
standing tradition of publishing a selection of 
letters from readers; eight per cent of the people in 
the United States report they have written at least 
one letter to a newspaper (Audits and Surveys 
Company, Inc. 1961), The reader who writes in 
does so, not so much to enlighten the editor or to 
convince him to change his mind, as to see his own 
opinions in print and presumably to influence other 
readers. In this sense, he is aiming at a mass 
audience for his opinions. 

In the case of mail from listeners, viewers, or 
movie-goers, the writer wants to communicate di- 
rectly with the star or producer to whom his letter 
is addressed, Not always is he merely approving, 
complaining, or suggesting changes, as a fan or 
critic, Often the fan letter is a device for achiev- 
ing human contact with the admirable or powerful 
figures he imagines stars to be. He may ask for 
help with his personal problems; he maintains the 
illusion that the star feels as close to him as he 
does to the star, whose private life may be a mat- 
ter of public record and interest (Bogart 1949). 

The letters column on the editorial page pro- 
vides the newspaper editor with a mechanism by 
which to evade concern with the letters’ content. 
Since broadcasters do not have a comparable mech- 
anism, they are bound to take letters more seri- 
ously, particularly critical letters, to which they 
often assign a weight beyond true proportion. This 
tendency is supported by the apparatus of com- 
mercial broadcasting, in which the sponsor of a 
program—and the whole business establishment of 
the broadcasting industry—normally finds it both 
unnecessary and commercially distasteful for even 
a single viewer to be antagonized, The media pro- 
ducer’s perceptions and preoccupations with the 
public’s reaction to his efforts have been better 
described in thinly disguised “fictional” narratives, 
such as Foreman’s (1958), than by professional 
sociologists. 


Broadcast-audience measurement 


The measurement of audiences for television 
and radio programs is a major industry in the 
United States and is carried out on a highly sys- 


tematic and continuing basis in western Europe, 
Japan, and several countries of Latin America. 
Identical techniques are generally applicable to 
television and radio, although the advent of the 
easily portable transistor radio has created a highly 
mobile audience, which is difficult to measure. 

Audience as a measurement concept for radio 
evolved some years after the term “radio audience” 
had become a conventional part of the broadcaster's 
vocabulary. Initially, the yardstick of radio pene- 
tration was the count of receiving sets. This paral- 
leled the use of a tangible unit of measurement for 
newspaper and magazine circulation: printed sold 
copies. Later, various formulas were used to esti- 
mate the number of listeners in ratio to the number 
of letters, and surveys were made to measure just 
what stations people could receive on their sets. 

The measurement of program audiences by the 
survey method was a by-product of commercial 
broadcasting in the United States, as described by 
Head (1956) and documented in the Harris Com- 
mittee hearings (U.S. Congress . . . 1963). Stations 
and networks sold air time to advertisers in units 
which appeared to be fixed, but which actually var- 
ied greatly in the size and character of the listener 
groups they yielded. The sponsor was not really 
buying an hour of time on a station or a half-hour 
on a network, but rather the opportunity to present 
a sales message to a certain number of listeners. 
That number would vary according to the signal 
strength and popularity of the station and the 
timing and appeal of the sponsor's program. 

As advertisers made larger and larger invest- 
ments in radio, they became increasingly concerned 
with the need for accurate measurement. Thus, 
radio research in the United States evolved from 
one-shot, unsystematically done surveys to con- 
tinuing studies, first directly sponsored by the 
broadcasting industry and then produced as syndi- 
cated services by individual research firms, which 
in some cases developed into institutions of con- 
siderable wealth and power. Curiously enough, the 
evolution of audience research by the noncommer- 
cial British Broadcasting Corporation underwent a 
parallel evolution, from special to continuing stud- 
ies (Paulu 1956). 

What was encompassed in the term “audience” 
could only be defined pragmatically, in terms of 
the statistics yielded by particular research meth- 
ods. “Audience” could be the number of people who 
said they “ever” listened to a particular show, or it 
could be the number who said they had listened to 
that show last week. It could be the number who 
reported they were listening to the show at the very 
time they were interviewed in person or on the 


COMMUNICATION, MASS: Audiences 75 


telephone. It could be the number who selected the 
show from a roster of programs that had been on 
the air the previous night or who listed the show 
in a radio-listening diary. “Audience” could encom- 
pass not only the respondents, who were the 
source of information, but the other family mem- 
bers on whose listening the respondents also re- 
ported. “Audience” could be the people who listened 
to any part of the program, those who listened for 
at least five minutes, or only the average number 
listening per minute of program. 

An enormous range of statistics was produced 
by such a variety of definitions and research meth- 
ods, Each measurement assigned a single quantita- 
tive value to the term “audience,” yet what was 
being measured represented more than a single 
kind of listening experience. The members of the 
audience are continually drifting in and drifting 
out; some of them are more attentive than others. 
A rather considerable variety of media-exposure 
patterns were thus lumped together only for the 
sake of convenience. Measurement now consisted 
of estimating the intangible “listening experience” 
rather than the tangible quantities of radio sets or 
of letters to the station. 

In most countries where broadcast-audience 
measurement exists it is provided either for an 
entire nation or for one or a few principal cities 
where receivers are concentrated, In the United 
States the highly competitive system of commercial 
broadcasting has created a number of services 
which measure audiences nationally and locally in 
hundreds of cities. Measurements are made of total 
set usage at various times of the day, the size of 
the total audience for a program, and the number 
of people listening or watching at a particular min- 
ute, Still further measurements are made to ascer- 
tain the composition or characteristics of the 
viewers and the audience stability (that is, the 
degree to which the audience shifts over time from 
show to show). 

Distinctions may also be made between different 
levels of viewing interest. These correspond to the 
difference between the primary audience (the ac- 
tual subscribers or purchasers) of print media and 
the secondary, or “pass-along,” audience. As more 
television families have come to own more than 
one television set—21 per cent in the United States 
as of August 1965 (Advertising Research Founda- 
tion 1965, p. 10)—-viewing has tended to get more 
and more remote from its original character as a 
group experience. This is comparable to the evolu- 
tion of radio from its central position in the family 
living room to a personal, highly portable instru- 
ment, The degree to which people like individual 
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programs does not show a consistent relationship 
to audience size, as measured by ratings. 


Print-audience measurement 


The evolution of broadcast-audience research 
had a profound effect upon the study of print 
media, in which the established criteria of meas- 
urement involved that most tangible of yardsticks, 
paid circulation. 

The drive for measurement of audiences for 
print media was, as in broadcasting, a by-product 
of the sale of advertising. The editorial vitality or 
political influence of a publication, like that of a 
broadcast program, has generally reflected, not its 
size, but rather the fire, talent, and ingenuity that 
go into it. Advertising, however, has tended to be 
most responsive to sheer numbers, although en- 
vironment is of course not neglected. But circula- 
tion alone is no longer generally considered an 
adequate yardstick of a publication’s appeal. In the 
United States a large part of the circulation of print 
media is today delivered to the people who buy their 
subscriptions on a long-term basis, often at vastly 
reduced rates. This stems from the continuous 
drive to gain greater and greater numbers of 
“guaranteed” readers to sell to the advertisers. 

Years after the newspapers and magazines of 
the United States had found advertisers satisfied 
with audited paid circulation as the basis for com- 
paring one publication with another, they suddenly 
found that the audience figures for radio programs 
were of a magnitude that few publications could 
match, The term “audience” was applied to print 
media in an effort to match these large figures by 
demonstrating that a newspaper or magazine nor- 
mally had several readers for every copy printed 
and placed in circulation. The rise of mass-circu- 
lation magazines, particularly the picture maga- 
zines with large “pass-along” or casual readership, 
made it possible to use the general opinion survey 
as a way to locate readers and to compare their 
total numbers from one publication to the next. 
It was through these competitive demands of Amer- 
ican advertising media that audience became a 
commonplace concept in the field of communica- 
tions research. The overwhelming preponderance 
of effort and expenditure in the study of mass 
media in Europe and the United States has been 
devoted to the measurement of the size and char- 
acteristics of media audiences. In the United States 
a number of “syndicated services” report on a vari- 
ety of different media through periodic surveys. 
Personal interviews and self-administered ques- 
tionnaires have been used to measure the extent to 
which different magazines are read by the same 


people or to which individual programs are watched 
by readers of certain periodicals. 

In an economy where the mass communications 
media are operated privately for profit, measure- 
ments of audience size are inevitably the basis of 
decisions that affect media content. Other things 
being equal, most mass media operators seek to 
broaden rather than restrict their coverage. They 
would prefer that their influence be greater rather 
than less. 


Differentiation of audiences 


“Audience” has been commonly used as a term 
that aggregates units of equivalent value, so that 
a major research interest has been to count these 
units rather than to differentiate them. Audience 
figures for print and broadcast media are dealt with 
comparatively, as though communication in space 
and time could somehow be reduced to a common 
basis, and audience figures for each medium have 
been lumped together without regard to the setting 
in which different people experience each one or 
the different qualities of communication each may 
represent to different groups. 

In the United States one of the problems in eval- 
uating audience data on the mass media is that 
surveys are not usually designed to permit sub- 
groups to be distinguished except in a very coarse 
way. Survey samples are usually not large enough 
to isolate the small segments of the population that 
represent the true market for elite culture. The 
conventional way of dividing an audience into in- 
come or education groups may isolate the upper 
fourth or the upper fifth under each heading, while 
the key target of elite culture may be no more than 
the top 1 or 2 per cent. When these people are 
averaged in with the others the results may appear 
to suggest that income and education have less 
effect on audience taste or choice than they in fact 
do. For example, a study made by W. R. Simmons 
for Harper-—Atlantic Sales, Inc., in 1965 found that 
readers of Harpers magazine differ markedly in 
travel and consumption styles from other persons 
of “identical” income, education, etc. (Blair 1965). 

Media audiences may be differentiated at three 
levels: exposure to the medium, to the vehicle, and 
to its content. At the first level an» analyst must 
deal largely in terms of broad-scale demographic 
generalizations. Certain kinds of people may be 
said to be strong newspaper readers or television 
watchers or the first to acquire a television set when 
the medium is new. Within this framework it be- 
comes necessary to further differentiate the audi- 
ences for individual publications or programs, 
whose exposure patterns may be quite different 


from those of the average user of the medium. 
Further, different kinds of people vary in their 
selective attention to the features or articles of a 
periodical or to the segments of a broadcast; a pub- 
lication will not be read with the same degree of 
thoroughness by every reader and not every item 
in it will be read by the same kinds of people. 

Media audiences or publics are often thought of 
as mutually exclusive (as in the case of programs 
which come on the air at the same time on dif- 
ferent stations or of rival periodicals with radically 
opposite political viewpoints), But far more com- 
monly, mass media audiences really consist of dif- 
ferent aggregations or combinations, into which 
the same people may be shuffled, regrouped, or 
subdivided. Such subdivisions may occur for many 
different reasons. For instance, they may reflect 
stages in the life cycle. In an advanced media cul- 
ture young children are exposed to broadcast media 
before print media and begin to select their own 
radio and television programs before their parents 
begin to read newspaper items or cartoon captions 
aloud to them. In the United States 44 per cent are 
reading newspapers themselves by the age of seven 
(Schramm et al, 1960), 

Studies done by the Japanese Ministry of Edu- 
cation have shown that 11 per cent of the children 
in Tokyo were watching television by the age of 
two, 40 per cent were watching by the age of three, 
and among five-year-old children the percentage 
rose to 96 (Furu 1965, p. 61). As children mature, 
both their media habits and their preferences for 
different types of media content undergo substan- 
tial change. This is illustrated in a classic study of 
small children’s shift from animal to human fan- 
tasy in comic book preferences (Wolf & Fiske 1949), 
as well as in numerous studies of shifting prefer- 
ences for various types of television program con- 
tent (summarized in Bogart 1956). 

Adults also differ in their characteristic media 
patterns, For example, a study of workers in Lenin- 
grad, reported by Durham (1965), found weekly 
television viewing among young adults to be under 
five hours a week, while older persons watched 
nearly two hours more per week. A study by the 
English Institute of Practitioners in Advertising 
found that 44 per cent of the 16-24 age group saw 
a film at least once a week, as compared with only 
5 per cent among those over 65 (Hall & Whannel 
[1964] 1965, p. 346). The Newsprint Information 
Committee’s study of newspaper reading in the 
United States found that the percentage reading a 
paper on the average weekday increased from 72 
per cent among teen-agers to 83 per cent in the 
40-54 age group and then declined to 78 per cent 


COMMUNICATION, MASS: Audiences 77 


among those 55 and older (Audits and Surveys 
Company, Inc. 1961). A high proportion of popular 
recordings are bought by adolescents, particularly 
in the affluent postwar society of the United States 
or western Europe. The volatile musical tastes of 
the adolescent subculture have thus come to domi- 
nate popular music in general. 

Audience differences may also reflect the mere 
distribution of access to a medium and the eco- 
nomic forces that affect it. Thus, in a country 
where television sets are expensive and radios are 
relatively cheap, the typical audiences for television 
and radio will have different incomes. The audi- 
ences for different media may also be sharply dif- 
ferentiated by social class; indeed, social class may 
be one determinant of the degree of reliance on 
different media. For example, Steiner (1963, p. 
75) reports average daily television viewing in the 
United States ranges from under 3 hours among 
the college educated to nearly 44 hours among 
those with less than four years of high school. 
There may also be ethnic or religious differences 
in the audiences of different media vehicles. Ac- 
cording to an unpublished survey made by Data 
Inc. in 1964, each of the three afternoon papers 
then published in New York drew its readers pre- 
ponderantly from the members of a different 
religious group. 

Moreover, people who are identical in income 
and education may choose different media for psy- 
chological reasons. For example, it has been found 
that blue-collar workers who watch educational tele- 
vision are more apt than their neighbors to identify 
themselves as members of the middle class, as 
“opinion leaders,” and as individuals with cultural 
and civic interests (Schramm et al, 1963), 

Broadly speaking, the great mass media in most 
countries strive for popular appeal to all sectors of 
society. But in practice their appeal is differenti- 
ated both with respect to specific media vehicles 
(individual programs, films, publications) and to 
the different types of content (sports, finance, 
food, fashion) presented by each medium. Curi- 
ously, since television is a pastime activity, the 
number of hours spent viewing appears to be inde- 
pendent of the number of different programs avail- 
able at the same time (Bogart [1956] 1958, chapter 
4). The differentiation of media audiences cannot, 
therefore, be explained wholly in terms of economic 
opportunity and direct consumption cost; the role 
of different tastes and interests is equally important. 

There are also variations in physical opportunity 
for exposure to media—variations that correspond 
to different specialized sex, age, or interest roles, 
A magazine sold only at the check-out counter of 
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supermarkets is bought primarily by the shopping 
housewife; a television program that comes on in 
the late afternoon, when the mother is preparing 
dinner and the father has not yet returned from 
work, attracts a great many young children regard- 
less of its content. In Muslim countries religious 
tradition keeps most women, apart from the urban 
elite, out of the motion picture theater. 

Content and physical opportunity are likely to 
complement each other. Broadcast programs aimed 
at uplifting the morale of the housewife are placed 
on the air in the morning precisely because the 
housewives will make up the bulk of the listeners 
or viewers. 

But content also determines audience appeal, 
inasmuch as there may be a symbolic identification 
between the roles or interests of the audience and 
those of the protagonists and characters, fictional 
or real, Clearly, the level of taste or intellectual 
complexity of media content will also influence 
audience composition, Everyone is more or less 
conscious of his own interests and exposes himself 
to the mass media accordingly, In effect, people 
read the publications and watch or listen to the 
programs that are most in conformity with their 
tastes, predilections, and opinions (Berelson et al. 
1954, chapter 11). But the audience is also selec- 
tive at a level below the threshold of consciousness, 
inasmuch as it selectively perceives and retains in 
its memory the most relevant, congruent, or com- 
fortable part of the media content to which it is 
exposed [see COMMUNICATION, MASS, article on 
EFFECTS; see also Bogart & Tolley 1964]. 

Audiences may also be differentiated in terms of 
political orientation. This has been a traditional 
function of the press in most democratic countries, 
where a variety of viewpoints are available. In 
much of the Western world, choice of a newspaper 
is an act of political self-expression. At the same 
time, it may indeed relate to social status just as 
political party alignments do. The reader of The 
Times of London, of Le Monde of Paris, of Rio de 
Janeiro’s Correio da Manhã is apt to be conserva- 
tive not merely because of what is on the editorial 
pages of those papers but because their total 
substance and tone appeals to the conservative 
business communities of those cities. In the United 
States the reduction of competition in all but the 
very largest cities has altered the traditional role 
of newspapers as organs of conflicting opinion. 
But many newspapers that enjoy a monopoly posi- 
tion offer a diversity of viewpoints through the 
syndicated columnists on their editorial or maga- 
zine pages. 

All the mass media have in common an ability 
to provide information, to stimulate or to relax, to 


pass time or to kill it, to persuade, to act as rallying 
points for diverse bodies of taste and opinion, There 
are, however, profound differences between print 
and the audio-visual media, with regard to their 
communicatory capacities. 

In the time-bound media (broadcasting and film) 
the audience has no control over the rate of input 
as it does with print, Thus, the selective separation 
of irrelevant “noise” from meaningful information 
takes place at a slower pace and with more focused 
attention. The reader must “work harder” than the 
listener or viewer, but he remains in control and 
sets his own pace. This helps explain why print- 
oriented people tend to be better educated than 
those who are broadcast-oriented, both in the hey- 
day of radio and in the age of television. It may 
also explain why empathy, identification, and 
catharsis appear to be the peculiar properties of 
the time-bound media, while reflection, internal 
rehearsal, and fantasy are the attributes of the 
print media (Wright 1960). 


Mass culture and mass media 


By common definition, mass media are those 
that attract people over a broad range of intellec- 
tual or cultural levels, Yet media-audience differen- 
tiation often carries an implicit classification of 
media content as “mass” or “elite” in terms of 
classical aesthetic canons. In this area, it is im- 
possible to make sharp conceptual distinctions. 

One broad distinction that is often made is be- 
tween generalized and specialized organs of mass 
communication. Under the latter heading might be 
included trade publications, local periodicals, and 
broadcast programs or publications in such rel- 
atively specialized areas as fashion, sports, or edu- 
cation; many such publications or programs may 
reach vast audiences, yet their appeal is consciously 
directed or restricted. A good example of the former 
would be the kind of lavishly produced film epic, 
pioneered by Cecil B. De Mille, that aims to include 
something to suit every taste, while offending none. 
But even this distinction is hard to apply consist- 
ently. If an opera singer cuts a recording of an 
operatic aria in an edition of twenty thousand press- 
ings, this would generally be conceded to fall into 
the realm of elite culture. But if she appears on a 
television variety program and sings the same aria 
before millions of viewing families, this might be 
said to bring the same content into the domain of 
mass culture. Similarly, a scholarly publication 
with five thousand readers might not ordinarily be 
considered a mass medium, but a community 
weekly newspaper with the same circulation would 
fall under that heading. 

It therefore seems more appropriate to charac- 


terize as mass media all media that use techniques 
making for a loss of direct personal communication 
between the communicator and his public, This 
would include all printed publications, including 
some with highly restricted publics: hard-cover 
books, “little” magazines, the specialized organs of 
voluntary associations, corporations, trade unions, 
and other groups. Under the same heading, “art” 
films, educational television, phonograph record- 
ings, FM radio, and other audio-visual media with 
limited appeal would have to be considered mass 
media, in contrast to the plastic and live perform- 
ing arts, even though all of these are a part of elite 
culture. 

The popular culture manifested in the mass 
media is quite distinct from folk culture; for in- 
stance, the existence of a folk culture implies direct 
personal participation in art forms such as music 
or the dance. The “high culture” elite that pre- 
serves the traditional spirit of the arts is to be dis- 
tinguished from the power elite that controls the 
content and distribution of the mass media. Of 
course, it is possible to be a member of both these 
elites at once; the well-educated television producer 
who refuses to have a television set in his house is 
not a wholly mythical figure. 

The relation of the mass media to contemporary 
popular culture is commonly conceived in terms of 
dissernination from the elite (masters of the media) 
to the mass (consumers), There are periods when 
this process is reversed, when popular culture in 
its most vulgar aspects enters into the culture of the 
elite as a display of disenchantment with prevailing 
social forms or symbols. During the eighteenth cen- 
tury it was the utmost chic for the aristocrats of 
the French court to assume the guise of shepherds 
and peasants in their festive outings, as immor- 
talized in the paintings of Boucher and Fragonard. 
Somewhat reminiscent of this studied reversal of 
social-class roles, fashions in the 1960s have in- 
cluded the celebration of the horrendous, the com- 
mercialized cliché, or the recently outmoded under 
the rubric “camp.” The high prices commanded by 
pop art or the craze in fashionable society for the 
discothèque symbolize the transformation of pop- 
ular culture into an elite genre; they offer a coun- 
terpoise to the spectacle of hillbilly pop tunes 
composed in Times Square office buildings by white- 
collar suburbanites. 

Elite culture varies in physical accessibility, ac- 
cording to the medium through which it is diffused. 
Good bookstores, art cinemas, and other specialized 
channels of dissemination tend to be restricted geo- 
graphically to major cities and intellectual centers, 
while periodicals and broadcasts are more wide- 
spread. In most countries, whether or not the broad- 
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casting system is state-run, elite culture on tele- 
vision is confined to the very early or very late parts 
of the broadcast day or to those parts of the week- 
end during which most people do not watch tele- 
vision at all, because of its limited audience appeal. 

In any given week a substantial number of pro- 
gram hours may be devoted to elite-culture mate- 
rial, but as Minow points out (1964), this is 
usually isolated in time, so that the interested 
viewer must search for it. He must expend energy 
to find interesting programs just as he would to 
find the particular books he wants to read, the 
motion pictures he wants to see, or the art exhibits, 
plays, or concerts he wants to attend. Most mem- 
bers of the elite-culture audience presumably do 
not want to expend this kind of energy, because 
they view television as a pastime, much like any- 
one else. At any rate, it is known that high-brows 
on the whole do not watch mainly high-brow shows, 
This has been documented in several studies of the 
educational television audience (see, for instance, 
Schramm et al. 1963), Similarly, more upper- 
income New Yorkers read the Daily News than the 
New York Times (Simmons . . . [1955] 1964, p. 38), 
and among upper-income Londoners, 36 per cent 
read the Daily Express and only 10 per cent The 
Times (Hall & Whannel [1964] 1965, p. 234). 

The relation between critical success and popu- 
lar success is difficult to trace in any mass medium, 
but in television it does not seem to exist at all. 
The reason is that the functions of the critic have 
been transferred to the audience, whose choice of 
program is recorded and analyzed by commercial 
rating organizations such as the A, C. Nielsen Com- 
pany. The influence of these ratings on program 
content can be inferred from the fact that about 
two-thirds of United States network television shows 
are replaced each season on the basis of ratings 
rather than of press notices, 

In a highly developed mass-media culture like 
that of the United States the audiences for all the 
major media reflect the general distribution of pop- 
ulation, even though individual vehicles are directed 
at one sex or age group. Mass-media content tends 
to be pegged at a level that appeals to the largest 
number of people and is correspondingly inoffen- 
sive or uncritical of the established order, Its hall- 
marks are the familiar, the hackneyed, and the 
stereotyped. 

In most countries with a commercial system of 
broadcasting, audience measurements indicate that 
public choice in programming comes very close to 
the entertainment formula established years ago by 
American radio and motion pictures. Countries with 
a state-operated system of broadcasting face the 
continuing problem of balancing programs aimed 
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at social uplift with others that will assure the 
presence, night in and night out, of a large body 
of attentive viewers, This is complicated when 
commercial and state-operated or educational chan- 
nels exist side by side. A kind of Gresham’s law of 
popular preference operates to make no more than 
a minority available for an elite-interest or educa- 
tional program when it competes with less demand- 
ing amusements. The BBC’s famous “Third Pro- 
gramme” draws 1 per cent of the audience (Paulu 
1961). 

At the same time, the minority audience for elite 
culture purveyed by the mass media may encom- 
pass far more people than would be reached by the 
same content through traditional means or forms. 
Berelson (1964) estimates that the average United 
States college graduate spends nearly twelve hours 
per month “in the presence of culture,” while the 
person who has not completed high school spends 
two or less. He estimates that commercial television 
accounts for 22 per cent of the total amount of 
time spent in this way by all segments of the 
population. 

Educational television in the United States has 
not been able to attain massive audiences in com- 
petition with commercial television, Only 10 per 
cent of the people in areas covered by ETV watch 
it at all in the course of a week (National Educa- 
tional Television and Radio Center 1966, p. 7). 
Yet, in many countries ETV has demonstrated its 
value both as an adjunct to regular classroom in- 
struction in schools and universities and as a means 
of independent adult education. In eastern Europe 
substantial audiences are assured for educational 
telecasts at times when they are the only programs 
available to the viewer. 

In commercial broadcasting, advertising may 
become an important influence over programming 
decisions, since it is in the interest of broadcasters 
and advertisers alike to attract large audiences to 
the advertising messages and so obtain the largest 
possible yield for the advertiser's investment. Ad- 
vertising also occupies a substantial amount of 
broadcast time and thereby becomes an important 
element in the actual symbolic content of the 
medium. The information advertising conveys and 
the fantasies and desires it generates are also sig- 
nificant factors in the substance of print publica- 
tions throughout the noncommunist world, As 
McLuhan (1964) suggests, advertisements are 
often prepared with more thought than the mate- 
rial that surrounds them. 

The mass media have brought the members of 
diverse societies into a common universe of dis- 
course and have been an increasingly powerful force 


for the international unification of ideas and values. 
Certain symbols (Mickey Mouse, the Shell sign, the 
cowboy) have been diffused and universalized to 
the point where they lack any specific cultural 
reference. Schramm (1959), analyzing fourteen 
prestige newspapers around the world, found the 
proportion of news originating outside the country 
or area ranged between 14 per cent and 50 per 
cent, with a median of 32 per cent. Even though 
news media in nearly all countries are highly paro- 
chial and nationalistic in perspective, major news 
events are perceived as having broad general sig- 
nificance, regardless of their immediate relevance 
to the particular country. 

But if the mass media tend to break down bar- 
riers to international understanding, they also tend 
to create new ones. The media have made possible 
the emergence of specialized bodies of lore and in- 
formation which are by-products of an increasingly 
complex technology and a more intensive division 
of labor. 

The growth of the mass media in Europe and 
the United States makes increasing demands on 
leisure time, which is itself increasing. This tends 
to deepen the chasm between the commonplace 
and the exceptional, between the popular culture 
of comic strips, pop records, and soap opera, and 
the elite culture of fine opera, drama, poetry, and 
the visual arts, At the same time, the broad-scale 
dissemination of the elite culture through the mass 
media makes its character at least superficially fa- 
miliar to vast populations to whom it would other- 
wise be utterly unknown. 

LEO BOGART 


[See also FILM; JOURNALISM; MASS SOCIETY.) 
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v 
EFFECTS 


The effects, alleged effects, and possible effects 
of mass communication are numerous and varied, 
direct and indirect. The development and extent 
of modern radio and television networks and the 
growth of the media industries have obviously af- 
fected the technological and economic nature of 
modern society and have nurtured revolutions in 
the use of leisure time and in processes ranging 
from pedagogy to marketing. The mere classifica- 
tion of the effects of mass communication presents 
a formidable methodological problem, which has 
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as yet defied attempts to construct exhaustive sche- 
mata. 

No such range of effects can be discussed within 
the scope of this article. The effects that are here 
considered are those with which the behavioral 
sciences are perhaps most concerned, namely, the 
effects of modern mass communication upon the 
attitudes and behavior of its audiences. 

Even within this area, certain exclusions have 
been necessitated by limitations of space and by 
the specialized treatment which such discussions 
would require. Thus, the article does not deal with 
the use of mass media in formal pedagogy, whether 
presented in classrooms or on closed or open 
broadcast systems. Relatively little mention is made 
of either the effects of mass communication in 
international psychological warfare or the effects 
of domestic mass communication in countries 
other than the United States; the various and dis- 
similar ways in which foreign communications 
systems are organized and controlled, and the dif- 
ferent cultural milieus in which their audiences 
exist, preclude casual generalization of the proc- 
esses here described to foreign arenas. 

Finally, no attempt is here made to deal with 
the effects of mass media as instruments of con- 
sumer advertising. The goals of such advertising 
and the psychological significance to the audience 
member of the decisions involved are often so 
unlike the goals and decisions of the kind of per- 
Suasive communications here discussed that gener- 
alizations from the one topical area to the other 
are hardly permissible. 

The article will first consider the effects of mass 
communication as an agent of persuasion. The 
principles established by findings in this area will 
be found to be reflected in the topical areas here 
discussed, namely, the effects of mass media upon 
audience views of public issues, the effects of mass 
media upon public taste, and the effects of depic- 
tions of crime and violence. Some of the vast areas 
of ignorance about the effects of mass communica- 
tion are noted, along with some suggestions as to 
how some of them might be explored. 

The research literature here cited includes both 
surveys and laboratory studies that bear upon the 
press, motion pictures, radio, television, and, in 
some cases, laboratory approximations of such 
media. No attempt is made to indicate precisely 
what medium is under discussion in reference to 
every cited finding. Many of the studies bear on 
mass communication as a whole rather than on 
any one medium, and in any case the findings here 
cited are believed by the author to be applicable, 


except when otherwise noted, to any or all of the 
media. 


Persuasive mass communication 


It has been repeatedly demonstrated that by far 
the most common effect of mass communication 
is to reinforce its audience's pre-existing interests, 
attitudes, and behavior and that the least common 
effect is to convert audience attitudes and behavior. 
In addition, pertinent attitudes of a considerable 
proportion of the audience are modified in intensity 
or salience but not in direction, that is, not to the 
point of conversion. 

The relative frequency of these types of attitude 
change was first documented in a now classic study 
(Lazarsfeld et al. 1944) of the origin and fate of 
vote intentions during the 1940 U.S. presidential 
campaign. Employing panel techniques, these re- 
searchers interviewed several hundred residents of 
Erie County, Ohio, prior to the nominating conven- 
tions and at approximately monthly intervals there- 
after, right up to election day. The study found 
that after exposure to months of mass media cam- 
paigning and partisan propaganda, 53 per cent of 
the respondents voted exactly as they said they 
would in May, before the candidates were even 
nominated. To whatever degree mass communica- 
tion affected these voters, it did so in the service 
of reinforcement. By contrast, less than 10 per cent 
had been converted to the opposition candidate. 
Practically all the other respondents had undergone 
minor attitude changes short of conversion. 

Later research has gone beyond the identification 
of effects and is able to describe much of the 
dynamics of their occurrence. Central to the study 
of these processes is the fact that mass communi- 
cations rarely work directly upon their audiences 
but function within a nexus of other factors that 
mediate the potential effects. These mediating 
factors tend, for the most part, to make mass com- 
munication an agent, but not the sole cause, of 
reinforcement. 

It is not possible within the scope of this article 
to cite and discuss all such factors. Nor, indeed, 
are all of them yet known. Among the most basic 
and most important, however, are audience predis- 
Positions and certain selective processes they en- 
gender; the influence of group norms; the opera- 
tion of a process variously called the two-step flow, 
personal influence, and opinion leadership; and the 
position of mass media in a free enterprise society. 

Predispositions and selective processes. The term 
“predisposition” requires little if any explanation. 
As used here, it carries its common denotation as 


, 


a “tendency toward.” More precisely, “predisposi- 
tion” can be viewed as an attitude “set” that is 
pertinent to the issue at hand and that exists prior 
to the audience member's exposure to whatever 
mass communication is under surveillance. Thus, 
a person may be predisposed to vote Republican, 
to enjoy classical music, or to commit acts of vio- 
lence. It is the substance of such predispositions 
which mass communication tends to reinforce, not 
in and of itself alone but rather as a contributory 
agent, working through the selective processes 
noted below. 

Selective exposure. Research has repeatedly 
shown that people tend to expose themselves to 
mass communication material that is in accord 
with their existing attitudes or interests and to 
avoid communications that express contrary atti- 
tudes or are outside their fields of interest. Persons 
who do not like classical music, for example, are 
most unlikely to expose themselves to broadcasts 
of classical music and will tend to avoid them in 
favor of others more in accord with their existing 
tastes (see, e.g., Suchman 1941). By the same 
token, persons with Republican predispositions are 
more likely to expose themselves to Republican 
campaign propaganda than to Democratic cam- 
paign propaganda, and the same is true of Demo- 
crats (see, e.g., Lazarsfeld et al. 1944 and, in 
reference to television, Schramm & Carter 1959). 
In brief, the audience member uses the medium 
much as he would a large supermarket: he selects 
from its many offerings the fare he knows he likes, 
and he does not as a rule purchase products which 
he dislikes or with which he is unfamiliar. The 
process of selective exposure is thus a process of 
feeding existing appetites and thus reinforcing 
dietary preferences, Applied to mass communica- 
tion, selective exposure minimizes the likelihood 
of the predisposed audience member's being ex- 
posed to the other side of the issue and maximizes 
his continued exposure to sympathetic communi- 
cations. It is therefore a cause of behavior that 
tends to make mass communication a reinforcing 
rather than a converting influence. 

Selective retention. The tendency of audience 
members who have been exposed to unsympathetic 
material to recall it less well than they recall sym- 
pathetic material, and to forget it more rapidly, is 
known as selective retention. The phenomenon was 
well documented as early as 1943, when Levine 
and Murphy observed such effects following the 
exposure of students to political tracts variously 
supporting and attacking the students’ own points 
of view. The occurrence of selective retention in 
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at least some degree has been widely regarded as 
obvious and has aroused little research curiosity 
in recent years. Its occurrence continues to be 
documented, however, among the findings of 
studies devoted to other objectives (e.g., Zimmer- 
man & Bauer 1956, later replicated by Schramm 
without published report). 

This kind of subconscious erasure obviously 
tends, like selective exposure, to reinforce the 
audience member's existing views and to minimize 
the likelihood of his being converted. 

Selective perception. The audience member's 
court of last resort is selective perception, or the 
tendency to perceive material as sympathetic to 
his predispositions, even when the material pre- 
sented is not, in fact, sympathetic. Since it occurs 
both concurrently with and after exposure to un- 
sympathetic communication, it is hard to disen- 
tangle chronologically from selective retention. 
Having been exposed to unsympathetic communi- 
cations that they recall reasonably well, audiences 
tend so to distort the content as to render it, in 
their own minds, supportive (and thus reinforcive), 
even though it is in fact a statement of opposition. 
Thus, communications designed to advance racial 
tolerance have been perceived by prejudiced per- 
sons as affirmations of ethnic inequality (Columbia 
University . . . 1946; Cooper & Jahoda 1947). In the 
course of informal inquiry, the present author 
personally observed that a poster proclaiming “It 
takes all races to make our city run” was perceived 
by prejudiced persons as a confirmation of their 
views that minority group members were needed 
to perform the menial chores necessary for the 
majority groups to live in comfort. More recently, 
persons who witnessed the Kennedy—Nixon debates 
were found able to recall statements or views with 
which they disagreed but to assign such statements 
to the candidate they shunned, regardless of which 
candidate actually made the statement (Katz & 
Feldman 1962). Thus, selective perception, like 
selective exposure and selective retention, is an- 
other phenomenon that mediates the effects of 
mass communication and does so in the direction 
of reinforcement. 

Group norms. Social psychologists have re- 
peatedly demonstrated the profound influence that 
group norms assert upon the attitudes and opinions 
of individuals who are or wish to be members of 
the group in question (an excellent review of the 
basic pertinent literature will be found in Katz & 
Lazarsfeld 1955). 

Briefly, membership in groups has been found 
to serve as an anchoring point of many attitudes 
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and, for the most part, to act in a variety of ways 
as a deterrent to opinion change. Both formal and 
informal groups tend to possess or develop consid- 
erable homogeneity of values, attitudes, and opin- 
ions in reference to specific issues. Persuasive mass 
communications that take positions alien to those 
of the group are therefore likely to be resisted by 
the individual member, not only because they are 
in conflict with his own predispositions but also 
because conversion would endanger such psycho- 
logical and social benefits as he derives from good 
standing in the group. Discussion of issues with 
other group members has been found likely to 
reinforce both the individual's original attitudes 
and his resistance to the unsympathetic communi- 
cation. Such discussion has also been shown to 
exercise a “refresher” function: by making the 
norms more conspicuous, the discussion serves to 
draw unconscious drifters back to orthodoxy (Mc- 
Keachie 1954). 

By way of example, predispositions on the part 
of respondents toward opinions characteristic of 
their family groups, and even a readiness to adhere 
to a given point of view for no other reason than 
that it is family-anchored, have been noted in 
various studies, including the previously mentioned 
investigation of the 1940 U.S. election campaign 
by Lazarsfeld and others (1944). So strong was 
the tendency toward family homogeneity found in 
this study that “only 4% of the 413 panel members 
who voted claimed that someone in their families 
had voted differently” (Lazarsfeld et al. [1944] 
1960, p. 141). Similar if less marked homogeneity 
was found among co-workers and among comem- 
bers of formal organizations. 

It would seem logical that the strength of these 
tendencies would be correlated both with the sali- 
ence of the question to the group and with the 
degree to which the audience member values his 
membership. Pertinent research, however, has 
indicated that this is not always the case. Experi- 
ments are reported, for example, in which the 
correlation appears in reference to one issue and 
one group but fails to appear in reference to 
another issue of presumably equal salience or 
among members of a presumably similar group 
(Kelley 1955; Kelley & Volkart 1952). To the best 
knowledge of the author, this anomaly has not yet 
been clarified or resolved. 

The two-step flow of communication. A phe- 
nomenon often integral to the influence of mass 
communication has been called personal influence, 
interpersonal influence, the two-step flow of com- 
munication, and opinion leadership. The process 


and its history are described in detail by Katz 
(1957) and will here be treated only as necessary 
to establish its influence as a mediator of mass 
communication effects. 

The important aspect of the phenomenon, from 
this point of view, is that in reference to decisions 
in various areas of attitude and behavior, people 
have been found to be influenced by specific other 
individuals as strongly or more strongly than they 
have been influenced by mass communication. 
These “others” have variously been termed opinion 
leaders, gatekeepers, influentials, initiators, inno- 
vators, and taste makers. They do not usually oc- 
cupy formal positions of leadership, being charac- 
teristically neither political officeholders, teachers, 
preachers, nor any other type of elite. Their chief 
demographic characteristic is that they have no 
distinctive demographic characteristics. They are 
simply people who, consciously or unconsciously, 
critically influence the attitudes and opinions of a 
handful of people demographically like themselves. 
Their following is characteristically small, involv- 
ing only a handful of persons; their influence is 
typically exerted in informal face-to-face discourse 
and may or may not be purposive; and their leader- 
ship is characteristically exercised within a limited 
topical sphere. The opinion leader in reference to 
public issues, for example, is not likely to be an 
opinion leader in reference to aesthetic issues or 
in reference to fashion. 

Where opinion leadership exists, it has been 
found typically much involved with mass commu- 
nications. Thus, the opinion leader is usually more 
exposed to topically appropriate mass communica- 
tions than are his followers, to whom he may or 
may not pass on the information he has gleaned. 
(This capability is reflected in the term “gate- 
keeper” and in the concept of the two-step flow of 
communication.) The opinion leader may direct 
followers to mass communication to document or 
promote his opinions or, vice versa, he may inter- 
pret to his followers what mass communication 
on a given topic really means. The essential point 
is that where personal influence exists, it often 
mediates the effects of mass communication, 

The direction of this mediation was long ob- 
scured by the fact that the phenomenon first came 
under research scrutiny in reference to change: 
changes in vote intention, changes in fashion, the 
adoption of new agricultural techniques, and the 
like. Studies of the role of the opinion leader in 
such circumstances led to his being widely if im- 
plicitly conceived as primarily an agent of change. 
It has recently become apparent, however, that per- 


sonal influence probably functions more often as 
an agent of stasis than as an agent of change. 
Thus, the opinion leader has been found to be an 
embodiment of group norms, whose leadership is 
effective precisely because it provides satisfaction 
in terms of the shared values of the informal groups 
to which both leader and follower belong. Even 
the nature of personal influence in certain proc- 
esses of change appears to be more a matter of 
counteracting departures from the norm than of 
initiating such departures: for example, personal 
influence in changing vote intentions appears to 
be primarily exerted on persons who had previously 
changed away from group norms, and the effective 
exercise of opinion leadership apparently serves in 
the main to bring such persons back into homo- 
geneity with the group. In sum, recent re-evaluation 
of earlier research suggests that opinion leadership, 
or interpersonal influence, probably works in the 
direction of stability and reinforcement more often 
than it does in the interests of change. Insofar as 
this is true, and insofar as it touches on mass 
communication, personal influence (or opinion 
leadership, or the two-step flow) appears to be 
another of the factors that mediate the effects of 
mass communication in such a way as to render 
the media primarily agents of reinforcement, [See 
DIFFUSION, article on INTERPERSONAL INFLUENCE. ] 

Mass media in a free enterprise society. Mass 
media that function as profit-making enterprises 
in a free, competitive economy must strive to please 
enormous and highly variegated audiences and to 
avoid offending any significant proportion of the 
population. Although a sizable portion of media 
offerings—particularly news, commentaries, docu- 
mentaries, and other informational programs—deal 
with highly controversial subjects, the major por- 
tion of mass media offerings are designed to serve 
an entertainment function. These entertainment 
products generally tend to avoid controversial is- 
sues and, implicitly or explicitly, to reflect beliefs 
and values already sanctified by the mass audience. 
To whatever degree this occurs, the media them- 
selves serve to reinforce such beliefs and values. 
This course is the more likely to be followed by 
those media, such as television networks, whose 
investment and production costs are high. Small- 
budget media, such as specialized magazines or 
individual radio stations, can afford to address 
themselves exclusively to selected, small audiences, 
but national television networks cannot do so and 
remain economically viable. Put another way, mass 
communication that is profit-oriented must be 
directed primarily to mass tastes and must reflect 
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mass values. When a mass medium ceases to do 
this, it ceases to be a mass medium. 

The creation of opinion on new issues. Rein- 
forcement and conversion can, of course, occur 
only where there is an opinion to reinforce or 
oppose. It cannot occur in the absence of opinion. 
Although there has been relatively little research 
on the subject, the media appear to be extremely 
effective in creating opinions. By way of a com- 
mon-sense example, a few months before Fidel 
Castro came to power, probably less than 2 per 
cent of the American people so much as knew his 
name, let alone his political leanings. A year there- 
after, however, the American public knew a great 
deal about him and his political behavior and were 
rather homogeneous in their opinions about him. 
The source of their knowledge and the bases of 
their opinions were obviously restricted, for all 
practical purposes, to the mass media. 

It is not difficult to see why the mass media are 
extremely effective in creating opinions on new 
issues. In such a situation the audiences have no 
existing opinions to be guarded by the conscious 
or subconscious play of selective exposure, selec- 
tive retention, or selective perception. Their refer- 
ence groups are likewise without opinion, and 
opinion leaders are not yet ready to lead. In short, 
the factors that ordinarily render mass communi- 
cation an agent of reinforcement are inoperative, 
and the media are thus able to work directly upon 
their audiences. The attitudes that the audiences 
derive from the media thereafter enjoy the rein- 
forcive effects described above. 


Areas of public concern 


Persuasion on public issues. Data revealing the 
vast predominance of reinforcement over conver- 
sion in the area of political persuasion have been 
noted above (see Lazarsfeld et al. 1944). Similar 
findings have emerged from research on the influ- 
ence of mass communication in many political 
campaigns (see especially Berelson et al. 1954). 
A modest study pursued by Schramm and Carter 
(1959) revealed that the audience for a Republi- 
can-sponsored election eve telecast contained twice 
as many Republicans as Democrats and succeeded 
in changing the vote intention of only one person 
among the hundred or so respondents. Star and 
Hughes (1950) found that a multimedia campaign 
designed to increase knowledge and approval of 
the United Nations attracted an audience consist- 
ing for the most part of persons who knew a good 
deal about the United Nations and were already 
favorably disposed toward it. Wiebe (1958) found 
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that the broadcasts of the Army—McCarthy hearings 
were selectively perceived by most of the audience, 
whose existing opinions about the senator and the 
issues were accordingly reinforced. Other similar 
findings are numerous. 

It must not be assumed, however, that reinforce- 
ment, which involves no change in opinion, is 
synonymous with lack of effect. Reinforcement is 
an effect, and an extremely important one. It 
strengthens the resolve in question, produces a 
type of immunity to counterpropaganda, and nurses 
straying sheep back into the fold (see, for instance, 
Lazarsfeld et al. 1944; Berelson et al. 1954; Katz & 
Feldman 1962). 

Furthermore, none of the foregoing is to gainsay 
the fact that conversions do occur. It is, rather, 
to suggest that conversion occurs infrequently and 
usually under very special conditions, and it is 
often a kind of indirect reinforcement. In brief, 
convertees are often found to have been, as it were, 
predisposed to conversion. An excellent example 
occurs in research literature bearing upon the ef- 
fects of Voice of America broadcasts to European 
satellite countries (International Public Opinion 
Research, Inc. 1953). Defectors from these coun- 
tries to the “free world” were found, in many in- 
stances, to have lived for years in relative content- 
ment under Stalinistic communism and either not 
to have listened to the Voice of America enough to 
make a difference or in any case to have been 
unaffected by the broadcasts. At first blush, it 
appeared that exposure to the Voice of America 
had suddenly become effective among persons for 
whom it had been previously ineffective and had 
rapidly made them not only intense anticommu- 
nists but warm and knowledgeable exponents of 
Western views as well. Deeper research revealed 
that many of them had become disaffected for 
wholly nonpolitical reasons. Some, for example, 
were young men facing the draft. Some were fac- 
tory managers who had been demoted for not 
achieving their production quotas. The future of 
life at home had suddenly become unattractive. 
Seeking a better future, they began searching 
about. At this point the Voice of America became 
a convenient and effective agent of conversion. 
Put in other words, these people had become pre- 
disposed to change, and the Voice of America was 
available to help them implement the change. 
Somewhat similar findings have been reported in 
reference to the effect of Allied propaganda ap- 
peals to Nazi soldiers (Shils & Janowitz 1948) 
and in reference to the effect of UN information 
campaigns upon communist troops participating in 
the Korean conflict (Schramm & Riley 1951). 


Public taste. Concern over the effects of mass 
communication on the level of public taste has 
been expressed in reference to virtually all Ameri- 
can mass media. Social critics have variously feared 
that mass communication would hinder the devel- 
opment of good taste among children and debase 
the existing tastes of adults. The mass media have 
also been called upon to assume the primary re- 
sponsibility for developing the aesthetic tastes of 
their audiences. 

Research indicates that mass communication, in 
and of itself, neither raises nor lowers the tastes 
of any significant portion of the audience (see, 
for example, Lazarsfeld 1940; Suchman 1941; 
Wolf & Fiske 1949; Klapper 1949; Himmelweit 
et al. 1958). Here again, as in reference to persua- 
sion, mass communication serves primarily as an 
agent that reinforces whatever levels of aesthetic 
taste its audience members bring to it. Persons 
who do not like classical music, or do not like 
programs on relatively high cultural levels, simply 
do not expose themselves to such programs and 
are in fact often unaware of their existence. In- 
creasing the incidence of such programs is thus 
unlikely to change the tastes of any significant 
portion of the audience. 

Research reveals, however, that given a predis- 
position to develop cultural and aesthetic tastes, 
people can and have done so by watching or listen- 
ing to appropriate media offerings. A now classic 
study (Suchman 1941) revealed, for example, that 
persons who were not particularly interested in 
classical music were sometimes impelled to listen 
to it on the air because they felt that their igno- 
rance was inappropriate to some status they had 
recently attained. Their motivation to begin listen- 
ing to the classical music was totally extraneous 
to the aesthetic quality of the music itself. By and 
by, however, they came to like the music for its 
own sake, 

The interesting question also arises as to whether 
group norms serve to debase aesthetic tastes. It is 
conceivable, for example, that some persons who 
like media material on a high aesthetic level may 
find such tastes a subject of ridicule and thus a 
social handicap in groups that are otherwise con- 
genial. The norms of such groups may demand 
familiarity with material of a lower quality. The 
individuals concerned may therefore undertake to 
view such aesthetically inferior material at the 
cost of the higher-quality material they prefer, and 
in the course of time they may come to like such 
material for its own sake. It seems unlikely that 
such a process would actually kill anyone’s taste 
for aesthetic material, but it might decrease his 


exposure to it. The phenomenon, if it occurs at all, 
does not seem so likely to involve an actual nega- 
tion of existing tastes as, rather, to produce a 
broadening of tastes in a downward direction. Sev- 
eral authors have suggested, in speculative essays, 
that some such process occurs (e.g., Friedan 1963; 
Tamiment Institute 1961). Others (in particular 
Steiner 1963; Berelson 1964; Wilensky 1964) have 
documented the universal participation, by intel- 
lectuals as well as nonintellectuals, in mass culture 
and mass media products. But no research known 
to the author really faces the question of whether 
any change in the communication consumption 
habits of cultural elites has occurred since the 
development of the media of mass entertainment. 

In any case, however, it is clear that mass com- 
munication serves for the most part to reinforce 
existing tastes. While it is not usually the prime 
mover, it can play a major role in taste develop- 
ment by providing a means of implementing such 
predisposition to change as may derive from extra- 
media sources. Accordingly, research into the oper- 
ation of these extramedia sources would appear to 
be a prerequisite, or at least a necessary and inte- 
gral part, of any campaign aimed at the develop- 
ment of audience taste. 

Crime and violence. The American public, and 
indeed other publics as well, has always feared that 
media depictions of crime and violence would lead 
children to commit violence or at least to regard 
its commission with equanimity. The concern has 
been expressed not only in reference to television 
but, before that, in reference to comic books, 
movies, radio, and even newspapers. 

The effect of depictions of violence on attitudes 
and behavior (as opposed to immediate emotional 
effects) has been the topic of inquiry of several 
studies, and additional studies are now being pur- 
sued. Prior to about 1960, this research typically 
consisted of surveys in which children who were 
heavy media users or heavy consumers of media 
depictions of crime and violence were compared 
with children who did not rate heavily on either 
count. These surveys typically resulted in one of 
two sets of findings. Either the two groups of chil- 
dren were found not to differ in reference to such 
criteria as were employed in the study, or the heavy 
users were found to differ in regard to characteris- 
tics that for the most part antedated or were irrele- 
vant to their exposure to the media material. Thus, 
an elaborate study of the effects of television on 
British children (Himmelweit et al. 1958, p. 215) 
“found no more aggressive, maladjusted, or delin- 
quent behavior among viewers than among [non- 
viewing] controls.” These findings are in accord 
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with earlier studies, for example, that of Lewin 
(1953), who found no differences among heavy 
and light violence consumers in tendencies toward 
delinquency, school attitude or achievement, or 
conduct. 

As noted above, other studies did find differences 
between heavy and light consumers of violence. 
Ricciuti (1951), for example, found that children 
who particularly liked programs containing crime 
and violence scored somewhat lower than their 
colleagues on IQ and school achievement and were 
slightly below average in general happiness and 
interpersonal adjustments. At the same time, 
Ricciuti found no differences between the groups 
in nervous habits, fears, daydreams, and frustra- 
tion reactions. A complex study by Bailyn (1959) 
revealed that certain types of depicted violence 
were particularly liked by boys (not girls) who 
were both highly exposed to such depictions and 
who exhibited certain psychological characteristics, 
including numerous problems relative to them- 
selves, their family, and their peers, as well as a 
tendency to blame difficulties on others rather than 
themselves. Bailyn considers that these factors 
antedate the children’s partiality for the aggressive 
material and thus “would not logically be thought 
of as effects of exposure” (ibid., p. 3). She notes 
that these psychological characteristics are very 
similar to traits that were previously found by 
Sheldon Glueck and Eleanor Glueck to play a role 
in the “causal complex of delinquency” (ibid., 
p. 37). She points out, however, that these char- 
acteristics do not in themselves differentiate be- 
tween high and low exposure and that it is only 
those boys who are both highly exposed and pos- 
sess these characteristics who are especially partial 
to the media material involved. High exposure, she 
found, was primarily associated with lack of pa- 
rental restrictions on media usage and with low IQ. 

Various other studies suggest that preference for 
crime and violence material is not a cause but a 
correlate of maladjustment. Eleanor Maccoby, for 
example, found that children who were mildly 
frustrated immediately before viewing films re- 
membered more of the aggressive content than 
did the nonfrustrated children (unpublished study, 
described in Bogart 1956, p. 154). 

The most extensive study of television in the 
lives of American children supports the findings 
of the earlier studies cited above and concludes 
“that the kind of child we send to television, rather 
than television itself, is the chief element in delin- 
quency . . . . [The] most that television can do is 
to feed the malignant impulses that already exist” 
(Schramm et al. 1961, pp. 165-166). 
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Within the last few years, the survey literature 
has been supplemented by a host of laboratory 
experiments whose findings are regarded by some 
people as indicating that media depictions of vio- 
lence or aggression elicit aggressive behavior on 
the part of children or young adults. The typical 
procedure involves the exposure of subjects, usu- 
ally for not over ten minutes, to a film clip depict- 
ing “aggression” and thereafter seeking the effect 
of the exposure as manifested by predetermined 
criterional behavior (see, for instance, Bandura 
et al, 1961; Lévaas 1961; Mussen & Rutherford 
1961). 

In almost all the experiments involving child 
subjects, both the stimulus material and the cri- 
terional behavior involve laboratory simulations of 
“aggression” which are so distant from aggression 
in the consensual sense of that term that generali- 
zation to the topics of social concern seems quite 
unjustified. Thus, stimulus materials have in- 
cluded films of adults attacking a plastic doll, 
cartoons in which animals wrestle, and the like. 
Criterional behavior has also typically involved the 
rough handling of toys or even, in one instance, a 
statement that it would be fun to see a balloon 
popped. The occurrence of effects is sought imme- 
diately after exposure, and the persistence of any 
observed effects is very rarely an object of inquiry. 

In one of the experiments which involved chil- 
dren and in which some approach to interpersonal 
aggression was undertaken (Bandura 1963), the 
child subjects were exposed to a stimulus film in 
which one adult, dressed like a child, seized the 
toys of another adult, who was also dressed like 
a child, precipitating a mild fight. A significant 
degree of spontaneous imitative aggression oc- 
curred among children who saw a version of the 
film in which the aggressor won, but not among 
those who saw a version in which the victim was 
triumphant. These data are regarded by Bandura 
and others as an indication that children will imi- 
tate depicted violence if it is shown to be fruitful. 
However, the only study in which children were 
enabled during criterional behavior to “attack” 
each other with rubber daggers or similar play 
weapons revealed no significant increase in violent 
or aggressive behavior, as these terms are com- 
monly understood (Siegel 1956). 

Several experimenters have pursued parallel 
laboratory studies with adult subjects (e.g., Ber- 
kowitz 1962; Berkowitz et al. 1963; Walters et al. 
1962; Walters et al. 1963; Walters & Thomas 
1963). In many of these, the stimulus films por- 
trayed real interpersonal violence. However, the 
criterional behavior involved, in its most “violent” 
form, very small increments in the duration and 


intensity of electric shocks, which the subjects had 
been told to administer in the first place. [See Ac- 
GRESSION, article on PSYCHOLOGICAL ASPECTS.] 

The vast difference between the types of “vio- 
lence” or “aggression” in these studies and the 
commission of real physical violence has led many 
researchers, including the authors of one of the 
studies (Mussen & Rutherford 1961), to deny the 
validity of generalizing the experimental results to 
the likelihood of committing real violence in a 
naturalistic situation. The present author agrees 
with this (see also Hartley 1964 for a review and 
critique of experiments in this tradition). 

Other areas of effect. The scope of this article 
does not permit discussion of all the attitudinal 
and behavioral effects that have been subjects of 
social comment, concern, or, in some cases, hope. 
Prominent among the questions we are thus unable 
to discuss is whether television viewing promotes 
passivity or, conversely, whether it stimulates new 
interests and activities; another common concern 
is that it reduces time spent on reading, In gen- 
eral, the pattern of effect that has been described 
throughout this article appears applicable to these 
other areas. Thus, research indicates that television 
neither changes active children into passive chil- 
dren nor changes passive children into active ones 
(Himmelweit et al. 1958). Rather, it provides an 
easy way for passive children to be passive, and 
it is a source of new interests and activities for 
children who are naturally curious and active 
(Himmelweit et al. 1958; Schramm et al. 1961). 
Although the data are not completely clear, the 
effects of television on reading appear to follow a 
similar pattern (see, for example, Himmelweit et 
al. 1958; Witty 1966). In those areas, as in others, 
reinforcement appears to be the dominant effect. 


Research needs 


Despite all that has been said above, our knowl- 
edge of the effects of mass communication is still 
limited, and there is need for additional research. 
Perhaps the most crucial need is for a series of 
inquiries on what children of various ages actually 
perceive when they look at the cinema or television 
screen or when they view or read other mass com- 
munications. It is only in the last three or four 
years that communication researchers have begun 
rigorously to question the frequent if implicit view 
that children perceive either what adults perceive 
or what adults think children perceive. These as- 
sumptions, once made explicit, appear to be ob- 
viously naive and probably invalid. 

A second topic demanding research might be 
called the fate of minor attitude change. It has 
been noted above that such changes, falling far 


E 


short of conversion, are quite common. But, for 
all we now know, these minor changes may be 
steppingstones on the path to major changes that 
have been neither sought nor documented. The 
concept of “sleeper effects,” that is, the possibility 
that major effects will become visible only weeks 
or months after exposure to the communications, 
was first formulated by Hovland, Lumsdaine, and 
Sheffield (1949), but the conditions promoting or 
inhibiting such effects and the patterns of their 
development have not as yet been adequately de- 
scribed. The recently developing interest in “learn- 
ing without involvement,” perhaps best set forth 
by Krugman (1965), seems likely to be found 
pertinent to this general area, as does also the 
recent work pursued by Festinger and Maccoby on 
“resistance to persuasive communications” (1964). 

A third area of ignorance is closely connected to 
the second. Much research has been devoted to 
determining the effects of a specific mass com- 
munication program, or of a series of such pro- 
grams, on audience attitudes and behavior. But in 
actual life the audiences of mass media are ex- 
posed to hundreds or thousands of communica- 
tions, many of them similar in content or point 
of view to the few specific programs whose effects 
are assayed, Our knowledge of the over-all effects 
of mass communication will remain limited unless 
and until research can be directed upon the effects 
of cumulative exposure to mass communications. 
This is tantamount to saying that there is urgent 
and critical need for “longitudinal studies” that 
keep audience members under scrutiny for years 
or decades. The methodological problems involved 
in the pursuit of such a study are formidable and 
have not yet been reduced to malleable proportions. 

There is need also for inquiry into the variety 
of functions that the same communication may 
serve for different members of its audience. By the 
same token, there is equal need for inquiry into 
the apparent ability of diverse and dissimilar types 
of communications to serve the same functions for 
specific audience members. The old methods of 
classifying programs as “informative” or “enter- 
taining,” or in such gross categories as “light 
drama” and “serious drama,” are beginning to ap- 
pear somewhat naive. The classification of audi- 
ences primarily or exclusively in terms of such 
time-honored demographic categories as sex, age, 
educational achievement, and the like is probably 
equally naive. The development of our knowledge 
as to how mass communication affects its mass 
audiences will probably be handicapped until func- 
tional patterns of communication can be identified 
and defined and until audience subgroups can be 
defined in terms of the functions that various types 
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of mass communication serve for them, The po- 
tential fruitfulness of such a research approach 
has been discussed by Wright (1960), Klapper 
(1960), and Katz and Foulkes (1962). 

JoseP T. KLAPPER 


[See also ADVERTISING; ATTITUDES, article on ATTI- 
TUDE CHANGE; COMMUNICATION, POLITICAL; MAR- 
KET RESEARCH, article on CONSUMER RESEARCH; 
PERSUASION; PROPAGANDA.] 
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COMMUNICATION, POLITICAL 
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1. INTRODUCTION 
i, INTERNATIONAL ASPECTS 


I 
INTRODUCTION 


Definition. Both the word “communication” 
and the word “political” may be defined in various 
ways. A broad definition of “communication” is 
any transmission of signs, signals, or symbols be- 
tween persons. A broad definition of “political” is 
of or pertaining to a process whereby intentional 
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changes are effected in the rules governing the 
relations between individuals. By such broad defi- 
nitions “political communication” is a category that 
includes a large proportion of all deliberative and 
hortatory activities that take place outside of the 
household. An international ultimatum or the 
speeches of a candidate are, of course, political 
communications; but so, by this definition, are an 
employee’s request that his superior address him 
as “Mr.” rather than by his first name, or a letter 
from a club to its members telling them that the 
dues are to be raised. 

By a narrower definition, however, “political 
communication” refers only to the activity of cer- 
tain specialized institutions that have been set up 
to disseminate information, ideas, and attitudes 
about governmental affairs. This narrow definition 
is often implicit in institutional studies of political 
communication. For example, studies of psycholog- 
ical warfare may focus on across-the-lines broad- 
casts and leaflet distribution (Lasswell 1927; 
Lerner 1949; Daugherty & Janowitz 1958). Studies 
of election campaigns may focus on the use of 
television, posters, and speeches (Herring 1940; 
Ostrogorskii 1902; Childs 1965). Studies of legis- 
lative communication may focus on letters to the 
Congress, committee documents, and floor speeches 
(Schattschneider 1935). Studies of administrative 
management may focus on office memoranda and 
letters (Simon [1947] 1961, chapter 8). 

Implicit in such studies is the notion that certain 
institutions have as their primary function the 
facilitation of the exchange and dissemination of 
messages, and that the characteristics of such 
institutions constitute the special province of the 
student of communications. But it is equally clear 
that although the rate of flow of political messages 
may be distinctly lower in other environments than 
these institutions it is not nil. Some students of 
political communications expand their scope to 
consider under that rubric much more than just 
the characteristics of specialized communications 
institutions (Berelson et al. 1954; Hovland et al. 
1953; A Plan of Research . . . 1954; Schramm 
1964). 

To report the infinity of facts that make up even 
the smallest social event is impossible. Every phe- 
nomenological statement about society is, as his- 
toriographers are fond of pointing out, really a 
report of certain selected indices that are believed 
to reflect what is happening in the whole. The 
particular indices that a scholar selects to report 
is a matter of his style and his discipline. Yet some 
indices are more widely useful than others. In its 
broad definition “communication” belongs among 
that limited set of concepts which provides con- 


venient handles for describing social life in its 
entirety. 

Just as the economist focuses his antenna on 
exchange relations (either in the particular in- 
stitution, the market, where such relations are 
most active or, alternatively, as a way of concep- 
tualizing any or all of social life), and just as the 
political scientist focuses his antenna on power 
relations (either solely in the state or as a way of 
conceptualizing any or all of social life), so too the 
student of communication may use the exchange 
of messages either as an index by which to describe 
institutions (such as the press) that are specifically 
set up for the purpose of message dissemination 
or may use the universal social act of communi- 
cating as a powerful index for describing any and 
all aspects of social life. The domain of the study 
of communication can thus legitimately be the 
entire domain of the social sciences. 

If one takes the communications approach to 
the study of society as a whole, then the study 
of political communication becomes just one par- 
ticular approach to the study of all of politics. 

Some classical contributions. One can docu- 
ment the breadth of the topics that may be an- 
alyzed as political communication by reviewing 
some of the classical contributions to the field. 
Among works prior to 1914 that a student of com- 
munication would have to consider as major con- 
tributions to his field would be Plato's Gorgias, 
which considers morality in propaganda; Aristotle’s 
Rhetoric and Mill's System of Logic, which an- 
alyze the structure of persuasive argumentation; 
Machiavelli's The Prince and Lenin’s What Is to 
Be Done? which are handbooks of political com- 
munication for the securing of power; Milton’s 
Areopagitica and Mill’s On Liberty, which consider 
the systemic effects of permitting individual vari- 
ation in the flow of political messages; Dicey’s The 
Development of Law & Opinion in England in the 
19th Century, which considers the effect of the 
ideological context on public actions; and Marx’s 
German Ideology, Sorel’s Reflections on Violence, 
and Pareto’s The Mind and Society, which distin- 
guish the social function from the true value of 
beliefs. 


Propaganda and persuasion 


The study of political comunication underwent 
a major efflorescence after World War 1. A number 
of factors stimulated that growth. The Allies, par- 
ticularly after Wilson’s Fourteen Points, had under- 
taken a substantial psychological warfare effort 
against Germany (Lasswell 1927). The German 
ultra-right, unwilling to admit the facts of defeat, 
perpetrated the myth that the German soldier had 
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been undefeated on the field of battle but had had 
victory snatched away by an Allied propaganda 
“Schwindel” that fooled Germany into giving up. 
This myth was expressed in a large body of Ger- 
man literature that overestimated the power of 
propaganda (Thimme 1932). 

Such illusions about the vast powers of the “hid- 
den persuaders” (Packard 1957) were in the same 
period reinforced in the United States by the growth 
of advertising and public relations. The men in 
these new businesses, eager to win clients, over- 
stated their own powers over the public mind. They 
acted as propagandists for propaganda. 

While the illusion that clever propagandists with 
the aid of the mass media could achieve great 
manipulation of the public became widespread, both 
research and experience were showing how pain- 
fully hard it was to educate the public to preferred 
civic attitudes. Notable was the National Opinion 
Research Center's study (1948) of a large-scale 
United Nations week in Cincinnati, which despite 
Herculean efforts affected an almost unnoticeable 
segment of the population. Such observations of 
the ineffectuality of propaganda led students of 
political communication to seek by field and ex- 
perimental studies to understand the conditions 
under which persuasion does change attitudes and 
the conditions under which it does not. 

Among the field studies perhaps the most no- 
table were the studies by Lazarsfeld, Berelson, and 
others, of the impact of election campaigns. They 
demonstrated that relatively few minds are truly 
changed by a campaign but that a campaign serves 
other important functions; it may define the issues 
and mobilize interest and partisanship. 

Communication has many effects besides that of 
persuading people of the thing said. It also affects 
attention, information, interest, and action. It often 
does so without causing a person to decide that 
what he previously thought to be false is true, or 
vice versa (Lerner 1951; Pool et al. 1956). None- 
theless, a large part of the sociological and psy- 
chological literature on political communication 
has dealt with the conditions of persuasion. 

Laboratory studies, most notably those of Carl 
Hovland and his associates at Yale, have shed sub- 
stantial light on the conditions under which mes- 
sages persuade. Certain individuals show a general 
tendency to be more easily persuaded than others. 
Informed and intelligent persons are more apt to 
be persuaded permanently by a presentation that 
refutes the arguments on the other side as well 
as presenting its own arguments, while uninformed 
and unintelligent persons are moved more by a 
one-sided presentation. Threatening communica- 


tions, where circumstances permit, are apt to be 
disregarded and forgotten more than communica- 
tions that present encouraging information. Argu- 
ments on matters of attitude (not fact) when 
presented to persons already predisposed to accept 
the arguments are apt to have more effect on the 
hearer’s attitude after a passage of time than they 
did immediately after the presentation. Factual 
information and also attitudinal material that goes 
against the hearer’s predisposition lose part of their 
impact with time (Hovland 1959). 

Psychological experiments have also shown that 
when a person's opinion structure is dissonant or 
unbalanced, his opinions are apt to change. If a 
person simultaneously believes two things that are 
hard to reconcile, and if circumstances force at- 
tention to this imbalance, then a person is apt to 
change an attitude or redefine the situation to 
avoid such imbalance (Abelson & Rosenberg 1958; 
Brown 1962; Festinger 1964; Attitude Change 
1960). The classic political case is cross-pressure 
in an election campaign, when a voter favors a 
candidate in one respect and dislikes him in an- 
other (Berelson et al. 1954). Balance may be re- 
stored by deciding that only one of those aspects 
is important, or by forgetting one of them, or by 
reversing one of the perceptions of the candidate, 
or by changing one’s own evaluation on one of 
the points. 

However, the results of laboratory studies of at- 
titude change, as Hovland pointed out (1959), 
diverge from the results of field surveys in one 
important respect. In the laboratory the messages 
communicated usually have an effect on the sub- 
ject; in the field the usual finding is that propa- 
ganda makes little discernible difference in any 
democratic, that is, competitive communication 
situation, That is because the paid or otherwise 
controlled subject in the laboratory is in a forced 
communication situation. In real life, on the other 
hand, exposing oneself to communication is gen- 
erally a voluntary act. Persons listen and read 
selectively, and they do not readily expose them- 
selves to communications that will change their 
minds. In politics voters attend primarily to their 
own candidate. They seldom listen to the opposi- 
tion and even when they do so may selectively 
misperceive or forget things that are said that they 
disagree with. So, in real-life situations the directly 
persuasive effects of political communication are 
much less cogent than they are in the laboratory. 
A book that summarizes comprehensively and well 
what social scientists know about the impact of 
communications on their receivers is Joseph Klap- 
per's The Effects of Mass Communication (1960). 
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Contents of communications 


Political scientists have long been interested in 
effects of communications other than their imme- 
diate persuasive power over their receivers. They 
have often been interested in describing, for ex- 
ample, the contents of the information flow per- 
meating a society. Dicey’s study of the spread of 
collectivist ideas in England is a classic example. 
Perhaps the foremost contemporary exponent of 
such studies is Harold Lasswell, who has either 
made or instigated the most systematic attempts 
to provide surveys of the social distribution of at- 
tention. In the 1930s he initiated the use of content 
analysis to compare the political propaganda out- 
put at different times and places (Lasswell & 
Blumenstock 1939), an effort expanded at the Li- 
brary of Congress during World War 11. The studies 
at the Hoover Institute, originated by him, pro- 
duced comparisons of the political symbols used in 
editorials in major papers in five countries over 
a sixty-year period (Lasswell et al. 1952; Pool et 
al. 1952). 

A study of the contents of political communica- 
tions in a country may be motivated by purely 
descriptive purposes. It is interesting to note, for 
example, the growth of attention to world affairs 
in the United States or the decline of attention to 
economics in political campaigns as the great de- 
pression has receded into the past. But study of 
the distribution of attention in society can have 
more than just descriptive interest. Content anal- 
ysis of political media can be useful both for intel- 
ligence purposes and for social scientific purposes. 

During World War 11 content analysis of enemy 
broadcasts was effectively used by both British and 
American agencies to decipher Nazi military plans 
(George 1959). Kremlinology is a present version 
of the same technique (Griffith 1963; Rush 1958). 
The order in which leaders are named, the disap- 
pearance of one formula and the appearance of 
another, or the allusion to some past analogical 
event may provide clues to major political develop- 
ments. Needless to say, this kind of analysis is 
superfluous when a political movement is willing 
to engage in free discussion of its problems and 
precluded when it can afford total silence. 

The deciphering of Aesopian meanings becomes 
important when a political group feels simulta- 
neously obliged to communicate and constrained 
not to communicate frankly. That happens under 
a variety of circumstances. It happens when dis- 
sidents attempt to express their views in ways that 
will avoid repression by the powers that be. It hap- 
pens when elites wish to communicate to fellow 


cognoscenti without revealing their hand to naive 
hearers. It happens when a politician wishes to 
convey a message without unduly disturbing estab- 
lished conventions. 

Historical examples of such Aesopian political 
communication have been analyzed most exten- 
sively by Leo Strauss (1948). The logic of such 
analysis has been most fully dissected by Alexander 
George (1959). Actual analyses of Soviet covert 
debates are many, but little careful attention has 
been given to just why it is that the Soviets choose 
to engage in a mode of political communication 
that is probably no longer well adapted to their 
needs. It reveals much that they would like to keep 
secret, while making the conduct of their policy 
discussions inefficient. Various relevant factors 
may be adduced to account for their discussion of 
policy questions in covert but revealing ways in 
their major public organs. Among these are the 
heritage of an illegal revolutionary past, but also 
a past full of ideological debate; the heritage of the 
authoritarian tsarist past; and an irrational reac- 
tion-formative preoccupation with secrecy in the 
Russian culture. There is also the need to give 
directives to thousands of middle-echelon persons 
on how to behave, for a complex society, even a 
centralized planned one, must find a way to have 
millions of independent decisions made in a so- 
cially functional way. Finally, informal communi- 
cations are so severely hampered by Soviet fear of 
uncontrolled social action that discussion in major 
media often becomes just as easy as more private 
kinds of informal discussion. For example, only 
one central mimeograph facility is allowed in each 
major department, all stencils are numbered, and 
a card on the use of each kept by the police au- 
thorities. 


Effects of political communication 


The ways in which Aesopian political material 
can provide insights into a political system may 
also demonstrate how the distribution of communi- 
cation content can be a matter of interest to social 
scientists concerned with the effects of commu- 
nication. Clearly, there are both systemic and 
individual consequences to the distribution of at- 
tention, quite different from the persuasive conse- 
quences of individual messages. 

One example is the emergence of political al- 
ienation when the contents of communications in 
the official and the mass media do not correspond 
to the perceptions and the interest of their au- 
diences (Kris & Leites 1947; Levin 1962). Modern 
totalitarian societies, unlike historical authoritarian 
ones, are very public-opinion conscious and put out 
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vast amounts of political communication to the 
public. The publics, however, may learn to con- 
sider such communications unreliable, often 
trusting more to rumor (Bauer et al. 1956); they 
then become less involved in political matters and 
devote minimum attention to the mass of material 
thrown at them. But the depolitization is, of course, 
never complete, and people who are given highly 
censored versions of the news acquire great skill 
at reading between the lines and become quite 
energetic at seeking information by such means 
as listening to foreign broadcasts. The non-Asian 
communist countries are now virtually covered 
by short-wave radios (about twenty million in the 
Soviet Union); and listening to foreign broadcasts 
is fairly universal. A very different situation exists, 
of course, in some underdeveloped countries, where 
large parts of the public do not know about major 
outside world events and trends. Nowadays, since 
every major nation broadcasts internationally the 
latest trends in culture, art, and popular music, 
political matters too become universally known, 
among interested persons, without much delay. 
The political consequences of such a change in 
the attention situation have already been profound 
and may be accentuated as communication satel- 
lites make international communication even easier. 
Practicing propagandists are generally aware 
that it is much easier to change people's distribu- 
tion of attention than to change their values and 
attitudes. Virtually all propaganda efforts serve 
only to focus people’s attention on certain issues 
rather than to reverse their previous views on those 
issues. Advertisers seldom seek to change the de- 
sires of the public; they try to convince the public 
that their product meets those desires. Successful 
psychological warriors do not try to convert the 
enemy nationals into rejecting their own father- 
land and joining in the cause of the propagandists. 
They succeed only when their target forces are 
falling apart anyway and they can simply focus 
the attention of the enemy troops on that fact and 
inform them how to save themselves (Lerner 
1949; Shils & Janowitz 1948). In general, it may 
be said that persuasion as such is only a small part 
of political communication. Most of it consists of 
modifying the information on which people will 
act, given their own values and preferences. 
The mass media. The growth of the mass media 
has had a major impact on the conduct of political 
activity. In the United States, for example, radio 
and, even more, television have significantly re- 
duced the power of the local political machine and 
drastically reduced the use of rallies and mass 


meetings. Since Franklin Roosevelt's fireside chats 
on radio in the 1930s it has been possible for the 
president or the presidential candidate to establish 
a direct campaign relationship with every individ- 
ual voter. In a presidential campaign this relation- 
ship is far more significant than the remarks of a 
local politician. Television, of course, adds an extra 
element of visibility and probably some extra im- 
pact. There is some evidence that different political 
personalities are more effective on radio than on 
television, but this has not been much investigated 
(Pool 1959). Both media certainly give an oppor- 
tunity for the exercise of political charisma. 

Furthermore, the cost-effectiveness of the mass 
media is likely to be greater than that of more in- 
dividualized campaign methods, at least in terms 
of spreading information. Expensive as television 
time may be, it is apt to be cheap per person 
reached. This is true even if one disregards the 
mammoth audiences reached on such special oc- 
casions as the Kennedy—Nixon debates in the 1960 
presidential campaign. 

However, it would be a mistake to make any 
simple comparative statement about the relative 
political effectiveness of the mass media versus 
face-to-face organization. They serve different func- 
tions and for maximum effectiveness they must 
be linked together. That is a principle that skillful 
political organizers have generally understood. In 
What Is to Be Done? Lenin developed the notion 
of a disciplined party of professional revolutionists, 
but in the same pamphlet he also strongly ad- 
vocated the establishment of an all-Russian news- 
paper to serve as a “collective organizer.” Even 
today in the Soviet Union, the several million oral 
agitators are serviced by a special magazine, Blok- 
not agitatora (“The Agitator’s Notebook”). 

Indeed, the growth of political parties and the 
growth of the press went hand in hand in most 
countries of the world until recent decades. One 
of the difficulties of establishing effective party 
systems in some of the newly emerging nations is 
that the newer mass media, particularly radio, can- 
not meet the needs of the small partisan group as 
well as the revolutionary newspaper put out in a 
small print shop. 

A number of major social science studies have 
dealt with the relationship of word-of-mouth com- 
munication to the mass media. It has been fairly 
well established in a variety of cultures that 
whereas the mass media serve effectively to diffuse 
information, people seldom act on that information 
without confirming their impulse by checking with 
an opinion leader with whom they are in face-to- 
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face contact (Katz & Lazarsfeld 1955; Rogers 
1962; Pool 1963). In India, for example, villages 
in which people listened to agricultural broadcasts 
in groups and then discussed them were compared 
with similar villages that received the same broad- 
casts but had no organized listening groups, In 
both sets of villages learning of the information 
was comparable, but only where there were or- 
ganized listening groups with face-to-face discus- 
sion was there any significant amount of adoption 
of the new practices that were learned (Mathur & 
Neurath 1959). The implications for the comple- 
mentarity of the mass media and political organiza- 
tion are clear. 

ITHIEL DE SOLA PooL 


[Directly related are the entries PROPAGANDA; PSYCHO- 
LOGICAL WARFARE. Other relevant material may be 
found in ATTITUDES; COMMUNICATION, MASS; PER- 
SUASION; PUBLIC OPINION; PUBLIC RELATIONS.] 
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Ir 
INTERNATIONAL ASPECTS 


The international system can be conceived in the 
terms of reference of a very large and complicated 
communication network. The prime argument for 
viewing international relations in this way is that 
the communication approach reveals aspects of the 
subject that hitherto have been neglected; and, 
therefore, it promises to advance knowledge of 
international behavior. 

A simple communication system consists merely 
of a message, a sender of the message, a channel 
through which the message can travel, and a re- 
ceiver of the message. There are, however, some 
further requirements if this simple structure is to 
be made to work. The system must be in contact 
with an environment changeful enough to give the 
sender the occasion for transmitting a message; 
the sender must have the ability to scan and select 
from the variety of things produced by a changeful 
environment; and both sender and receiver need to 
have somewhat comparable capabilities to engage 
in encoding and decoding the messages that are 
sent and received. From the standpoint of the inter- 
ested social scientist, a means for observing is an 
essential part of the system (Cherry 1957, p. 89). 

Beyond these requirements, a communication 
system can be elaborated almost endlessly by the 
incorporation of additional components. For ex- 
ample, the system may be equipped in some way 
to discover if the messages sent were received as 
intended. Filtering and condensing parts can be 
added to diffuse the transmissions to many receivers 
and to gather the messages from many senders to 
a single receiver. Provision may be made for storing 
the records of past communications and for re- 
trieving them when they are again needed. Multi- 
ple channels may be used or a single channel can 
be arranged to carry several different streams of 


messages. In brief, the opportunities to enlarge 
and elaborate a simple communication structure 
usually appear to be limitless. 


Scope 

It may be thought that to characterize a part or 
the whole of social phenomena such as the rela- 
tions among nations is merely to carry over an 
analogy to social science from the realms of me- 
chanics and electronics. This is not the case, how- 
ever; because relations among nations are carried 
along on a flood of words, both written and spoken, 
Further, many international events that are more 
physical than the acts of speaking and writing are 
symbolic in their purposes and effects and fall, 
therefore, into the category of communication. The 
communication aspects of international relations 
may be appreciated more fully by the consideration 
of a comprehensive definition of international com- 
munication and of the topics that fit within that 
definition. 

Let us define international communication as in- 
cluding both the structure and the content of the 
stream of social messages transmitted over time 
and across national boundaries. Diplomatic ex- 
changes and negotiations between national govern- 
ments come easily within the definition. No conflict 
between states could be carried on without a flow 
of information bearing ideas, feelings, and inten- 
tions between the antagonists. To make a proposal, 
to issue a warning, or to deliver a protest in inter- 
national politics is to commit a communicative act. 
The essential purpose of moving a fleet to a troubled 
area or of sending the head of state abroad on a 
good-will tour is generally to relay some specific 
information to other peoples and countries. The 
traditional functions of diplomacy are to gather 
and evaluate information from abroad and to carry 
on conversations and negotiations with foreign 
governments. Unquestionably, international com- 
munication is pervasive in the practices of inter- 
national politics. It extends well beyond official 
relations of governments, however. Smith has illus- 
trated eloquently what else is encompassed in 
international communication. He includes the fol- 
lowing: 


. . . the relatively disinterested activities of interna- 
tional newsgathering agencies; the creation of impres- 
sions abroad by tourists and other migrants; the 
probably massive but generally unplanned impact of 
books, art works, and movies distributed in foreign 
countries; the international contacts of students, edu- 
cators, scientists and technical assistance experts; the 
negotiations and correspondence of international busi- 
ness interests; the activities of international mission- 
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aries and religious movements; the work of interna- 
tional pressure groups, such as trade unions, chambers 
of commerce, and political parties; international phil- 
anthropic activities, like the Ford Foundation’s “private 
Point Four” program in India; the “propaganda of the 
deed” implicit even in the unpublicized activities of 
leaders and collectivities, as perceived by various audi- 
ences; and a great many other processes by which 
information and persuasion are consciously or uncon- 
sciously disseminated across national and cultural 
boundaries. (Smith 1956, pp. 183-184) 


Recent research 


The foregoing illustrates the types of subjects 
that come within the scope of international com- 
munication or that can be considered in a communi- 
cation context. Next we shall consider the question 
of what has been added in recent years to the store 
of knowledge of international affairs through re- 
search on communication. It is difficult, for several 
reasons, to offer a compact survey of the research 
accomplishment. No effective theory of communi- 
cation in international relations has emerged to 
organize and simplify the facts and to locate the 
various constituting segments of the subject. Re- 
searchers from several disciplines, including po- 
litical science, psychology, sociology, education, 
anthropology, and journalism, have engaged in 
relevant work; but the different interests and per- 
spectives of these fields complicate the task of sum- 
marization. Nor are the theoretical foundations of 
international relations sufficiently stable to offer a 
satisfactory organizing framework. What seems 
most reasonable under the circumstances is to dis- 
cuss, from more than a single vantage point, some 
of the scholarly literature on international com- 
munication, It is best to view the subject according 
to several sets of organizing categories. The situa- 
tion is somewhat puzzling because large expanses 
of the subject have been investigated very little but, 
at the same time, the volume of the literature has 
become very large. In a survey covering the period 
1943-1955, Smith and Smith (Bureau . . . 1956) 
identified over 2,500 articles, books, and reports 
of substantial character on international com- 
munication, 

Senders, channels, and receivers. The simple 
communication system structure that was de- 
scribed at the beginning of this article has been used 
frequently to guide reporting on the areas of com- 
munication research. Thus, one may think in terms 
of studies of senders, messages, channels, and re- 
ceivers. The senders, or communicators, of inter- 
national messages have been studied less frequently 
than the receivers, i.e., the audiences or “targets,” 
of communication. Cohen’s (1957) analysis of 


John Foster Dulles’ activities in the negotiating of 
the Japanese peace treaty is an outstanding ex- 
ample of the study of international communicators. 
The conceptual scheme of Snyder et al. (1962) for 
foreign-policy decision-making analysis stresses the 
importance of the communication elements that 
influence decision-making groups in the formula- 
tion of policy. The Snyder and Paige (1958) study 
of the 1950 U.S. decision on Korea provides numer- 
ous data for understanding the actions of interna- 
tional communicators. Nevertheless, only a sketchy 
knowledge exists of how communicators scan, se- 
lect, and encode information from the international 
environment for the transmission of messages to 
others. About the same situation exists in the area 
of channel studies. 

The channels of international communication 
are complex systems in their own right and are 
highly specialized for the particular functions they 
serve. Most of the research that has been done has 
centered on two topics: (a) the flow of world news 
through the mass media and (b) the growth of 
national communications in transitional and mod- 
ernizing societies. UNESCO has sponsored studies 
of how different parts of the world are served by 
the news agencies, the press, and other mass circu- 
lation media. Schramm (1959) and Kayser (1953) 
have provided some materials for studying the con- 
tent of the news in the press of different countries, 
and Pool et al. (1952) have shown the role played 
in international relations by certain “prestige” news- 
papers. Analyses have been made to determine how 
much international and foreign news appears in 
newspapers, and a beginning has been made in 
studying the relay stages and selection processes 
in the transmission of world news from the point of 
occurrence of a significant international event to 
the newspaper on the breakfast table. It is clear 
that the public channels of international communi- 
cation are highly selective so that some message 
flows are let through the system while others are 
turned back through choices made by “gatekeepers” 
located at various points in the transmission chain 
(Carter 1958). 

Some of the most important work of recent years 
has centered on studies of the growth patterns of 
communication networks in countries changing 
from traditional social forms to modern national 
organization. The spread of the mass media and 
the changes in the habits of social communication 
have been considered an indicator of national de- 
velopment and also a vital ingredient that must 
combine with other social, economic, and psycho- 
logical elements if national development is to con- 
tinue (Lerner 1958; Conference on Communi- 
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cation . . . 1963). The pioneering work that 
brought attention to the forms and the structures 
of social communication and their importance in 
the building of political communities was that of 
Karl Deutsch (1953). 

As one might anticipate, the effects of communi- 
cation on audiences have interested researchers 
perhaps more than any subject in the field. The 
result is that a great mass of information is avail- 
able about public opinion on many international 
issues, about popular views on war and peace, and 
about attitudes and images concerning foreign 
peoples and foreign governments. The specific re- 
sults of educational efforts in world affairs, of 
propaganda campaigns, and of different kinds of 
people-to-people communications have been re- 
ported in large numbers. Several volumes of the 
series of studies “Citizen Participation in Interna- 
tional Relations” (Hero 1959a; 1959b; 1960a; 
1960b) assemble much of the data about American 
audiences and foreign affairs. 

The expansion to world-wide dimensions of sur- 
vey research on public opinion has made in- 
creasingly apparent the possibilities of carrying on 
periodic cross-national samplings of opinion on 
international matters and of charting long-term 
trends of opinion change. Indeed, valuable cross- 
national comparative studies have already been re- 
ported, for example, those of Buchanan and Cantril 
(1953), Gillespie and Allport (1955), Almond 
(1954), and Free (1959). As yet no approach has 
been sufficiently developed to bring together the 
data of thousands of surveys of opinion on foreign 
affairs that have been collected over the years in 
dozens of countries. Pool (1961) has urged that 
these idle resources be put to use and has suggested 
some methods for establishing systematic and 
quantitative criteria for assembling this knowledge. 

Some general conclusions concerning audiences 
and international relations can be set forth on the 
basis of the research that has been done on public 
and international affairs. There is overwhelming 
testimony to the low state of factual knowledge 
about foreign affairs in mass publics in all coun- 
tries and to the lack of understanding in national 
populations of the meanings of even very important 
international events. The early insight (Lasswell 
1935; Lasswell et al. 1952) that in all nations 
foreign affairs enlist the attention, knowledge, 
skills, and participation of small elites has, in gen- 
eral, been supported by the results of audience re- 
search. A modification of elite theory has been 
made, however, in recognition of the fact that na- 
tional populations are organized in large numbers 
of special publics and special interest groupings. 


Thus, the concept of a single small elite group 
largely in charge of foreign relations has been 
widened somewhat to encompass numbers of 
special publics and their “opinion influentials.” 

In recent years, the outstanding hypothesis ad- 
vanced to explain both the apparent insensitivity 
of mass publics and the role of leaders and influ- 
entials in the communication process is the concept 
of the “two-step flow” (Katz & Lazarsfeld 1955; 
Katz 1957). This hypothesis states that the effects 
of communication from the mass media take place 
among relatively few people but that among those 
affected are persons who convey opinions and 
ideas to others by direct and personal conversation. 
The first step of the communication flow is via the 
mass media, and the second step operates through 
face-to-face communicating and personal influence. 
The failures of many vigorous world affairs pro- 
grams to educate the general public (for an ex- 
ample, see the study of Star & Hughes 1950) may 
be related to the slighting of the second step of 
communication, 

Communication and foreign policy. In locating 
additional research on international communica- 
tion, it is advantageous to make reference to the 
categories now commonly employed in describing 
the general field of international relations. When 
the study of international relations is divided into 
two parts, one concerned with the making of for- 
eign policy and the other with the interactions and 
organization of the international political system, 
it will be found that some communication studies 
occupy both sectors. The framework of Snyder et 
al. (1962), which was mentioned previously, is a 
major guide to the study of the making of foreign 
policy, and it calls for careful attention to the 
processes of communication within national socie- 
ties. Rosenau (1961) has made a significant con- 
tribution in his conceptualization of the foreign- 
policy process as a system of mutual influence 
involving the government, the influentials, and the 
attentive publics, carried forward by means of a 
circular flow of communication. Data supporting 
this conceptual scheme have been published by 
Rosenau (1963). The place of the press in the 
making of foreign policy has been discussed fre- 
quently but studied rarely. An analysis of the press 
and foreign policy by Cohen (1963) has partially 
corrected the situation. Almond’s study (1950) 
still stands as the best general treatment of public 
opinion and its relationship to U.S. foreign policy. 

A novel approach to the communication aspects 
of the exchanges and interactions between nations 
in the international system has been developed by 
North and his associates (1963). By devising a 
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program of content analysis for a large computer, 
the North group has been producing measures of 
the kinds of perceptions and the amount of emo- 
tion being carried, during crises, in the statements 
and messages of national decision makers. The 
practices of diplomacy, otherwise, have received 
little attention from the communication research 
standpoint. Proceedings of international confer- 
ences offer rich materials for content analysis, but 
little work has been done in this direction. Spanier 
and Nogee (1962) have interpreted recent dis- 
armament negotiations from the propaganda angle, 
however. 

Governments are interested in persuasive com- 
munication as one of the instruments of foreign 
policy. Many studies have been done under govern- 
ment auspices to evaluate information programs 
and propaganda campaigns conducted abroad. War- 
time needs to counter enemy propaganda and to 
explore all avenues of propaganda and psycho- 
logical warfare were instrumental in stimulating 
propaganda analysis. Surveys of these wartime ac- 
complishments and of the progress of the art and 
science of propaganda are readily available (Lerner 
1951; George 1959; Holt & van de Velde 1960; 
Daugherty & Janowitz 1958). 

Psychological factors. Two clusters of theory 
and research that are contributing increasingly to 
current knowledge of international communication 
have their sources in psychology and anthropology. 
These contributions may be related conveniently to 
the phenomena of the encoding and decoding of 
messages, referred to early in this discussion. 
When psychological and cultural considerations are 
taken into account in the study of international 
relations, they enrich the understanding of foreign 
policy and international politics; but they also tend 
to complicate matters and sometimes to confuse 
them. The introduction of “the human dimension 
in international relations” (Klineberg 1964) pre- 
sents some problems in integrating new knowledge 
with old. It is the communication approach that 
appears to offer the best connecting link between 
psychocultural and political knowledge. 

Political scientists have ordered their knowledge 
of international relations principally around the 
concept of power. Further, they have identified 
decision making as the most vital of the problems 
involved in the use of power. As the foregoing has 
indicated, communication research has concen- 
trated on the study of the structures and processes 
of message exchange. At first sight, the political 
perspective and the communication perspective ap- 
pear to belong to entirely different realms. This is 
not the case, however. Communication includes not 


only the study of psychological and cultural varia- 
bles in information processes; it also includes 
branches of theory and research concerned with 
persuasion and influence and with the making of 
decisions under conditions of uncertainty. The 
latter concerns lie very close to the political scien- 
tists’ preoccupations with power and political de- 
cision making. Thus, the possibility of connecting 
these two areas of interest and communication 
theory is of real significance. The following discus- 
sion is a sketch suggesting how the possible con- 
vergence of perspectives can be developed. 

Conventional teachings of international politics 
hold that the participants in international affairs 
are guided in their talk and actions by the universal 
language of power politics. International actors 
may make errors and fall into misunderstandings, 
but these are the mistakes in the use of the calcu- 
lus of power. The basic model of international poli- 
tics is a distillation of centuries of historical ex- 
perience, and it portrays a relatively small number 
of national states interacting as units according to 
national interests and under circumstances deter- 
mined by the distribution of power among the 
units. This inherited model cannot be cast aside 
lightly, yet it has little room for the ideas and data 
of the human dimension that psychologists and 
anthropologists have cultivated. Factors of public 
opinion, the influence of personality variables, the 
differences in national character, the consequences 
of sheer ignorance about the world, and the dy- 
namic interplay of conflicting beliefs and values 
among nations are squeezed into the model only 
with great difficulty. Thus, the accommodation of 
the human dimension would appear to require 
either the abandonment of the traditional model 
in favor of a new one or its expansion to encompass 
the additional interests. Let us see, tentatively, 
how the model might be enlarged. 

Men act according to the information they have. 
As has been said many times, they act according 
to their perceptions of what is so. If the question 
of the differences in the availability of information 
is put aside, there remains the matter of what an 
individual does with the information that comes 
to him. Psychological theory and research have 
done much toward explaining the processing of 
information within human beings. Some messages 
reach the individual and are internalized because 
they are taken as significant or salient. Others may 
be noted casually, and still others are turned away 
and not internalized. The generalization is that 
men possess processing equipment to handle the 
receiving, selecting, categorizing, and storing of 
reports received about the world. This total re- 
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source has been called the “image” (Boulding 
1956) or cognitive structure. 

Two facts about the image are to be noted espe- 
cially. Individuals act to keep a reasonably well- 
organized view of the world. The psychological 
theories of how this balance is maintained have 
been summarized conveniently by Maccoby and 
Maccoby (1961). One's cognitive organization must 
be understood as the product of personal experi- 
ence, but its structure is also the reflection of 
upbringing and social training—the outcome of 
living in a particular cultural milieu. Osgood 
(1958) and Miller, Galanter, and Pribram (1960), 
among others, have extended the knowledge of the 
many intricacies in the workings of the image in 
human beings. How much of the individual's 
communication processing capability is shared 
among all men and how wide its variations are 
because of different upbringing and cultural con- 
ditioning are questions of interest to the students 
of international behavior. The answers are by no 
means clear at the present time. Cultural anthro- 
pologists have a strong opinion on the subject, 
however. 

Cultural factors. The stock in trade of anthro- 
pology is the culture concept. The interrelated 
wholeness of a culture has been much emphasized 
(Benedict 1934), When most anthropologists think 
about international relations, their foremost thought 
is that international interactions involve people 
who carry different and distinctive cultural traits 
and patterns. It is small wonder that vigorous com- 
plaints have arisen from the anthropological quar- 
ter against the political analyses of international 
relations that disregard or slight the cultural factor. 
On the other hand, strong doubts have been ex- 
pressed on the creditability of anthropologically 
oriented interpretations of American, German, 
Japanese, Russian, and other national cultures 
(Benedict 1946; Mead 1951; Hennessy 1962). 
Criticisms and countercriticisms notwithstanding, 
cross-cultural studies have been made on subjects 
more specific than whole national cultures. Inves- 
tigations have been carried on of the experiences 
of foreign study (Selltiz et al. 1963); of modern 
acculturative situations (Lord 1958; Scotch 1961 J; 
and of specific linguistic, emotional, and gestural 
misperceptions and misunderstandings (LaBarre 
1947; Haring 1951). Even some training on psy- 
chocultural situations has been introduced in edu- 
cation for foreign service. Hall (1959) has made 
available many fascinating examples of cultural 
influences on international communication and, in 
addition, has developed a conceptual scheme for 
studying cross-cultural phenomena. Enough is now 
known about the cultural factor that a place for 


it must be provided in a general model of interna- 
tional relations. 

In the perspective of communication research, 
the actions of individuals and the influences of 
cultures have multifarious effects on the fidelity of 
the transmission of messages through the inter- 
national system. What effects occur in the encoding 
and decoding of international information at the 
level of the individual? What happens in the proc- 
essing as messages pass through one culture and 
enter another? These are, perhaps, the fundamen- 
tal questions for which specific answers are needed 
from research on international communication. 
The human dimension of international relations 
finds its place as the coding process in a commu- 
nication system. It remains now merely to indicate 
how else the concept of the international system 
as an elaborate communication network may en- 
large conventional views of international politics. 


Communication and international politics 


The governments of sovereign states exert power 
to serve their national interests. This is the conven- 
tional statement. The relationships that arise from 
the pursuit of national interests have many prop- 
erties and attributes. They may be extensive or 
slight, friendly or hostile, attentive or indifferent, 
peaceful or violent, stable or changing, and so on. 
However these different states of relationships are 
distinguished, they are recognized by the makers 
of foreign policy only from information that can 
be gathered about occurrences and events taking 
place in international relations. It follows that the 
central foreign-policy problem of a government is 
to decide, on the basis of the information at hand 
about the international environment, what to do 
either in making a demand on another government 
or in responding to a demand made by another gov- 
ernment (Sprout & Sprout 1962, pp. 141-177). 

The task of making decisions in foreign rela- 
tions is never easy for at least three fundamental 
reasons: (1) there is never enough information 
to undergird a decision with complete certainty; 
(2) the outcome of the decision depends on the 
responding action of other governments; and (3) 
the outcome of one decision may impose the 
difficulty of making further decisions that were 
not anticipated at first. In brief, the source of dan- 
ger is the uncertainty in the situation. To reduce 
uncertainty is to diminish the surprise factor in 
international relations. The better a decision maker 
becomes in deciding what to do—without being 
surprised by the result—the better are his chances 
of being able to advance the interests of his state. 

Information theory (Cherry 1957; Pierce 1961; 
Garner 1962) was devised precisely for the system- 
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atic choosing of courses of action in the face of 
uncertainty. In order to use information theory, 
one needs to have stable sources of action (or of 
messages), an understanding of what could hap- 
pen, and a reasonably extensive record of what 
has already happened. Foreign policy and interna- 
tional politics furnish data to these specifications. 
The data of international relations should prove to 
be amenable to analysis according to information 
theory. 

All that must be done, then, to enlarge the con- 
ventional model of international relations is to 
regard national states as complex organizations 
that process incoming information through net- 
works of communication and transmit abroad their 
reactions in the form of demands and responses. 
The national states are communication sources, 
and their demand and response actions generate 
the messages flowing between states. The units of 
international relations are the thousands of dis- 
crete messages that are exchanged. Only these few 
reconceptualizations are needed to modify the 
traditional model of power politics in international 
relations and to reorient understanding to the 
terms of reference of communication systems. 

It becomes the task of the study of international 
relations in the communication perspective to trace 
the flows of communication; to determine the 
structure and content of these flows through the 
international system; to measure the intensity, 
volume, and variety in the messages; and to dis- 
cover the methods best suited for using informa- 
tion, thus reducing uncertainty, in the making of 
decisions. The theory of power is translated into 
the theory of communicative influence (Dahl 1957; 
Singer 1963). Here are the ingredients for an ex- 
panded model of international relations. By bring- 
ing international communication into the main- 
stream of international studies, the researchers of 
international relations will advance the knowledge 
of their field and increase the possibilities of pre- 
diction and control in the international system. 


CHARLES A. MCCLELLAND 


[See also FOREIGN POLICY; INTERNATIONAL POLITICS; 
INTERNATIONAL RELATIONS.] 
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The articles under this heading deal with the 
political aspects of contemporary communism. For 
a detailed guide to related topics, see the articles 
listed under MARXISM. 
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I. SOVIET COMMUNISM 

11. NATIONAL COMMUNISM 
nI. THE INTERNATIONAL MOVEMENT 
1V. THE INTERNATIONAL SYSTEM 


I 
SOVIET COMMUNISM 


The origins and nature of Soviet communism 
are, and promise to remain, a subject of debate. 
Some scholars emphasize indigenous formative 
influences, stress factors of continuity with the 
Russian past, and tend to regard the Soviet political 
system as a modern form of traditional Russian 
despotism. Others view Bolshevism primarily as a 
Western heresy, whose roots can be traced to 
eighteenth-century French utopianism, Messian- 
ism, and Jacobinism but which, in a more imme- 
diate sense, was an offshoot of the activist, 
revolutionary ingredients in classical Marxism. 
Those who stress discontinuities with the Russian 
past argue that Russia in the late nineteenth and 


early twentieth century was embarked on a path 
of constitutional development, which the Bolshevik 
seizure of power aborted, and that in any case, 
Soviet totalitarianism, with its pretensions to all- 
encompassing control of society and world revolu- 
tionary objectives, must be sharply distinguished 
from tsarist authoritarianism, which functioned as 
a national state and operated a much more limited 
and inefficient control system. 

As is not uncommon in scholarly polemics, there 
are elements of truth in all these contrasting inter- 
pretations, and no measuring rod exists to give 
precise value to their respective contributions. 
Soviet communism clearly owes much to its French 
revolutionary and Marxist forebears, but the way 
in which these borrowed ideas were adapted and 
transformed represented a response to the urgen- 
cies and peculiarities of the Russian environment. 

Classical Marxism. Classical Marxism may it- 
self be viewed as a response to the dislocations 
and suffering of the working class in the early 
stages of the industrial revolution in Great Britain 
and western Europe, In the Marxian scheme the 
spread of capitalism and the intensification of 
working class misery were the necessary prelude 
to a socialist revolution; Marx’s panorama of 
economic development clearly implied that the 
socialist revolution would arrive first in the most 
advanced industrial countries, where the prole- 
tariat or working class was expected to be most 
numerous, most highly organized, and ripe for the 
seizure of power. 

Nineteenth-century Russia, with its overwhelm- 
ingly peasant population and low level of indus- 
trial development, hardly fit the specifications for 
a Marxist-type revolution, Indeed, through the 
1880s the Russian revolutionary movement, which 
was dominated by the Narodniki, or Populists, 
looked to the peasant as the primary revolutionary 
force to overthrow the autocracy and dreamed of 
building socialism around the peasant commune, 
or mir. While Narodnik philosophers were familiar 
with the works of Marx and Engels and largely 
responsible for translating them into Russian, they 
regarded the evils of industrialization and prole- 
tarianization, which Marx and Engels described so 
graphically, as dangers to be avoided rather than 
stages to be traversed. Nor were Marx and Engels 
at first certain that Russia would have to recapitu- 
late the economic development of the West. As late 
as 1882 they still thought that the mir might 
“serve as a starting-point for a communist course 
of development,” but only “if the Russian revolu- 
tion sounds the signal for a workers’ revolution in 
the West, so that each becomes the complement of 
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the other” (Engels [1890] 1963, pp. 264-265). 
By 1892, however, Engels had written off the mir 
as a Narodnik illusion; Russia seemed to him to 
be embarked on an irreversible capitalist course. 

Russian Marxism. Russian Marxism as an in- 
dependent political movement originated in the 
split in 1879 of the Narodnik organization Zemlya 
i Volya (Land and Liberty). The seceders, who 
stood for propaganda and agitation as opposed to 
terrorism, established a rival organization, the 
Chernyi Peredel (Black Repartition), to propagate 
their doctrines. One of their leaders was Georgii 
Valentinovich Plekhanov, soon to be known as the 
father of Russian Marxism, but then still clinging 
to the Narodnik belief in the peasant as the driving 
force of revolution. The failure of the peasantry 
to respond either to agitation or terror and the 
rapid industrial development that was beginning 
to take place in the last decades of the nineteenth 
century impelled Plekhanov to re-examine his views 
and to turn to the new industrial working class— 
the proletariat, so-called—as the revolutionary 
hope of the future. The search for a new faith led 
him to Marxism. In 1883 Plekhanov, Paul Axelrod, 
Leo Deutsch, and Vera Zasulich, all of whom had 
been members of the Chernyi Peredel, joined in 
establishing the first important Russian Marxist 
organization, known as Osvobozhdenie Truda 
(Emancipation of Labor). By the 1890s the works 
of Marx and Engels were being widely and eagerly 
read by the more radically inclined students and 
intellectuals and propagated in workingmen’s cir- 
cles. One of the new converts was Vladimir Ilyich 
Ulyanov (Lenin), who was to change the face of 
world history. 

The Russian Social Democratic Labor party, of 
which Lenin’s Bolshevik faction was an offshoot, 
held its first congress in 1898 and its second in 
1903. At the 1903 congress, divisions developed 
within the party. The split between “hards” and 
“softs” centered on the character of party organi- 
zation. Lenin wanted a select, closed party of 
dedicated revolutionaries operating in strict sub- 
ordination to the center and serving as a vanguard 
of leadership for the masses of workers who would 
follow the party without necessarily belonging to 
it. Martov (Iulii Osipovich Tsederbaum), leader 
of the “softs,” pressed for a broad party open to 
anyone who believed in its program and was will- 
ing to work under its direction. At the 1903 con- 
gress, the views of Martov on party organization 
registered a temporary triumph. But in the election 
of officers at the end of the congress, Lenin’s fac- 
tion carried the majority and became known as 
Bolsheviks (in Russian, “majority men”). Lenin’s 
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opponents were dubbed Mensheviks (“minority 
men”). 

The differences between the two factions were 
not confined to organizational matters. A central 
issue that was to divide the party grew out of the 
problem of Russia’s industrial backwardness and 
the political consequences to be drawn from it. 
How should Marxism, which provided a recipe for 
socialist revolution in the most advanced industrial 
countries, be applied in the Russian setting? The 
Mensheviks, who prided themselves on being ortho- 
dox Marxists, saw the arrival of socialism in Rus- 
sia as the climax of a long process of industrial 
development. Impressed by the weakness of the 
Russian proletariat, they concluded that a socialist 
Russia was a matter of the distant future and that 
their first charge as good Marxists was to help the 
bourgeoisie discharge its historical responsibility 
to overthrow the autocracy, Meanwhile, they could 
only wait for the further growth of capitalism and 
the proletariat to establish the conditions for a 
successful socialist revolution. 

At the opposite extreme from the Menshevik 
conception was the theory of “permanent revolu- 
tion,” developed by Parvus (Alexander Helphand) 
and adopted by Leon Trotsky during and after the 
1905 revolution. For Parvus and Trotsky the indus- 
trial backwardness of Russia was a revolutionary 
asset rather than a liability. Because of the weak- 
ness of the Russian bourgeoisie and its dependence 
on the state, they looked to the proletariat to spear- 
head both a bourgeois and a socialist revolution. 
Once the proletariat had won power, its responsi- 
bility was to hold onto power and keep the revolu- 
tion going “in permanence.” The Russian revolution, 
Trotsky thought, would ignite a series of socialist 
revolutions in the West, which would in turn guar- 
antee the success of the socialist revolution in 
Russia. Thus Trotsky’s prescription for Russia’s 
retarded economy was a new law of combined 
development. The two revolutions—bourgeois— 
democratic and proletarian—socialist—would be 
combined, or telescoped, into one. The working 
class would assert its hegemony from the out- 
set and leap from industrial backwardness into 
socialism. 

Leninism. Lenin’s own views underwent an 
interesting transformation. He began by accepting 
the orthodox two-stage conception that Russia 
would have to pass through a bourgeois~democratic 
phase before it was ripe for a socialist revolution. 
But, unlike the Mensheviks, Lenin turned his back 
on the bourgeois liberals and looked to the peas- 
antry as allies of the working class. Lenin’s formula 
envisaged two tactical Stages: first, “the revolu- 


tionary-democratic dictatorship of the proletariat 
and the peasantry” to complete the bourgeois— 
democratic revolution, and second, an alliance of 
the proletariat and the village poor to initiate the 
socialist revolution. Like Trotsky, Lenin came to 
believe that the two stages of the revolution could 
be telescoped into one; at the height of the 1905 
revolution he pronounced, “We stand for uninter- 
rupted revolution. We shall not stop half way.” 
Despite many intervening conflicts, the bond with 
Trotsky was to be sealed by the events of 1917. 
The dialectic of backwardness was “resolved” by 
the Bolshevik seizure of power. 

Out of that experience a new theory of revolu- 
tion was to be developed with world-wide applica- 
tions. The Leninist theory of revolution, as Stalin 
christened it, was a far cry from orthodox Marxism. 
Its prerequisites were no longer industrial develop- 
ment, a mass proletariat, or the completion of a 
bourgeois-democratic revolution. As Stalin summed 
it up, “The front of capital will be pierced where 
the chain of imperialism is weakest, . . . and it 
may turn out that the country which has started 
the revolution, . . . is less developed in a capitalist 
sense than other, more developed, countries, which 
have, however, remained within the framework of 
capitalism” (Stalin [1924] 1954, p. 37). 

Thus Marx was “adapted” to the task of revolu- 
tion making in the underdeveloped countries of 
the world. Industrial backwardness was trans- 
formed from obstacle to opportunity. The concept 
of the dictatorship of the proletariat shifted from 
a weapon of the majority into a tool of minorities. 
Consciousness triumphed over spontaneity, and 
the way was cleared for the organized and disci- 
plined revolutionary elite capable of transmuting 
the grievances of a people into a new instrument 
of absolute power. 


Early organizational development 


“Give us an organisation -of revolutionaries,” 
said Lenin, “and we will overturn Russia” (Lenin 
[1902] 1961, vol. 5, p. 458). Lenin forged the in- 
strument, but the prototypes of the professional 
revolutionary as the strategic lever of political up- 
heaval were planted deep in Russian history and 
were nurtured by the conditions of the revolution- 
ary struggle against the autocracy. It was against 
this background that the organizational concepts of 
Lenin took shape. By 1902, with the publication 
of What Is to Be Done? they were fully developed. 

Four ideas stand out in this seminal source of 
Communist organizational doctrine. First, there 
was the fear of spontaneity, the notion that the 
working class could not be trusted to discover its 


own true interests and that, if left alone, it would 
follow the path of trade-union bargaining rather 
than commit itself to a revolutionary struggle for 
socialism. Second, there was the unquestioned as- 
sumption that the working class had to be guided 
and directed by a politically conscious revolutionary 
vanguard possessed of a superior knowledge of the 
laws of history. Third, there was the idea that this 
vanguard should consist of a small, carefully se- 
lected party of professional revolutionaries oper- 
ating under highly centralized direction and disci- 
pline. Finally, there was the concept of political 
monopoly, that no other organizations should be 
permitted to compete with the party in obtaining 
access to the masses and that all mass organiza- 
tions, such as trade unions, must be brought under 
the ideological influence and, if possible, the direct 
control of the party. 

Lenin sought to put these ideas into effect in his 
struggle with the Mensheviks for ascendancy in 
the Russian Social Democratic Labor party. While 
a façade of surface unity was at first preserved, 
the factional strife became increasingly bitter, and 
finally in 1912 Lenin and his followers broke away, 
organized a “pure” Bolshevik central committee, 
and laid claim to the party title. The name “Com- 
munist” was not adopted until 1918, after victory 
was won, when the designation of the party was 
changed to Russian Communist party (Bolshevik ) 
to distinguish it from the socialist or social- 
democratic parties of western Europe, whom the 
Bolsheviks charged with betrayal of the revolution- 
ary cause during World War 1. 

The early organizational history of Bolshevism 
has more than historical interest. The experience 
of the formative years stamped the character and 
future development of the party. The elitism which 
was so deeply ingrained in Lenin, the theory of the 
party as a dedicated revolutionary order, the tra- 
dition of highly centralized leadership, the tighten- 
ing regime of party discipline, the absolutism of 
the party line, the intolerance of disagreement and 
compromise, and the manipulatory attitude toward 
mass organization—all these patterns of behavior 
were to continue to shape the code by which the 
party lived and the way in which its institutional 
structure was organized. 

The Bolshevik victory in November 1917 may 
fairly be described as one of the most remarkable 
triumphs of revolutionary engineering in human 
history. On the eve of the March revolution the 
total membership of the Bolshevik party was gen- 
erously estimated at 23,600. In the short space of 
eight months, this small group was able to accumu- 
late sufficient support to seize power in a nation 
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of over 150,000,000 people. Russia, to be sure, was 
ripe for revolutionary action. The war, with its vast 
losses of men, territory, and resources, its revela- 
tions of incompetence and even degeneracy in the 
very highest circles, and the mounting war-weari- 
ness, hunger, and deprivation, induced a mood of 
desperation. The Bolsheviks alone among revolu- 
tionary parties were able to turn the resulting dis- 
organization to their advantage. One major source 
of the strength of the Bolshevik party was its high- 
ly centralized organization, its activist disciplined 
membership, and the determination of its leader, 
Lenin, to seize power at the first opportune mo- 
ment. Another source of strength was its success 
in exploiting all of the accumulated dissatisfac- 
tions in Russian society. The provisional govern- 
ment that replaced the tsar was weak, vacillating, 
and slow to respond to popular grievances. The 
Bolsheviks were willing to promise what the masses 
wanted—land to the peasants, bread to the hun- 
gry, and peace to the war-weary army. Finally, the 
Bolsheviks concentrated their efforts on building 
power where it would be strategically effective— 
among the sailors of the Baltic fleet, in the Petro- 
grad garrison, and in the armed workers’ Red 
Guard in the factories. The enemies of the Bolshe- 
viks were far more numerous, but they were poorly 
organized, divided, and ineffective. As Lenin sub- 
sequently observed, “The Bolsheviks did have an 
overwhelming preponderance of force at the de- 
cisive moment in the decisive points” (Lenin [1919] 
1932, vol. 29, p. 635). Relying on these tactical 
advantages, they succeeded in taking power. 

The next problem was whether they could hold 
onto power in the face of the successive onslaughts 
of the Germans, the White armies, and the Allied 
intervention. The first decision of the Bolsheviks 
was to sue for peace with the Germans. The terms 
were harsh, but Lenin argued that there was no 
alternative except to bow. In the Treaty of Brest- 
Litovsk (March 1918) the Bolsheviks temporarily 
signed away to the Germans a third of their coun- 
try and more than half of their industry. But the 
breathing space the treaty was designed to win 
never materialized. Although German pressure re- 
laxed with the signing of the armistice in Novem- 
ber 1918, the Bolsheviks had still to cope with the 
White armies, the Allied interventionist forces, and 
a war with Poland. After three years of civil war, 
the Whites were defeated, the Allies (the United 
States, Japan, France, and Great Britain) withdrew 
their forces, and the Bolsheviks survived. 

Coincidentally with the civil war, the Bolsheviks 
proclaimed a state of siege, and all opposition 
parties were suppressed one by one. For a very 
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brief period (from late December 1917 until March 
1918), there were three Left Socialist Revolution- 
aries (who stood close to the Bolsheviks) in the 
Council of People’s Commissars, the cabinet. But 
they left the government in protest against the 
Treaty of Brest-Litovsk and grain confiscations 
from the peasants, and the Soviet government re- 
verted to its original exclusively communist com- 
position, which it has since retained. Lenin at the 
time made no bones about the necessity for dic- 
tatorship and suppression of opposition groups. 
“There is no other way to Socialism,” he insisted, 
“but the dictatorship of the proletariat. . . . Vio- 
lence, when it is committed by the toiling and ex- 
ploited masses, is the kind of violence of which we 
approve” (Lenin 1918, vol. 35, p. 265). 

Since in Lenin’s view it was the Communist 
party that was the sole custodian of the interests 
of the masses, the logic of his position led inex- 
orably in the direction of the one-party state and 
the use of the Cheka (security police) to extermi- 
nate enemies of the regime. By abolishing freedom 
for opposition parties, the communists transformed 
the Soviets, the trade unions, and other forms of 
mass organization into transmission belts, as they 
came to be called, for carrying out the will of the 
monopoly party. 

From the one-party state it was only a short step 
to the establishment of dictatorship within the 
party. This was uniquely Stalin’s achievement. But 
Lenin had set the master precedent with his ultra- 
centralist ideas on party organization. In attacking 
them as far back as 1904, Trotsky had propheti- 
cally observed that in Lenin’s scheme the party 
takes the place of the working class; the party or- 
ganization displaces the party; the central com- 
mittee displaces the party organization; and finally, 
the “dictator” displaces the central committee 
(Trotsky 1904, p. 54). 


The Stalinist era 


It remained for Stalin to provide the final vindi- 
‘cation of Trotsky’s prophecy. In 1922 Stalin was 
appointed general secretary of the party, a post of 
key importance in controlling party patronage. His 
recommendations became increasingly decisive in 
appointments of regional and local party secre- 
taries, who later returned to party congresses as 
voting delegates. When Lenin became ill in 1922- 
1923, the most prominent contender for the suc- 
cession was Trotsky. But the man who commanded 
the party machine was Stalin, He joined with two 
other party leaders, Grigori E. Zinoviev and Lev 
B. Kamenev, who held strong positions in the party 
organization, and together they proved powerful 


enough to shear Trotsky of a considerable degree 
of his authority. The troika, or triumvirate, that 
they established dominated the party Politburo 
after Lenin’s death in 1924. 

In that period Stalin was building up his per- 
sonal machine and soon felt strong enough to dis- 
pense with the support of Zinoviev and Kamenev, 
who moved over in 1925 to join Trotsky in form- 
ing the so-called Left Opposition. In order to defeat 
them, Stalin allied himself with the right wing in 
the Politburo—Nikolai I. Bukharin, Mikhail P. 
Tomski, and Aleksei I. Rykov. Together, they 
crushed Trotsky and his new allies in 1926-1927. 
Not satisfied to share authority, Stalin then turned 
round in 1929 to rid himself of the right wing and 
ejected Bukharin, Tomski, and Rykov from lead- 
ing positions in the party. Even though Stalin’s 
position as the “supreme leader” of the party was 
now virtually unquestioned, he proceeded in the 
mid-1930s to carry through a series of bloody 
purges which left no corner of Soviet society 
untouched. 

Stalin’s drive to rid himself of all opposition was 
not confined to the Old Bolsheviks, Many of the 
most faithful of his erstwhile supporters were 
caught up in the fury of the purges. Perhaps the 
most dramatic and authoritative testimony on the 
damage the purges did to the top stratum of party 
leadership was provided by Khrushchev, when he 
revealed in his “secret” speech to the twentieth 
party congress in 1956 that “of the 139 members 
and candidates of the Party's Central Committee 
who were elected at the XVIIth Congress (1934), 
98 persons, i.e., 70 per cent, were arrested and 
shot (mostly,in 1937-1938) . . .” and that “of 
1,966 delegates with either voting or advisory 
rights” at the same congress “1,108 persons were 
arrested on charges of anti-revolutionary crimes 
...” (Columbia University, Russian Institute 1956, 
pp. 22-23). 

With the consolidation of Stalin’s power, the 
figure of the infallible dictator emerged as the 
operative principle of Communist party leadership. 
His colleagues in the Politburo functioned as ad- 
ministrative henchmen and assistants on a high 
level; the central committee went into a shadowy 
eclipse; party congresses became rallies of the 
faithful; and the party apparatus operated as a 
subservient instrument of Stalin’s will. 

While the particular form of despotic rule that 
Stalin developed was no doubt peculiar to him and 
a reflection of his personality, the desperate desire 
of both Lenin and Stalin to overcome Russian 
backwardness reinforced reliance on authoritarian 
expedients. When the Bolsheviks seized power, 


Russia was still a largely peasant country in the 
early stages of industrialization. Among Bolshe- 
viks, consciousness of Russian backwardness was 
so strong that many feared it would be impossible 
to move forward to socialism, or even hold onto 
power, unless help was provided by successful 
proletarian revolutions in the advanced industrial 
nations of the West. When revolutionary aid from 
the West was not forthcoming, the Bolsheviks were 
thrown back on their own resources and faced with 
the task, as Lenin put it in 1921, of “adopting dic- 
tatorial methods to hasten the copying of Western 
culture,” and of using “barbarous methods in com- 
batting barbarism.” Restated in non-Leninist terms, 
the historic burden that the communists assumed 
was that of modernizing an underdeveloped coun- 
try by dictatorial and totalitarian means. 

After the exhaustion of the civil war and the 
famine that accompanied and succeeded it, the 
communists were compelled to beat a retreat, to 
make concessions to the peasantry, and to permit 
a partial revival of private trade, During the years 
of the NEP—the New Economic Policy, which 
lasted from 1921 to 1928—prewar industry was 
rebuilt, but the regime encountered the greatest 
difficulty in accumulating the investment funds to 
finance a further expansion of the industrial 
sector. Agricultural output grew but so did peasant 
consumption, and efforts to increase the tax bur- 
den on the more well-to-do peasants met for- 
midable resistance. The resulting stalemate was 
eventually resolved by forcing the peasants into 
collective farms, which operated essentially as 
collection agencies to extract grain from the peas- 
antry at low prices. Thus a large part of the bur- 
den of accumulating an industrialization fund was 
transferred to the countryside. 

Beginning in 1928 with the first five-year plan, 
Stalin launched Russia on a program of forced- 
draft industrialization. The decision carried with 
it a train of consequences: emphasis on heavy in- 
dustry at the expense of light industry; the sup- 
pression of the claims of consumption; the “revo- 
lution from above,” as Stalin termed it, by which 
the peasantry was brought to heel through collec- 
tivization; a new emphasis on technical education; 
a reorganization of the incentive system to reward 
those who contributed to industrial productivity; 
and finally, a strengthening of the coercive and 
totalitarian features of the regime to deal with the 
discontent that the new program generated. 

Inherent in the industrialization drive was also 
a powerful nationalist, or patriotic, ingredient. As 
the prospects of world revolution dwindled in the 
1920s, Russia, the home base of the revolution, 
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became the central preoccupation of the party 
leadership, and the need to strengthen its defenses 
and build up its power loomed as a more and more 
important task. In the process the legacy of Rus- 
sian national interests became inextricably inter- 
twined with broader Soviet goals and objectives. 
Industrial backwardness operated as a barrier both 
to the assertion of Russian national claims and to 
the realization of communist revolutionary hopes. 
To overcome this barrier was to ensure survival 
and to open the door to greatness. As Stalin put it 
in a speech to industrial managers in 1931: 
«_. . we are fifty or a hundred years behind the 
advanced countries. We must make good the dis- 
tance in ten years. Either we do it, or they crush 
us” (Stalin [1931] 1954, p. 456). The speech was 
to prove prophetic. Exactly ten years later came the 
Nazi attack on the Soviet Union. Had it not been 
for the industrialization drive and the military 
power that it produced, it is doubtful whether the 
Soviet Union could have survived the onslaught. 

The Stalinist years were a period of “practical 
tasks,” rather than of great doctrinal ferment. The 
transformation of the Bolsheviks from a revolu- 
tionary into a governing party meant that Leninism 
(or Marxism-Leninism, as it came to be called) 
had become a state ideology. Its primary task was 
to explain, rationalize, and defend the decisions 
taken by the party leadership. Lenin himself in the 
prerevolutionary years had had little to say about 
how society would be organized under Bolshevik 
auspices, and after power was won, both Lenin and 
Stalin felt free to make practical adjustments with- 
in the framework of a general commitment to 
socialism. 

Stalin’s theoretical pronouncements, if they can 
be described as such, were essentially statements 
of operative ideology, which undertook to justify 
the policies he was pursuing. His theory of “build- 
ing socialism in one country,” which he developed 
in his polemic against Trotsky’s doctrine of “perma- 
nent revolution,” went beyond glossing over a dis- 
agreeable necessity to a positive affirmation of 
Russian strength and self-sufficiency. It laid the 
basis for his doctrine of Soviet patriotism, that 
strange amalgam of traditional nationalism and 
pride in Soviet achievements by which Stalin 
sought to unify the nation in the face of foreign 
dangers. His repudiation of egalitarianism and 
other “utopian” elements in the heritage of classical 
Marxism was closely geared to the requirements 
of productive efficiency as he understood them. His 
dialectical “discovery” that the state must grow 
stronger before it could wither away and his glori- 
fication of the state superstructure reflected the 
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powerful role he assigned to the state apparatus 
in reshaping and directing Soviet society. His 
theory that the class enemy would become more 
desperate and dangerous as communism ap- 
proached was nothing less than a justification of 
the great purges. 

As Stalin neared the end of his days, his mega- 
lomania mounted. The controls that he exercised 
became increasingly rigid and stultifying. His 
totalitarian grip embraced every facet of Soviet 
life. Literature, drama, music, art, and every 
branch of learning were purged of any trace of 
independence and were forced to join in choruses 
of sycophantic adulation to the “supreme leader.” 
The supercentralization on which Stalin insisted 
became increasingly anachronistic. It induced con- 
gestion at the center and paralysis below. Because 
authority was so concentrated in Stalin’s person, 
in the last years of his life there was a perceptible 
decline in the influence of the party organization 
as such. The party apparatus was faced with in- 
creasing competition from an expanding govern- 
ment bureaucracy and an already inflated police 
apparatus. Meanwhile, Stalin stood at the apex of 
each hierarchy—party, government, and police— 
playing them off against one another, using their 
competition to reinforce his own authority, and 
relying on his system of calculated insecurity to 
keep them all submissive and responsive to his 
commands. 

Stalin’s legacy to his successors was replete with 
problems. His impressive achievements in forcing 
the pace of Soviet industrialization, building mili- 
tary power, and expanding his domain into eastern 
and central Europe were all purchased at a heavy 
price. Soviet agriculture remained backward and 
stagnant, and the food available to the Soviet pub- 
lic was monotonous, scarce. and high priced. 
Stalin’s obsession with the development of heavy 
industry meant that light industry was ignored and 
underdeveloped, and shortages of consumer goods 
and housing were acute and widespread. The sys- 
tem of terror on which Stalin relied to protect his 
own security and to enforce his regime of depriva- 
tion and sacrifice had its debilitating effects. Fright- 
ened bureaucrats shrank from exercising initiative; 
there was a frozen and congealed quality about 
Soviet life which tended to rob it of all dynamism 
and revolutionary appeal. 


The Khrushchey period 


The death of Stalin marked the end of an era 
and opened the way to new policy initiatives and 
political and administrative arrangements better 
suited to the more mature stage of industrializa- 


tion, which Stalin himself had done so much to 
create. Once the succession crisis had been re- 
solved by Khrushchev’s purge of his competitors, 
the outlines of a new model of Khrushchevian rule 
began to crystallize. It could best be described as 
a form of “enlightened,” or rationalized, totalitar- 
ianism, which sought to eliminate or mollify the 
worst grievances of the Stalinist epoch and ration- 
alize the system of administrative and economic 
controls, while preserving the substance of totali- 
tarian power itself. Khrushchev's formula of gov- 
ernance relegated terror to a much less central 
position than it had occupied under Stalin, and 
the welfare reforms that he sponsored were de- 
signed to broaden popular support for the regime. 
But Khrushchev’s vision of society remained total, 
and he saw nothing within it that could be per- 
mitted to remain free from the party's paternal 
guidance and control. 

In contrast to Stalin, Khrushchev used the party 
as his primary instrument of rule. Within the 
party he sought to revive what he called Leninist 
norms—more active participation by the party 
rank and file in party discussions, encouragement 
of greater criticism from below, more frequent as- 
semblages of party congresses, committees, and 
other important party organs, and greater empha- 
sis on the recruitment of workers and collective 
farmers in order to strengthen the party's popular 
roots. But the revival of the forms of intraparty 
democracy did not extend to its essence. Khrush- 
chev, like Stalin before him, jealously guarded his 
authority, and insisted that the party function as 
a monolith in executing his will. Party function- 
aries, nevertheless, flourished under Khrushchev. 
He depended heavily on them to strengthen his 
control of the armed forces and the police, to 
achieve centralized direction of industry and agri- 
culture, and to provide the coordinating framework 
that held Soviet society together and forced it to 
march in step. 

Khrushchev’s use of the party apparatus as an 
integrating force was combined with a pragmatic 
willingness to experiment with forms of decentral- 
ized administration and new incentive systems 
where they promised more effective operational 
results, His recognition that the supercentraliza- 
tion of the Stalinist era was ill suited to the rational 
management of an increasingly complex economy 
drove him to shift the weight of supervisory au- 
thority much closer to the grass roots. Under 
Khrushchev the Soviet administrator was given 
more elbow room to exercise initiative and his con- 
ditions of work became more normalized, but he 
had still to adjust himself to goals and priorities 


that were centrally determined, and the ability to 
meet them remained the ultimate criterion of 
success, 

Despite Khrushchev’s repudiation of the Stalinist 
legacy of terror, the model which he held out for 
Soviet society was no Liberty Hall where indi- 
vidualism would run rampant. In his vision of 
Soviet society there was no place for the heretic, 
rebel, or skeptic. Even when communism was 
fully realized, he insisted, the Soviet Union would 
remain a highly organized, ordered, planned, and 
disciplined society in which the party would retain 
its leading role. The community for him was es- 
sentially a production enterprise where “each per- 
son must, like a bee in a hive, make his own con- 
tribution to increasing the material and spiritual 
wealth of society” (Khrushchev 1963a, p. 4, col. 
1 ff.). For Khrushchev the key to mass freedom 
was the self-imposed discipline of willing workers, 
imbued with the love of work and profoundly be- 
lieving in the cause to which they devoted their 
efforts. 

What most sharply distinguished Khrushchev 
from Stalin was his apparently sincere faith that 
a society of the Soviet type could be governed with- 
out reliance on large-scale repression. The new 
“populism” that Khrushchev articulated was evi- 
dent in his assiduous efforts to create an image of 
himself as a leader who was close to the people. In 
striking contrast to Stalin, who rarely ventured 
forth from the Kremlin, Khrushchev was the agi- 
tator par excellence, jn constant motion, address- 
ing meetings from one end of the country to the 
other, visiting collective farms and factories, speak- 
ing the language of the people, and reaching out 
for popular support. His efforts to mobilize the 
energies of the masses by providing for their more 
active participation in the tasks of communist con- 
struction reached out in many directions. It was 
manifest in his repeated calls for the recruitment 
of “leading” workers and collective farmers into 
the party, in his revival of comrades’ courts to im- 
prove workers’ discipline, in his use of peoples” 
guards (druzhiny) to help enforce public order, 
and in his reliance on neighborhood assemblies to 
mete out punishments to parasites and other social 
deviants. It was strikingly evident in a number of 
the theoretical formulations that he devised to sig- 
nal the Soviet entry into the stage of building com- 
munism. Thus the theory of the dictatorship of the 
proletariat was replaced by the concept of an all- 
people’s state, while the party of the proletariat 
became the party of the whole people. The 1961 
party program promised that as the state withered 
away, agencies of public self-government would 
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gradually replace state organs, and it pointed to 
already announced transfers of authority to trade 
unions, sports societies, the Komsomol, and Soviets 
as marking the pathway of future development. 

While the party program spoke of wider mass 
participation in administrative functions, it also 
made clear that such participation would be sub- 
ject to strong party guidance. No withering away 
of the party was contemplated in the communist 
stage; indeed the party program declared that it 
would become stronger than ever, performing the 
indispensable role of directing, guiding, and con- 
trolling all the activities of society, including “the 
organs of public self-government.” Thus this effort 
to square the political circle by combining the ap- 
pearance of popular control with the reality of 
party rule constituted still another in a long series 
of efforts to enlist mass energy and initiative in 
support of party-determined objectives. 

As the Khrushchev era drew to a close, there 
were increasing signs that his popularity was 
waning. Under his aegis the Soviet Union con- 
tinued for a time to make rapid industrial progress 
and score space triumphs, but there were also dis- 
turbing indications toward the end of his reign 
that the industrial growth rate was slowing and 
that agricultural output was declining. The prom- 
ises of rapid improvement in living standards that 
he had made to his own people were belied by food 
shortages and an apparent inability to master the 
agricultural problem. Khrushchev's efforts to re- 
solve intractable economic problems by periodic 
reorganizations of the state and party machinery 
only served to compound confusion. His unsuccess- 
ful efforts to dislodge the Western powers from 
West Berlin, the rebuff he received during the 
Cuban missile crisis, the increasing bitterness of 
the Sino-Soviet dispute, and declining Soviet influ- 
ence in Eastern Europe and in the international 
communist movement contributed to undermin- 
ing Khrushchev’s prestige. Thus, his early accom- 
plishments and bold destalinization initiatives tend- 
ed to be taken for granted and forgotten, while he 
found himself increasingly measured by the ex- 
pectations that he had aroused and failed to fulfill. 
In October 1964 his associates in the party pre- 
sidium deposed him and instituted a new period 
of collective rule with its own short-term problem 
of maintaining a precarious power equilibrium and 
its longer-range challenge of coping with changes 
in Soviet society. 


Trends in Soviet communism 


Although any attempt to assess the direction of 
development of Soviet communism must neces- 
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sarily be highly provisional, some trends can be 
identified that promise to leave their stamp on the 
future. 

Industrialization. Perhaps the most important 
single development is the continuing transforma- 
tion of the Soviet Union from a predominantly 
agrarian into a highly industrialized society, Indus- 
trialization has set new forces into motion. It has 
enhanced the importance of skilled labor at the 
expense of unskilled or semiskilled labor and has 
made necessary widespread literacy and a com- 
mand of basic technical skills in the labor force. 
In bureaucratic terms, it has meant a vast ex- 
pansion of managerial, engineering, technical, and 
scientific personnel and a recognition that they 
constitute an “industrializing elite” who must be 
appropriately rewarded for their crucial contribu- 
tion to the industrialization process. It has required 
a heavy emphasis on scientific training and re- 
search and a recognition that the dynamic mo- 
mentum of industrialization is intimately inter- 
twined with and dependent on scientific creativity. 

As the Soviet Union became a more and more 
highly industrialized society, dependent on its 
scientists, engineers, and managers to maintain its 
ongoing technological momentum, some redefini- 
tion of influence within the society appeared in- 
evitable. The authority of scientific knowledge 
could not be denied without doing damage to the 
society’s prospects. While this development did not 
necessarily challenge the party's formal monopoly 
of political power, it did mean that the party lead- 
ership had to come to terms with the scientific 
community and that party functionaries were be- 
ing increasingly equipped with sufficient technical 
and scientific knowledge to exercise their control- 
ling roles intelligently. It also opened up the pros- 
pect of gradual erosion, adaptation, and even out- 
right rejection of ideological dogmas which operate 
as barriers to technical progress. A party that had 
embraced forced-draft industrialization as a key 
to its salvation promised to be transformed by the 
very burdens it had assumed. 

Welfare totalitarianism. With advancing in- 
dustrialization came other changes. The austere 
production ethic of the early stages of the indus- 
trialization process was increasingly challenged by 
a consumption ethic to which concessions had to 
be made. After decades of deprivation and sacri- 
fice under Stalin, there was a widespread and in- 
sistent demand for improved incentives and more 
amenities. The spread of elementary and higher 
education stirred rising aspirations and presented 
the regime with new problems in adapting its sys- 
tem of controls to these expectations. 

One of the keys to an understanding of the post- 


Stalinist era lies in the recognition that these aspi- 
rations could no longer be ignored. Unwilling as 
Khrushchev was to part with substantive authority, 
he did recognize that there were grievances to be 
remedied. His decision to mitigate Stalin's terror 
and provide greater welfare benefits for the Soviet 
people represented an effort to establish his regime 
on a more rational and popular basis. One of the 
questions still to be determined is whether this 
formula of “popular,” or “welfare,” totalitarianism, 
which epitomized the internal policies of the 
Khrushchevian period, will prove viable, whether 
the aspirations to which Khrushchev partially re- 
sponded and which he helped activate will acquire 
a momentum of their own that will transform 
Soviet society in directions that his successors can 
neither fully anticipate nor control. 

Peaceful coexistence. Equally indeterminate is 
the effect that relations with noncommunist states, 
as well as with other communist states, are likely 
to have on the development of Soviet communism. 
Unlike Stalin, Khrushchev operated under a com- 
pelling necessity to adjust the strategy and tactics 
of Soviet foreign policy to the realities of the ther- 
monuclear age. These realities not only brought to 
the fore the common interests that he shared with 
Western leaders in avoiding mutual destruction; 
they also impelled him to seek out ways of advanc- 
ing the communist cause that would minimize the 
risks of igniting a thermonuclear holocaust. They 
caused him to replace Lenin’s doctrine of the 
inevitability of war between the Soviet Union and 
the so-called imperialist states with the new for- 
mula that such wars were not “fatalistically inevi- 
table.” They led him to reaffirm the theory of 
peaceful coexistence, even though peaceful coex- 
istence as interpreted by Khrushchev did not imply 
a static acceptance of the existing correlation of 
forces between the camps of communism and 
capitalism, nor did it exclude Soviet aid to so-called 
national liberation movements. Despite these ca- 
veats, Khrushchev was not prepared to support 
a reckless and adventurous revolutionary strategy 
that would pose unacceptable risks of thermonu- 
clear extinction. Charged with safeguarding Soviet 
interests and promoting the communist cause in a 
thermonuclear age, Khrushchev envisaged his 
main tasks as those of building up Soviet power, 
demonstrating the superiority of the Soviet system, 
and counting on the weaknesses and vulnerabilities 
of the noncommunist world to yield opportunities 
for communist advance within the framework of 
a nuclear stalemate. 


Khrushchev’s strategic posture reflected the rela- 
tively conservative interests of a mature commu- 


nist power with a strong vested interest in pre- 
serving its hard-won industrial gains from total 
destruction. This strategy posited a prolonged, if 
perhaps uneasy, peace with the West, during which 
Soviet society would continue to evolve and develop 
without catastrophe. However pleasing this pros- 
pect might be from the Soviet point of view, it 
offered small comfort to the more militant ele- 
ments in the international communist movement 
who saw their salvation in advancing the timetable 
of world communist triumph. It was particularly 
suspect to the Chinese communists, who believed 
that their own interests and ambitions were being 
sacrificed to promote Soviet development. It was 
against this background that the Sino—Soviet dis- 
pute intensified in bitterness, and separate eastern 
and western communist empires began to take 
form. The stage was also set for the emergence of 
communist forces that sought to escape the disci- 
pline of both. The world perspectives of Soviet 
communism promised to be increasingly restricted 
by two parameters: the strength of polycentric 
tendencies within the communist camp and the 
inhibitions that the thermonuclear strength of the 
West imposed on Soviet freedom to maneuver. 
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LLI 
NATIONAL COMMUNISM 


The term “national communism” has been loosely 
used to describe certain modern political systems 
and ideologies. The term has been applied, for in- 
stance, to indigenous forms of communism, as 
represented by individual national parties in or out 
of power; to national variations of communism, 
representing adaptations of a single ideology to 
different national settings, with or without the as- 
sertion of national autonomy in policy determina- 
tion; to defiance of supranational or foreign com- 
munist guidance on the part of a communist party 
or government; and to the upsurge of national- 
ist sentiments within a communist framework, 
whether directed against a communist or noncom- 
munist external foe. 

Despite this wide range of meanings and the lack 
of precise content and notwithstanding the ab- 
sence of a systematic doctrine of national commu- 
nism, two aspects appear to constitute the core of 
the term, as it is commonly used: (1) independ- 
ence from outside control—notably, from the So- 
viet state and its ruling party; and (2) national 
distinctiveness of significant aspects of ideology, 
social institutions, or political strategy. 

Historically, the primary content of national 
communism has been the effort to shake or reduce 
Soviet control, or hegemony, over other communist 
parties and states. With the increasing fragmenta- 
tion of world communism, its various national 
units are bound to reflect the growing diversity 
of communism—as a political movement, as an 
ideology, and as a mode of political practice. 


Historical development 


National communism has arisen in situations 
where uncertainty has existed in regard to one or 
more of the following questions of communist the- 
ory and practice: (1) what is or should be the 
relation of class to nation and the relation of na- 
tionalism to internationalism? (2) is the belief in 
unilinear historical development and in “general 


laws” compatible with the support of “multiple 
paths” of revolutionary strategy and tactics? (3) 
are monolithic party unity and centralist discipline 
compatible with permissiveness regarding national 
diversities? 

The early period. Lenin accepted the Marxist 
view of nationalism and the nation-state as mani- 
festations of the capitalist era, and he shared the 
Marxist belief in the future evanescence of na- 
tional differences and antagonisms. But he came 
to recognize the revolutionary potential of national 
aspirations. Precisely this recognition has been at 
the root of the dual Soviet approach to nationalism 
—an effort to use it for their own ends but also a 
fear of being used by it; a vigorous endeavor to 
harness nationalism abroad against common foes 
(notably “imperialism”) but also a firm though 
differentiated hostility to manifestations of nation- 
alism as a divisive force within the Soviet Union 
and within the international communist movement. 

The Leninist “concession” to national sentiment 
—a shrewd, tactical exception to the Bolshevik im- 
pulse toward centralization—aroused severe criti- 
cism from the extreme “left” even prior to the 
Russian Revolution. It equally failed to satisfy those 
non-Russian communists who, after the Soviet 
state was established, demanded greater self-deter- 
mination or actual autonomy than Moscow per- 
mitted (e.g., parts of the Ukrainian and Georgian 
communist elites during the early years of Soviet 
rule: Pipes 1954). Their protests may be consid- 
ered precursors of national communism. 

The emergence of the Soviet Union as the sole 
communist state and of its party as the sole ruling 
one was bound to lead to a subordination of “pro- 
letarian internationalism” to Soviet state interests, 
as interpreted by the Soviet leaders. The stage for 
this development was set in Lenin’s day with the 
“21 conditions” of the Comintern and the subse- 
quent Bolshevization of the International. A variety 
of foreign communists protested, left, or were ex- 
pelled, precisely because at some point they clashed 
with the Soviet striving for hegemony. Some went 
on to establish national communist groups or par- 
ties in their own countries (Borkenau 1938; Dallin 
1962). 

For the communist opponents, domestic and for- 
eign, of Moscow's hegemonial tendencies and of 
Stalinism, national enmity was but one of many 
elements in their opposition, and usually a sub- 
ordinate or even insignificant one. Then, as later, 
many of them had no opportunity to organize or 
express their hostility, because they lacked an au- 
tonomous power base. By the same token, the mere 
existence of nation-states and the establishment 


of communist parties within their boundaries were 
to provide a framework for the expression and 
organization of intracommunist opposition to Stal- 
inism—regardless of the presence or intensity of 
specifically national ingredients. 

The term “national communism” seems first to 
have appeared in connection with two phenomena 
which, unlike the above conflicts between national 
and communist impulses, stemmed from efforts to 
find a basis for joint action of communist and na- 
tionalist forces against a common foe. In the West 
this foe was the “Versailles system,” and the en- 
deavors were antiliberal, anti-British and anti- 
French. In Asia the foe was the colonial system, 
and the efforts were anti-imperialist. 

In the confusion of German radical politics after 
1918, some extremists of the left sought an al- 
liance or an ideological amalgam with nationalist, 
right-wing extremists. At the same time, some mili- 
tarists and nationalists, hardheaded or romantic, 
explored the possibilities of a national communist 
movement. While the label commonly attached to 
these explorations is “national Bolshevism,” “na- 
tional communism” was the name given in 1919 
to a Hamburg group and also the title of a book 
(published in Munich in 1920) that character- 
ized national communism as “Germany's bulwark 
against East and West.” The notion of a united 
national communist party was discussed until 1933 
—but without significant political results. The com- 
munists made no serious overtures to these groups 
(Schiiddekopf 1960; Paetel 1965). 

World War 11 and its aftermath. World War 1 
changed the nature of and the conditions for na- 
tional communism. The dissolution of the Com- 
munist International in 1943, while a formality, 
made the communist parties in each country offi- 
cially sovereign units, presumably free to interpret 
the doctrine and determine their own policies. War 
conditions also helped to crystallize national feel- 
ings (particularly under enemy occupation, e.g., 
in France, Yugoslavia, China). Hence, there en- 
sued a series of partly opportunistic communist 
moves to identify their goals and values with na- 
tional themes and purposes. Also, at the end of the 
war the foundations were laid for the establish- 
ment of communist regimes in some ten countries 
and thereby for the creation of a communist inter- 
national system in eastern Europe and in east Asia. 

Until 1947 Stalin gave evidence of approving of 
indigenous—i.e., national—variations in the states 
that had come under communist control. Largely 
because of their weak positions, communist lead- 
ers in eastern Europe felt compelled to stress na- 
tional themes, and Moscow often made them “go 
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slow” with radical reforms and deal gently with 
much-needed allies. These concessions were part 
of a “rightist” policy, which Moscow had always 
meant to be temporary and in fact substantially 
reversed in 1947 and 1948. The first years after 
the communist take-over likewise saw a substantial 
dose of what has been called “domesticism” ( Brze- 
zinski 1960). That is, priority was often given to 
domestic needs and interests by otherwise loyal 
pro-Moscow leaders who did not wish to sacrifice 
or jeopardize the broader needs and interests of 
world communism. On the whole, party leaders 
who had been put into power thanks to Soviet 
force and political pressure tended to follow Mos- 
cow’s lead—as did the parties out of power. 

Different conditions obtained in those few states 
where communists came to power substantially on 
their own: Yugoslavia, Albania, China, and at a 
later date North Vietnam and Cuba. Those re- 
gimes, however unpopular, had a broader, more 
truly national base of support than did communist 
regimes in the countries where the authority of 
local communist elites was established by the So- 
viet Union (Lowenthal 1964). Moreover, the local 
elites of such self-established regimes were con- 
scious of their relatively autonomous position and 
were prepared to assert it. None of them initiated 
a break with the Soviet Union, but in the first dra- 
matic contest of wills, Marshal Tito in 1948 re- 
sponded to the Soviet challenge by defying requests 
that would have spelled increasing Soviet domina- 
tion. While the dispute was not fundamentally due 
to Yugoslav nationalism—in fact, until 1948, the 
incorporation of Yugoslavia into an expanded So- 
viet Union was given serious thought—Tito was 
prepared to turn his proud patriotism against Mos- 
cow. As he wrote the Soviet leaders, no matter how 
much communists of other countries loved the 
Soviet Union, the land of socialism, they could in 
no case love their own country less (Soviet—Yugo- 
slav Dispute 1948, p. 19). 

After Stalin. The general relaxation of con- 
trols at the center of Soviet power, which set in 
after Stalin’s death in 1953, inevitably provided an 
opportunity for the assertion of greater authority 
and initiative by the other member states of the 
communist bloc. The following decade was to see 
a remarkable though uneven slipping of Soviet 
authority and control and, thus, a far-reaching 
assertion of national sovereignty by the various 
communist parties and regimes. 

In the previous phase the official motif had been 
to make each communist country a little replica 
of the Soviet Union. Now the reaction against the 
centralism of the Stalinist era brought greater rec- 
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ognition of national diversities. Indeed, Moscow 
could scarcely deny its loyal allies and dependents 
in eastern Europe what it was prepared to grant 
renegade Yugoslavia. In the course of its reconcili- 
ation with Tito, the Soviet government in 1955 
pledged to be guided on ideological, economic, and 
political questions by a policy of noninterference 
in internal affairs, saying that matters of internal 
organization and differences of social systems 
and of concrete forms of building socialism were 
exclusively the affairs of the individual countries. 
During the following 18 months other declarations 
went even further. But uncertainty about the limits 
of Soviet tolerance helped set the stage for the 
crises in Poland and Hungary in 1956. Here there 
were, for the first time, overt nationalist compo- 
nents in the outlook and motives of some of the 
protagonists. 

The consolidation which followed proved to be 
temporary and largely formal. Soviet efforts to meet 
part way the anti-Soviet nationalism of foreign 
communists probably stimulated it even further. 
In effect, Moscow failed to provide a formula that 
could both satisfy the nationalist demands of the 
other communist regimes and assure its own con- 
tinued control. 

The weakening of Soviet authority among com- 
munist parties and states contributed to the erup- 
tion of the Sino-Soviet dispute, which in turn 
weakened Soviet prestige even more. The dispute 
bared the conflicting content of national interests, 
as interpreted by various communist elites, and it 
undermined confidence in the wisdom and leader- 
ship of both China and the Soviet Union. In over 
thirty countries communist parties split and rival 
organizations emerged, many of which turned 
against Soviet leadership and some against the 
Chinese as well. The Sino-Soviet rift dramatized 
the evanescence of the single, universal communist 
perspective: thereby it made each communist party 
“national.” It provided opportunities for other states 
either to defy Soviet leadership and survive even 
when expelled from the fold (as in the case of 
Albania) or to tacitly and gradually pursue policies 
sharply at variance with those of other communist 
States without formal rupture or rebuke, as in the 
case of Rumania. 

Rumania, in particular, provided evidence that 
a state at the borders of the Soviet Union and 
seemingly dependent on it could resist pressures 
for economic integration and coordination (Mon- 
tias 1964), could undo a variety of measures that 
it considered cultural and political “Russification,” 
and could have the central committee of its com- 
munist party resolve in 1964 that the sovereignty 


of the socialist state requires that it effectively and 
fully hold in its hands all the levers of economic 
and social life; that transmitting such levers to the 
competence of a superstate or extra-state bodies 
would make of sovereignty an idea without con- 
tent; that there are not and cannot be unique pat- 
terns and recipes; and that no one can decide what 
is and what is not correct for other countries or 
parties. 

Meanwhile, the cases of Kerala and Cuba had 
shown that communist movements could triumph 
without significant Soviet support and establish na- 
tional communist regimes at a considerable dis- 
tance from the Soviet bloc. 


The national ingredient 


The specific weight of national components in 
the national communist amalgam has been a 
source of recurring perplexity. There are no sure 
criteria or methods by which to assess them. The 
problem is further complicated by the confusion 
regarding the definition of “nationalism.” It is im- 
portant to note that the term “national” need not 
mean “nationalist” but may merely be descriptive 
of a nation-state serving as the vehicle of a political 
system. In common modern usage, “nationalism” 
is also taken to have an oppositional affective con- 
tent, expressing an “anti” feeling—against a colo- 
nial power, against a rival nationalism, or against 
a supranational movement, 

It is useful to distinguish between national 
factors antedating the communist experience and 
those identified with it. The earlier factors, which 
may re-emerge or reassert themselves under com- 
munist rule, include historical enmities (e.g, 
Polish-Russian, Russian-Chinese, Slavic-German), 
ethnic minority problems (e.g., in Czechoslovakia, 
China, Poland, Rumania), territorial conflicts and 
irredentism (e.g., Macedonia, the Soviet Maritime 
Territory, Bessarabia, Cieszyn, Trieste). Many of 
these factors have clearly reappeared and added to 
tensions (e.g., in the case of Rumania and Russia), 
but nowhere have they determined whether rela- 
tions between two parties or states improve or 
worsen. Soviet relations with Hungary and Poland 
improved considerably after 1956 despite tradi- 
tional enmity. Generally, traditional national con- 
flicts appear to be vital only when they are ex- 
acerbated by fresh and more sensitive tensions 
(e.g., in the case of Yugoslavia and Albania). 

As for the new elements introduced by the ex- 
perience with communism, resentment of the So- 
viet treatment of foreign parties, especially during 
the Stalin era, was the obvious price Moscow had 
to pay for its various offenses to national pride, 


its economic exploitation, its clandestine surveil- 
lance, and its peremptory dictation of general do- 
mestic and foreign policy and even specific eco- 
nomic decisions in the satellite states. 

In turn, the recognition of multiple “paths” in 
the pursuit of communism and the lessening of 
Soviet controls in the post-Stalin era stimulated the 
assertion of national consciousness and distinc- 
tiveness in these countries. 

The shift of economic policy from making each 
communist state an autarkic replica of the Soviet 
model to (at least theoretically) a division of labor 
among the states involved implied a recognition of 
significant variations from country to country and 
approval of them by the Soviet Union. 

Attempts at coordinating policies and ideological 
postures within the bloc have created new conflicts 
of interest, most visible in the economic sphere, 
where they have evoked a reassertion of national 
sovereignty in response to attempts at suprana- 
tional integration or the national policies of other 
communist states (e.g., the relations of Rumania 
with Czechoslovakia and East Germany). 

In the Stalin era there was little evidence of 
genuine nationalism among the ruling cadres of 
communist parties. The leadership in eastern Eu- 
rope was, after all, overwhelmingly composed of 
Moscow-trained and Moscow-oriented “internation- 
alists” dependent on Soviet power rather than 
responsive to popular will or sentiments. Signifi- 
cant exceptions to this pattern were those commu- 
nist regimes which came to power substantially 
without foreign assistance, specifically Yugoslavia, 
China, and Albania. 

By contrast, among the greater range of diverse 
communist types which have emerged, even in 
eastern Europe, since 1956, some may be consid- 
ered not merely national in form but also nation- 
alist in content. It has been observed that any 
regime, in order to remain in power, will sooner 
or later seek to identify itself with the defense of 
traditional interests and values of the country it 
claims to rule. Moreover, the gradual devolution 
of decision making from Moscow to the indigenous 
communist leadership prompted a “nativization” in 
each individual instance, including a greater sensi- 
tivity to national opinion and sentiment. Here was 
an opportunity to forge a unity of support on a 
broader, more truly national base than had pre- 
viously been possible for communist regimes (e.g., 
in Poland, Hungary, Rumania). No doubt the re- 
sult has been a growth of “true” nationalism 
among communist cadres ( Shoup 1962). 

At the same time, it is well to consider the de- 
ceptiveness of appearances with respect to nation- 
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alism. Much of “Titoist revisionism” is national only 
in the sense that it was introduced within the 
framework of the Yugoslav nation-state: it was not 
uniquely or specifically Yugoslav. The Soviet Union 
and other communist states did actually adopt a 
number of “Titoist” doctrines and policies at a 
later date. Similarly, all communist regimes and 
parties (e.g., China) have repeatedly shifted from 
“tight” to “left” communism and back: neither 
posture can be considered distinctively national, 
although any specific decision to make such a shift 
may be determined by national considerations. 

Asian communism. In the contest between 
communism and nationalism neither has swallowed 
the other. Modern nationalism has, even in the 
West, proved to be far more compatible with com- 
munism than had been surmised by either its 
friends or its enemies. There is evidence that com- 
munism has proved to have a particularly strong 
appeal for ethnic minorities suffering from na- 
tional and socioeconomic discrimination. With due 
allowance for situational differences, this has been 
true both in eastern Europe (Burks 1961) and in 
Asia (Zagoria 1965). 

It is commonly assumed that among the variants 
of modern communism a place of special impor- 
tance is occupied by what may be called non- 
Western communism. One of its main character- 
istics is supposed to be the fusion of national and 
revolutionary objectives, since the communists 
themselves became the carriers of “national libera- 
tion.” But the evidence suggests that this is not 
really a distinctively non-Western or Asian phe- 
nomenon, 

Both in social composition—an intelligentsia 
leadership and a peasant mass—and in national 
appeal there are significant parallels between com- 
munist parties in eastern Europe and those in 
certain Middle Eastern, south Asian and Far East- 
ern countries. Compared with the “proletarian” 
mass parties of the West (such as the French or 
Italian), the “national-revolutionary” parties do, 
however, exhibit some distinguishing features. 
A number of them are characterized by a lack of 
discipline, ideological vagueness, espousal of vol- 
untarism, and a proclivity for factionalism and 
“direct action.” 

While Asian communism may prominently fea- 
ture an anti-Western animus combined with a 
commitment to modernization, it does not follow, 
as has been argued by some scholars, that the na- 
tionalist content has displaced the communist or 
that communism is merely a form of or a façade 
for nationalism (Kautsky 1962; Johnson 1962). 
While many of the non-Western parties have given 
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prominent recognition to the national factor and 
made themselves spokesmen for the national cause, 
the specific models advocated and the strategies 
pursued by them have not necessarily been re- 
flective of particularities in their national situation 
or heritage. 

For a long time, the spokesmen for the priority 
(or even the propriety) of a distinctive anticolo- 
nial, non-Western model of communism were losers 
in the international movement. While some, like 
Sultangaliev, regarded industrial society itself as 
the enemy of the colonial peoples (Bennigsen 
1960), others, like M. N. Roy in India and Tan 
Malaka in Java, ultimately broke with the Com- 
munist International and attempted to organize 
national communist movements of their own. 

With the rise of Mao Tse-tung to the leadership 
of Chinese communism, the adaptation (“Sinifica- 
tion”) of the common creed became more explicit 
and in turn was rapidly elevated into a model for 
other non-Western peoples to emulate in their 
struggle against “imperialism” (Schram 1963; Bar- 
nett 1963; Scalapino 1965). In other cases ele- 
ments of Marxist-Leninist theory have been eclec- 
tically combined with national, tribal, or racial 
ideological ingredients into novel and not always 
viable amalgams, which have been labeled “syn- 
creto-Marxism” (Feuer 1964). 

All elements of the communist movement have 
recognized the growing potency of nationalism. 
The Soviet Union and the parties identified with 
its policies have aimed at alliances with national 
movements, hopeful of converting, subverting, or 
exploiting them on the road to power, without nec- 
essarily insisting on communist hegemony in the 
alliance. The Chinese communists developed at an 
early date a strategy of appropriating the nation- 
alist content and subordinating the nationalist al- 
lies to themselves. In both instances there remain 
considerable uncertainties in the application of 
general strategic prescriptions. While the Soviet 
approach has permitted greater flexibility in com- 
munist alliance strategy, far greater successes have 
thus far been registered where the communists 
have themselves appropriated the national cause. 

The substantive differences among the national 
components of international communism cannot 
even be adumbrated here: they range from explicit 
doctrinal and strategic variations to unarticulated 
divergencies in values and style. Suffice it to say 
that the extent or direction of a given party's 
“particularity” on one issue need not correspond 
to its position on any other issue. The categories 
of national differentiation do not coincide at all 


with the classification according to degree of free- 
dom from Soviet control (Skilling 1964, p. 36). 
No single formula will account for the way in 
which various national communist parties have 
lined up on any particular issue. Some communist 
analysts have correctly identified certain factors 
without trying to assess their specific weight in 
individual instances: differences in “objective con- 
ditions” and historical and cultural traditions, in 
geographic situation and political circumstances, 
and in levels of socioeconomic development. Among 
additional factors, particularities in the history of 
a given communist party, its relations with Mos- 
cow, and the characteristics of its leadership should 
be listed. All in all, the divergent perception of 
national needs goes far toward explaining the out- 
looks and policies pursued by the individual parties. 


Theory and ideology 


Communist formulations regarding national com- 
munism, national “particularities,” and “separate 
paths” in the pursuit of socialism and communism 
have invariably followed political zigzags. This has 
been as true of Soviet pronouncements as of those 
from Yugoslavia or China. While there has no 
doubt been a strong urge to find basic principles 
and consistent doctrinal formulas on these sub- 
jects, communist pronouncements have actually 
amounted to little more than post-factum legitima- 
tions of reality. 

In the post-Stalin era, Soviet arguments have 
been consistent in rejecting the whole notion of 
national communism as nonsensical (for, by defi- 
nition, communism must be internationalist) and 
in acknowledging that, while there are certain 
universal laws or “regularities” governing societal 
and political change, certain national particular- 
ities exist because of objective conditions in a 
given country. The limits of proper national dis- 
tinctiveness, however, have remained appropriately 
vague and flexible. They do leave Moscow consid- 
erable discretion in determining what, in any given 
case, is to be deemed orthodox or deviant. 

The most clear-cut Soviet generalization was 
formulated in terms of priorities. Soviet officials 
reportedly told Italian communists in 1957 that 
they condemned national communism, “not be- 
cause it seeks out national particularities, but 
because it affirms on the theoretical level that na- 
tional particularities are more important than gen- 
eral laws” (Kogan 1958, p. 662). In practice the 
situation has rarely been so clear-cut. Soviet writ- 
ers have been quick to label national communism 
a major “weapon” of imperialism, an expression 


of petty-bourgeois narrowness or “bourgeois na- 
tionalism.” Until 1961 it was considered a “right- 
ist” phenomenon, i.e., a facet of revisionism. Since 
then, Soviet and pro-Soviet writers have argued 
that the Albanian and later the Chinese case have 
shown that a “nationalist” line can also be part 
of a militant, “pseudoleftist’” communist posture 
(Moscow, Akademiia Obshchestvennykh Nauk 
1960; Akademiia Nauk SSSR 1964). 

Among non-Soviet communists, Yugoslav politi- 
cal scientists went furthest of all in systematizing 
their own views, but their theorizing, too, tended 
to follow political developments. From 1950 to 
1952 they did explore similarities between Stalin- 
ism and fascism, the existence of “state capital- 
ism” in the Soviet Union, and the persistence of 
the national problem even after the communists 
came to power. While the hostile analysis of Soviet 
conditions was soon toned down, the new program 
of the League of Yugoslav Communists developed 
a number of doctrinal pronouncements which were 
at variance with Moscow’s views, e.g., that the 
uneven development of socialism and the wide 
diversity of its paths and forms produce a number 
of internal contradictions; that the state may turn 
into a factor of stagnation; that there is, in effect, 
no two-camp dichotomy in international affairs; 
and that as long as nations exist, communism 
must remain within the beneficent framework of 
a system of nation-states (Savez Komunista Jugo- 
slavije 1958). The Yugoslav communists remained 
committed to proletarian internationalism—as they 
saw it—and continued to condemn national com- 
munism (Hoffman & Neal 1962). 

Since 1956 the Italian communists have played 
a leading role in encouraging the autonomy of all 
communist parties. It is significant that in his 
last statement to the Soviet leadership, in 1964, 
Palmiro Togliatti chose to dwell on the revival of 
nationalism in the “bloc” as one of the forms of 
centrifugal tendencies there. He reiterated what 
Tito had stated earlier: 


The national sentiment remains a permanent factor 
in the working class and socialist movement for a long 
period, also, after the conquest of power. Economic 
progress does not dispel this; it nurtures it. Thus in 
the socialist camp, too, one needs perhaps . . . to be on 
one’s guard against the forced exterior uniformity and 
one must consider that the unity one seeks to establish 
and maintain must lie in the diversity and full auton- 
omy of the individual countries. (Togliatti 1964, 
p. 80) 


But it was not so much the verbal pronounce- 
ments that mattered, although even here subtle 
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semantic nuances spelled significant political di- 
vergencies. Perhaps the one fraught with greatest 
potential for conflict was between those who 
seemed to accept national variations as a perma- 
nent fact, to which communism must adjust, and 
those who saw the nationalist gambit as a reluc- 
tant strategy, to be jettisoned when no longer 
needed. Echoes of both lines may still be heard. 

To what extent is the Soviet experience manda- 
tory or significant for other communist parties? 
Since 1954 Moscow has been inconsistent or vague 
on this matter, while the other parties have in- 
creasingly felt free to ignore the Soviet model in 
principle and in practice. In the early years after 
the Russian Revolution, those in the West who 
rejected it (e.g., Rosa Luxemburg, Paul Levi, and 
Hermann Remmele) found themselves in open 
conflict with Moscow; more recently, however, 
not only Yugoslav but also Italian, Polish, Swedish, 
and other communists have been able to assert, 
while remaining loyally on the Soviet side, that 
the Russian record, that of a lone socialist country, 
industrially backward, experiencing rapid moderni- 
zation (as well as Stalinism) under special condi- 
tions, was too exceptional to constitute a model for 
other societies. The Chinese communists and their 
allies have likewise rejected the Soviet model of 
seizing and using power, claiming that it is irrele- 
vant for the underdeveloped and non-Western 
parts of the world. 


In summary, then, national communism must 
be considered a transitional phenomenon, not a 
stable sociopolitical system, because both the na- 
tional and the communist elements in the twin 
formulas are susceptible to various meanings; be- 
cause either may be real or a façade; and because 
either may prevail over the other. National com- 
munism emerged in the context of two broad pro- 
cesses: (1) the linkage of national and social 
revolutions; and (2) the disintegration of “mono- 
lithic” world communism, which precipitated in 
its wake an increasing variety of communist ex- 
perience, ideology, doctrine, and conduct. 

The content and nature of national communism 
have changed over time and have become more 
varied. Basically, the term “national communism” 
remains a misnomer, for it gives a single label to 
a variety of significantly diverse phenomena. But, 
however defined or delimited, national commu- 
nism would appear to belie the conventional image 
of communism as a monolithic phenomenon, in- 
variably possessed by universalist aspirations. 
Many, although not all, national communist states 
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and parties are content to seek limited objectives, 
involving less than total control at home and lim- 
ited ambitions abroad. 

While by no means all national communism 
reflects true nationalism, nationalist feelings have 
also been growing in communist ranks. With the 
rise of a new generation of communist leaders 
everywhere, the future elites are more likely to find 
it important to respond to national sentiments 
than their elders did. The “nationalization” of com- 
munism may lend it greater appeal and authority; 
it has already, almost universally, freed the parties 
from some of the earlier odium of being “agents of 
a foreign power.” But it also opens up the parties 
to gradual transformations. It would be foolish to 
predict whether, in the end, national communism 
will strengthen communism as a political force or 
help to destroy it. 

ALEXANDER DALLIN 


[Directly related are the entries NATION and NATION- 
ALISM. Other relevant material may be found in 
CHINESE SOCIETY; ECONOMIC DATA; PLANNING, ECO- 
NOMIC; and in the biography of LUXEMBURG. ] 
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m 
THE INTERNATIONAL MOVEMENT 


All political movements have been subject to 
change; the international communist movement is 
no exception. While the nature and thrust of world 
communism appeared unchanged and uncompro- 
mising in the light of hostile cold-war attitudes, its 
revolutionary ideology, strategic objectives, and 
organizational concepts have, nevertheless, not re- 
mained static. The debates in the international 
communist movement touched off by the Sino- 
Soviet dispute, which developed in the late 1950s, 
dramatically exposed the division over policy and 
orientation in the movement and marked a water- 
shed in its history. 

The emergence of change is often obscured by 
the tendency of language to lag behind deed. In 
political affairs deviation from some principle is 
customarily accompanied by reaffirmation of that 
very principle; in the communist world no leader 
is yet willing, explicitly, to acknowledge the attenu- 
ated revolutionary possibilities of the Communist 
parties or to cast doubt on the proposition that com- 
munism will replace capitalism. Like other political 
movements, however, communism has responded 
to changes in the environment and bowed to neces- 
sity. Beginning in 1918 with the goal of world revo- 
lution and with Soviet Russia as but one compo- 
nent of the international movement, international 
communism was turned inside out to become 
merely one of the instrumentalities of Soviet diplo- 
macy, while Soviet Russia came to be regarded as 
the incarnation of the movement. In the post-1945 
era, to an extent as yet unclear, this inverted rela- 
tionship was again upset, altering the priority of 
loyalties and the structure and orientation of world 
communism. 


International communism between the wars 


Communism as an international movement, con- 
sidered in organizational terms, is a product of 
Lenin’s concept of party organization and the par- 
ticular direction given the concept by Stalin’s poli- 
cies. Lenin’s motives in creating a new organiza- 
tion were to pre-empt the mantle of international 
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revolutionary Marxism beforg the European Social 
Democrats could re-establish their authority after 
World War 1, and to create the instruments for a 
seizure of power, particularly in western Europe, 
to insure the survival of the Bolshevik revolution. 
The last point may strike the contemporary reader 
as incongruous, but the Bolshevik leaders of the 
time suffered a double insecurity. Having seized 
power in a backward country, they looked to revolu- 
tion, notably in Germany, to provide the technical 
wherewithal for the transformation of their own 
society. They also feared, not wrongly, that in isola- 
tion Russia would be exposed to assault from the 
capitalist countries. Lenin therefore attempted to 
fashion a revolutionary international instrument 
modeled on his notion of the elitist party, ideologi- 
cally pure, hierarchically organized, knit together 
into a single international party under strictest 
discipline, and dedicated to world revolution. 

If the predisposition to conspiratorial Moscow- 
oriented organization was created by Lenin, the 
specific order of loyalties and behavior patterns was 
molded by Stalin. In Lenin's time—and perhaps 
till the end of the 1920s—the idea of the autono- 
mous and fraternal character of the Communist 
parties still persisted, as did the preoccupation with 
internal, indigenous revolution. The failure of revo- 
lutionary attempts in Hungary, Germany, and, 
later, China and the general weakness of the Com- 
munist parties conditioned their acceptance of 
Soviet authority. In addition, the elimination of 
Trotsky, plus the successive purges of dissident 
elements, contributed to the weakening of the 
international revolutionary wing of the movement. 
In turn, Stalin exacted from the international 
movement precisely the same behavior he required 
internally, i.e., unquestioning obedience enforced 
by terror, purge, and intimidation. The role of the 
Communist parties was transformed, therefore, 
from one geared to their own requirements and 
prospects for revolution to one of supporting Soviet 
foreign and domestic policy. Whatever the line of 
Soviet policy, it was generalized for all parties and 
tactically adapted to the respective countries. Agil- 
ity, flexibility, opportunism, and cynicism became 
hallmarks of communist political behavior. That 
the communist movement survived at all outside 
Russia is in the nature of a political miracle. 

Nevertheless, if power is the criterion of success, 
communism as an international movement has 
vindicated Lenin’s decision to break with the re- 
formist Socialist (Second) International during 
World War 1. While the parties of social democracy 
have been absorbed into the mainstream of Western 
industrial society, influencing it with ideas of social 
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justice and often becoming indistinguishable from 
other political parties, international communism 
has become the most powerful international politi- 
cal movement of the modern world. From a hap- 
hazard union of left-wing groups and parties in 
1919, it has become a movement that in 1964 could 
boast the allegiance of over 40 million Communists, 
united in some 90 parties, 12 of them holding the 
reins of state power (not including Cuba and Yugo- 
slavia). For this, however, the movement has had 
to pay its own dialectical price; the very prolifera- 
tion of communist states and important parties in 
Europe and Asia has introduced disintegrative tend- 
encies into the movement whose outcome cannot 
yet be precisely foreseen. The international com- 
munist veneer has been pierced by the edge of 
national interest, which remains as much a feature 
of the communist as of the noncommunist world. 
The conflicting interests of the communist states 
have, in addition, created the necessary conditions 
for the emergence of the particularist interests of 
the Communist parties out of power and have been 
joined to the general trend away from Moscow 
domination and from centralization. These centrif- 
ugal forces spell the end of communism as an or- 
ganized international movement in its Stalinist 
form, 

The Third International. In its “ideal” Stalinist 
form, international communism was a global net- 
work of Communist parties bound by a common 
ideology, codified and interpreted by the Soviet cen- 
ter. “Proletarian internationalism” was the organiz- 
ing principle of the movement, providing it with 
policy coordination and unity of action on the basis 
of unconditional support and subservience to the 
Soviet Union. Its structure and behavior patterns 
were shaped by the Communist (Third) Interna- 
tional, or Comintern (1919-1943). 

As its designation, the “Third International,” sug- 
gests, the communist movement's antecedents were 
rooted in the earlier socialist movement, which had 
twice formed international organizations: the Inter- 
national Working Men's Association (1864-1876), 
subsequently referred to as the First International, 
and the Second (or Socialist) International ( 1889- 
1914). 

Patterned on Lenin’s organizational principles, 
the Comintern was a global extension of his con- 
cept of a Bolshevik party. Unlike the Second Inter- 
national, which was a loose federation of parties, 
the Comintern was specifically designed as an 
international Communist party in which each of 
the national parties constituted a section. Relations 
between the national units and headquarters were 
governed by the concept of “democratic centralism,” 
adopted from the party usage that, in theory, pro- 


vided for freedom of discussion and criticism 
(democratic) and absolute unity of action and pur- 
pose once decisions had been taken (centralism), 
By 1930, certainly, this pyramid of authority had 
been inverted; decisions were imposed on the par- 
ties by the Moscow center. 

In sum, Russian domination of the Comintern, 
while not premeditated, became actual as a conse- 
quence of the success of the Russian Revolution— 
the only successful communist revolution between 
the wars. The organization, located perforce in 
Moscow, was dependent on the Russians for its 
staffing and financing. To paraphrase Trotsky’s 
criticism of Lenin’s concept of party organization, 
the organization of the Comintern took the place of 
the Comintern itself; the Soviet Communist party 
took the place of the organization; and finally the 
dictator took the place of the Soviet Communist 
party. 

In its heyday, the Comintern developed an elabo- 
rate organizational network to bind the parties to 
it. Its executive committee played the role of an 
international staff translating the policies of the 
Soviet Politburo into programs for the international 
movement. The central committees of the national 
parties, in theory subordinate to their respective 
national congresses, were also subordinated to the 
executive committee, which was empowered to 
amend decisions of national congresses, pass on 
their programs, and expel individuals and, indeed, 
entire sections from the movement. Various Com- 
intern “commissions” organized on a regional basis, 
supervised the activities of the parties, and such 
units as the International Liaison Department 
(OMS) were charged with the organization and 
operation of an international clandestine commu- 
nications network, complete with undercover 
agents, facilities for forging documents, clandes- 
tine radio stations, etc. Supporting the Comintern 
were a number of special-purpose organizations 
or interest groups, such as the Youth International, 
some of which developed into “front” organizations 
to gain the support of those who did not wish to 
make the full commitment to communism. The 
executive committee itself, in expanded form, as- 
sumed the function of the world congresses, which 
met only twice after 1924 (in 1928 and 1935). 

This complicated apparatus began to fall apart 
in the 1930s, first under the assault of the Nazis 
in Germany, where much of the plant and activity 
was centered, and then under the impact of Soviet 
collective-security policy, which required the Com- 
munist parties to cast off their conspiratorial and 
sectarian character in favor of cooperation with 
socialist and even bourgeois parties in antifascist 
fronts. By 1943, Stalin could abolish the Comintern 
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as a gesture to his wartime allies without sacrific- 
ing anything of value. As for the Communist par- 
ties themselves, the calculation was that, schooled 
to loyalty over the years, they could be counted 
upon by Moscow without the insurance of formal 
organizational mechanisms. 


Growth after World War u 


World War u, however, decisively altered the 
shape of the international communist movement, 
changing its power structure and its operational 
mechanisms. Before the war, the movement had 
been essentially Western-oriented. It had looked 
for revolution in the advanced capitalist countries 
(except for abortive attempts in China in the 
1920s), and whatever strength it had, had resided 
in the Western communist movements. With the 
breakdown of metropolitan control over the colo- 
nies during the war and their achievement of inde- 
pendence, Communist parties proliferated and grew 
throughout the world. The largest Communist party 
not in power developed in Indonesia, and influen- 
tial parties appeared in Burma, India, Japan, the 
Philippines, Iraq, and Guatemala, in some cases 
backed up by guerrilla forces or significant labor 
movements. Soviet strategy was formulated and 
reformulated to meet the changed circumstances, 
playing now on the aspirations of the newly inde- 
pendent states and now on the potential of the 
local communist movements, 

The seizure of power by the Chinese communists 
in 1949 and the establishment of the North Korean 
and North Vietnamese regimes transformed com- 
munism into an Asian power. 

In eastern and central Europe communist revo- 
lutions were effected from outside and above. The 
Red Army and cadres loyal to Stalin imposed a 
revolution on those areas which the Allies had in- 
formally designated as part of the Soviet sphere 
of influence. Instead of the traditionally indirect 
exercise of power, organic control was introduced. 
Revolution by military conquest replaced the ortho- 
dox Marxian concept of indigenous proletarian 
revolution. This was the pattern for all countries 
except Albania and Yugoslavia, where the commu- 
nists came to power more or less on their own, and 
Czechoslovakia, where a combination of political 
miscalculation on the part of the democrats, an 
aggressive policy pursued by the dominant Com- 
munist party, and a favorable international cli- 
mate, resulted in a successful communist coup. By 
1948, then, Poland, Hungary, Rumania, Bulgaria, 
Czechoslovakia, Albania, Yugoslavia, and East 
Germany had come under communist control. 

In western Europe and, to some extent, in other 
parts of the West, the wartime alliance with the 
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Soviet Union served to cover the Communist parties 
with a certain respectability and stimulate their 
growth. In France and Italy, where the communists 
had been active in the underground, parties of 
major importance emerged, claiming, in 1946, 
850,000 and 2,300,000 members and polling 26 
and 19 per cent of the popular vote respectively 
and dominating the trade-union movements of their 
countries. (In fact, with the inclusion of the Nenni 
socialists, who were then the communists’ close 
allies, the Italian radical left polled 40 per cent.) 

New circumstances did not, however, affect 
Stalin’s method of dealing with these Communist 
parties; they were simply to be manipulated in 
support of Soviet policy. And, allowing for the ex- 
panded theater of Soviet operations in central and 
eastern Europe, Stalin’s policy was again essentially 
isolationist, that is, focused on the area under his 
control. Priority of policy went to the rehabilitation 
of the Soviet Union, devastated by war, and con- 
solidation of the central and eastern European 
states through social revolution under the aegis of 
a loyal Stalinist apparatus. The parties outside the 
communist-controlled orbit merely played a sup- 
porting role. In western Europe, they were used, 
not primarily to consolidate and increase their 
power, but to impede rehabilitation and rearma- 
ment. In Asia, they were employed in civil war and 
violence to weaken the rear, so to speak, of the 
western European countries and disrupt the transi- 
tion of power to the nationalist regimes. 


Strategy after World War 1 

The postwar strategy of the international com- 
munist movement falls into three slightly overlap- 
ping periods; the aggressive “forward” period asso- 
ciated with the Cominform; “peaceful coexistence,” 
Stalin-style; and Khrushchev’s era of “competitive 
coexistence,” characterized by his emphasis on eco- 
nomic competition and the avoidance of war. 

The “forward” period. The terms of reference 
for the communist movement during the first period 
were contained in the “two-camp” doctrine, which 
proclaimed a world divided between the Soviet 
Union, the communist countries and parties, and 
the national liberation movement, together with 
the scattered “peace” forces, on the one hand, and 
the “imperialist” United States and its allies, on the 
other. Strategically, the Communist parties were to 
disrupt the American policy of rehabilitating and 
rearming western Europe and to undermine the 
new nationalist regimes in Asia. Neither aspect of 
the strategy paid off: western Europe recovered 
phenomenally, while the Communist parties paid 
the price of opposition in decline of membership 
and influence. In Asia, where communist tactics 
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took the form of civil war and violence, the revolts 
were broken or contained to the point where the 
Communist parties, beginning in 1950-1951, be- 
gan to bid for a return to normal status. 

The international organizational manifestation 
of the “forward” period was the Cominform, the 
Information Bureau of the Communist and Work- 
ers’ Parties, established in September 1947. Com- 
posed of the Communist parties of the Soviet 
Union, Yugoslavia, Bulgaria, Hungary, Rumania, 
Poland, and Czechoslovakia ( Albania, then a Yugo- 
slav fief, was excluded), and the two mass parties 
of France and Italy, the Cominform was designed 
as an instrument of Soviet control to consolidate 
the new communist states and disrupt western 
Europe. As an organization of Communist parties, 
it bore little resemblance to the Comintern in mem- 
bership or apparatus. If the Cominform had any 
potential for development as an effective commu- 
nist international, it was nullified by the expulsion 
of Yugoslavia; that action, in effect, marked its 
demise. The visible evidence of the Cominform'’s 
existence was its weekly journal, For a Lasting 
Peace, For a People’s Democracy! which purveyed 
the Soviet line to the international communist 
movement until the organization was dissolved on 
April 17, 1956. 

Stalin’s “peaceful coexistence.” By 1950-1951, 
if not sooner, the “forward” policy had been played 
out. Tito had successfully defied Stalin. Western 
Europe was on the road to recovery and drawing 
together more closely in political and military agree- 
ment. The Communist parties had failed to under- 
mine the governments of western Europe and, in 
their attempts to do so, had suffered losses in pres- 
tige, membership, and electoral support. In Asia, 
with the exception of Vietnam, violent tactics 
failed: the revolts in Indonesia, the Philippines, 
Burma, and Malaya were either crushed or con- 
tained; in India, limited guerrilla warfare was sup- 
pressed and contributed to the weakening of the 
Indian party's following. The fortunes of the Jap- 
anese Communist party, until 1950 quite promis- 
ing, were reversed when it was forced into violent 
action, undoubtedly imposed on it to hamper logis- 
tical support for the American forces in Korea. 

The failure of communist strategy was tacitly 
conceded in certain labyrinthine doctrinal pro- 
nouncements in connection with the nineteenth 
congress of the Communist party of the Soviet 
Union. The sharper edges were rubbed off the two- 
camp formula, and the Communist parties of west- 
ern Europe were urged to adopt a more “national” 
position within their respective countries. Europe 
remained the chief geographical target of Soviet 


diplomacy, and the broader tactical flexibility per- 
mitted the Communist parties was still couched in 
terms of a “united front from below” directed 
against American power on the Continent. 

In Asia, the problem of the Communist parties 
under Stalin’s broad direction was to reintegrate 
themselves into the normal political life of their 
respective countries. For the Philippine and Malay- 
an parties, readjustment was painfully slow; in 
Burma, however, by 1956 the party had recouped 
to the point where the National United Front, a 
communist-dominated coalition, polled one-third of 
the popular vote. The Indian party's return to re- 
spectability was underscored by its electoral victory 
in Kerala, where it proceeded to organize the pro- 
vincial government. The experience of the Indo- 
nesian party was unique. Although its revolt in 
Madiun in 1948 had been crushed, the party suc- 
ceeded in rebuilding a substantial base of support 
to become the most powerful political organization 
within the country. 

Khrushchev’s “competitive coexistence.” If Sta- 
lin’s last move had been a relaxation of interna- 
tional policy, his successor, Nikita S. Khrushchev, 
steered the international communist movement on 
a new course whose direction came to be dictated 
more by events than by intention. Khrushchev’s 
major pronouncements affecting the international 
movement were presented at the twentieth CPSU 
congress in February 1956. Surviving the vicissi- 
tudes of the ensuing years, they were reiterated at 
the party's 22d congress in October 1961, and in- 
corporated into its new program: the “new com- 
munist manifesto.” War between capitalist and 
communist states was officially proclaimed to be 
avoidable. Revising the Leninist doctrine on war, 
Khrushchev underwrote the orthodoxy of nonviolent 
communist activity as one of the “roads to power.” 
Different roads to socialism were validated, bowing 
to the Yugoslavs, whom Khrushchev sought to 
bring back to the fold. This point, coupled with 
sanction for peaceful and parliamentary capture 
of power, gave the Communist parties a greater 
range of tactical flexibility. The denigration of 
Stalin, reiterated publicly at the 22d CPSU con- 
gress, reinforced the impression that the Commu- 
nist parties were to be allowed their own choice of 
tactics. 

Communist policy also experienced a dramatic 
shift in regard to the underdeveloped areas. Backed 
by personal diplomacy and relatively massive grants 
of aid and trade, the Soviet Union entered the 
former Western preserve as a new competitor. The 
newly independent countries and nationalist move- 
ments were accepted by and large as expressions 
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of a progressive development deserving of com- 
munist support, whose aim was to insure, at the 
very least, anti-Western neutralism and, at best, 
ideological affinity. To complement Soviet diplo- 
macy, the Communist parties, in their turn, gave 
open backing to nationalist leaders, such as Nehru, 
Sukarno, and Nasser. In the case of India, for ex- 
ample, the Communist party moved “right” through 
the political spectrum to support Nehru on all 
essential aspects of both his foreign and domestic 
policy. With the exception of certain countries 
closely linked to American security arrangements 
(e.g., Thailand), communist strategy in the under- 
developed areas became virtually a “united front 
from above.” 


The splintering of the communist movement 


The shift in Soviet policy was the result of a 
number of factors. The death of Stalin in 1953 was 
the catalyst; the exercise of power and authority 
was totally invested in his person. Since succession 
of leadership and legitimization of authority had 
not been institutionalized in the Soviet system, 
Stalin’s death opened the way to a scramble for 
power. Khrushchev was successful in neutralizing 
and eliminating his opponents partly because his 
bid for support capitalized on the revulsion against 
Stalinist practices. His turn in domestic policy was 
aimed at establishing a rule more responsive to the 
requirements of an advancing industrial society 
stripped of organized terror as “incentive” to per- 
formance and “normalized” in other respects as 
well. 

Internationally, the Khrushchevite leadership 
came to terms with the realities of nuclear weapons 
systems and the resulting equilibrium of power be- 
tween the United States and the Soviet Union by 
attempting to negotiate a détente with the United 
States. It also broke sharply with the Stalinist past 
in retooling its policy to win influence with the 
burgeoning nationalist regimes in the formerly 
Western-held colonial areas. The de-emphasis on 
violence as an instrument of policy and the en- 
dorsement of multiple roads to socialism in a bid 
to recapture Yugoslavia or to associate it more 
closely with world communism implied the toler- 
ance of greater diversity and, with the attack on 
Stalin’s method of rule, a greater degree of autono- 
my for the Communist parties. In effect, the shift 
was no less momentous than that of 1935, when 
the Comintern formally adopted a strategy of co- 
operation with leftist and bourgeois parties to sup- 
port the Soviet policy of collective security against 
the fascist powers. Implicit in that strategy had 
been the more forthright identification of the Com- 
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munist parties with the national interest of their 
countries and a wider measure of freedom in pur- 
suit of allies. What was not at stake in 1935 was 
the authority of the Soviet leadership or the struc- 
ture of the international movement. 

The policy turn in 1956, however, took place in 
entirely different circumstances with regard to both 
the international environment and the structure of 
the communist movement itself. First and fore- 
most, it did not reckon with the force of national- 
ism within the communist orbit and the aspiration 
for independent exercise of power by the leaders of 
the communist states. The successful Yugoslav re- 
sistance to Stalin’s demands for complete subordi- 
nation in 1948 foreshadowed the nationalist devia- 
tion. But the Yugoslav case was thought to be 
unique. Special circumstances—the indigenous con- 
quest of power in a country not bordering on the 
Soviet Union—made the Yugoslav defection pos- 
sible. In two other cases—China and Albania— 
such defection seemed theoretically possible, but 
was discounted by most Western observers on the 
grounds that these countries required economic 
support from, and protection by, the Soviet Union. 
Assertion of independence by the other communist 
countries was ruled out because of the Soviet mili- 
tary presence, Overlooked in this view were the 
operation of the irrational in politics and the subtle 
possibilities of extending the area of national de- 
cision making even within the framework of 
ultimate Soviet hegemony. 

However that may be, the twentieth congress set 
off a sequence of events that almost led to the use 
of Soviet force to forestall the establishment of a 
more nationalist regime in Poland and did lead to 
its use in Hungary. These events underscored the 
intensity of anti-Soviet nationalism in eastern Eu- 
rope and demonstrated to what extent the relation- 
ship between the east European communist states 
and the Soviet Union rested on sheer force. The 
challenge to Soviet authority spilled over the bor- 
ders of eastern Europe, particularly to the Italian 
Communist party, whose leader, Palmiro Togliatti, 
questioned Khrushchev’s analysis of Stalin’s mis- 
deeds, implying that something had gone basically 
wrong in the Soviet system as a whole. These 
events dramatized the urgency of finding a new 
organizing principle to preserve communism as a 
cohesive and united international movement. But 
the search for such a principle has been hopelessly 
complicated by China’s bid for leadership in the 
international movement. 

The Sino-Soviet controversy. The Chinese com- 
munists never accepted Khrushchev’s denigration 
of Stalin or his guidelines for world communism as 
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expressed at the twentieth congress. Soviet solici- 
tation of Chinese help in settling the disturbances 
in eastern Europe brought the Chinese into the 
process of decision making for the international 
movement at a crucial time. Chinese prestige 
reached a peak in November 1957, when the 
Chinese joined with the Russians in drafting a 
“declaration” of the twelve Communist parties 
(Yugoslavia declined); the declaration was de- 
signed to establish principles applicable to all coun- 
tries developing a socialist society. With regard to 
the organization of the communist movement the 
declaration endorsed the dictum of proletarian in- 
ternationalism, that is, loyalty to the Soviet Union. 
However, attempts at the November meeting to re- 
establish a formal international organization were 
rejected—the best illustration of weakening Soviet 
authority, A compromise was reached by designat- 
ing bilateral exchanges and multilateral party 
meetings as the mechanism for ironing out differ- 
ences and by launching a new international jour- 
nal—World Marxist Review: Problems of Peace 
and Socialism, subsequently established in Prague 
under general Soviet editorship. 

Containment of the centrifugal tendencies was 
of short duration, however; the deterioration of re- 
lations between the Soviet and Chinese communists 
split the international movement wide open. In 
retrospect it is ironic that it was the Chinese who 
insisted that the parties at the November meeting 
recognize the CPSU as the “head” of the communist 
camp. To the Chinese there was no contradiction 
between their willingness to exalt the CPSU as 
head of the movement and their proviso that poli- 
cies prescribed for the movement meet with Chinese 
approval. 

The Soviet-Chinese quarrel was compounded of 
differing national requirements and consequently 
differing views on international communist strate- 
gy. In most genera] terms, the Chinese have at- 
tacked the Soviet policy of peaceful coexistence 
geared to détente with the United States and the 
concomitant attenuation of uncompromising, revo- 
lutionary struggle by the Communist parties. The 
Chinese charge the Russians with subordinating 
the interests of the Communist parties and com- 
munist states, as well as those of the so-called na- 
tional liberation movements in underdeveloped 
areas, to the Soviet policy of cooperation with the 
imperialists. The Chinese have been angered by the 
low level of economic assistance the Russians have 
provided them, by Soviet reneging on an agree- 
ment to help build China’s nuclear capability, by 
the absence of strong support of their ambitions in 
Formosa, by Soviet criticism of their incursion 


into India, and by other Soviet actions. They re- 
jected Khrushchev’s attacks on Stalin at the twen- 
tieth CPSU congress and again at the 22d congress 
as ill advised, his emphasis on peaceful transition 
to socialism as mistaken, his tolerance of revision- 
ism of the Titoist variety as criminal, and his en- 
tire outlook as, in a word, disastrous to the com- 
munist movement. 

The erosion of Soviet authority and contro] over 
the movement was dramatically revealed at the 
November 1960 meeting of 81 Communist parties 
in Moscow. Here the Russians attempted to call a 
halt to Chinese politicking against Khrushchev in 
the Communist parties and front organizations and 
to their support of Albania, which, suspicious of 
Khrushchev’s rapprochement with Yugoslavia, had 
opted to defy Soviet policy. The Soviet attack took 
the form of a draft declaration binding the parties 
to accept the rule of democratic centralism—in this 
context, the majority opinion of Communist parties, 
which Khrushchev knew he could command. The 
declaration also explicitly banned “fractional” ac- 
tivity, that is, the Chinese attempt to win over 
parties or groups and individuals within parties. The 
Chinese refused to comply, stating that democratic 
centralism was a proper organizational principle 
within parties but not between them, The Chinese 
declaration of independence went beyond this by 
rejecting the time-honored custom of generalizing 
Soviet party statements for the communist move- 
ment. The Chinese insisted instead on the rule of 
unanimity in making international decisions, which 
implied their right to veto any decisions they found 
unpalatable. The final statement was a transparent 
compromise which left the question of authority 
open. So it remains. 


The future of international communism 


Unlike the Tito-Cominform split in 1948, the 
Soviet-Chinese split has resulted in the fraction- 
alization of the international communist move- 
ment. Although the Yugoslav defection may be 
viewed historically as the harbinger of “polycen- 
trism,” or pluralization of the communist move- 
ment, that process was temporarily contained for 
a number of reasons. First of all, Stalin was in full 
command and did not hesitate to use purge and 
violence to prevent further eruptions. The commu- 
nist states were too new and their leaders too 
insecure to challenge the Soviet leadership. More- 
over, tension in world politics was at a high point 
and communist unity was an overriding concern. 
Finally, Yugoslavia did not have the magnetic at- 
traction of China, a large country of potentially 
great power, with a special appeal for non-Euro- 
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peans and for all those who were looking for a 
more dynamic policy. 

Monolithism has succumbed to bicentricism. 
Nevertheless, the pattern of party alignments is 
still unclear. Those which have opted to support 
Peking, for instance, do not fall into a coherent 
pattern based on race, stage of political develop- 
ment, or type of leadership. However, a regional 
configuration does seem to be emerging; the com- 
munist states of North Korea and North Vietnam, 
together with the communist parties of Indonesia, 
Japan, Malaya, Burma, Thailand, and New Zea- 
land, have backed Peking explicitly or, at the very 
least, implicitly by withholding support from Mos- 
cow on crucial issues. On the other hand, the Aus- 
tralian Communist party, whose problems do not 
differ significantly from those of the New Zealand 
party, has supported Moscow in spite of opposition 
from ranking leaders. The leadership of the Indian 
Communist party, also, has maintained traditional 
loyalties abetted by the Indian government, which 
imprisoned the pro-Peking radicals during the 
Sino-Indian border crisis. In addition, Peking has 
developed pockets of support throughout the 
world communist movement: dissident pro-Chinese 
groups came into existence rather early in Belgium, 
Brazil, Mexico, Peru, and Venezuela. 

How far the fragmentation process will go is 
hard to say, Most of the Communist parties are 
observing their old habits of loyalty to Moscow. 
The dominant position in world communism is 
held by the Soviet Union, which, after all, is the 
traditional leader and a world power second only 
to the United States. Yet the Chinese defection ap- 
pears, under present circumstances, permanent; 
Chinese inroads into Moscow’s authority and con- 
trol over the world movement are serious; and 
divisive forces in the movement seem to be growing 
stronger. 

One of the by-products of the Sino-Soviet split 
has been the return of the movement to politics. 
Debate at the November 1960 meeting was wide 
open; parties have voted against Moscow or ab- 
stained; and individual communist leaders have 
taken it upon themselves to attempt to influence 
the decisions of the big parties. The Chinese com- 
munist lobby, by financial inducement, military 
support, propaganda, and cultivation of individual 
leaders, has contributed to a revival of independ- 
ence within the movement. Rumania, for example, 
has capitalized on the situation by asserting its own 
national economic interests against Soviet-backed 
coordination of the Comecon countries. The Italian 
Communist party, sympathetic to Khrushchev’s 
policy, has nevertheless used the controversy to 


125 


further its own program of “structural reform,” i.e., 
a policy of piecemeal socialism within the consti- 
tutional framework of Italian politics. To parties of 
some strength and influence within their respective 
countries, the circumstances offer considerable 
range and opportunity for improving their domes- 
tic fortunes without excessive interference from 
the center. For the smaller, more distant parties, 
whose existence depends on identification with a 
sharply defined revolutionary movement, there is 
little advantage to be gained from the disruption 
in the movement. 

The coincidence of the erosion within the inter- 
national communist movement and the emergence 
of radical nationalist movements in the underde- 
veloped areas suggests that a new dimension may 
be added to world communism. Symbolic testimony 
to this development was the presence of the ruling 
parties of Ghana, Guinea, and Mali as observers to 
the 22d congress. Leninist forms of organization 
and forced economic development seem to appeal 
to such parties, and though they may not favor 
integral association with the communist move- 
ment, they may nevertheless join in what might 
be called associative membership. Soviet—Chinese 
competition for their favor permits a looser ar- 
rangement than would have been possible at any 
time since 1928. Castro's position may be illustra- 
tive; since he declared himself a Marxist-Leninist 
and communist only after the Cuban revolution, 
his association with the international communist 
movement has been unique. 

In sum, the Khrushchevian era in the Soviet 
Union represents a transition from Stalinism to a 
form of social organization whose features are still 
blurred. The international communist movement, 
changed in structure since the interwar period and 
beset by nationalist rivalries is similarly in a transi- 
tional phase. The vaunted unity of the communists, 
which, Stalin warned, should be safeguarded as the 
“apple of one’s eye,” has been shattered. What this 
will mean for the world at large still remains to 
be seen. 

BERNARD S. Morris 
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Iv 
THE INTERNATIONAL SYSTEM 


The contribution made by students of interna- 
tional relations to the understanding of commu- 
nism as a world-wide political movement is re- 
viewed in the present article. Communism has 
been seen alternately as a system of communist 
states (a “communist international system”) and 
as a world system of political parties. These dif- 
ferent approaches offer two perspectives and pro- 
vide the first two sections of this article; the third 
and fourth sections discuss the main problems 
facing communism as a world system. 

Marxist theory contains no permanent place for 
the role of the state and it therefore has no place 
for an organization of states; nor does Marx- 
ism recognize political autonomy as an explicit 
value; the concept of an international system and 
an international-relations approach are, therefore, 
basically alien to it. The primary units of analysis 
are class and the relations of production; the state 
is secondary, derivative, and liable to wither away. 
On the world scale, too, society is viewed first of 
all as a world market, while its political institu- 
tions, such as diplomacy, are seen as no more than 
a necessary evil. Contrasting with the underem- 
phasis of politics is the consistent and strong 
awareness of the role of international class solidar- 
ities. Thus, the powers of the “capitalist world 
market” are to be defeated by the “international 
workers’ movement” organized by communist par- 
ties. The several “internationals” have been the 
practical embodiment of this concern, but the 
literature of solidarity has been inspirational rather 
than analytical. 

For a time, Soviet federalism was an implicit but 
not very influential model for relations between 
communist states (Goodman 1960). This might 
have served as the prototype of a world state, a 
possible world union of soviet socialist republics. 
But developments since World War 11 have brought 
about for the first time the simultaneous coexist- 
ence of a number of communist-ruled states, and, 
thus, the need has arisen for new conceptualiza- 
tions. The concept now emplaced in Soviet ideology 
is that of a “world socialist system,” defined in the 
1961 program of the Communist party of the Soviet 
Union as “a social, economic and political com- 
munity of free sovereign peoples pursuing the 
Socialist and Communist path, united by common 
interests and goals, and the close bonds of inter- 
national Socialist solidarity.” In a manner entail- 
ing some revision of classic theory, Kuusinen’s 
text (1961, especially chapter 25) paints a picture 
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of the “commonwealth of free and equal states” 
equally developed and freely associated in relation- 
ships unmarred by war, and it recalls Lenin’s dic- 
tum that “national and state distinctions between 
peoples and countries would continue to exist for 
a very long time” (p. 769). However, the Chinese 
Communist party has neither accepted these pro- 
nouncements as authoritative nor clarified its own 
position on these points. 


A system of states 


The communist world appears above all as a 
system of states, possibly an international system 
in embryo. In 1947 the system could be said to 
have had about a half-dozen members; by 1965 
membership had increased to 13 (the Soviet Union, 
Albania, Bulgaria, Czechoslovakia, East Germany, 
Hungary, Poland, Rumania, China, North Korea, 
North Vietnam, Mongolia, and Cuba). The prin- 
cipal international organizations of that system, 
the Warsaw Treaty Organization and the Council 
for Mutual Economic Assistance (CEMA), are both 
confined in membership to the European area (ex- 
cept that Mongolia belongs to CEMA). 

For a time, students of politics characterized 
the system as primarily an expansive empire, an 
array of satellite governments dominated from 
one, and later possibly from two, centers. Others 
have seen it as a bloc of states assuming a place 
in a world system of bipolarity (Lasswell 1945; 
Kaplan 1957). It was then observed, however, that 
a potentially global political system might be de- 
veloping whose constituent units nevertheless pos- 
sessed a degree of independence. Modelski (1960) 
therefore described it as a partial international 
system that had the following attributes for be- 
coming universal, while maintaining its essentially 
international character: 

(1) Aspiration to universality. The ideologies 
of ruling communist states contain a claim to 
universal validity, and this is held to justify the 
principle of proletarian internationalism, accord- 
ing to which all communist parties have the obli- 
gation of mutual support in the “world proletarian 
struggle.” Communism could, therefore, become a 
world-wide system of states controlled by commu- 
nist parties. The constant reminder of this possi- 
bility is the existence, the world over, of close to 
one hundred communist parties on five continents, 
each upholding the principle of proletarian inter- 
nationalism. 

(2) Segmentation of authority. The interna- 
tional character of this potentially universal system 
is accounted for by the segmentation pattern ob- 
taining among the political authorities within it 
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(that is, the number and distribution of units 
within the system, their relative size and political 
potential). Especially since the establishment of 
the Chinese Peoples’ Republic, the formerly pre- 
ponderant influence of the Soviet Union has dimin- 
ished and, if an elementary play-off mechanism 
is postulated, greater autonomy has in consequence 
accrued to other members of the system. 

(3) Separate identity. Politically, communism 
is a system that is characteristically distinct and 
isolated. There are “boundaries” which separate 
communist from noncommunist states (an “iron 
curtain”) and mechanisms which maintain this 
separateness, especially the solidarity of the move- 
ment and the distinctive political culture which it 
upholds. 

(4) Self-maintenance. To persist through time, 
systems need structures to ensure their survival 
and operation. International systems require ar- 
rangements for performing functions of legitimate 
authority and for strengthening solidarity, main- 
taining cuture and communications, and allocating 
resources. Modelski (1960) described the ways in 
which these functions are performed in the system 
of communist states, and he stressed the impor- 
tance of arrangements for authority and leader- 
ship. Organizational devices, such as conferences 
of ruling parties and assemblies of both ruling 
and nonruling parties, have exploited the party 
channels but have not produced a framework 
of even mild formal authority such as is charac- 
teristic of modern international systems. Respon- 
sibility for initiating and pursuing policies aimed 
at the common interest still devolve upon individ- 
ual authorities and, in particular, upon the most 
powerful parties and states, We now see that the 
leadership role held for many years by the Soviet 
party, the Soviet state, and, in effect, by Stalin 
has been implemented in a hegemonial manner 
and frequently without regard to the interests of 
other parties and governments. No accepted rules 
govern the succession to that role, and China’s 
leaders as well as parties anxious for a more dis- 
persed system of authority have contested the 
Soviet right to an exclusive occupancy of the lead- 
ership position. The achievement of modern struc- 
tures of leadership—as well as the maintenance 
of solidarity—are among the most crucial prob- 
lems of the system. 

(5) Conflict containment. Despite the diverse 
causes for international conflict within the com- 
munist system, there has been considerable suc- 
cess, over a long period, in “containing” these 
conflicts within the system through the strength 
of solidarity. Communist states have hesitated to 
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call upon outsiders for aid in their intrasystem 
differences, and proletarian internationalism re- 
quires mutual support in case of conflict with the 
world outside. Failure to contain rather than fail- 
ure to abolish conflict could become the most pro- 
found cause of change and, ultimately, transfor- 
mation of the system. 

Developments in the Sino-Soviet dispute have 
led observers to see the growth of semipermanent 
regional subsystems in Europe and in Asia, based 
on the Soviet Union and on China, respectively. 
In this context, the Council for Mutual Economic 
Assistance has attracted some attention, being seen 
as the counterpart of the European Common Mar- 
ket and having, like the latter, aroused the hostility 
of some of its members for being an instrument 
of excessive integration. But far from being sub- 
divisions or regional sections of a world system, 
these new institutions are symptoms of declining 
solidarity and semipermanent alignments on intra- 
system issues. They may be seen in the traditional 
light of spheres of influence of major organizing 
powers, 


A system of parties 


Communism may also be viewed as a world 
system of parties. This perspective is peculiarly 
congenial to political scientists, but it is one that 
students of world politics in particular must de- 
velop. The system of communist states cannot be 
properly understood without regard being paid to 
the structure of parties upon which it is built. 
Communist parties perform basic functions in the 
system of states: they facilitate communications 
and contact against the background of common 
culture, and they guard solidarity, legitimize the 
rule of governing parties, and serve as justification 
for claims to universality. The communist inter- 
national system could not exist without the parties, 
but the parties could be, and have been, in exist- 
ence on the world scene even before there was a 
system of communist states. That is why they 
deserve separate attention as an independent phe- 
nomenon of world politics. 

The world system of communist parties has 
grown from 7 national parties with 400,000 mem- 
bers in 1917, to 56 parties and 4.2 million mem- 
bers in 1939, and to about 90 parties with more 
than 44 million members in 1964 (U.S. Depart- 
ment of State 1965). 

Jan Triska (Triska et al. 1964, pp. 20-31) ana- 
lyzed the data for 1963 with respect to nonruling 
parties. He found that most of these parties were 
small (34 out of 76 parties had less than 2,000 
members ). Most nonruling parties attract less than 


0.1 per cent of their countries’ population to mem- 
bership. (The figures for ruling parties are, of 
course, considerably higher.) Electoral and com- 
parative figures for the years 1953 to 1963 do not 
disclose any significant trends except an over-all 
rise in membership largely attributable to increases 
in world population as a whole. 

The organization of the world system of com- 
munist parties has passed through a number of 
forms—the most important of them being the 
Comintern which lasted from 1919 to 1943 (Bor- 
kenau 1938; Nollau 1959). The Communist In- 
ternational was the most thoroughly international 
party known so far. It was world-wide and so 
highly centralized that its directing organs were 
empowered to issue binding orders to individual 
parties, officially known as mere “sections” of the 
Comintern. The ruling bodies of the organization 
initially reflected a variety of views from member 
parties, but within a few years the Soviet party 
assumed a position of absolute control. The Com- 
intern of 1948 to 1955 (Brzezinski 1960), Europe- 
oriented and much weaker organizationally, was, 
again, merely one of the cloaks veiling Soviet con- 
trol over eastern Europe. Since 1956 the organi- 
zation of the party system has been a bone of 
contention, and efforts to re-create a centralized 
organization or even to institutionalize world-wide 
conferences have been handicapped not only by 
the memories of the Comintern but also by the 
accession to power of a number of parties each 
intent on preserving some autonomy and freedom 
to choose its own “road to socialism.” 

The Comintern can be categorized as an inter- 
national pressure group on account of its interna- 
tional composition and world-wide sphere of action 
(Meynaud 1961); but communism as such does 
not seem to fit neatly into the “pressure group” 
category because of its primary character as a 
political movement, its diffuse objectives, and its 
revolutionary proclivities. 

Propositions describing the behavior of commu- 
nism as a world party system have been few, as 
most students adopt a single national party as a 
focus of analysis. Borkenau (1938, p. 413A) drew 
attention to a regular alternation between a “left” 
and a “right” tendency in communist policies and 
suggested some explanations, but swings have been 
much less marked since the dissolution of the 
Comintern and the declining centralization of the 
system. Nor is Soviet policy any longer a simple 
guide to the behavior of the communist system. 

In modern world politics we can see the incip- 
ient growth of “parties” as a process similar to 
that suggested by Duverger for the national sphere 
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(1951). Such world “parties” are formed when 
delegates meet for international meetings and 
divide on contentious issues; they can be observed 
most clearly in international organizations. The 
“party” formed on such occasions by delegates of 
communist states may be seen as the reflection of 
a world system of communist parties. This “world 
communist party,” is, in Duverger’s terms, extra- 
parliamentary in origin and bears most of the 
characteristics attributed by him to national extra- 
parliamentary parties: greater centralization, re- 
duced influence of parliamentary representatives, 
and lesser dependence upon the international po- 
litical process. 

Studies of voting and other political behavior 
at the United Nations (Hovet [1958] 1960, pp. 
47-55; Dallin 1962, especially pp. 108-111) have 
shown that communist representatives caucus to- 
gether, coordinate their behavior, and later vote 
and otherwise act in unison. They may therefore 
be regarded as forming a party whose potential 
“electorate” is world-wide. 

Communism as a political party on the world 
stage should also be considered in relation to other 
parties on this stage. How extreme are its public 
views as compared with those of its rivals? In 
other words, is it an evolutionary or a revolutionary 
party? Does it promote a normal or a polarized 
international system? In origin communism was 
undoubtedly revolutionary in character, and its 
influence upon the structure of the international 
system has clearly been toward polarization. But 
other influences have been bearing upon it, and 
some evolutionary changes have apparently been 
at work, at least in the case of the Soviet Union; 
this at any rate is the burden of the Chinese com- 
plaint. It is hard to predict whether communism 
as a whole will maintain a revolutionary character 
or whether, like socialism half a century ago, it 
will separate into evolutionary and revolutionary 
wings. 


Major problems 

Solidarity. A condition of the persistence of an 
international system is maintenance of solidarity 
within certain limits. We might postulate that if 
solidarity falls beneath a certain critical level, the 
system dissolves, and that if solidarity rises above 
a certain level, the international system becomes a 
supranational bloc. 

In fact, little has been done to measure system 
solidarity or to establish appropriate indices; at 
most we can say that solidarity has been surpris- 
ingly strong in the past but that as the system 
grew, and as world society at large changed, this 
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solidarity has declined. Another problem may have 
been created by the fact that communist leaders 
had aspired to a degree of solidarity higher than 
was compatible with the maintenance of an inter- 
national system, being seemingly incapable of ac 
cepting the fact that monolithic styles of politics 
cannot (except at times of crisis) be reconciled 
with the universally strong demand for autonomy; 
they have been unable to accept the politics of 
what they still regard as “factionalism.” 

Solidarity may be maintained by an equitable 
distribution of benefits and resources. The feeling 
that the Soviet Union was inclined to appropriate 
an excessive share of such benefits and resources 
and was unwilling to redistribute them, through, 
for example, foreign aid, or in the form of military 
assistance, probably reduced solidarity and was 
conducive to “self-reliance” and, hence, to auton- 
omy. For the Soviet Union, this might or might not 
have been an excessive price to pay. Solidarity may 
also be maintained through control of deviance by 
means of sanctions. Brzezinski (1962) has ex- 
plored this problem by drawing on methods used 
by the Catholic church in the late sixteenth and 
seventeenth centuries to contain Jansenism and to 
manage disputes between religious orders. But this 
analysis posits the persistence of a “center” in one 
authority or location and reduces the problem of 
deviance to marginal proportions. A study of the 
schism between the Eastern Orthodox church and 
the church of Rome in the eleventh century might 
have been perhaps more useful, because control of 
deviance presents more serious problems in case 
of a major challenge: a bid for central authority. 

Growth. A revolutionary system must expand 
if it is to maintain its momentum, and the pres- 
sures on communism to maintain its growth re- 
main strong. 

Recent Soviet doctrine foresees both violent and 
nonviolent methods of “transition to socialism” 
(Black & Thornton 1964, pp. 417 ff.), but Soviet 
practice has in effect been a policy of seeking sup- 
port of the “national democratic states” (now re- 
ferred to as “revolutionary democracies”) in the 
“third world,” such as Algeria, Burma, Mali, the 
United Arab Republic, Ghana, and Guinea. Some 
degree of community of action, which also tends 
to fluctuate, has been achieved, but none of these 
states may be said to have become members of the 
system. Triska (Triska et al. 1964, pp. 32-37) 
has proposed indices for judging the degree of 
“affiliation”; they are trade, aid, and voting in the 
UN. These indices would have to be followed on a 
continuous basis, and they need to be refined, both 
to account for the influence of China and in the 
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light of world-wide and comparative studies of 
cohesion and conflict. 

Since 1960 the Chinese leadership has been on 
record as preferring the violent and revolutionary 
methods of promoting communism; despite cau- 
tion in the execution of policy, it has advocated 
“wars of national liberation” even at the risk of 
an atomic conflict and has directed its appeal to 
the regimes in the new Afro-Asian nations. It 
remains to be seen whether this more extreme 
tendency will pervade the whole system or whether 
a separation might occur between a revolutionary 
and an evolutionary trend. 

In their more optimistic moments, Soviet writers 
foresee a gradual yet irreversible growth of the 
system—through accretions mainly in the third 
world—as the result of a high rate of economic 
development in countries of the communist sys- 
tem. Looking some twenty years ahead, Strumilin 
(1961, pp. 390-391) wrote in terms of a shift of 
30 per cent of the population of the “neutral” and 
10 per cent of the population of the “imperialist” 
countries toward the “socialist” system, which lat- 
ter system would by 1980 comprise 54 per cent of 
the world’s population. Yet the possibility of con- 
traction cannot be ignored. Yugoslavia was ex- 
pelled from the communist system in 1948-1949, 
and in 1956 the government of Hungary was pre- 
pared to leave abruptly. Developments in Sino- 
Soviet relations may lead to drastic changes in the 
allegiance and viability of nonruling parties in 
particular. Indeed “reversibility” (that is, the pos- 
sibility that accession may later be followed by 
withdrawal) could be the only condition which 
would make the entry of new members into the 
communist system acceptable to the noncommu- 
nist world. 

Links to world society. International systems 
need to be related to the societies in which they 
function. In general terms the communist system 
might be described as one variant of the interna- 
tional systems of modern industrial world society 
(Modelski 1961, p. 141). But first a number of 
questions must be answered. How significant is 
this variant? How might the significance be meas- 
ured? Is it a significant difference in structures 
of society or in the more restricted sphere of polit- 
ical organization? Is the difference so significant 
that communist states form in fact a separate 
world society? While its adherents claim that com- 
munism is a distinctive social system and while 
others voice impressionistic theories of conver- 
gence, there is little concrete research in this field. 

Other problems relate to the interdependence of 
communist states as contrasted with their links to 


world society. Recent research shows that the ex- 
tent to which trade takes place within the system 
varies from over 95 per cent for Albania to 50 to 
65 per cent for Poland but that as of 1961—1962 
all system-members, except Yugoslavia, conducted 
over 50 per cent of their trade within the system, 
As the foreign trade of communist countries is a 
government monopoly, these figures would seem to 
bear more directly upon the intensity of political 
relations than upon the existence of a rich pat- 
tern of social contact constituting an inclusive 
society. Data on social and cultural interaction 
are too sketchy to warrant any conclusions. But 
available information indicates the growth of sep- 
arate European and Asian spheres and therefore 
again throws doubt upon the separateness of the 
communist system as a whole. 

The distinctiveness and separateness of com- 
munist societies have of course a close bearing 
upon the degree of upheaval caused by a country’s 
entry into or withdrawal from the.system and, 
therefore, upon the revolutionary nature of such 
changes. 


Future transformations 


The communist international system is an inter- 
pretation of the international experience of com- 
munism. Implicit in it is the assertion that, given 
certain conditions, communist states would be- 
have as though they formed a distinct system of 
independent yet solidary states—possibly the nu- 
cleus of a future international society. This concept 
fitted to a good degree the development of com- 
munism in the decade following the death of 
Stalin. Yet it might also be argued that this was 
an oversubtle interpretation of a transitional situa- 
tion: the inevitable disintegration of the commu- 
nist monolith and the gradual dissipation of revo- 
lutionary fervor. There might have been no more 
than a fleeting moment in which the emergence 
of a genuinely international communist system 
was a realistic possibility. 

The argument that communism will not produce 
a genuinely international system is supported by 
what may be seen as a major cause of communist 
intrasystem difficulties: a doctrinal disdain for 
governmental and diplomatic forms of action and 
a contemptuous disregard for tried and tested 
methods of maintaining close relations in condi- 
tions of basic autonomy. Conceivably, a revolu- 
tionary nucleus cannot be held together with any but 
party—political links which ignore state institutions 
and the niceties of diplomatic behavior. But it is 
at least as conceivable that the communist inter- 
national system poses the conditions which, if 
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observed, would ensure the survival and possibly 
the expansion of such a system. We must leave 
this question unresolved. Even if the post-Stalin or 
post-Khrushchev communist system were to be es- 
sentially altered, the analysis of its experience 
would have served a more general purpose: it 
would have afforded an example of the operation 
and growth of extraparliamentary international 
systems that are built upon a revolutionary ideol- 
ogy and that have universal aspirations directed 
against an existing world society. 

For the present let us consider the transforma- 
tion possibilities confronting the communist inter- 
national system, other than the possibility of uni- 
versal extension. The possible transformations are 
(1) a monolithic bloc, (2) dissolution, and (3) 
entry into a “normal” world party system. A bloc 
would be likely to ensue if distinctiveness and 
separateness increased, authority structures be- 
came regularized and strengthened, economic inte- 
gration advanced, the segmentation pattern was 
changed toward greater centralization, and conflict 
containment was improved. Dissolution would be 
the consequence of opposite trends, especially if 
distinctiveness declined and the structures of au- 
thority, solidarity, and culture lost power. If indices 
could be established for each of these character- 
istics, the transformation processes could be fol- 
lowed accurately. 

The third possible transformation falls some- 
where between the two other alternatives and is 
also distinct from the “revolutionary international 
system” concept discussed above. It would involve 
the development of at least part of the communist 
system into a looser yet still ideologically oriented 
association. If the revolutionary system were to 
mellow and shed some of its more extreme mem- 
bers, the remaining elements could find allies 
among noncommunist states by an appeal to 
Marxist ideas and by a policy of mutual support 
based on a modicum of organization. If a non- 
polarized party system were to gain strength on 
a world scale—and such a trend is arguable on 
account of the growing complexity of global poli- 
tics and the resulting need for greater regularity 
and predictability—a communist “party” could be 
one of its components. The need to exercise in- 
fluence on the world stage through and with others 
makes it necessary for all governments to have 
associates, Communist participation might thus be 
acceptable as long as the aim is world influence and 
not absolute power or the overthrow of the world 
system, 
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[See also INTERNATIONAL ORGANIZATION; PARTIES, 
POLITICAL; SYSTEMS ANALYSIS, article on INTERNA- 
TIONAL SYSTEMS.] 
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I 
OVERVIEW 


“Communism” is a term with various meanings. 
When we speak of a modern state as communist, 
however, we usually mean that political power 
there is concentrated in a single party, which is 
typically controlled from above through a bureau- 
cratic structure and has gained, and ultimately 
maintains itself in, power by authoritarian means. 
Elections to public office, so far as they are held, 
are thus apt to have only a relatively formal char- 
acter. Power was often seized initially, however, 
with appreciable popular support and the ruling 
party adheres to a social philosophy derived from 
Marx. The nature of this ideology is subject to con- 
tinuing reinterpretation (and in a manner which 
would sometimes have surprised Marx), but access 
of the party to power has in fact been followed by 
extensive nationalization of private property, and 
much of the means of production, at least in the 
more strategic sectors, such as industry and trans- 
port, comes sooner or later to be publicly owned. 
The party also seeks to direct economic life 
generally. 

These are cardinal features of the contemporary 
communist state, though persons considering them- 
selves communists would no doubt find fault with 
this characterization. Curiously, in communist the- 
ory strict usage also dictates that a society that 
we would designate communist should be referred 
to otherwise. Depending chiefly on the degree of 
economic advance in it and the extent of socializa- 
tion as measured by public and cooperative owner- 
ship, such a society is called either socialist or a 
people’s democracy. As for the term “communism,” 
this is reserved to describe a stage of social devel- 
opment which admittedly is yet to be achieved. 
This is the stage depicted long ago by Marx (Marx 
& Engels [1875-1891] 1938, p. 10), one which will 
be realized “after the enslaving subordination of 
individuals under division of labor, and therewith 
also the antithesis between mental and physical 
labor, has vanished; after labor from a mere means 


of life has itself become the prime necessity of life; 
after the productive forces have also increased with 
the all-round development of the individual, and all 
the springs of cooperative wealth flow more abun- 
dantly.” Only at this stage “can the narrow horizon 
of bourgeois right be fully left behind, and society 
inscribe on its banners: from each according to his 
ability, to each according to his needs.” [See MARX- 
ISM.] In this sense communism represents a form 
of utopia where there is free sharing of material 
goods. Such a society has also been called com- 
munist in the West, but our concern here is with 
the modern communist state and particularly with 
the organization of economic life in such a state. 


The Soviet model 


Communism first came to hold sway in a mod- 
ern state in Russia after the revolution of Novem- 
ber 1917. Moreover, other countries where this 
social system has since come to prevail have tended 
in economic affairs to emulate the U.S.S.R. They 
have also diverged from the Soviet model, some- 
times radically, but the organization of economic 
life in the U.S.S.R. remains of particular interest. 

Public ownership. In the U.S.S.R. public owner- 
ship of the means of production is indeed ex- 
tensive. Such property was nationalized in a whole- 
sale way shortly after the revolution, and public 
ownership has been extended even further since 
that time. Thus, a pervasive feature is the legally 
defined public enterprise (predpriiatie), which ad- 
ministers a publicly owned plant or other works in 
behalf of the government. Moreover, even where 
ownership is not public, the difference almost in- 
evitably tends to be perfunctory. To a marked de- 
gree this is true even of the collective farm, which 
with its mixed public and cooperative ownership 
represents by far the chief departure from public 
ownership as such. 

Ownership other than public, however, also 
exists in the nonfarm sector, on which attention 
is focused here, and is sometimes of some magni- 
tude. Thus, the so-called consumers’ cooperatives 
conducted 28.1 per cent of all retail trade in 1964. 
Familiar to all tourists to the U.S.S.R. is the col- 
lective-farm market, where the government allows 
the collective farm to sell surpluses which it does 
not distribute to its members and the members in 
turn to sell such as they wish of distributed sup- 
plies and of the produce from small household plots 
assigned them. The collective-farm market thus has 
much of the character of a peasant bazaar and 
represents another departure from public enter- 
prise in retail trade. Housing is usually thought of 
as a consumer good rather than a means of pro- 
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duction, but it should be observed that the govern- 
ment has also permitted extensive private owner- 
ship of such property. 

Bureaucracy. Given public ownership of the 
means of production, economic life in a communist 
state might still be organized in various ways. The 
economic system of the U.S.S.R. has long been 
characterized by three other basic features. The first 
is the pervasive bureaucracy. Public enterprises 
are directed by an extraordinarily complex adminis- 
trative apparatus beginning with the hierarchy of 
general organs of government conducting public 
affairs in all spheres: the Council of Ministers in 
Moscow, associated legislative and judicial organs 
of the all-union government, and corresponding 
bodies in 16 constituent republics and subordinate 
local administrations. In the administration of pub- 
lic enterprises in different economic sectors use has 
also been made of further complexes of specialized 
agencies, In industry, for example, authority over 
public enterprises has been vested at different times 
in such agencies as all-union and republican 
branch ministries and their subordinate depart- 
ments and all-union, republican, and regional eco- 
nomic councils. Sometimes several enterprises are 
administered through a further agency, the firm. 

The above are agencies through which general 
organs of government exercise operational control 
over public enterprises in different sectors. In the 
administration of such enterprises, use is also made 
of still other complexes of agencies of a more func- 
tional or staff nature, though inevitably these often 
have duties not very different from those of agen- 
cies exercising operational control. Among the 
more important bodies with such staff and func- 
tional responsibilities are the State Planning Com- 
mittee of the U.S.S.R., or Gosplan, which, together 
with corresponding republican and local agencies, 
has major responsibilities for formulating and ob- 
serving the fulfillment of economic plans; the 
Central Statistical Administration, which compiles 
and publishes statistical data; and the Ministry of 
Finance and the State Bank, which are concerned 
with money and finance. 

While directing public enterprises in different 
sectors, general organs of government also oversee 
cooperative and in some degree even private eco- 
nomic activities. For this purpose, they rely on Gos- 
plan and other agencies with functional and staff 
responsibilities in the public sector and sometimes, 
though not always, on those exercising operational 
control over the public sector. Consumers’ coopera- 
tives, however, are nominally administered by an 
apparatus of their own, at the apex of which is 
Tsentrosoiuz. 
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Planning. The second basic feature of the 
Soviet economic system is the all-embracing com- 
plex of plans which are intended to guide the 
economy. The most famous of these is the Five- 
Year Plan, the first of which went into effect in 
October 1928. But the Five-Year Plan is only one of 
a number of programs that the government has 
employed in directing economic affairs. From time 
to time it publishes plans of a very long-run sort, 
the most recent of these being the twenty-year pro- 
gram that avowedly is to establish “a material basis 
for communism” by 1980. Among plans of shorter 
duration, the last that was promulgated for a five- 
year interval was to have run from 1956 to 1960. 
This was superseded by the Seven-Year Plan, which 
was formulated for the years 1959 to 1965. Long- 
term plans are supposedly reflected in the plan 
drawn up for still another interval, the year. It is 
the latter plan that bears most immediately on cur- 
rent operations. For purposes of administration, 
however, the annual plan is further broken down 
into quarterly and, at least in some spheres, month- 
ly installments. 

Whatever the planning interval, ordinarily there 
is not one program but a complex of programs of 
varying scope and detail affecting operations at 
different bureaucratic levels. Thus, in industry 
there are programs for the U.S.S.R. as a whole, for 
different regions and branches. Finally, there is 
also a plan for each enterprise, that for the year 
being the tekhpromfinplan, or “technical—industrial— 
financial plan.” 

Monetary regimen. The third feature is per- 
haps not nearly as noteworthy as the first two, but 
it too is basic. This is the monetary regimen. Trans- 
fers of goods and services not only between house- 
holds and enterprises but between one enterprise 
and another ordinarily take place at established 
money prices. The household, in other words, is 
paid money wages for its labor by the enterprise 
employing it, and the household in turn is charged 
money prices for the consumer goods supplied it, 
though some goods, chiefly education and health 
care, are by and large supplied the household with- 
out charge. Similarly, each enterprise pays other 
enterprises for goods it obtains from them and is 
paid, in turn, for goods it supplies. Within the 
enterprise, records are kept of costs and income, 
and it is considered not at all incongruous that in 
a communist state the resulting profit is also 
calculated. 

Though they are largely supplied the household 
free of charge, education and health care generally 
are of such a nature that they might also have 
been made available at money prices. In addition 
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to such “communal services,” however, there are 
various “collective goods” (defense, internal se- 
curity, etc.) which because of their nature must be 
made available to the community at large without 
direct charge. In the Soviet Union the government 
finances all such activities through various money 
taxes and other levies on both public and other 
enterprises and households. Of these, the chief levy 
by far is the notorious turnover tax (nalog s 
oborota). While imposed in different ways, this is 
in effect a sales tax. The government has also seen 
fit to finance other activities through its budget 
rather than through the monetary regimen. Prin- 
cipal among these is much of the new investment 
of public enterprises. 

The monetary regimen described represents but 
a facet of a larger phenomenon, the system of 
khozraschét, or economic accounting. This system, 
other features of which will be discussed below, 
provides a basis for a distinction that otherwise 
might be difficult to make, that between economic 
agencies that may be viewed as institutions con- 
stituting the government as such and economic 
agencies acting on its behalf in the conduct of the 
public sector. Thus, under the khozraschét system 
the public enterprise usually is expected to finance 
out of its own revenue and hence without recourse 
to the government budget much or all of its cur- 
rent expenditures. Sometimes this has also been 
true of agencies immediately superior to the public 
enterprise, but generally such organizations have 
their current expenditures financed by appropria- 
tions from the government budget. As a rule, these 
budgetary organizations seem properly viewed as 
government institutions as such, while the former 
extrabudgetary organizations are seen as agencies 
acting on the government's behalf. In the U.S.S.R., 
extrabudgetary organizations are often referred to 
as being on khozraschét, or as khozraschét organ- 
izations. 


How the system functions 


Given the foregoing cardinal features, how does 
the system work and how well? The manner in 
which the economic system of the U.S.S.R. func- 
tions is a large, complex, and still often obscure 
theme. We can do little more than suggest the 
more significant patterns of operation that may be 
discerned. 

Enterprise management. An outstanding fea- 
ture of the economic system is the notable cen- 
tralization of decision making, but authority is 
unavoidably dispersed in some degree. The eco- 
nomic plans become increasingly detailed as they 


descend the administrative ladder, but even the 
agency at the bottom, the public enterprise, must 
be allowed some discretion in their implementation. 

Furthermore, contrary to a common supposition, 
the plans are not simply the creation of Gosplan, 
presented in final form for implementation to sub- 
ordinate agencies exercising operational control, 
Nor, though this is somewhat nearer the mark, are 
the plans simply the creation of Gosplan with due 
regard to directives of the party leadership and the 
all-union Council of Ministers. Rather they are the 
product of an intricate administrative process in 
which not only Gosplan, the party leadership, and 
the all-union Council of Ministers but all subordi- 
nate agencies concerned participate. Targets form- 
ulated for each administrative level are subject to 
negotiation between those in posts of responsibility, 
their superiors, and, especially at higher levels, 
Gosplan. Sometimes administrators at one level 
take the initiative in proposing targets to superiors. 
It is also the task of administrators at any level to 
elaborate general directives into more detailed 
plans. 

So far as decision making is decentralized, it 
must, of course, be subject to controls. Among 
these controls, first and foremost are the plans 
themselves. The annual plan particularly serves as 
a “success criterion” for managerial performance 
at all levels, and managerial personnel, by all ac- 
counts, act accordingly. Among the many targets 
in the plan, management has focused chiefly on 
that for total output, for apparently it has been 
especially in terms of performance in this respect 
that success has been judged. By fulfilling and 
overfulfilling the target for output, the manager of 
the enterprise and his chief assistants might earn 
sizable bonuses. Since 1959, however, priority has 
often been accorded another, related target, that 
for the reduction in costs per unit of output, and 
bonuses for management of the enterprise have 
been determined correspondingly. 

Whether the concern has been with output or 
costs, managerial personnel understandably have 
sometimes been more interested in fulfilling than 
in overfulfilling the target, for with overfulfillment 
they may only be confronted with still higher tar- 
gets in the future. Understandably, too, zeal in ful- 
fillment of the plan does not carry over into its 
formulation. A recurring complaint in the U.S.S.R. 
concerns the “safety factor,” which the manage- 
ment of the enterprise seek to establish for them- 
selves by negotiating for low plan targets. The 
agency superior to the enterprise may try to coun- 
ter this proclivity, but this agency too has reason 
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to seek to establish a safety factor in negotiating 
with agencies at still higher levels. 

As a plan target, total output usually must be 
calculated by the aggregation of a variety of prod- 
ucts in terms of their prices, and such calculation 
must also underly that of the further target for the 
reduction in cost per unit of output. In seeking to 
fulfill the plan, therefore, enterprise management 
has had to consider the relative prices of different 
products, and, as will be discussed below, such 
prices have their limitations. Partly for this reason, 
as incessant complaints in the Soviet press testify, 
the resultant assortment is very often odd. Enter- 
prises have reportedly produced, for example, an 
inordinate output of small-sized men’s shoes rela- 
tive to large-sized boys’ shoes, of bandage cloth 
relative to shirt cloth, of large electric bulbs relative 
to small, and so on. Assortment of course is itself 
subject to plan targets, but managers apparently 
find it expedient to subordinate such targets in the 
interest of fulfilling that on total output or on cost 
per unit of output. 

Quality too is a variable subject to controls, but 
these have been especially difficult to enforce. In 
the U.S.S.R. complaints about defective quality are 
hardly less numerous than those on violations of 
the assortment plan. 

Reference has been primarily to the public sector 
and to industry, but working arrangements of a 
broadly similar sort usually prevail in the nonfarm 
sphere generally. 

Coordination. At superior levels, especially in 
Gosplan, a cardinal concern must be with coordi- 
nation. This is a task with many facets, of which 
one of the chief is industrial materials supply, the 
coordination of targets for industrial goods used as 
materials in industry. For important products of 
this sort, Gosplan applies its famous “method of 
balanced estimates.” Essentially a balance is drawn 
up of supplies and requirements for each good. 
Supplies are calculated to reflect provisional tar- 
gets for production on the part of supplying enter- 
prises and for other availabilities. Requirements 
similarly are calculated to reflect provisional tar- 
gets for production on the part of consuming enter- 
prises and for other dispositions. Where supplies 
and requirements are found not to balance, the 
plan is suitably adjusted to remove the disparity. 
Such adjustments for any one product, of course, 
might necessitate further adjustments for others, 
and these might do so for still others. But adjust- 
ments apparently are imposed primarily where 
such secondary effects will be avoided: on supplies 
allotted to final uses, on production targets of sup- 
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plying enterprises without any accompanying 
change in material inputs, or on material inputs 
of consuming enterprises without any accompany- 
ing change in their production targets. 

In the latter cases, the adjustments in effect en- 
tail changes in material input coefficients. Under 
khozraschét such coefficients are supposed to take 
into consideration the possibilities of economizing 
money costs through the substitution of one ma- 
terial for another, and so far as this is done in 
practice, the coefficients must depend on the rela- 
tive prices charged for different goods. Yet prices 
of industrial materials tend to be fixed simply to 
cover average costs in a branch (including, under 
Soviet accounting practice, charges for labor, ma- 
terials, and depreciation, but not for interest on 
fixed capital), together with a small “planned 
profit,” and when once fixed are changed only in- 
frequently. To some extent, however, such prices 
are varied in ways to restrict disparities in supplies 
and requirements. In effect, while coordination is 
achieved primarily by direct administrative con- 
trols, the task is sometimes eased by use of indirect 
market controls. 

Unavoidably, however, market controls have a 
more important place in certain other spheres. The 
numerous and sometimes draconian restrictions 
which the government at different times has im- 
posed on the workers’ freedom of choice as to occu- 
pation and place of work make a story in them- 
selves, but such of these as survived the war were 
very largely abandoned by 1956. Since then the 
worker has enjoyed substantial freedom in these 
choices, though there are still some impediments 
(for example, administrative assignment of col- 
lege graduates to jobs during an initial three-year 
period). In order to recruit workers in required 
numbers in different occupations and branches, 
therefore, the government relies chiefly on a money 
wage system under which rates of pay are system- 
atically differentiated between occupations, de- 
pending on such features as skill, complexity, and 
arduousness, and between branches, depending on 
their social importance. The worker, of course, also 
has much discretion over the effort he exerts in 
any pursuit. To control this aspect, the government 
makes extensive use of piecework (in 1961 about 
60 per cent of the workers in industry were re- 
munerated on this basis) and premium systems. 

Similarly, in distributing among households 
goods for which money prices are charged, the gov- 
ernment has relied on rationing during protracted 
intervals, but the last such period was brought to a 
close in December 1947. At that time the govern- 
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ment re-established an open market where house- 
holds could choose freely among different con- 
sumer goods, and such a market has prevailed ever 
since. At the same time the government relies on 
prices to limit consumer demand to available sup- 
plies. However, for various reasons the government 
has more often than not held prices below clearing 
levels. Hence, such familiar features exist in the 
U.S.S.R. as unavailability of supplies, queues, and 
the premiums over official prices prevailing in the 
collective farm market. In the collective farm mar- 
ket, prices are unregulated by the government and 
thus tend to be fully responsive to pressures of 
demand. 

Technology and production capacity. In addi- 
tion to coordination, superior agencies are largely 
responsible for the determination of technology, in- 
cluding the crucial aspect of capital intensity and 
of production capacity in different branches. Capi- 
tal intensity has been settled on various bases and, 
for a long time, often without reference to any inter- 
est charge on capital. If by additional investment a 
given output could be produced at less cost, ex- 
clusive of interest, the investment supposedly was 
to be made, at least so far as the available supply 
of capital permitted. Recently, however, increased 
use has been made of an interest-like criterion, 
though under another name. Thus, for any two 
projects under consideration, a coefficient of capital 
effectiveness is calculated by comparing the differ- 
ence in their respective capital requirements with 
the corresponding difference in current operating 
costs. The project requiring the greater volume of 
capital is chosen only if the coefficient of effective- 
ness exceeds some rate taken as a norm. While the 
reciprocal of the coefficient of effectiveness is re- 
ferred to in the U.S.S.R. as a “period of recoup- 
ment,” this differs from the payoff period familiar 
in Western business practice insofar as in the 
Soviet Union operating costs include depreciation 
expense. 

The responsible authorities must somehow take 
into account also that, fixed in the manner de- 
scribed, prices of industrial materials in terms of 
which capital outlays and operating costs are cal- 
culated fail to reflect “scarcity values.” This is also 
true of the prices of durable capital goods, which 
are fixed similarly. In addition to the coefficient of 
effectiveness, therefore, administrators have been 
told in a methodological handbook issued by the 
Academy of Sciences to consider pertinent “physical 
indicators, characterizing the productivity of labor 

» ., expenditure of fuel, power, materials, utiliza- 
tion of equipment and productive space, applica- 


tion of progressive construction designs, and so 
forth” (Bergson 1964, p. 263). 

The principles applied in the determination of 
production capacity in different branches are not 
very clear. Among industrial branches producing 
substantially for household consumption, the re- 
sponsible agencies apparently consider the diver- 
gence of prevailing retail prices from propor- 
tionality to average branch costs, though such a 
divergence may be viewed as “normal” (for example, 
for children’s things prices are viewed as properly 
low). The degree to which supplies to households 
have fallen short of their demand at the prevailing 
prices is also considered. Recently, consideration 
has been given to “rational norms” of consumption 
for “healthy, culturally developed people,” which 
have been worked out by the Nutrition Institute of 
the Academy of Medical Sciences and by other 
agencies. 

The criteria used elsewhere must be similarly 
varied and often, too, not very incisive. Among 
basic industrial branches, for example, steel mani- 
festly has long been an unquestioned favorite, but 
in November 1962 Khrushchev attacked Gosplan 
for allowing “steel blinkers” to blind it to econo- 
mies that might be realized by use of competing 
materials, especially plastics, and apparently the 
latter products are now receiving more attention— 
though Khrushchev has been criticized in turn by 
his successors. There are reasons to think that at 
one time Gosplan also wore “coal blinkers,” but for 
some years now the Russians have been shifting 
to the more extensive use of oil which their re- 
source position permits, 

Capital formation. Last but not least, the share 
of the community's total output that is allotted to 
capital formation is also determined by superior 
agencies. Indeed, the very highest authorities seek 
to reserve this matter almost exclusively for them- 
selves, and on the whole probably succeed. A car- 
dinal concern obviously has been to provide the 
wherewithal for rapid growth and in this way to 
permit realization of the Soviet imperative “to over- 
take and surpass” the advanced capitalist countries. 
Pursuit of this goal avowedly reflects various more 
basic concerns. Thus, according to its latest (Oc- 
tober 1961) program, the party seeks in economic 
affairs “to develop the production of material goods 
for the highest satisfaction of all needs of Sovict 
man,” “to win the economic competition with capi- 
talism,” and “to maintain the defenses of the 
country at a level allowing us to smash any ag- 
gressor” (Khrushchev 1961, p. 3). 

How the party has weighed these diverse aims 
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in practice must be judged from facts such as the 
following: under the Five-Year Plans the Soviet 
economy has indeed grown rapidly, though, accord- 
ing to independent calculations, not so much as 
official claims suggest. During the period from 
1928 to 1960 the gross national product of the 
U.S.S.R. grew at an estimated average rate of 
4.5-6.3 per cent a year, or by 5.2-7.3 per cent if 
the entire growth is imputed to the 28 peacetime 
years. Of the total output measured in constant 
prices, the share devoted to investment rose over 
the same period from 12 per cent to perhaps 35 
per cent, while that devoted to personal (as distinct 
from communal) consumption declined from 80 
per cent to about 45 per cent. Though the share of 
consumption has thus fallen sharply, with output 
as a whole increasing as it has, the Soviet house- 
hold has been able to realize a marked increase in 
standards. For the most part the increase has oc- 
curred only very recently, since the years under 
Stalin were years of denial, The average Russian 
in 1962 still enjoyed a per capita consumption of 
but $485 per year. 

While seeking rapid growth, the U.S.S.R. has 
also sought to limit its dependence on foreign coun- 
tries, particularly capitalist ones. Since Stalin this 
policy of autarky has been relaxed, but in 1962 
exports still represented but 2.7 per cent of the 
national income. Hence, the U.S.S.R. must produce 
at home the great bulk of the capital goods re- 
quired to realize the high rates of capital formation. 
Also production capacity must vary correspondingly 
as between consumer-goods and investment-goods 
branches. As the rate of investment has risen, there- 
fore, the output of investment goods necessarily 
has increased more than that of consumer goods. 
In the U.S.S.R. this disparity of tempo has been 
described as an economic law. Often, however, the 
reference is actually to divisions within industrial 
output rather than within total output. 

As the foregoing bare bones only begin to sug- 
gest, superior agencies in the U.S.S.R. bear oner- 
ous responsibilities. That the patterns described 
sometimes seem of a rather arbitrary sort must be 
read partly in this light. Indeed, as the activities 
of the superior agencies are examined more closely, 
arbitrariness only becomes the more manifest. In a 
modern, complex economy such as that of the 
Soviet Union, the responsibilities might have been 
onerous in any case, but they become the more so 
because of the pervasive limitations in money 
prices. Given these limitations, superior agencies 
must somehow digest and react to even more in- 
formation than would confront them otherwise, 
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information not only of a summary, monetary sort 
but of a detailed physical sort. Alternatively they 
must risk miscalculation by relying on the former 
alone, and this too occurs. 

Labor theory of value. In the economy gen- 
erally, to what extent have operational patterns 
conformed to the labor theory of value? [See 
VALUE, LABOR THEORY OF.] In the U.S.S.R. the 
government has been held to be the master rather 
than the servant of “economic laws,” and among 
the laws in question is Marx's “law of value.” Yet 
the law has a place, though apparently now a 
diminishing one. Consider, for example, the prac- 
tice of calculating costs without any allowance for 
interest on fixed capital and also the related prac- 
tice of neglecting such a charge in the choice of 
technology, though here ideology seems by now to 
have been largely vanquished by scarcity, The labor 
theory is often not especially incisive regarding 
economic calculation, but it has tended to stand in 
the way of its more incisive competitor, marginal 
value theory as now understood in the West. 

Comparative efficiency. In economics, how well 
an economic system works usually is taken to turn 
primarily on its efficiency, or the effectiveness with 
which available resources are used to produce 
wanted goods and services. The Soviet economic 
system manifestly must often leave something to 
be desired from this standpoint. This is true of the 
nonfarm sector, and needless to say, the perform- 
ance is no better when agriculture, particularly that 
part under the notorious collective farm, is con- 
sidered. But the chief rival system, Western pri- 
vate enterprise, has its limitations, too. These in- 
clude, perhaps most notably, monopolistic business 
practices; trade union restrictions; distorting gov- 
ernment measures, including farm controls, sales 
and income taxes, and tariffs; and cyclical and 
other extrafrictional unemployment. Which of the 
two systems is the more efficient? 

In speculating about this complex question, we 
may consider, in addition to the sources of waste 
in the two systems, some related quantitative evi- 
dence: in 1960 the net national product of the 
U.S.S.R. per unit of factor inputs, including labor 
and reproducible capital, was less than one-half 
that of the United States. Depending on the 
method of measurement, the Soviet output per unit 
of factor inputs might be less than one-third that 
of the United States. Reference is to the net na- 
tional product less certain items for which a satis- 
factory comparison of productivity is infeasible: 
personal services in education, health care, govern- 
ment administration and the like, and housing serv- 
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ices. Correspondingly, the labor and capital em- 
ployed in providing these services are omitted from 
factor inputs. 

However, factor productivity and economic ef- 
ficiency are not the same thing. Without any dif- 
ference in economic efficiency, two economies 
might differ in factor productivity as calculated be- 
cause of differences in labor skill, quality and quan- 
tity of farmland, and so on. Then, too, comparative 
data on factor productivity reflect the compara- 
tive volume of output but not how desirable it is in 
structure. The comparison of factor productivity 
has been held, however, to create a presumption 
that Soviet communism is less efficient than West- 
ern private enterprise, at least as the latter is ex- 
perienced in the United States (Bergson 1964, 
pp. 340 ff.). 

In the U.S.S.R., however, has not the cardinal 
concern been growth rather than efficiency? And 
from this standpoint is not the Soviet performance 
impressive after all? As we saw, the Soviet econo- 
my has grown rapidly under the Five-Year Plans. 
At least this was so until around 1958; since that 
year the tempo has fallen. But for the inefficiency, 
however, the government might have been able to 
invest and hence expand output still more, though 
the intense pursuit of growth no doubt has some- 
times been itself a cause of inefficiency. Even from 
the government's standpoint, therefore, efficiency is 
the more basic standard. What goods are wanted 
and to what extent depends, of course, on the ends 
sought, and hence so must efficiency. But the 
Soviet economy must often fall short of its poten- 
tial to produce goods of any sort that might be 
desired. Moreover, the ends sought in the U.S.S.R. 
are special, but not so much as is sometimes sup- 
posed. Thus, the government values investment 
goods highly, but obviously it also values consumer 
goods in some degree. 

Rapid growth, of course, does not itself signify 
efficiency—though it is sometimes maintained that 
it does. Rather, growth in the U.S.S.R. has been 
induced by various factors different from, though 
not unrelated to, efficiency. The high tempo at 
most testifies not that efficiency is high but that it 
probably is rising from an initially low level. 
Among the chief growth-inducing factors are the 
high rate of investment, which has been the more 
potent because of the relatively limited capital 
stock initially available; the extraordinary oppor- 
tunities to borrow Western technology, which have 
compounded still more the effect of the high rate 
of investment; and probably the opportunity to 
raise labor quality rapidly from an initially low 
level. 


Is not the economic system, however, in the 
process of change? And are not new measures even 
now being introduced or considered that will raise 
its efficiency? The Soviet economic system has 
always been in flux, but lately it has been espe- 
cially so. While we have sought to describe it as it 
has prevailed recently, interesting changes are in 
progress and further ones no doubt are in the offing. 
Among other things, in order to facilitate planning, 
the government is beginning to use mathematical 
techniques and electronic computers. Also, indus- 
trial enterprises are being given greater autonomy, 
with sales and profitability becoming major criteria 
of their success. The government is also reforming 
financial practices and industrial prices, But how 
such changes might affect efficiency is a question 
which must be left for the future to answer. 

Income distribution. In addition to efficiency, 
how well an economic system works is usually 
judged in terms of the equity of income distribu- 
tion. Here too the precise standard to be applied 
depends on the ends. Moreover, while the degree of 
inequality of incomes is of interest from the stand- 
point of almost any ends, this is a matter that is 
difficult to judge in the case of the U.S.S.R. because 
of the government’s secrecy regarding earnings, 
especially of the more favored groups. As we have 
noted, however, wages are differentiated in a man- 
ner more or less similar to that obtaining under 
private enterprise. Sizable differences in earnings 
have been reported in many spheres. The inequality 
of incomes generally must be less than in, say, the 
United States, but at least if account is taken of 
taxes and perquisites, one wonders whether the 
difference is really very marked. 


The merit of an economic system such as that 
of the U.S.S.R., where public ownership is pre- 
dominant, has long been a subject of theoretic de- 
bate in economics. So far as we may judge from 
the Soviet experience to date, critics of this system 
would seem closer to the mark than proponents. 
As research proceeds, it may be hoped that we will 
gain further insight into this weighty theme. 

The U.S.S.R. has served as a model in economic 
affairs for other communist countries, though they 
have sometimes diverged markedly from it. Among 
the more important of such countries, those in 
eastern Europe—with the exception of Yugoslavia 
—have economic systems that are broadly similar 
to that of the U.S.S.R. However, private enterprise 
in agriculture in the form of peasant farming is 
still predominant in Poland. This is also true in 
Yugoslavia, where, in addition, decision making is 
relatively decentralized. Correspondingly, there is 
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greater reliance than in the U.S.S.R. on indirect 
market controls. Reforms entailing increased de- 
centralization and reliance on markets are also be- 
ing introduced elsewhere in eastern Europe and 
reportedly are to be especially sweeping in Czecho- 
slovakia and Hungary. Despite doctrinal differ- 
ences with the U.S.S.R., China continues to organ- 
ize its economy in a manner often similar to that 
of the U.S.S.R., but the complex and rapidly shift- 
ing scene there defies easy summarization. 


ABRAM BERGSON 


[See also PLANNING, ECONOMIC, article on EASTERN 
EUROPE; ECONOMIC THOUGHT, article on SOCIALIST 
THOUGHT; ECONOMIC DATA, article on THE SOVIET 
UNION AND EASTERN EUROPE; and the biography of 
BaRONE.] 
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n 
AGRICULTURE 


Until very recently agriculture has nearly always 
been ranked toward the bottom in the ordering of 
priorities in communist economies. The leaders of 
communist states have generally decided that rapid 
economic growth and the development of economic 
power required a concentration of resources in in- 
dustry, and especially in heavy industry, In pursu- 
ing their goal of rapid growth of heavy industry, 
they have demanded great sacrifices from their 
peoples and have tended to neglect the development 
of agriculture, which produces consumer goods and 
raw materials. Capital for the development of in- 
dustry has been obtained from agriculture by con- 
fiscation of land and compulsory deliveries of farm 
products to the state, often at prices below cost. 
Exports of agricultural commodities have been used 
to pay for imports of industrial equipment. Invest- 
ment and current inputs into agriculture have been 
severely restricted, and the movement of the better 
educated and more productive labor from the coun- 
tryside into industry has been fostered. 
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In recent years, the communist leaders have 
made some revisions in their economic priorities. 
In the Soviet Union the stagnation of agriculture 
had become a threat to the future rate of industrial 
growth by the time of Stalin’s death, and during the 
period 1954-1958 Khrushchev sought to achieve 
better balance between agriculture and industry. 
The proportion of total investment going to agri- 
culture was increased, the prices the government 
paid farmers for their products were raised, and 
a number of changes in the organization of agri- 
culture were made. Similar changes were reflected 
in the policies of the European satellites, 

In the Soviet Union a large share of the increased 
resources devoted to agriculture during this period 
went into the expansion eastward into the new 
lands. These measures, together with good weather, 
brought about a substantial increase in Soviet agri- 
cultural output by 1958. However, since 1958 there 
has been little increase, and in 1961 and 1962 fur- 
ther reorganizations of the agricultural administra- 
tive apparatus were undertaken in an attempt to 
stimulate the lagging agricultural sector. 

In Communist China a drastic drop in farm out- 
put after 1958 and the spectacular collapse of the 
“great leap forward” in 1960 brought about the 
adoption of more realistic measures, with a shift 
in priority from the development of heavy industry 
to the improvement of agriculture. Collectivization 
was relaxed, and measures were undertaken to pro- 
vide greater incentives for farm production. 

Domestic importance and foreign trade. Agri- 
culture is a major sector of the economy in all 
countries under communist rule. In the Soviet 
Union agriculture uses about 45 per cent of the 
total labor force and in recent years has received 
about 15 per cent of the total investments, while 
producing about a fourth of the national income. 
In the European satellites the proportion of the 
labor force engaged in agriculture ranges from 
about 18 per cent in East Germany to over two- 
thirds in Rumania, while agriculture’s contribution 
to the national income ranges from less than 10 
per cent in Czechoslovakia to one-third in Bulgaria. 
In Communist China more than 80 per cent of the 
labor force is occupied in agriculture, which pro- 
duces about 50 per cent of the national income. 
North Korea and North Vietnam are also primarily 
agricultural countries. 

Farm products and products manufactured from 
raw materials produced by agriculture are impor- 
tant in the trade of the communist countries. In 
the period 1955-1961 agricultural commodities ac- 
counted for about 21 per cent of the Soviet Union's 


total exports and 25 per cent of its total imports, 
Grains and cotton have made up the bulk of the 
Soviet Union's exports of agricultural commodities, 
Although normally a large grain exporter during 
the postwar period, mainly to the European satel- 
lites, in 1963 the Soviet Union became a net im- 
porter of grain. This was a result of a very poor 
grain crop following several mediocre harvests, 
which probably had caused a decline in stocks 
of grain. 

China has exported farm products and products 
manufactured from agricultural raw materials to 
pay for imports of machinery and heavy industrial 
equipment. Until the ideological break with the 
Soviet Union the trade of Communist China had 
been oriented mainly toward the Soviet bloc. How- 
ever, the Soviet Union's imports of farm products 
from China declined 90 per cent between 1959 and 
1961. Since 1960 China has been forced by serious 
food shortages to import large amounts of grain, 
mainly from Western countries. 

Organization in the Soviet Union. Between the 
revolution of 1917 and the year 1928 small-scale 
peasant farming predominated in the Soviet Union, 
with only a very small proportion of farm labor 
working on state and collective farms. In 1929 
Stalin initiated mass collectivization of agriculture, 
which was carried through rapidly by various 
means of coercion, including mass deportation and 
heavy taxation. In the early 1930s the present 
forms of agricultural organization became domi- 
nant. The collectives were given a legal basis by 
the Collective-Farm Statute of 1935, which re- 
mained essentially unchanged until 1956. 

Agriculture in the Soviet Union is made up mainly 
of the world’s largest farms (the state farms and 
the collective farms) and some of the world’s small- 
est agricultural enterprises (tiny private plots that 
belong for the most part to farm workers). There 
is a less important group of medium-sized farms 
owned by factories and other government organi- 
zations. 

The collective farms, which occupy nationalized 
rent-free land, are nominal cooperatives with elected 
chairmen and management committees. However, 
the state and party exercise close control over the 
collectives, and party officials can readily determine 
the “election” of the chairman as well as reprimand 
or dismiss him (Nove 1961, p. 47). 

The collective farm is the dominant form of 
agricultural unit in the Soviet Union. However, the 
proportion of the total sown area encompassed by 
collective farms declined from 84 per cent in 1953 
to about 60 per cent in 1960 because of the con- 
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version of collectives to state farms and to the pre- 
dominance of state farms in the new lands to the 
east. As a result of the amalgamation into larger 
farms and the conversion to state farms the num- 
ber of collective farms also declined, from more 
than 250,000 in 1950 to 53,400 in 1960. In 1960 
the average collective farm encompassed nearly 
6,800 sown acres and was worked by the labor 
from nearly 400 households. 

The collective farmers are not paid wages as are 
workers in state-owned enterprises, but rather are 
residual claimants to the income of the collectives 
after the state’s claims, production expenses, and 
investment needs have been met. The share which 
the collective farmer receives varies with the skill 
required in the work he performs. In recent years 
steps have been taken to distribute the collective 
farmers’ share in quarterly or monthly payments 
rather than in a single annual payment. 

Until 1958 most of the farm machinery used 
on the collectives was operated by state-owned 
Machine Tractor Stations (MTS), which supervised 
much of the collective farms’ activities and per- 
formed machinery work on contract. The farms 
paid for the machinery services of the MTS with 
produce, In January 1958 it was decided to liqui- 
date the MTS and sell their machinery to the col- 
lectives (the kolkhoz). 

The state farm, or sovkhoz, which is considered 
in the Soviet Union to be ideologically the most 
desirable form of organization for agriculture, is 
becoming more important, accounting for a third 
of the total sown area in 1960. The state farms are 
owned by the state, and the farm workers are paid 
wages like factory workers. State farms tend to be 
more specialized than collective farms. They have 
a higher labor productivity, mainly because they 
have received most of the investment which the 
state has made in agriculture (Walters 1963, p. 
8), have not been burdened with excess labor, and 
have offered higher incentives than have the col- 
lectives. The state farms are huge organizations, 
with an average in 1960 of nearly 22,500 sown 
acres and about 750 workers per farm. 

The private plots which the farm workers are 
permitted to cultivate and the livestock which they 
own are remnants of legal private enterprise in the 
Soviet Union. The plots are considered to be ideo- 
logically undesirable but are tolerated for prag- 
matic reasons. Although they occupy only a little 
more than 3 per cent of the total sown area, they 
produce a significant share of the total output of 
some important items of food. The farmers and 
their families consume part of the produce from 
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the plots, and the remainder, when sold—largely 
in free markets—contributes a significant share of 
the farmers’ money income. The ideological un- 
palatability of the private plots and their competi- 
tion for the labor time of the farmers have at times 
prompted the authorities to limit their operation 
through acreage restrictions, taxes, and other 
means. 

Organization in eastern Europe. With the ex- 
ception of Yugoslavia and Poland, the organization 
of agriculture in eastern Europe is generally pat- 
terned after that of the Soviet Union. In Yugo- 
slavia and Poland, although collectivization of 
agriculture remains a long-run goal, 85 to 90 per 
cent of the farmland still is privately farmed. In 
the rest of eastern Europe a drive to complete the 
collectivization of agriculture, which had lagged in 
several countries, has been undertaken in recent 
years, and by 1962 the collectivized sector encom- 
passed about 90 per cent or more of the arable 
land. Collectivization has been accomplished by 
the expropriation or purchase of privately owned 
land, taxes on private farmers, and a preferential 
policy toward the collectivized sector in the alloca- 
tion of inputs and in the prices paid by the state 
for farm products (U.S. Department of Agricul- 
ture . . . 1964, p. 100). 

Agriculture was socialized slowly in Poland, and 
despite great effort less than one-fourth of the 
total agricultural area was socialized by the middle 
of 1956. After the disturbances of October 1956, 
most of the collective farms were dissolved and 
the total area in state farms was reduced. 

In Yugoslavia forced collectivization was aban- 
doned in 1951. Most of the collective farms were 
disbanded by the end of 1953, and by 1962 the 
socialist sector occupied only 12 per cent of the 
arable land. Peasants’ farms averaged less than 12 
acres in size in 1962. They were limited by law 
to a maximum of 25 acres. Expansion of the so- 
cialized area in Yugoslavia is accomplished by 
means of the purchase of privately owned land by 
collective and state farms. 

Organization in Communist China. The agri- 
cultural policy of China has been designed to in- 
crease agricultural output (both for export and for 
feeding the growing urban population) with mini- 
mum state investment, while maintaining central 
political control over the countryside. The Chinese 
communists have made strenuous efforts to achieve 
a breakthrough in agricultural as well as in indus- 
trial production. During the decade from 1950 to 
1960 the leaders experimented with various com- 
binations of centralized and decentralized control 
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of the rural areas. China’s problem was more dif- 
ficult than that facing the Soviet Union before 
collectivization in that Chinese agriculture did not 
produce substantial surpluses which could readily 
be expropriated by the state. 

By 1952 the landlord class was essentially elim- 
inated and their land was distributed to poorer 
peasants. In 1954 and 1955 compulsory delivery 
quotas were established to provide the state with 
more farm products, and by the winter of 1955 the 
regime had decided on full collectivization. 

In 1955 and 1956 private farmland was abol- 
ished and cooperatives were established with party 
cadres in charge. It was expected that the cadres 
would ensure the meeting of the state’s needs for 
agricultural commodities, set aside larger shares 
for investment, and make effective use of season- 
ally unemployed labor in the construction of roads, 
irrigation works, and other projects. It was thought 
that collectivization would permit more rapid intro- 
duction of improved technology and more efficient 
use of the land (Perkins 1964, p. 219). Also, col- 
lectivization was desirable for political and ideo- 
logical reasons. 

Although regulations permitted the continuance 
of private plots, their size and the time available to 
work them were reduced, and the free market was 
closed. The output and availability of products from 
the private plots fell so drastically that the free 
market was soon reopened, however, and the size 
of private plots was increased. During 1957 some 
of the larger cooperatives were reduced in size and 
some authority was shifted to the subunits, the pro- 
duction brigades. 

The net effects of the collectivization and cen- 
tralization were unimpressive. In addition to the 
fostering of bureaucratic mismanagement, the in- 
centives of the individual farmers to produce had 
been affected adversely. 

In 1958 the regime embarked on the rapid estab- 
lishment of the communes, a more radical form of 
organization of agriculture. Where the average co- 
operative had two hundred families, the typical 
commune had four thousand to five thousand. The 
communes took on additional functions, including 
responsibility for expansion of large-scale rural 
construction projects and control over local com- 
mercial, credit, and manufacturing establishments. 
Private plots and the free market were eliminated. 

The communes were not subjected to effective 
financial or administrative restraints while the cam- 
paign atmosphere prevailed, and the pressure for 
spectacular results was such that ill-considered 
projects and procedures were widely adopted. Ex- 


treme demands were made on the farm people, and 
fantastic claims of success were made. Responsi- 
bility for the collection of statistics on agricultural 
production was given to the communes, and the 
Statistics became extremely unreliable. The com- 
munes widened the distance between the workers’ 
efforts and their rewards even further than did the 
cooperatives because wages were established on the 
basis of the income of the entire commune (Per- 
kins 1964, p. 231). 

Despite the shortcomings of the communes, the 
harvest of major crops in 1958, aided by excellent 
weather, was good. However, the regime later ad- 
mitted that early claims of agricultural production 
were greatly exaggerated. The unusually good har- 
vest of 1958 and the lack of reliable statistics made 
it difficult to ascertain the effects of the establish- 
ment of the communes, but by early 1959 the 
regime decided that output of the products formerly 
grown on the private plots had been seriously af- 
fected. The rural markets were reopened, but with 
tighter controls than formerly, and in 1960 private 
plots were restored. 

The revolutionary development of the communes 
ended with the restoration in 1961 and 1962 of 
agricultural ownership and control to lower levels 
and with a decision to cut back on investment in 
heavy industry in favor of agriculture. Controls 
over production decisions and wage units were 
passed down to the subunits of the communes, 
until the rural institutions and levels of authority 
were essentially similar to those existing before 
the establishment of the communes. Although the 
communes remained as paper organizations, they 
had been stripped of their significant functions 
(Perkins 1964, p. 234), 

Marketing and prices. In essence, communist 
economies are guided by central authorities toward 
centrally determined goals. However, managers of 
enterprises must necessarily have a degree of au- 
tonomy, especially in day-to-day operations. Al- 
though the managers are generally not free to react 
to prices to maximize profits, prices are expected 
to provide incentive and thus to assist in guiding 
the performance of the economy. But the function 
of prices is severely restricted, and, with the excep- 
tion of limited free markets, prices are not free to 
respond quickly to supply and demand conditions. 

The agricultural price systems of the communist 
countries are complex and have gone through nu- 
merous modifications. The analyst constructing even 
a simple index of output is forced to make a num- 
ber of quite arbitrary choices of prices for weights. 
The economist attempting to evaluate the perform- 
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ance of communist economies must constantly 
keep in mind the limited functions of prices and 
the specific peculiarities of the price system rele- 
vant to the problem with which he is concerned. 
For agriculture, one peculiarity of the price system 
arises from the communists’ refusal to permit land 
rent to perform its allocative function. There have 
at times been attempts to vary the requirements for 
compulsory deliveries to the state as a substitution 
for differenial rents, but these variations have been 
too crude to be effective. One of the effects of the 
elimination of rents is to penalize the farms with 
poorer land. 

The use of a system of compulsory deliveries at 
very low prices and a system of multiple prices for 
the same product, with little logical relation be- 
tween the prices of various products, greatly dimin- 
ishes the value of prices as a measure of economic 
performance. The shortcomings of the agricultural 
price system have prevented the managers from 
developing useful cost-of-production estimates as 
guides for the allocation of resources. Since a large 
proportion of payments to collective farmers have 
been made by distributing commodities, it is ex- 
tremely difficult, if not impossible, to measure labor 
costs. In addition, the costs of farm machinery and 
other inputs often have little relation to their value 
on the farm. 

After a few years of free prices in the NEP (New 
Economic Policy) period, the Soviet Union estab- 
lished a system of compulsory deliveries of farm 
products to the state at very low prices, constituting 
in effect heavy taxes on the farm sector. Quantities 
over the compulsory quota were sold for higher 
prices, Prices paid by the state for most food com- 
modities changed very little until 1953, although 
the prices of some industrial crops were raised dur- 
ing the period and were much more favorable to 
the farmers. During the period 1953-1955 the 
prices paid farmers were greatly increased, but the 
system of compulsory deliveries at low prices and 
over-quota sales at higher prices was continued. In 
addition, the farms paid the Machine Tractor Sta- 
tions (MTS) in commodities for the performance 
of machinery operations. 

In 1958, in connection with the abolition of the 
MTS, the entire agricultural price system in the 
Soviet Union was overhauled. The state now pays 
each farm a single price for each product, with 
regional variations in the prices. The regional vari- 
ations in prices are intended to reflect, in part, 
differences in average production costs, and thus 
they are, to some degree, land-rent differentials. 
The tax element in the prices for a number of 
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products was reduced or eliminated, that is, there 
remained little difference between the price paid 
the farmers and the retail price, after deducting 
handling costs. However, grain and sugar beets 
were still purchased at below value prices. In 1962 
and 1963 further increases were made in the prices 
of grain, potatoes, sugar beets, livestock products, 
and cotton. 

The agricultural price system in eastern Europe 
has generally been patterned after that of the 
Soviet Union, although there are variations from 
country to country, and the Soviet Union’s restruc- 
turing of farm prices in 1958 has not been copied 
uniformly. The state pays low prices, in many in- 
stances below cost, for compulsory deliveries of 
fixed quantities of agricultural products. The state 
also contracts with producers for the delivery of 
additional products at fixed, but sometimes higher, 
prices. After the compulsory quotas and contracted 
quantities are delivered, the producer sells his re- 
maining available products to the state, at prices 
which are sometimes higher, or on the free market, 
at prices determined by supply and demand. In 
Yugoslavia compulsory deliveries of farm products 
to the state were abolished in 1953. Although the 
state and the purchasing agencies agree on the 
maximum prices which they will pay, the ceilings 
are sometimes ineffective when supplies are short 
(U.S. Department of Agriculture . . . 1964, p. 113), 

In Communist China from 1950 to 1953 the 
state bought agricultural products from the peas- 
ants at centrally determined prices and collected 
an agricultural tax in kind, 90 per cent of which 
was paid in grain. Compulsory quotas were later 
introduced for most of the major crops. It was nec- 
essary to reduce quotas in 1956 in order to leave 
enough to the peasants for seed and feed as well 
as for their own consumption. In 1956 and 1957 
the purchase prices were adjusted in an attempt to 
control the relative production of major crops, but 
the results were disappointing. 

Because the cooperatives generally had to pay 
their own way they had a considerable incentive to 
adjust production in response to changes in pur- 
chase prices, although other factors often severely 
limited the adjustment. However, when the com- 
munes were established allocations were made 
directly to the communes from state budgetary 
funds and the credit available to agriculture was 
greatly expanded. The increased availability of 
funds, together with the campaign atmosphere 
of the commune drive, encouraged the local com- 
mune authorities to ignore economic criteria and 
concentrate on the production of a few key com- 
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modities emphasized by the campaign. For these 
and other reasons the production of subsidiary 
products fell sharply. 

Trends in output. The collectivization drive in 
the Soviet Union started in 1928. The peasants re- 
sisted by slaughtering their livestock as well as in 
other ways, and, after adjustment for territorial 
changes, agricultural output exceeded the 1928 
level in only two of the years between 1929 and 
1951. In 1952 agricultural output was only six per 
cent greater than in 1928 on comparable territory. 

After Stalin's death in 1953, concern over the 
impact of the stagnation in agriculture on future 
industrial growth stimulated a series of measures, 
including the development of the new lands and 
an expansion of the area planted to corn to support 
a desired expansion of the livestock industry. The 
expansion of acreage together with a good harvest 
of grain in the Ukraine in 1955 and a good harvest 
of wheat in the new lands in 1956 brought about 
substantially increased output. In 1958 excellent 
weather and increased output of livestock products 
resulted in the largest total agricultural production 
up to that time. 

The seven-year plan, initiated in 1958, called for 
a 70 per cent increase in agricultural output by 
1965, but the years after 1958 were disappointing, 
bringing little increase, The failure to increase pro- 
duction, while demand continued to rise, apparently 
caused a decline in grain stocks. Poor weather in 
1962 and 1963, together with the perennial short- 
comings of Soviet agriculture, caused agricultural 
output to fall sharply in 1963. Grains, especially 
wheat, were the hardest hit. The Soviet Union in 
the latter part of 1963 and the early part of 1964 
contracted to buy more than 10 million tons of 
wheat and flour, mostly from Western countries. 
Though some of the imported grain was to be re- 
exported to eastern Europe, this trade situation was 
in sharp contrast to the 1959-1962 grain-exporting 
position when between 4.8 and 6 million tons were 
exported annually, 

In East Germany and Czechoslovakia agricul- 
tural output had still not regained prewar levels 
by the early 1960s. However, output in the rest of 
the eastern European satellites was somewhat above 
the prewar level. Per capita agricultural output in 
all of east Europe other than the Soviet Union 
barely exceeded the prewar level in 1961 and 1962 
and fell below it in the following two years. 

Communist China apparently had a record har- 
vest in 1958, although the early extravagant claims 
were later admitted by the regime to have been 
exaggerated. The authorities have released no offi- 
cial data on agricultural production since 1959, but 


production has apparently failed to keep pace with 
population growth. Mainland China has had chronic 
food shortages since 1960 and has been importing 
5 to 6 million tons of grain annually. 

Labor productivity. Although lack of informa- 
tion makes it impossible to calculate reliable meas- 
ures of agricultural labor productivity for most 
communist countries, it is clear that the produc- 
tivity is quite low. It has been estimated that before 
World War 11 the net agricultural production per 
person depending on agriculture was about 4.5 
times as large in the United States as in the Soviet 
Union (Jasny 1949, p, 442). By the early 1960s 
this ratio of comparative labor productivity had 
widened to perhaps 7 or 9 to 1 (Walters 1963, pp. 
4, 13). The Soviet Union's attempt to reduce the 
labor committed to agriculture has met with only 
moderate success. The number of people defined 
as the agricultural labor force declined slowly be- 
tween 1940 and 1953 but rose for a period after 
1953 in connection with settlement of the new 
lands. It has resumed its decline since then. 

One of the reasons for the low labor productivity 
in Soviet agriculture as compared with the United 
States is the relative scarcity of capital. Although 
investment in agriculture has increased during the 
last decade, the stock of capital equipment has 
remained relatively small. For example, despite the 
fact that the Soviet Union has about 60 per cent 
more land in crops than does the United States, in 
1962 Soviet agriculture had only one-fourth as 
many tractors, one-half as many grain combines, 
and slightly more than one-fourth as many trucks. 
Soviet agriculture is especially wasteful of labor in 
animal husbandry, in part because of poor manage- 
ment and labor-wasting practices and in part be- 
cause of a lack of labor-saving equipment. 

Although its labor force is very large, Soviet 
agriculture suffers from seasonal labor shortages. 
Especially during the harvest, peaks of demand for 
labor create shortages, and the Soviet press often 
stresses the need for young farmworkers, espe- 
cially for skilled operators of equipment. A large 
part of the agricultural labor force consists of older 
persons and women with families—people who 
cannot readily be adapted to other kinds of work or 
transported to distant jobs. 

The agricultural labor situation varies widely 
among the other east European countries. In East 
Germany the loss of agricultural labor to the cities 
and to West Germany has been at too great a rate 
to be adequately balanced by the increases in labor- 
saving machinery, although the number of tractors 
tripled between 1950 and 1961. The shortage of 
labor has been so great that members of the armed 
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forces and urban workers have been required to 
help with the harvest. 

In Czechoslovakia the agricultural labor force de- 
clined about one-third from 1950 to 1961, while 
agricultural output rose about 10 per cent. Agri- 
culture occupied about 38 per cent of the total 
labor force in 1950 and 23 per cent in 1961. 

In Rumania, up to the early 1960s, the agricul- 
tural labor force remained at about the level of 
1938. However, during the harvest season it is 
necessary to obtain additional labor from factories, 
schools, and the armed services. 

In Communist China the rural-to-urban shift in 
population has had little impact on the farm labor 
force, although between 1950 and 1956 the popula- 
tion of the cities rose by more than 30 million, or 
55 per cent. After the collapse of the “great leap 
forward,” the lack of jobs in the cities and the des- 
perate need for food impelled the regime to insti- 
tute a campaign to return people to the countryside. 

In their pursuit of rapid industrialization, China’s 
leaders attempted to make use of their relative 
abundance of rural labor by embarking on a cam- 
paign to establish large numbers of industrial en- 
terprises in rural areas. The program was intended 
not only to make use of surplus rural labor but also 
to compel the people to work harder than ever 
before. The most spectacular enterprises in rural 
industrialization were the back-yard furnaces built 
during 1958-1959 to produce pig iron. But the de- 
mands of this program on the health and strength 
of the people were excessive, and most of the prod- 
uct was worthless. The program was largely aban- 
doned in 1960. 

Problems of analysis. The study of the eco- 
nomics of agriculture in communist countries is 
complicated by several serious difficulties [see Eco- 
NOMIC DATA]. There are large gaps in the available 
statistics, which often are of questionable reliabil- 
ity. The limited functions of prices or their function 
as taxing devices—as in obligatory deliveries at low 
prices—often results in their being of little use as 
criteria of economic rationality or performance or 
as weights for the construction of indexes. The ideo- 
logically determined refusal of the communists to 
charge explicit land rent complicates the analysis 
of most agricultural economic problems. Finally, 
because centralized decision making is dominant 
in communist countries and because the decision 
makers are often much more concerned with politi- 
cal rather than strictly economic goals, an eco- 
nomic analysis sometimes may be largely irrelevant 
or at least peripheral to the heart of the problem. 

The interpretation of official data has always 
presented major difficulties for students of Soviet 
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agriculture. A number of prominent statisticians 
were purged in the Soviet Union in the late 1920s, 
and agricultural statistics became largely subordi- 
nated to the goals of the central policy makers. 
From 1933 to 1953 the official statistics on agri- 
cultural output were generally seriously biased up- 
ward, being based on estimates of the crop in the 
field, which did not, as is normally done, take 
account of losses in harvesting. Indexes were cus- 
tomarily published without any indication of what 
prices were used as weights, thus making the in- 
dexes impossible to reconstruct and extremely dif- 
ficult to evaluate. Important data often were not 
published at all. For example, although the publi- 
cation of statistics on agricultural production in- 
creased somewhat after the death of Stalin in 1953, 
the statistics on the size of grain harvests continued 
to be restricted as a state secret until 1958. The 
official estimate of the disastrous grain harvest of 
1963 has not been released to date. 

Although the present official estimates of crop 
production in the Soviet Union apparently are in- 
tended to be in terms of “barn yields,” there are 
serious questions as to the comparability and valid- 
ity of some of the data, These questions arise in 
part from a lack of precise information about the 
procedure and concepts behind the statistics. An- 
other reason for considerable skepticism about some 
of the statistics on the agricultural output of the 
Soviet Union in recent years is the amply reported 
incidence after 1957 of increased statistical mal- 
practices at lower levels. Greed and the pressure 
of unfulfillable goals apparently motivated many 
officials to falsify production data reported to higher 
levels. Such practices are especially prevalent in the 
agricultural sector because it is impossible for cen- 
tral authorities to verify many of the statistics on 
agricultural production (Willett 1962, p. 97). 

While the agricultural data for the other east 
European countries present problems similar to 
those of Soviet data, the data for Communist China 
are generally even less satisfactory in quality and 
much more limited in quantity. The problem of 
measuring the performance of Communist Chinese 
agriculture with any degree of precision is made 
impossible by the lack of reliable data, both for the 
communist and the precommunist period. In fact, 
none of the major statistical elements required for 
evaluation, such as population, food consumption, 
acreages, or yields, can be considered satisfactory. 
The data for the years before 1950 are meager and, 
especially the national aggregates, can be used only 
with major reservations, From October 1949, when 
the communist government was set up, through 
1952 the agricultural-output statistics were pro- 
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duced by extremely unreliable procedures. Although 
the establishment of the State Statistical Bureau in 
1952 probably brought about some improvement in 
procedures, the “great leap forward” of 1958 and 
the accompanying decentralization of statistical 
control caused a virtual breakdown of the state 
statistical system. The decentralized local opera- 
tions produced highly exaggerated statistics on na- 
tional agricultural production. Although the State 
Statistical Bureau’s system was re-established in 
1959 and 1960 and agricultural output surveys 
were introduced on a national scale in 1959, the 
operations of the system were placed under party 
committees at the local levels. The system was to 
operate under the “partisanship” principle, which 
required that the statistical work be programmed 
and developed to serve the political objectives of 
the party. There are no official data available on 
agricultural production since 1959. 


JoserH W. WILLETT 


[See also AGRICULTURE and ECONOMIC GROWTH.] 
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ur 
PUBLIC FINANCE 


The fiscal systems of the communist countries 
are in some respects similar to those of comparable 
capitalist nations, while quite different in other 
respects. The similarities and differences stem 
largely from similarities and differences in eco- 
nomic and social institutions and in economic and 
political goals. The communist countries as a group, 
however, have many institutions and goals in com- 
mon and have, therefore, very similar fiscal sys- 
tems. These similarities are, furthermore, probably 
even greater than they would otherwise have been, 
because the budgets of the smaller communist na- 
tions were originally patterned after that of the 
Soviet Union. The discussion which follows will 
concentrate, therefore, on the Soviet budget as 
prototypical, 

General characteristics. A major difference from 
the capitalist systems derives from the fact that 
the central governments of the communist nations 
are, relative to state and local governments, so 
much more powerful than the central governments 
of many Western nations, Furthermore, the gov- 
ernments of the communist nations use this power 
to centrally plan most of the economic activity 
which takes place within their borders. Correspond- 
ingly, the fiscal emphasis in these nations is also 
centralized. The Soviet state budget, which is rati- 
fied by the Supreme Soviet each year, is not just 
the budget of the “union,” or central government, 
but rather represents a consolidation of the budgets 
of all political units in the Soviet Union: the union, 
republican, and numerous levels of local budgets. 
Whereas in the United States, for example, the 
federal, state, and local governments each operates 
its budget in substantial independence of one an- 
other, in the Soviet Union the consolidated over-all 
budget is the majcr instrument of central govern- 
ment policy. 

The budgets of the communist nations tend to 
be larger and broader in coverage than those of 
capitalist nations. The Soviet budget is almost one- 
half as large as Soviet gross national product 
(GNP), or roughly double the proportion to GNP 
of the sum of U.S. budgets at all levels of govern- 
ment. This is explained by the fact that almost all 
investment, education, health, and research in the 
Soviet Union are controlled by the state and fi- 
nanced through the budget, whereas in capitalist 
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nations a large part is privately controlled and 
financed. Furthermore, each of these activities in- 
volves a claim on national resources which is rela- 
tively larger in the Soviet Union than in most 
capitalist nations. 

Because all industry in the Soviet Union is na- 
tionalized and its activity planned by the state, 
one might be led to assume, by analogy with capi- 
talist practice, that the Soviet budget would be 
larger than it is. In capitalist countries, the gross 
receipts and expenditures of government enter- 
prises (such as the post office) are often included 
in the budget. This was true also in the Soviet 
Union in the period of “war communism,” from 
1918 to 1920. Since then, however, most state 
enterprises have operated on an independent finan- 
cial basis (a so-called khozraschet basis) in which 
they sell their products for money and use the 
resulting receipts to finance normal operating ex- 
penditures, neither the receipts nor expenditures 
ever appearing in the budget. 

The nationalization of industry does change, in 
a fundamental way, the significance of taxes which 
are collected from industrial enterprises and which 
are of overwhelming importance in the communist 
countries, In capitalist nations, the fiscal process 
can be viewed as one in which the state divides the 
tax burden between households and private busi- 
nesses (and their owners). Where industry is na- 
tionalized, taxes paid by state-owned enterprise 
cannot be viewed realistically as taxes on an inde- 
pendent sector. It appears most appropriate to view 
such taxes, when they are levied on enterprises 
producing consumers’ goods, as sales taxes on the 
population, since their entire incidence is to raise 
the prices of such goods and to reduce the real 
purchasing power of household earnings. Taxes on 
capitalist enterprise are also partly on the consumer 
(to the extent that they are shifted) but fall on 
businesses (as legal entities ) and their owners (a 
special class of households) as well. It is perhaps 
worth noting that Soviet economists do not view 
taxes paid by nationalized industry as taxes on the 
household at all but rather as a transfer to the state 
of surplus value created by rising industrial pro- 
ductivity. 

One final important difference between the fiscal 
systems of planned and unplanned economies 
should be indicated. In the communist countries, 
the state economic plan sets forth the level and 
distribution of economic resources necessary for 
the fulfillment of national objectives. The financial 
plan, including the budgets, is a reflection of the 
economic plan and is designed to ensure that the 
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economic plan is implemented without inflation or 
other undesirable consequences. Financial factors 
do not, however, in any substantial way constrain 
the operation of the plan. In capitalist countries, 
on the other hand, and particularly in peacetime, 
budgetary factors must be taken into account in 
the formation of economic policies. Insufficiency 
of funds, because it is not possible to increase 
either taxes or the public debt, is an important vari- 
able in the policy makers’ considerations. 

Budget revenues. Revenues of the consolidated 
state budget of the U.S.S.R. for the years 1955 and 
1960 are presented in Table 1. 


Table 1 — Revenues of the consolidated state budget of 
the U.S.S.R. (billions of post-1961 rubles*) 


1955 1960 
Indirect or commodity taxes 37.1 53.6 
Turnover tax 24.2 31.3 
Deductions from profits of state enterprises 10.3 18.6 
Payroll tax 2.6 3.7 
Direct or income taxes 4.8 5.6 
Sales of state bonds to population 3.1 b 
Other revenues® 11.4 17.9 
Total revenues 56.4 774 


a. In 1961 the Russians revalued the ruble to reduce all prices, wages, and 
other financial magnitudes to one- h their previous values, Following 
Soviet practice, these figures are presented in the new 1961 rubles. 

b. Negligible. 

€. Described in text. 


The outstanding feature of the Soviet tax struc- 
ture is the predominance of indirect or price-increas- 
ing taxes. These taxes amount to more than 60 
per cent of the budget revenues of every eastern 
European nation. 

The Soviet turnover tax has the impact of an 
excise or sales tax, as it applies at present only to 
consumers’ goods (with a few minor exceptions 
like petroleum and petroleum products where the 
turnover tax substitutes for explicit rent payments). 
The tax is, however, usually levied at early stages 
in the production process of consumers’ goods and 
is therefore largely hidden from the consumers’ 
view. The rates are highly differentiated, ranging 
from 1 per cent of selling price on some commodi- 
ties to as much as 90 per cent on others. Since 
1955 the average rate of turnover tax in the Soviet 
Union has varied from 40 to 50 per cent of price. 
The turnover tax is the major source of budget 
revenue in every European communist country. 

The deduction from profits is not a tax on state 
enterprise for reasons noted above. However, the 
deduction from profits, since it is part of profits 
over and above costs, adds to the price paid by the 
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consumer. In this respect, it does not differ from 
the turnover tax and can properly be considered a 
tax on the household. Profits are used to make pay- 
ments into the Directors’ Fund (a profit-sharing in- 
centive-type fund used for bonuses, workers’ hous- 
ing, and so forth) and to finance part of the 
investment in fixed and working capital planned 
for the enterprise. Any surplus of profits above 
these needs is deducted into the budget, In recent 
years, about two-thirds of total profits have been 
paid into the budget. 

The payroll tax, or “social insurance markup” 
as it is called by the communist nations, is derived 
by adding to the cost of commodities some 5 to 10 
per cent of the wage fund of each enterprise. The 
receipts from this tax are presumably earmarked 
for sickness and old-age insurance. 

The current extensive use of these indirect taxes 
by communist nations is surprising in view of the 
fact that Marxist and early Soviet writings excori- 
ated the use of such taxes as regressive, hence 
socially inequitable. The fact that such taxes are 
nevertheless relied upon attests to their superiority 
for the purpose of the planned economies. At least 
two important advantages of indirect taxes over 
direct taxes should be mentioned. First, the indirect 
taxes are cheaper to administer and more difficult 
for the population to evade. They are collected from 
thousands of enterprises rather than millions of 
households. This was particularly important in the 
early stages of development of these nations before 
the administrative apparatuses of the government 
were adequately developed and before literacy was 
widespread. 

A second factor has to do with the so-called 
“money illusion” hypothesis which argues that 
workers are more conscious of the impact on their 
economic position of changes in wages than of the 
impact produced by changes in prices. A corollary 
to this is the hypothesis that workers are more 
sensitive to changes in direct taxes (take-home 
pay) than to changes in indirect taxes (reflected 
in higher prices), The money illusion, therefore, 
would cause an income tax of given size to have 
a greater negative impact on work incentives than 
an indirect tax of equal revenue. This is important 
for two related reasons. First, the communist na- 
tions, like Western nations, rely very heavily on 
differential wages to allocate labor. Second, as in- 
dicated above, the direct role of government in 
communist nations is so much greater than in capi- 
talist nations that the rate of taxation is necessarily 
much higher. In the U.S.S.R., for example, taxes 
presently take about one-half of household income; 
in earlier years, taxes were even higher. If the 


U.S.S.R. relied largely on income taxes to maintain 
monetary stability, and these taxes were progres- 
sive as income taxes almost always are, it seems 
clear that the monetary incentives created by the 
differential wage structure would be largely blunted 
—take-home pay would be substantially equalized, 
On the other hand, the indirect taxes probably have 
a much lesser disincentive effect. They are not 
progressive and are largely invisible. 

Direct taxes on the population have never been 
very significant revenue earners and are, in fact, 
presently scheduled to be abolished in the U.S.S.R. 
gradually over the next few years. The major direct 
tax is the income tax on the urban population. At 
one time this tax was considered important as an 
instrument of class policy, in the sense that it was 
differentiated to discriminate very heavily against 
occupations the government wished to discourage. 
Thus, while workers and salaried employees paid 
a very low rate of tax, which progressed to a peak 
of only 13 per cent, doctors and lawyers with pri- 
vate practices, priests, artisans not belonging to 
cooperatives, and petty entrepreneurs paid at higher 
rates which reached peaks of from 55 to 65 per 
cent. 

The rural counterpart to the income tax, a S0- 
called agricultural tax, is levied on collective farm- 
ers. The tax is progressive, reaching a peak rate 
of 48 per cent. That its major purpose is to dis- 
courage collective farmers from spending too much 
time on their private plots of land at the expense 
of their work on the collective is suggested by the 
fact that income from working on the collective 
farm is exempted from the tax. 

Until 1957 sales of so-called mass-subscription 
bonds to the population brought in almost as much 
revenue to the budget as did direct taxes. In fact, 
these bonds may well be considered to have been 
another form of direct levy on the population, Con- 
siderable social pressure was brought to bear on 
the population to subscribe from two to four weeks’ 
wages a year, the appropriate amount being de- 
ducted, like the income tax, every month from 
wages at the place of work. Social pressure was 
necessary to get subscriptions because the standard 
of living was low, prices were rising rapidly (until 
1948), the interest rate on bonds has been only 
3 to 4 per cent in the postwar period, and the bonds 
were basically inconvertible until maturity. In later 
years, in an effort to increase the attractiveness of 
the bonds, bond holders did not receive interest as 
such, but instead, lotteries were held in which the 
few winners received all the interest in the form of 
large prizes. In 1957 sales of mass-subscription 
bonds were discontinued; at the same time, a mora- 
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torium on repayment of 20 to 25 years was de- 
clared. At present, small amounts of lottery bonds 
are sold annually on a purely voluntary basis. Until 
their discontinuation, the mass-subscription bonds 
were looked upon by the state as a regular source 
of revenue with none of the deficit-covering impli- 
cations of government borrowing in the West. 

The budget receives revenue from a number of 
other minor sources, many of which are unspeci- 
fied. Profits taxes are levied on industrial and agri- 
cultural cooperatives (the collective farms). In 
1960 receipts from these organizations amounted 
to 2 per cent of total revenue. Customs and other 
receipts from foreign trade operations are still an- 
other source of revenue. Other income sources are 
a fee for inheritance certificates (there is no ex- 
plicit inheritance tax), sales of state property, fees 
for commercial forestry and fishing, income from 
revaluation of inventories, various small local fees 
and taxes, and gross receipts of organizations, like 
the post office, which are not on an independent 
financial basis. 

No mention has been made, so far, of taxation 
in kind of agriculture, because it never directly 
provided the state with funds nor appeared explic- 
itly in the budget. The tax in kind takes the form 
of compulsory deliveries of agricultural products 
by collective farmers and peasant farmers to gov- 
ernment procurement agencies. These deliveries are 
based on units of tillable land and numbers of ani- 
mals. While the farms and farmers are not uncom- 
pensated for their deliveries, until recently the 
prices paid by the state were usually far below costs 
of production. At the same time they were only a 
fraction of the retail price at which the state resold 
agricultural products to the urban population. The 
high retail price was achieved by superimposing a 
turnover tax on the procurement price plus costs 
of processing and distribution. The portion of the 
turnover tax collected by virtue of below-cost pro- 
curement price can be viewed as the monetary 
equivalent of the tax in kind on agriculture rather 
than as a sales tax on the urban household. In 
recent years the tax in kind equivalent of compul- 
sory deliveries (now contractual deliveries) has 
been sharply reduced as a result of the much higher 
procurement prices which the state now pays for 
agricultural products. 

Budget expenditures. Expenditures of the con- 
solidated State Budget of the U.S.S.R for 1955 and 
1960 are presented in Table 2. The largest item of 
budget expenditure is “Financing the national econ- 
omy,” to be expected where most economic activity 
is nationalized. The major expenditure (roughly 
half) in each of the listed subsectors is for invest- 
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Table 2 — Expenditures of the consolidated state budget 
of the U.S.S.R. (billions of post-1961 rubles) 


1955 1960 
Financing the national economy 23.5 34.1 
Industry and construction 11.0 15.6 
State agriculture 5.1 4.4 
Agricultural procurement 8 5 
Trade (domestic and foreign) TI 3.6 
Transportation and communications 2.0 2.8 
Municipal economy and housing Cd 3.2 
Other 2.6 4.0 
Social—cultural measures 14,7 24.9 
Education and science 6.9 10.3 
Health and physical culture 3.1 48 
Social welfare 47 9.8 
Defense 10.7 9.3 
Administration 1.3 1.1 
Loan service 1.4 7 
Other 2.5 3.0 
Total expenditures 54.1 73.1 


ment in and repair of fixed capital. Other important 
expenditures are for the expansion of working capi- 
tal, subsidies to cover losses, training of workers, 
experimentation and design, geological prospecting, 
and stockpiling of material reserves. 

Expenditures for “Social-cultural measures” are 
next in importance to “Financing the national econ- 
omy.” Outlays for scientific research have risen 
rapidly in recent years and in 1960 amounted to 
almost one-third of the expenditures listed under 
“Education and science.” “Social welfare” includes 
social security and insurance, aid to mothers, and 
other such transfer payments. These have been 
increasing rapidly of late as a result of the recent 
increase in size of pensions and because of the 
forthcoming extension of social security and in- 
surance to the collective farm sector. 

The “Defense” category is defined by Russian 
textbooks to include the expenditures of the Min- 
istry of Defense for wages and material allowances 
of personnel, purchase of supplies and repair of 
combat equipment, support of military institutions 
and schools, military construction, and so forth. 
Presumably some of what might legitimately be 
considered part of the Soviet defense effort is not 
included in the explicit “Defense” item. For ex- 
ample, defense-connected research is probably fi- 
nanced under the science part of “Education and 
science,” and it has been suggested that expendi- 
tures for atomic energy are included under “Financ- 
ing the national economy.” Finally, it is possible 
that some unspecified military expenditures are 
included in the “Other,” or residual items, two of 
which appear in Table 2. 

“Administration” includes, as in most countries, 
the support of the traditional political, judicial, and 
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other such departments of the government. The 
final “Other” category is known to include internal 
security outlays, loan service, and some reserve 
funds. 

The pattern of Soviet budget expenditures indi- 
cated above has been followed fairly closely by al- 
most all the communist nations. There were some 
deviations, however. Almost all the nations devoted 
about 10 per cent more of their budgets to “Financ- 
ing the national economy” and from 5 to 10 per 
cent less to “Defense” than the Soviet Union. An 
exception was Outer Mongolia, whose budget ex- 
penditures on the national economy amounted to 
only about one-third of the budget and whose de- 
fense expenditures were about as large a proportion 
as that of the Soviets. The importance of “Social— 
cultural measures” expenditures in the budget was 
relatively low in China (12 per cent) and Bulgaria 
(21 per cent) and relatively high in Czechoslovakia 
(39 per cent) and in the German Democratic Re- 
public (38 per cent), As in the Soviet Union, “De- 
fense” was much larger than “Administration” in 
Czechoslovakia, Rumania, Bulgaria, Albania, and 
China but much lower in Hungary and the German 
Democratic Republic. The significance of these 
comparisons is reduced to the extent that they may 
result in part from differences in expenditure clas- 
sification. 

Budgetary imbalance and monetary stability. 
Success or failure of the budgetary policies of the 
communist nations cannot be judged by observing 
the budget accounts in isolation, Although the 
budget is undoubtedly the most important financial 
institution, it is not the only one, and its policies 
must be viewed in the context of over-all financial 
policy. There seems little doubt that the budgets of 
the communist countries are designed to have a 
deflationary impact on the economy. With the ex- 
ception of the years 1941-1943 for the Soviet 
Union, and 1950 and 1955-1956 for China, the 
budgets of the communist nations have returned 
surpluses for all years for which data are available. 
In part, these surpluses may be explained by the 
practice of including income from sales of govern- 
ment bonds as a regular item of budget revenue. 
However, as noted above, this is appropriate in 
circumstances where sales of bonds are compul- 
sory, the bonds are not convertible until maturity 
and are not purchased by banks through printing 
of money. 

Most of the communist nations have had strong 
inflationary pressures which, despite the existence 
of deflationary budgets, have not always been under 
control. This suggests that in the other financial 


institutions of the communist nations, offsetting 
inflationary policies have been pursued. The major 
culprit in the Soviet economy has been the Gos- 
bank, or state bank. The state bank finances through 
short-term loans the expansion of the temporary 
working capital needs of state enterprises. One 
major source of its funds is the surplus deposited 
with it by the state budget. To the extent that 
short-term loans outstanding increase by more than 
the budget surplus, the state bank creates money 
(abstracting from other items in the bank balance 
sheet). If the amount of money created is large 
relative to the increase in output and if other (real) 
factors are conducive to inflation, then upward 
wage and price pressures tend to develop. 

In recent years, the price levels of the commu- 
nist nations have substantially stabilized relative to 
the earlier years. In the Soviet Union, two decades 
of inflation ended with the currency reform of 1947 
and prices have been stable or declining ever since. 
One cause of this relative stability has to do with 
budgetary policy: it is the fact that the budget 
surplus has risen relative to the expansion of short- 
term bank credit. Second and third causes, perhaps 
as important, if not more so, are the facts that the 
Soviet government now has better direct control 
over rising wage levels than it had in the past and 
that the pressure on resources of overfull-employ- 
ment planning has apparently been eased in the 
postwar period. 

FRANKLYN D, HOLZMAN 


[See also TAXATION.] 
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Iv 
INTERNATIONAL TRADE 


Until communist regimes established themselves 
or were established in a number of countries in the 
aftermath of World War 1, the Soviet Union was 
the only country claiming to be “socialist” (com- 
munist), and naturally, its trade was wholly with 
the “capitalist” world. However, the principles upon 
which it based its foreign trade and the organiza- 
tional structure of its trade have changed little and 
have greatly influenced the procedures of other 
communist countries. 

Historical development. After the seizure of 
power in 1917, the communist regime took all 
foreign economic transactions into its own hands, 
declaring a state monopoly of foreign trade. This 
principle meant at first simply that no private firm 
or individual could trade with foreign countries 
without specific authorization by state organs. It 
was a necessary reaction to the chaos of war and 
civil war, a means of preserving the pitifully small 
reserves of gold and foreign exchange and of com- 
bating black-marketeering and speculation. During 
the “war communism” period, from 1918 to 1921, 
the state nationalized all industry and trade, but 
there followed a period of relative economic liberal- 
ization, the New Economic Policy (NEP), during 
which some private industrial and internal trading 
activities were again allowed. However, foreign 
trade was deemed to be part of those “commanding 
heights” which it was essential to preserve in the 
hands of the communist state. 

In 1920 a people’s commissariat for foreign trade 
was created, and foreign trade deals were almost 
wholly the responsibility of this ministry and of the 
specialized trading departments which were created 
under it. It is true that so-called cooperatives were 
allowed to engage in trade and to maintain repre- 
sentatives in foreign countries (e.g., Amtorg in the 
United States and Arcos in Great Britain). How- 
ever, these were mere devices to cope with such 
problems as nonrecognition of the Soviet govern- 
ment by the United States or prejudice against 
dealing with Soviet trade officials. It made no prac- 
tical difference to the planning of foreign trade. 
Whenever possible the Soviet authorities estab- 
lished trade delegations with diplomatic status 
which were associated with Soviet embassies in 
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foreign countries, These included representatives 
of the principal specialized trading departments 
most concerned in trade with that country. 

This pattern has continued. The people’s com- 
missariat of trade, like other commissariats, was 
renamed “ministry” in 1946. There is such a min- 
istry in each communist country; each has trading 
departments or corporations specializing in imports 
or exports (or both) of particular commodities, 
ship chartering, or other dealings abroad. With few 
exceptions this ministry and its organs exercise 
the monopoly of foreign trade as the sole inter- 
mediary between state enterprises and foreign 
countries. In other words, a doctrine which was 
designed to prevent private entrepreneurs or pri- 
vate traders from buying and selling abroad is now 
interpreted to mean something rather different: 
that state-salaried directors of domestic economic 
enterprises may not engage in deals with any for- 
eign country, including, as we shall see, any com- 
munist country. 

To explain why this has come about, let us re- 
turn to the experience of the Soviet Union. By 
1929, private enterprise in industry and trade had 
been virtually wiped out, and the Soviet Union was 
launched upon its industrialization and collectiviza- 
tion programs. There developed a planning system 
which, in essentials, survives to this day. All state 
enterprises are told by the planning organs what 
to produce, to whom to deliver and at what prices, 
and from whom to buy and at what prices. Material 
balances are drawn up of all principal commodi- 
ties, both for the current year and for use in plan- 
ning future growth. It is then determined whether, 
now or in the future, the country’s needs for par- 
ticular items can be met out of domestic production 
or whether imports are necessary, Similarly, mate- 
rial balances show export availabilities. Naturally, 
in the process of planning constant adjustments 
are made in the internal plans as well as in the 
foreign-trade plans. Sometimes domestic users have 
been deprived of much-needed commodities, which 
were then used to pay for urgent imports. A well- 
known example is the export of wheat to pay for 
capital equipment in the early 1930s, a time when 
many were hungry or even starving. This was a 
matter of political decision about priorities, which 
was reminiscent of trade planning in “capitalist” 
countries in wartime. 

In such circumstances it is natural that all trade 
deals are centralized under a specialized ministry, 
which works closely with the planning organs. As 
will be shown this version of the principle of the 
“monopoly of foreign trade” leads to many practical 
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Table 1 — Growth of foreign trade and national income in the eastern 
European countries and the Soviet Union, 1960 to 1965 
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VALUE IN 1962 
(millions of 
current dollars’) 1960 
Albania 
Exports e 42.8 
Imports e 49 
National income c 0.4 
Bulgaria 
Exports 770 22.3 
Imports 780 9.2 
National income © 7.0 
Czechoslovakia 
Exports 2,193 n7 
Imports 2,070 13.3 
National income g 8.2 
East Germany 
Exports 2,353 3.3 
Imports 2,372 8.9 
National income e 46 
Hungary 
Exports 1,100 13.6 
Imports 1,148 23.1 
National income e 10.1 
Poland 
Exports 1,646 15.7 
Imports 1,886 5.3 
National income € 4.5 
Rumania 
Exports 818 37.2 
Imports 941 29.1 
National income e 10.8 
Seven eastern European countries 
Exports 8,800° 12.8 
Imports 9,197 12.1 
National income € 67 
Soviet Union 
Exports 7,035 2.2 
Imports 6,450 11.0 
National income « 77 
Eastern Europe and Soviet Union? 
Exports 15,9159 8.1 
Imports 15,6472 11.6 
National income e 7# 


PERCENTAGE INCREASE IN VALUE FROM CORRESPONDING PERIOD OF PREVIOUS YEAR 


Plonned annual 


1961 1962 First holf 1963 1963 plan rote to 1965° 
7 A ; 26.4 
105-11: 
~109 : i a pee 
58 8 z 11.0 93 
159 16.3 10 10.6 12.5 
53 172 20 10.8 F 
28 “ k j 99 
61 72 -r P 94 
n.5 23 -8.4 3 67 
64 05 A " 73 
32 4a 13 15 93 
21 7.0 ~6.5" a 72 
37 42 s 2 68 
177 69 3.5 6 10.0 
51 ne 73 10° 59 
58 5.0 : 7-8 63 
13.4 9.5 15 6.0 92 
128 11.8 48 3.2 49 
81 1P y 5 70 
10.5 3.2 s si 
25.8 15.5 h sa bis 
10.1 65 t, 13" 9.3-10.3 
9.0 70 F ial s 
8.8 9.1 z; ay e 
6.3" 3) x 5 7? 
78 17.3 ali ; 
35 mot 2 And Cob 
68 6.0 x 7 7.2-7.4 
8.5 1.3 7 ofa i 
6.6 9.79 c e J 94 
oy sp e e 7” 


a. Plan data for trade and all data for national income are at constant prices, except those for East Germany. 
b. The periods covered are 1959-1965 for the Soviet Union and East Germany, 1960-1965 for Rumania, 1961-1965 for other countries. Some of these plans were 


in process of revision when table was constructed, 
. Not available. 
. Estimate by the Secretariat of the Economic Commission for Europe. 
. First quarter. 
Approximate. 
. Excluding Albania. 


oye mean 


Foreign Trade (Pravda, February 18, 1959). 


. National aggregates converted to a common basis of valuation by applying official exchange rates; they should be regarded os no more than indicators of the trend. 
The lower figure was written into the plan as a target for trade with socialist countries. The higher figure represents on aspiration expressed by the Minister for 


Source: Adapted from Economic Bulletin for Europe, vol. 15, no. 1, p. 24. 


difficulties. However, it is a logical and integral 
part of the system of internal planning established 
under Stalin, and it has not proved easy to reform 
the system after his death. 

In the Stalin period Soviet trade policy was 
greatly influenced by the need to achieve self- 


sufficiency, especially in capital goods. This could 
be explained by military security considerations, 
especially in the 1930s. This tendency toward 
autarchy, which continued, though not to so marked 
an extent, into the postwar period, affects in vary- 
ing degree all communist countries. It has two 
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Table 2 — External trade of six eastern European countries, by main regions, 1960 to 1962 


Bulgaria 
Total” 
Soviet Union 
Other eastern European countries 
Asian planned economies 


Czechoslovakia 
Total 

Soviet Union 
Other eastern European countries 
Asian planned economies 
Western Europe 

of which, EEC 
Rest of the world 

of which, less-developed regions® 


Hungary 
Total” 
Soviet Union 
Other eastern European countries 
EEC 
Less-developed regions® 


Poland 
Total 

Soviet Union 
Other eastern European countries 
Asian planned economies 
Western Europe 

of which, EEC 
Rest of the world 

of which, less-developed regions“ 


Rumania 
Total 

Soviet Union 
Other eastern European countries 
Asian planned economies 
Western Europe 

of which, EEC 
Rest of the world 

of which, less-developed regions" 


Soviet Union 
Total? 

Other eastern European countries 
Asian planned economies 
Western Europe 

of which, EEC 
Rest of the world 

of which, less-developed regions* 


Value in 1962, 
millions of 


current dollars 1960 1961 


770 
387 
213 


1,491 
(1,357) 


EXPORTS IMPORTS 
Annual percentage Percentage Valuein 1962, Annual percentage Percentage 
increase share millions of increase share 
1962 1962 current dollars 1960 1961 1962 1962 
22.3 15.9 16.3 100.0 780 9.2 5.3 17.2 100.0 
18.5 9.6 15.0 50.3 440 16.0 7.0 23.6 56.3 
178 257 10.0 27.6 183 18.0 54 —1.4 23.4 
8.9 5.7 —50° 0.7 6 6.2 46.2 —20.0° 0.8 
11.7 6.1 ES 100.0 2,070 13.3 11.5 2.3 100.0 
12.1 83 16.1 37.7 781 54 41 19.1 37.7 
12.9 11.3 13.0 32.6 648 21.4 20.0 1.6 31.3 
—16 —63.3 —46.6 1.2 45 —1.8 —47.4 —20.3 2.2 
63 10.3 —5.2 15.5 359 21.1 80 —2.5 173 
(14.8) (14.4) (—1.8) (6.9) (157) (18.1) (17.8) (—1.8) (7.6) 
25.0 19.9 —2.9 13.0 237 16.7 48.1 —22.5 11:5. 
(24.6) (23.9) (—2.8) (11.6) (202) (13.5) (44.5) (—14.1) (9.8) 
13.6 17.7 69 100.0 1,148 23.1 5,1 11.9 100.0 
22.9 294 18.4 35.7 417 18.5 176 169 36.3 
10.7 26.1 4,0° 33.4" 366° 26.1 —4.2 19,2» S12 
5.5 7.3 74 10.1 133 37.9 21 —1.1 11.6 
14.0 34.0 9.5 7.0 68 16.4 34.5 33.7 6.0 
15.7 13.4 9.5 100.0 1,886 5.3 12.8 11.8 100.0 
24.7 24.3 17.2 34.5 578 2.9 5.3 18.0 30.6 
15.3 9.2 5.6 23.7 579 5.5 17.4 22.1 30.7 
10.8 —41.6 —36.0 1.4 32 —121 —51.0 16.0 1.7 
9.6 8.2 12.8 28.8 435 1.0 14.5 5.7 23,1 
(4.4) (11,8) (11.4) (10.4) (134) (11,3) (—9.0) (—2.1) (7.1) 
14,1 310 —14 11.6 262 27.4 34.9 —7.7 13.9 
(17.5) (35.8) (—7.1) (8.6) (129) (40.4) (4.4) (13.6) (6.8) 
37.2 10.5 3.2 100.0 941 29.1 25.8 15.5 100.0 
12.5 248 —2.2 42.0 371 13.3 124 24.1 39.4 
54.9 —10.9 15.0 23.8 233 28.3 21.2 10.5 24.7 
18.6 —65.6 —29.1 1.3 15 —9.5 —11.6 —40.2 1.6 
89.1 23.0 —2.7 23.4 262 95.4 56.6 12.0 27.9 
(87.0) (22.0) (9.8) (15.4) (157) (119.0) (30.3) (28.5) (16.7) 
48.1 43.3 29.4 9.5 60 27.4 53.3 32.2 6.4 
(49.7) (31.3) (33.5) (8.8) (52) (4.6) (75.8) (33.7) (5.5) 
2.2 7.8 17.3 100.0 6,450 11.0 100.0 
S 9.6 165 56.5" 3,588" 11.9 55.6" 
—14.5 —40.0 —14.4 7.0 695 —I17.6 10.8 
10.8 21 8.8 15.3 1,143 27.4 17.8 
(6.9) (9.6) (7.4) (6.2) (537) (56.2) (8.3) 
43 98.6 47.8 21.2 1,024 46.1 15.9 
(—5.2) (115.6) (56.7) (19.3) (817) (29.5) (12.7) 


a. Total is not sum of subitems, because of exclusion of the rest-of-the-world category. 


iz 


In general, “less-developed regions’ 


Estimate by the Secretariat of the Economic Commission for Europe. 
1" includes all trade for which origin and destination are not specified. The spectacular rise in exports to that category in 


1961 and 1962 was largely due to the inclusion of Cuba (e.g., Soviet exports to Cuba multiplied by four in 1961 alone). 


d. Details may not add to total because of rounding. 


e. Excluding Albania, 


Source: Adapted from Economic Bulletin for Europe, vol. 15, no. 1, p. 25. 


explanations. The first is fear that the foreign 
country may upset the plan by acting in some un- 
predictable way; thus a communist partner may 
alter its internal plan and cancel trade deals (this 


has happened several times) or a Western country 
may impose some unexpected embargo, or import 
duty, or add a desired item to a list of banned 
“strategic” goods. All this makes it seem easier to 
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rely on supplies from within the country. Second, 
the same tendency is encouraged by the bureau- 
cratic complexities encountered by attempts to 
import. These, in turn, are contributed to by the 
chronic and sometimes critical shortage of foreign 
exchange, which persistently plagues all communist 
countries. 

After World War 11 trade between the Soviet-bloc 
countries grew rapidly, stimulated particularly by 
Western restrictions imposed at the time of the 
Korean War, but trade with China fell very sharply 
after 1960, After Stalin’s death trade with most 
Western countries and also with the underdevel- 
oped countries rose very fast from very low levels. 
An important exception is the United States, which, 
unlike its European allies, has continued a severely 
restrictive policy. Tables 1 and 2 show the pattern 
as it has developed. 

Analysis of policies. In analyzing trade policies 
and problems, it is best to consider separately three 
kinds of transactions: those with the industrialized 
West, with underdeveloped countries, and with 
other communist states. While there are some 
questions common to them all, there are some 
peculiarities which it is necessary to dwell on. In 
addition it is important to bear in mind one further 
distinction, that between the U.S.S.R. and China 
on the one hand and the other communist coun- 
tries on the other. The U.S.S.R. and China cover 
immense territories and are to a considerable ex- 
tent self-sufficient. By this is not meant that foreign 
trade is unimportant. It is undeniably a source of 
essential materials, machinery, and know-how. 
However, the total volume of imports or exports 
represents a very small proportion of their gross 
national product, in the case of the U.S.S.R. about 
3 per cent, Consequently, foreign trade is not among 
the most important factors governing growth, and 
domestic policy and economic organization are un- 
likely to be decisively influenced by foreign-trade 
considerations. For example, it is hardly likely that 
Soviet economic reforms would occur merely be- 
cause the existing arrangements are inconvenient 
for foreign trade, By contrast, the smaller coun- 
tries—notably Czechoslovakia, Poland, Hungary, 
East Germany, and increasingly also Rumania— 
depend greatly on foreign trade, which for them is 
the bottleneck in any plan of economic growth. 
Therefore the inconveniences of the “traditional” 
Soviet trading system matter very much more for 
them, and the pressures for change are greater. 

Trade with Western industrialized countries. 
The U.S.S.R. and its allies buy machinery and 
equipment and also a number of minerals and 
semimanufactures from Western countries. In addi- 


tion heavy imports of grain have been necessary, 
especially in years in which weather adversely af- 
fected the harvest. There is reluctance to spend 
scarce foreign exchange on manufactured con- 
sumer goods, Exports to Western countries have, 
of course, increased, but there have been serious 
problems in the way of finding more of the right 
goods to export. This has been partly a matter of 
availability of goods of the right type or quality 
but is also due to Western restrictions. These affect 
a wide range of agricultural products which these 
countries could export, as well as Soviet and Ru- 
manian oil and many other products, Some of these 
restrictions are of a discriminatory character, that 
is, they apply only to goods from the Soviet bloc, 
and this is the subject of bitter complaint, under- 
standably enough. Discrimination against trade 
with communist countries has been introduced on 
the following grounds. First, it was feared that the 
communist traders would resort to “dumping” and 
so disorganize the market. This fear does not, as a 
rule, go with the belief that the dumping and dis- 
organization are intended. On the contrary, it is 
appreciated by most serious analysts that the Soviet, 
Czech, and other traders are under instructions to 
sell as dear as possible to maximize foreign cur- 
rency earnings. However, they are also under in- 
Structions to sell, and, if faced with the need to 
cut prices, would undercut Western commercial 
competitors, A second reason for Western caution 
is fear of excessive dependence on supplies which 
might be cut off in the event of political trouble. 
This fear also affects, for equally valid reasons, the 
trading policies of communist countries. The can- 
celing by the Germans, in 1963, of contracts for 
steel pipe, although this item was not on the list 
of strategically restricted Western exports, must 
have sharply reminded Soviet planners of this risk. 
A third cause of difficulty is the Western strategic 
list, which, though considerably pruned, still in- 
cludes some items of no direct military significance 
which the Soviet bloc wishes to buy. Since the 
American interpretation of what is “strategic” is 
more stringent than western Europe's, snags arise 
when British or German chemical machinery hap- 
pens to include some American-patented part or 
process. Fourth, discriminatory measures in West- 
ern countries are due to the belief that they are 
being discriminated against, Since state organs are 
the only buyers, it is open to the communist author- 
ities, by unpublished interoffice memoranda, to 
tell their traders to buy in one market rather than 
another. The result is that Western negotiators 
press for quotas for their goods, especially “less 
essentials,” in elaborate bilateral negotiations while 
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imposing quotas on a range of “communist” goods 
of types which enter from elsewhere without quan- 
titative restriction. 

Trade with developing countries. Soviet-bloc 
trade with developing countries may be divided into 
two parts. The first, traditional in character, con- 
sists in the purchase of raw materials (e.g., rub- 
ber, cocoa, tin), often with sterling obtained from 
a favorable trade balance with Great Britain. The 
second, which became significant in and after 1956, 
is closely connected with aid programs and credits. 
Manufactures, and especially capital goods, are 
mainly involved, although the U.S.S.R. has also 
exported grain, timber, and oil to Egypt and a num- 
ber of other countries. The credits are, as a rule, 
repayable in the goods normally exported by the 
recipient country, and so the Soviet bloc has been 
greatly increasing its imports of Egyptian cotton, 
Indian tea, and so on. The limiting factor on the 
expansion of this trade-with-aid has been, and is, 
the availability of resources within Soviet-bloc coun- 
tries, which are heavily committed to these coun- 
tries’ own plans of economic growth. So far the 
bulk of trade-with-aid has been concentrated in a 
few countries, India, Indonesia, and Egypt have 
had the lion’s share. 

Trade within the Soviet bloc, In 1949 the Coun- 
cil for Mutual Economic Assistance (COMECON, 
or CMEA) was set up to coordinate the Soviet and 
European communist economies. China and other 
Asian countries sent observers, though since 1961 
China and Albania have absented themselves, while 
Mongolia has become a member. COMECON virtu- 
ally did nothing until 1955. Bilateral arrangements 
were the rule, with Soviet influence predominant. 
Unequal trade treaties were imposed, as may be 
surmised from the execution of officials in Bul- 
garia and Czechoslovakia on a charge of trying to 
strike too hard a bargain with the Soviet Union. 
Ex-enemy countries (Rumania, East Germany, Hun- 
gary) had to make large reparation deliveries. A 
number of joint companies were set up, under 
virtual Soviet control. After Stalin’s death a gradual 
evolution began toward a less subordinate role of 
the Soviet Union’s European allies. This process 
was greatly speeded up by the shock of the 1956 
events in Poland and Hungary. Paradoxically, the 
period of direct political subordination to the 
U.S.S.R. was also one in which there was very little 
systematic effort at economic coordination. It was 
increasingly recognized that by failing to plan their 
investments and their trade so as to achieve ra- 
tional economic specialization, the countries of the 
Soviet bloc, and particularly the smaller countries, 
were suffering unnecessary economic loss. There- 
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fore, questions of joint planning came to the fore, 
and COMECON began to be more active, Since 
1956 a number of agreements have been concluded 
under the aegis of COMECON. These include spe- 
cialization agreements, especially those covering 
some branches of the machinery and vehicles in- 
dustries, some joint investment projects, an effort 
to standardize components and trade nomenclature, 
and arrangements for the joint use of railway 
wagons and for international electricity transmis- 
sion lines and oil pipes. A COMECON bank came 
into existence in January 1964, The point was not 
lost on Khrushchev that the Common Market in 
western Europe showed that, contrary to doctrine, 
economic integration between capitalist states 
seemed to be advancing more rapidly than between 
socialist ones. Nonetheless there has still been rel- 
atively little joint planning, the COMECON organs 
have no executive authority, and actual trading 
arrangements remain almost wholly bilateral. 

The special problems of trade between com- 
munist countries are the following: first, since in 
each country all major investment and resource 
allocation decisions are taken by the central plan- 
ners, all matters relating to international special- 
ization and therefore to trade become a matter for 
decision at the highest government and party lev- 
els. In the West most of the specialization is a 
consequence of a large number of decentralized 
arrangements between firms. This is impossible in 
the communist countries so long as their internal 
planning system is what it is, Effective joint plan- 
ning under the circumstances requires a supra- 
national authority which could oversee the require- 
ments of the entire bloc and which could take 
appropriate and enforceable decisions, There is no 
such authority. In its absence the concentration of 
trading decisions in the hands of specialized trad- 
ing organizations cuts off the enterprises from con- 
tacts with their markets and their customers and 
makes it difficult to study the requirements of users 
or to undertake market research. 

Second, since prices in the various countries are 
fixed by government order and bear little relation- 
ship to relative scarcities or to need, they cannot 
be used as a guide as to whether anything should 
be imported or exported, nor as a basis of compar- 
ing costs. In practice the prices used in trade be- 
tween Soviet-bloc countries are those of the capital- 
ist world market (very roughly, since they are 
sometimes frozen for several years and seem on 
average to be rather higher than capitalist prices). 
These artificial foreign-trade prices have little or no 
connection with internal costs or with intrabloc 
exchange rates. This makes it difficult to find mutu- 
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ally acceptable objective criteria for specialization. 
The less-developed countries of the bloc, notably 
Rumania, have been strongly pressing their own 
ambitious industrialization plans, and it has proved 
virtually impossible to advance effective counter- 
arguments in the name of rational specialization. 

Third, while it is realized that bilateralism causes 
economic loss, it has so far proved incurable. Trade 
negotiations have amounted to a kind of barter: 
each country exchanges with the other country 
goods worth the same amount, and the quantities 
in question are then incorporated in the plans and 
material balances of each country. Any consider- 
able surplus convertible into a claim on a third 
country may not enable the holder of the surplus 
to convert it into the sort of goods that it wants, 
since there has been no provision in the plans of 
that third country to make these goods available. 
Theoretically it is possible to envisage multilateral 
barter, but in practice there has been very little 
of this. Since planning is essentially in terms of 
physical quantities, the multilateral facilities of the 
new COMECON bank are hardly relevant in this 
situation. 

So long as each country continues to plan its 
own investments and resource allocation and so 
long as COMECON plays a purely advisory or con- 
sultative role, it is hard to see how the situation 
could improve. On political grounds the conversion 
of COMECON into a supranational organ is not 
practicable, The theoretical alternative solution 
would be to so decentralize the planning of each 
country that enterprises would be free to purchase 
their inputs from suppliers both within and outside 
the borders of the given country instead of having 
them allocated administratively. This may be the 
long-term solution but would plainly require some 
major changes in the internal arrangements of 
each country. 

Meanwhile the volume of exchanges, bilateral 
it is true, between the communist countries in 
Europe has been steadily increasing. While the 
difficulties mentioned above are very real and are 
the cause of much worry to the planners of the 
countries concerned, one should recall the fact that 
the trade practices of many Western countries are 
far removed from the traditional principles of eco- 
nomic rationality and that developing countries 
such as India have been compelled to impose trade 
restrictions based on a political determination of 
priorities and not on a necessarily static analysis 
of comparative costs, Yet when all this has been 
allowed for, the special weaknesses of foreign trade 
of communist countries remain. Not the least of 


them is the almost total failure to develop any the- 
ory of foreign trade or to devise a pricing policy or 
a method of assessing costs which could be fol- 
lowed in intercountry comparisons. It is not by 
any means always necessary to determine the pat- 
tern of trade by reference to such comparisons, but 
one should surely start by knowing these elemen- 
tary economic facts of life. 

A. NovE 


[See also INTERNATIONAL TRADE CONTROLS, article on 
STATE TRADING. | 
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COMMUNITY 


The articles under this heading discuss three 
aspects of the community as a geographical and 
social unit. Other aspects of the community are 
discussed in many other articles in the encyclo- 
pedia. Major theoretical positions are reviewed in 
COMMUNITY-SOCIETY CONTINUA and ECOLOGY, ar- 
ticle on HUMAN ECOLOGY. Communities of different 
sizes are described in Crry; NEIGHBORHOOD; RURAL 
SOCIETY; VILLAGE. The major institutions that en- 
able communities to function are described in CITY, 
article on METROPOLITAN GOVERNMENT; LOCAL 
FINANCE; LOCAL GOVERNMENT; LOCAL POLITICS; 
VOLUNTARY ASSOCIATIONS. The fate of community 
in modern society is discussed in Mass SOCIETY. 
PLANNING, SOCIAL, article on REGIONAL AND URBAN 
PLANNING, and UTOPIANISM, article on THE DESIGN 
OF EXPERIMENTAL COMMUNITIES, describe sharply 
contrasting approaches to the problem of planning 
better communities. Loss or renunciation of com- 
munity is described in HOMELESSNESS. Methods of 
research into community life are reviewed in AN- 
THROPOLOGY, article on THE ANTHROPOLOGICAL 
STUDY OF MODERN SOCIETY; ETHNOGRAPHY; FIELD 
WORK; OBSERVATION. For material on social scien- 
tists who have particularly advanced the study of 
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community, see the biographies of BooTH; BURGESS; 
GEDDES; PARK; REDFIELD; TOCQUEVILLE; TONNIES; 
WEBB, SIDNEY AND BEATRICE; WIRTH. 


1. THE STUDY or COMMUNITY 
POWER 


11, COMMUNITY DISORGANIZATION 
II. COMMUNITY DEVELOPMENT 


Nelson W. Polsby 
Jessie Bernard 
Irwin T. Sanders 


I 
THE STUDY OF COMMUNITY POWER 


Contemporary research on community power is 
distinguished by: (1) a concern with characteriz- 
ing as a whole the political order of an entire com- 
munity (generally an American local community ); 
(2) radical disagreement on methods of going 
about this task; (3) both agreement and disagree- 
ment on specific findings; and (4) conflict over 
the proper interpretation of findings, At points of 
disagreement there has been a tendency for the 
literature to break roughly into two schools of 
thought—one based on a theory of social strati- 
fication, the other less systematically derived from 
theories of pluralist democracy, While contention 
between advocates of these schools has occupied 
much space in the literature, the problems, possi- 
bilities, and accomplishments of community power 
research neither begin nor end there. 

There is a sense in which the beginnings of com- 
munity power research are identical with the ori- 
gins of Western thought, since the early specula- 
tions and investigations of Plato and Aristotle on 
the nature of man and his government referred to 
the polities and constitutions of Greek city-states 
(Friedrich 1959; Long 1962). The marked resur- 
gence of interest in community power in recent 
years, however, has several distinctive features and 
immediate antecedents. Among sociologists, inter- 
est in American social problems created by indus- 
trialization, mass immigration, and urbanization 
reached a kind of crescendo in the late 1920s and 
during and after the great depression, notably 
under the leadership of Robert Park and W. I. 
Thomas at the University of Chicago. Investiga- 
tions of Negro and Jewish ghettos by Zorbaugh 
(1929) and Wirth (1928), of the Polish immigrant 
by Thomas and Znaniecki (1918-1920), and of 
skid row bums, criminals, and prostitutes by others 
—while contributing significantly to the growth of 
sociological thinking—also reflected the policy- 
oriented, ameliorative interests of sociologists of 
this period. Later, the depression led sociologists to 
re-examine their equalitarian assumptions about 
American society and reactivated a long-dormant 
concern among them for theories of social strati- 
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fication. These theories had never gone out of 
fashion in Europe but had previously been thought 
to be of only limited applicability to the American 
experience. One illustration of a shift toward the 
use of stratification theory is contained in the con- 
trast between the early classic study of Middletown 
(1929), by Robert S. Lynd and Helen M. Lynd, 
and its equally renowned successor, Middletown in 
Transition (1937), which emphasized community 
power much more than did the earlier book. 

When American political science emerged as a 
formal discipline around the turn of the twentieth 
century, its members regarded the local community 
as a massive political problem requiring prescrip- 
tive rather than strictly descriptive activity. The 
adverse judgments on American city politics ad- 
vanced by Bryce (1888), Ostrogorskii (1902), and 
others led to agitation for the professionalization 
of city management and to other suggestions for 
reform, but not to systematic attempts to compre- 
hend local politics descriptively (see Sayre & Polsby 
1965). In the 1920s and 1930s an energetic group 
of political scientists at Chicago, led by Charles E. 
Merriam, undertook to redress the balance; but 
most of their efforts were incorporated into ongo- 
ing research in political parties and voting behavior 
rather than into the field of local politics (see, for 
example, Merriam & Gosnell 1924; Key 1936). 

World War u, with its large-scale displacement 
of academic talent into national and international 
arenas, no doubt served to delay further the dis- 
covery of local communities as suitable and attrac- 
tive laboratories for the examination of traditional 
problems of empirical political theory. At any rate, 
it was not until the 1950s that this literature began 
to emerge in full flower, although its precursors 
and roots can be found in earlier writings. 


Characterizing a whole community 


Little attention has been devoted in contempo- 
rary community power research to the problem of 
defining a community—a question that has recently 
engaged, among others, students of international 
politics and organization (e.g., Deutsch 1953; Haas 
1958; Russett 1963). For the most part, a conven- 
tional perspective has been adopted and a “com- 
munity” has been defined as a population living 
within legally established city limits. Only rarely is 
the term used in this literature to describe a stand- 
ard metropolitan area, marketing area, or some 
entity defined by functions other than political. The 
problem of setting boundaries on the community is, 
perhaps, ultimately insoluble except by arbitrary 
means, because it is freely conceded that externally 
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made decisions may have a significant impact on 
the allocation of values and on important private 
and public decisions within the community, how- 
ever defined. And it is the description of the shap- 
ing and sharing of these values and decisions that 
is the central concern of the community power 
literature. 


Methods of research 


A serious problem in characterizing the political 
order of a whole community concerns sampling 
procedures. Clearly not all decisions, or even a 
meaningful fraction of them, are normally studied 
in order to arrive at a description of power dis- 
tribution within a particular community. The in- 
evitable and necessary compromises that students 
have made have led to much disagreement on 
methods of research. To a certain extent each stu- 
dent of community power has had to fashion his 
own compromise with the fuzzy universe of de- 
cisions in the community in which he has worked— 
a compromise dictated in part by the time, help, 
and funds at his disposal, and in part by the theo- 
retical presuppositions he has brought to the 
research, 

“Total immersion.” One method of study has 
consisted of what might be termed “total immer- 
sion” in the community by a researcher or research 
team for a lengthy period of time—sometimes as 
long as a year or more (see, for example, Lynd & 
Lynd 1929; 1937; Warner & Lunt 1941; Warner 
et al. 1949; Hollingshead 1949; Sayre & Kaufman 
1960; Scoble 1961; Dahl 1961). The researcher 
thus has an opportunity to learn the perspectives 
of local residents, to identify issues and decisions 
they regard as significant, to absorb some of the 
history and background of the community, and to 
see its various groups and leaders in the round of 
daily life. More formal interviewing, participant 
observation, clipping of newspapers, and examina- 
tions of official records, census materials, and so 
on can also be employed during the period of resi- 
dence. A variant of this is the method of gathering 
and generalizing from numerous case studies de- 
scribing decision making in detail (see, e.g., Ban- 
field 1961; Wildavsky 1964; Decisions 1961). 

Clearly this method—insofar as it can be re- 
garded as a method of research rather than as an 
eclectic combination of methods—is not open to 
much criticism. Most of the criticism of community 
power studies using this method has attacked the 
analysis of data and the appropriateness of con- 
clusions drawn from the data, rather than the data- 
gathering methods of the studies. 


“Reputational” studies. A drawback of the total 
immersion method of research is its expense in 
time and money. It also smacks to some of casual 
empiricism of a kind that renders either replica- 
tion or comparative study difficult. And so it is not 
surprising that an attempt, however crude, to de- 
velop a more mechanical method of research in this 
field was warmly welcomed and quickly imitated. 
Such an attempt was made by Floyd Hunter (1953), 
who in a study of Atlanta, Georgia, introduced 
what later came to be called the “reputational” 
approach to discovering patterns of community 
power. 

Although this method is not compactly described 
in the original study and modifications have been 
introduced by others (e.g., Miller 1958; D'Antonio 
et al. 1961; Klapp & Padgett 1960), we can pre- 
sent it in general outline. Briefly, the central goal 
of the method is to arrive at a list of people who 
can be called “top influentials” in the community. 
Often, lists of nominees are solicited from people 
who are presumed to be knowledgeable in various 
sectors of community life. These names are then 
winnowed (to 40 in Atlanta) by a panel of judges, 
so that only those on whom a number of people 
agree appear on the final list. Members of this list 
are customarily interviewed and asked such ques- 
tions as: “If a project were before the community 
that required decision by a group of leaders—lead- 
ers that nearly everyone would accept—which ten 
on the list of 40 would you choose?” and “Who is 
the biggest man in town?” (Hunter 1953, pp. 61 ff., 
268-271). Hunter's book consists largely of an 
account of his methods of identifying the top 
leaders, a discussion of their involvement in a few 
community decisions, and a demonstration that 
they were by and large well known to one another. 

At first Hunter's book was favorably received, 
and a number of studies were planned and exe- 
cuted in other communities using this or a similar 
method (e.g., Form & Sauer 1960; Klapp & Padgett 
1960; Hunter et al. 1956; D'Antonio & Erickson 
1962; Pellegrin & Coates 1956; Schulze 1961; 
Thometz 1963). The reputational method has also 
been vigorously defended by admirers of its econo- 
my and compactness (Rossi 1960; Herson 1957; 
Ehrlich 1961; D'Antonio & Erickson 1962; and 
others). But it has also been attacked for a number 
of reasons. 

Perhaps the most fundamental objection has 
been that the identification of presumed “influen- 
tials” is a very crude method of sampling the under- 
lying shape of decision-making processes. Instead of 
focusing on how valued outcomes are distributed 
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by those who actually distribute them in specific 
instances, it focuses on those who some people say 
distribute them under hypothetical circumstances. 
The identifiers, about whom we are told little in 
these studies, may be misguided, prejudiced, or ill- 
informed (Wolfinger 1960; LaPalombara 1964; 
Bauer et al. 1963). In fact, some people who have 
appeared on top leadership lists have been found 
not to belong there by criteria more closely tied to 
actual behavior; others not on the lists have been 
found to belong on them (Fanelli 1956; Scoble 
1961; Wildavsky 1964; and others). Different de- 
cisions have been found to involve different influen- 
tial people. In some issue areas there are persistent 
patterns of decision making; in others the patterns 
change rapidly. Critics charge that all these dis- 
tinctions, crucial to an understanding of com- 
munity power, are disregarded by the short cuts of 
the reputational method (see McKee 1953; Kauf- 
man & Jones 1954; Dahl 1961; Scoble 1961; Polsby 
1959; Wolfinger 1960; Polsby 1963). 

Users of this method have often a priori picked 
among the alternatives posed by each of these dis- 
tinctions. They have held, at least implicitly, that 
all those on the reputational list are the top leaders 
in the community (whether they actually are or 
not) by suggesting that there is a general top lead- 
ership group, influential in more than one issue 
area (whether it is or not) and persisting over time 
(whether it does or not). Even where researchers, 
conscious of latter-day criticisms, refrained from 
picking these particular alternatives a priori, the 
reputational method, because of its removal from 
real decision-making events, has rendered them no 
assistance in picking any other alternative. Thus, 
for example, attempts to demonstrate persistence 
of leadership groups by the reputational method 
fail because regardless of the similarities or differ- 
ences in lists at times t, and t,, the reputational 
method unaided fails to show the relations be- 
tween the lists and actual behavior at either time 
(D'Antonio & Erickson 1962). 

Attempts to improve the reputational method 
have concentrated mostly on reducing vagueness 
and ambiguity in questioning; thus, instead of a 
hypothetical “project,” one or more actual projects 
or issue areas can be the subject of an interview 
(e.g., Thometz 1963). Instead of asking merely for 
a list of who might or should be involved in a de- 
cision, descriptions of specific events, actions, and 
reactions can be elicited. But there comes a 
point along this road at which the reputational 
method becomes indistinguishable from ordinary 
interviewing. 
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The central methodological issue is thus one of 
appropriateness. Are respondents being asked ques- 
tions to which they can give answers capable of 
unambiguous interpretation? Are respondents able 
to give the best available testimony on the ques- 
tions they are asked? Are they able to give compe- 
tent testimony? Are their responses, if correctly 
interpreted, capable of answering underlying ques- 
tions about community decision-making processes? 
If so, then the reputational method is clearly an 
appropriate tool of community power research. If 
not, as the weight of evidence and argument seems 
to suggest, then the method can be discarded. 


Findings 

The heated debate over the reputational method 
has obscured the fact that this method is not solely 
or even chiefly responsible for the promulgation of 
findings about the distribution of power in local 
communities that, on the basis of internal evidence, 
are quite likely incorrect. 

Stratification theory. A number of studies— 
some reputational, some not—have set forth all or 
several of the following conclusions, which for sev- 
eral decades were widely accepted as scientifically 
established and accurate descriptions of American 
community politics: (1) the upper class rules; 
(2) political and civic leaders are subordinate to 
this class; (3) there is a single power elite dealing 
with a wide variety of community issues; (4) this 
upper-class power elite rules in its own interests; 
and (5) social conflict takes place chiefly between 
the upper and lower classes. A common thread 
running through these propositions is the depend- 
ency of political power on the class and status 
structure of the community; thus, for convenience, 
we may refer to these propositions as a “stratifica- 
tion theory” of community power. 

Each of these findings has been questioned. In 
many instances the evidence supporting them has 
been weak, ambiguous, vague, or even contradic- 
tory. In some cases these propositions have not 
been tested directly or properly, and in others find: 
ings contrary to the propositions have been ex- 
plained away unsuccessfully (see Polsby 1963). 
And so the extent to which any of the propositions 
actually holds and the circumstances in which one 
or more of them is applicable are now problematical. 

“Pluralist” findings. Alternative findings, sup- 
ported by a number of sources, can also be dis- 
tilled from this literature. They include the appar- 
ent fact that participation in the making of most 
community decisions is concentrated in the hands 
of a few, but that different small groups normally 
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make decisions on different community problems. 
Government officials are the most likely of all par- 
ticipants to overlap issue areas. Over time, the 
membership of decision-making groups appears to 
turn over at variable rates of speed. 

These small groups have frequently been found 
to contain representatives of more than one inter- 
est; indeed, the finding is often made that partici- 
pants in decision making disagree sharply among 
themselves. The powers of these groups also seem 
to be restricted in other ways—by expectations that 
they will not innovate, by threats that counterelites 
will “make an issue” and defeat them by enlarging 
the number of participants, and by the necessity to 
seek outside support to undertake nonroutine com- 
mitments. Finally, many members of these groups 
seem to be recruited by self-selection rather than 
co-optation, and many are answerable to the local 
electorate. 

Valued outcomes are, in any event, only par- 
tially under the control of local participants in de- 
cision making. Decisions significant to local politics 
may be made outside the community—as, for ex- 
ample, the decision by a national corporation to 
move a plant in or out of a given locality—or out- 
comes may occur as a by-product of other activities, 
without explicit decisions being made by anyone. 
There is no sure connection between the intentions 
of community leaders and the actual distribution 
of values, Because of these and other considera- 
tions, knowing simply who gains and who loses in 
community politics is not sufficient to give a re- 
liable picture of local decision-making processes. 

Success in community politics evidently does not 
come automatically to possessors of great amounts 
of any one of the many possible resources available 
to actors in community life. Many resources in com- 
bination—time, knowledge, money, official posi- 
tion, energy, popularity, social status, and so on— 
must be applied with skill and diligence for actors 
to succeed in influencing decisions in desired di- 
rections. These resources must be disposable for 
political purposes, and it is helpful if they are con- 
vertible into other resources that may be in short 
supply. Skill and diligence in the application of 
resources to political ends are themselves often 
scarce. Successful actors in the community usually 
must also learn to choose goals that minimize costs 
to others in the system, that effect incremental 
rather than sweeping changes, and that distribute 
payoffs widely. 

Findings such as these are often referred to as 
“pluralist” in character. They emphasize the di- 
versity of decision-making processes and focus less 
on the social backgrounds and identities of par- 


ticipants and more on their actual roles and activi- 
ties in local decision making. Whereas a stratifica- 
tion theory of community life by definition involves 
everyone in a hierarchical power relationship which 
can be ascertained by reference to his class and 
status position, the pluralist theory speaks more 
readily of “groups” whose size, cohesiveness, state 
of mobilization, range of interests, and durability 
are all subject to empirical examination. 


Interpretations 


Fecent defenders of the stratification approach 
to the study of community power have been willing 
to concede that the original case for this approach 
rested on shaky empirical ground; yet, at the same 
time they have suggested that a full-blown pluralist 
theory accounting satisfactorily for all the facts 
has by no means emerged in the literature. They 
have also raised a number of specific points at 
which their interpretations of agreed-on facts di- 
verge from pluralist interpretations, Of these, only 
three can be discussed here. 

Middle-class leadership. Consider first the find- 
ing that leaders are drawn from a very small pro- 
portion of the total community and are much more 
likely to be middle class than lower class. To 
stratification analysts this suggests that there is 
an unspoken but very real “price of admission” to 
the political stratum, which can be paid in coin 
that is, for many participants, inherited and thus 
attributable to the stratification system of the 
community. 

A pluralist response might be that nevertheless 
inherited membership in the middle class is prob- 
ably not a necessary condition of participation. 
Participants may find it possible to enter the polit- 
ical stratum by other means. They may enter the 
middle class from below and acquire its values and 
social and economic resources by means other than 
inheritance. Or, while retaining low economic and 
social standing, they may become intensely in- 
volved in the disposition of a particular issue. Or 
they may come to community leadership through 
activity in a labor union, ethnic association, neigh- 
borhood group, or local party organization. The 
extent to which any of these routes to political 
participation is taken in a given community is, of 
course, always a matter to be determined by 
investigation. 

Would it be proper to infer, then, that although 
membership in the middle class is not a necessary 
condition, it may be a sufficient condition for par- 
ticipation in decision making? Hardly. At most, 
only a small percentage of the middle class partici- 
pates, and the rest do not. 
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In what circumstances are the class identities of 
decision makers crucial in determining the shap- 
ing and sharing of political outcomes? Certainly 
when local decision makers are faced with alterna- 
tive courses that will distribute values differentially 
to the different classes, and also in all other in- 
stances where class membership is a meaningful 
determinant of political interests. Such circum- 
stances must be assumed to exist in Middletown, 
where members of X family—identified as a major 
component of the upper class—were found to be- 
long to both the Democratic and Republican parties. 
This report could be interpreted to mean that both 
parties were dominated by the cohesive, class-based 
interests of the X family (see Lynd & Lynd 1937, 
pp. 87, 91-93). However, an alternative interpreta- 
tion might be that the X family was politically 
split (see Polsby 1960). 

Pluralists often remark on conflict between de- 
cision makers and on the diversity of interests they 
represent. To them, an agreement between Walter 
Reuther and Henry Ford can be interpreted as 
something other than collusion between two rela- 
tively wealthy residents of the Detroit suburbs. In 
this case, as in many others, knowing the number 
of decision makers present and their class member- 
ships would be less important to a pluralist than 
knowing what the subject of their discussion was 
and what interests were involved. 

Inequality of influence. Different constructions 
are also put on the palpable fact that equality of 
influence over decisions by all members of the 
community is nowhere approximated. Inequality 
in this sense is not a new discovery; the question 
is whether such inequality can somehow be recon- 
ciled with the democratic pretensions of American 
society. On the whole, stratification theorists have 
doubted that it can; pluralists have on the other 
hand called attention to the varieties of inequality, 
to the possibility that inequalities along various 
dimensions could be dispersed rather than cumu- 
lative, and to the ways in which dispersed inequal- 
ities lead to checks, balances, and standoffs in the 
political system, and consequently to bargaining 
and the spread of benefits. 

The status quo. Finally, there is disagreement 
among students of community power in their orien- 
tations toward the status quo. On the one hand, 
an ongoing political system can be regarded as a 
stacked deck of cards, in which some people are 
much better situated than others, with possible 
alternatives or correctives never brought into pub- 
lic discussion and never raised as issues. Thus 
aspects of social structure that are never contro- 
versial and hence never brought fully into focus 
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by the activity-oriented pluralist approach may 
exercise a profound influence on community de- 
cision making (see, in particular, Bachrach & 
Baratz 1962; Agger et al. 1964, e.g., p. 325). In- 
sofar as pluralist researchers confine their attention 
to controversial, extraordinary, and highly conflict- 
ful events, this criticism has great merit. But when 
pluralists attempt to explain issue areas in which 
routine and structured decision making also takes 
place, they are less vulnerable to this charge. 

When critics of the pluralist approach specify in 
some detail precisely which aspects—controversial 
or not—of the status quo must be investigated, and 
in the light of which unconsidered alternatives, 
then it is possible in particular cases to examine 
the justifications for describing local decision mak- 
ing in terms of alternatives suggested by outside 
observers rather than in terms of alternatives per- 
ceived and acted upon by local residents. 

The status quo can be viewed as passively but 
powerfully restraining community residents from 
better possibilities and levying costs inequitably. 
But it can also be considered a synonym for the 
stability that is the sine qua non of organized soci- 
ety. Out of a universe of possibilities, and at un- 
deniable but unavoidable costs, communities evolve 
habits and patterns of activity in response to a vari- 
ety of internal and external pressures. The alterna- 
tive to a deck stacked in some way, at persistent 
cost to somebody, is, in this conception, a kind of 
random world in which costs are levied unex- 
pectedly and unpredictably. In situations of rapid 
flux and uncertainty, one may also suggest that the 
dependence of the many on the few for decision 
making is accentuated rather than diminished, and 
the forms of due process that are the bulwark of 
the rights of ordinary citizens in a democracy give 
way before the exigent demands of the powerful. 

Obviously, not the mere fact of status quo but 
the undesirable aspects of status quo in particular 
communities give rise to the impulse to search for 
alternatives, even when local residents are not 
vocally among the search party. It is nevertheless 
a common mistake to view stability as uniquely 
costly in and of itself, when with equal cogency one 
can argue against change for the sake of change. 

In any event, researchers interested in these 
problems should try to discover the consequences 
of change or of status quo empirically, rather than 
deducing them from a priori notions. 


The major conflicts in this field over method 
and the interpretations of findings seem to have 
drawn scholarly attention and energy away from 
the discussion of local community politics. One 
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result, perhaps, is that the literature of community 
power has as yet provided no firm basis for differ- 
entiating among communities by means of em- 
pirically verified propositions that state the varying 
conditions under which different findings hold. For 
it is manifestly the case that communities do differ 
in their politics and in other ways. Apparent as this 
is to the naked eye, the literature on community 
power has not yet effectively exposed and illumi- 
nated community differences as they affect local 
decision-making processes. The main contributions 
of this literature are to be found less in discussions 
of community politics than in discussions of the 
study of local politics as a problem in research 
methodology, philosophy of social science, and 
speculative social theory. 

NELSON W. POLsBy 


[See also Loca po.itics; PoweEr.] 
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1 
COMMUNITY DISORGANIZATION 


A community, in the sense in which the term 
will be used here, is a territorially bounded social 
system or set of interlocking or integrated func- 
tional subsystems (economic, political, religious, 
ethical, educational, legal, socializing, reproduc- 
tive, etc.) serving a resident population, plus the 
material culture or physical plant through which 
the subsystems operate. The community concept 
does not include such characteristics as harmony, 
love, “we-feeling,” or intimacy, which are some- 
times nostalgically imputed to idealized preindus- 
trial communities (Foster 1960-1961), but it does 
include a minimum of consensus. A normative 
structure is either inherited from the past or self- 
consciously instituted in each subsystem, and con- 
formity to its demands is usually sufficient to 
guarantee that the above-mentioned functions will 
be performed. 

Community disorganization may be defined as a 
state in which any one or more of the several sub- 
systems, for whatever reason, fail to function at 
some specified expected level of effectiveness, or 
it may be defined as the processes that lead to such 
a state, or it may refer both to the processes and to 
the state. Failure of a subsystem may take the form 
of performance that is either better or worse than 
the expected one; the processes leading to such 
failure may be “natural” or purposive. Since com- 
munity disorganization as a state may occur in a 
wide variety of subsystems and in a wide variety 
of circumstances and, as a process, may result 
from a wide variety of causes, it is not a unidimen- 
sional sociological phenomenon susceptible to sim- 
ple theoretical conceptualization or definition. 

The ideal-type of a community that is not disor- 
ganized would be one in which: (1) the physical 
plant is in good running order, capable of serving 
the needs of the people; (2) the people are in good 
physical and mental health, that is, able to perform 
at least at minimal levels of efficiency; (3) there 
is at least a tolerable fit between community needs 
(“functional requisites”) and functional subsys- 
tems (institutions and groups) to serve them; (4) 
there is consensus with respect to norms, so that 
everyone knows what to expect of everyone else, 
and hence there is no confusion; and (5) these 
expectations are fulfilled. Change would not be pre- 
cluded, but it would be change for which the com- 
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munity is prepared and change that is synchronous 
and compatible among all the subsystems. Change, 
in fact, may be in the direction of organizing rather 
than in the direction of disorganizing the com- 
munity, as when a new traffic control system is 
introduced to overcome the disorganizing effect of 
traffic jams. Still, there is a nice theoretical ques- 
tion about just where to draw the line between 
organizing and disorganizing change. 

Community disorganization as a state is not an 
all-or-nothing phenomenon; it is a matter of degree. 
It is currently believed that complete absence of 
any disorganization in a community is highly im- 
probable, A certain amount of stress and strain 
(Moore 1963; Moore & Feldman 1960; 1962) or 
instability is seen as probably intrinsic in any social 
system. It may be taken for granted, therefore, that 
at least a certain degree of disorganization, how- 
ever slight, is present in all communities. The ex- 
tent may be wide, including the entire community 
system, or limited, affecting some parts more than 
others. Communities characterized by “organic” 
solidarity will be more susceptible to disorganiza- 
tion than those characterized by “mechanical” 
solidarity [see COMMUNITY—SOCIETY CONTINUA]. 

Community disorganization as a process may 
range in extent from a temporary tie-up in rush- 
hour traffic to a total collapse of all subsystems. It 
may be sudden and rapid, as in a cataclysmic “act 
of God,” or slow and of long duration. It may be 
immediately and even spectacularly visible, or it 
may be almost imperceptible. There may even be 
controversy as to its very existence. (It has been 
asked, for example, whether the people of Rome 
ever knew that their empire was “falling.” ) More- 
over, it may be reversible or irreversible, “natural” 
or purposive. 

Community disorganization as a state. At what 
point in the process of community disorganization 
should we say that a community is in a state of 
disorganization? How much malfunctioning of any 
subsystem constitutes a state of community dis- 
organization? Crisis disorganization, as in the case 
of a disaster, presents no problems of identification. 
But if a certain amount of nonconformity is en- 
demic or chronic, in fact, intrinsic in the operation 
of communities, it is not at all clear at what point 
we should speak of a state of disorganization. 

There is by no means always consensus among 
observers with respect to the existence, let alone 
the extent, of disorganization in any specific com- 
munity. A classic example is the controversy among 
researchers with respect to family structure in the 
Caribbean. Some students view the family there as 
in a state of disorganization (Blake 1962; Goode 


1960); others, however, view it as an institution- 
alized adaptation to a difficult set of circumstances 
(Rodman 1961). This difference of viewpoint illus- 
trates a contrast between the sociological and the 
cultural conceptualization of disorganization. The 
anthropologist tends to underplay it; his emphasis 
is on the fact that if the community is surviving, 
the basic functions are at least being minimally 
performed. In the Caribbean, therefore, he does not 
see “friending” and “living” as nonconformity to 
community norms for marriage but rather as a 
cultural innovation for adapting to circumstances 
that make conformity to the community norms im- 
possible [Rodman 1961; see also CARIBBEAN SOCI- 
ETY]. Similarly, one social anthropologist has de- 
tected a “culture of poverty,” rather than a state 
of disorganization, in Mexican slum communities 
(Lewis 1959). 

In answer, then, to the question raised above, all 
we can say is that there are no consensual stand- 
ards that define the limits of community d 
zation. For example, at one time and pl 
economic subsystem is judged to be performing its 
function if it provides subsistence for everyone; at 
another, it is judged not to be performing its func- 
tion if anyone is at a merely subsistence level. And 
such is the case for all of the subsystems. 


Disorganizing processes 

Competition and conflict. The subsystems in a 
community are not operated by self-programmed 
computers. They are operated by human beings 
who are socialized into subsystems and who come 
to identify with them, Thus, although they are all 
contributing to the “commonweal,” they see them- 
selves as competing and in conflict with one an- 
other. For example, the police system competes 
with the highway system for tax funds; the public 
school system competes with the private school 
system; and the public sector competes with the 
private sector for personnel and support. The con- 
flict between political and economic institutions for 
power, classic in the nineteenth century, continues 
in the twentieth; even President Eisenhower, before 
he retired, warned against the encroachment of the 
military—industrial complex on political prerog- 
atives. 

But conflict between or among groups and sub- 
systems, in and of itself, need not disorganize a 
community. Some kinds of conflict are highly in- 
stitutionalized; that is, they are bound by con- 
sensual or legislated norms. Indeed, the philosophy 
of checks and balances in the United States con- 
stitution is itself an institutionalized form of 
conflict, as is the concept of a “loyal opposition” in 
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the British constitution; so, too, is the system of 
parliamentary debate. The whole legal system also 
is designed to institutionalize conflict. The process 
of “countervailing power” in the economic sphere 
(Galbraith 1952) is another example. There is, in 
fact, one school of thought which holds that con- 
flict in this institutionalized sense is an organizing 
rather than a disorganizing factor, a sine qua non 
of stability in our society; it is seen as a method for 
preventing too great a concentration of power in 
any one position, thus making for new equilibria 
to accommodate new statuses among the interest 
groups in the community. Since such conflict is 
institutionalized and provided for, either explicitly 
or implicitly, in the system itself, it does not dis- 
organize the community; it is, rather, part of the 
organization of the system. 

This does not mean that conflict cannot be a 
disorganizing process. Sometimes mass hysterias 
become hostile and lead to widespread violence, 
resulting in destruction of property and loss of life. 
If conflict takes the form of hatemongering, rabble- 
rousing, or ressentiment, it may disorganize the 
community by preventing the normal expectations 
of behavior, In the Middle East, street rioting has 
become a recognized technique of conflict that may 
be at least temporarily disorganizing. Rioting in 
some countries can be turned on and off almost at 
will by the dissident leadership (Rummel 1963). 

Maximizing performance of function. The divi- 
sion of labor implies common interests. It is thus 
clear that if any one subsystem under-performs its 
function, community disorganization may result. 
If the economic subsystem cannot produce and dis- 
tribute goods and services, all the other subsystems 
are impeded in their functioning, and the total 
community system suffers some degree of disor- 
ganization; if the governing subsystem cannot pre- 
serve order, again, the other subsystems cannot 
perform their functions. These conclusions are 
obvious. 

But precisely because it is part of a total system 
and not an isolated autonomous entity, the over- 
performance as well as the underperformance of a 
subsystem may also be disorganizing. Assigned a 
function to perform, a subsystem may do it so well 
that other subsystems suffer. Any one subsystem, 
performing at a high level and thus presumably 
contributing to the welfare of the total system, may 
by that very fact be interfering with the operation 
of some other subsystem performing a comple- 
mentary function. 

The economic subsystem, for example, has the 
function of creating and distributing goods and 
services. Institutional norms for achieving this goal 
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are present, and conformity to them presumably 
leads to successful performance. Accordingly, such 
obstacles to successful performance as inefficiency, 
technical obsoleteness, and (if employers choose 
to define it as an obstacle) unionization, are re- 
garded as contrary to the norms and tend to be 
suppressed whenever possible. The “natural” result 
of a high degree of conformity to the norms of the 
economic subsystem in its productive aspect, may, 
however, interfere with the adequate performance 
of its distributing function, resulting in widespread 
unemployment, or a high level of competitive stress 
and strain, demoralization, family trauma, and so 
on. In other words, the very norms of efficiency 
that maximize productivity may interfere with the 
distribution of the goods and services; the better 
the productive subsystem works, the worse other 
subsystems may work, Or, to take another example, 
the better the system of reproduction works for 
increasing the population, the worse the economic 
production system may work, as land becomes in- 
creasingly fragmented among heirs. If a school 
system attempts to make maximum use of its plant 
by sending children from crowded slum schools to 
uncrowded schools in better neighborhoods, fam- 
ilies may withdraw their children from the better 
schools and parental interest in public schools may 
decline, to the detriment of the whole system. If 
the welfare and charitable agencies wish to max- 
imize their services by assuring a decent level of 
living for their clients, unions may accuse them of 
disorganizing the labor market by subsidizing in- 
dustry’s low wages and weakening organized efforts 
to improve wages. 

Overconformity. Overconformity to community 
norms within any subsystem constitutes a disor- 
ganizing process that is analogous to overcon- 
formity to subsystem norms. The performance of 
community functions by any subsystem is usually 
mediated by way of institutionalized norms, and 
conformity to these norms presumably guarantees 
adequate performance of the functions by the sub- 
system. However, some community norms are, in 
effect, on a stand-by basis; they are not enforced 
most of the time but are there in case they are 
needed. It has been said, for example, that if every 
single norm were rigidly enforced, almost any sys- 
tem would collapse. For example, unwilling mili- 
tary conscripts report that resorting to punctilious 
conformity to every military norm serves admirably 
as a form of sabotage. It has been said that com- 
plete conformity to the political norms in the 
U.S.S.R. would bring the economic system to a halt 
and that only the presence of blat or fixing—itself 
illegal—keeps it performing at all (Crankshaw 
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1959). Complete and rigorous enforcement of all 
the laws, mores, conventions, and administrative 
rulings could throw a community into chaos. A 
large proportion of the population would end in 
jail; court dockets would run years behind schedule; 
and interpersonal ties would be shattered as the 
conventional white lies of the amenities that con- 
flict with the mores were discarded. 

The disorganizing effects of too-rigorous con- 
formity to norms are counteracted by the emer- 
gence of informal, unarticulated, unofficial, 
crescive norms or understandings that optimize 
rather than maximize conformity to the formal, 
articulated, official norms. Only as much conform- 
ity as is needed is demanded; as much noncon- 
formity is permitted as is needed to keep the 
community operating. There is, in effect, a new 
form of organization of nonconformity that per- 
forms a latent function. It constitutes “institu- 
tionalized evasion of the norms” (Merton [1949] 
1957, pp. 318, 343-344; Bernard 1949). 


A typology of disorganization processes 

Community disorganization as a process may be 
classified according to any one of several criteria. 
For our purposes here, a simple twofold typology 
is used, namely, “natural” community disorganiza- 
tion and purposive or strategic community disor- 
ganization, each with several subtypes. 

“Natural” processes. The term “natural” is 
placed in quotation marks because anything that 
human beings do is natural and the contrasting 
rubric, “purposive,” refers to behavior as natural 
as any other. The term is used only for the sake of 
convenience. 

“Natural” community disorganizing processes 
may be extrinsic or intrinsic in origin. There is 
little of theoretical interest in the state of com- 
munity disorganization that results from the im- 
pact of outside forces. What theoretical interest 
there is lies in the effects, in tracing the process by 
which the impact finds its way through the system 
and in determining the relative susceptibility or 
resistance of the several subsystems to the impact 
of differing kinds of extrinsic forces (Coleman 
1957). Is there a hierarchical order among the sub- 
systems, some being more, some less, important for 
the total community system? The processes by 
which the impact of such extrinsic forces ramifies 
throughout the community will not be discussed 
here, except as the various subsystems constitute 
extrinsic influences on one another. [See DISAS- 
TERS.] 

Intrinsic community disorganization as a process 
does present some interesting theoretical problems. 


Is there, for example, anything inherent in the 
functioning of a community, even when protected 
from extrinsic influences, that leads to a state of 
disorganization? Are there, as Marx said of capi- 
talism as a system, internal contradictions that 
make a state of disorganization inevitable? Or, con- 
versely, if there were no extrinsic influences at 
work, would a community achieve a kind of “per- 
petual motion”? 

Theoretical models. At least three theoretical 
models may be proposed to describe the “natural” 
processes of intrinsic community disorganization, 
namely, an escalation model, a deceleration model, 
and a stability or “natural limits” model. 

The escalation model begins with the instituting 
of a subsystem in the community to perform a 
function perceived as needed. It may be, for ex- 
ample, a police system, a school system, or a traffic- 
control system. Although there are “bugs” in the 
system at first, those who instituted it hope that 
once it gets into operation they can be eliminated 
and the operation improved as it learns to dovetail 
its contributions into the over-all community sys- 
tem. If these hopes are justified, the end result 
will be a stable situation. But it is possible that the 
improved efficiency of the new subsystem may 
disorganize the over-all system, as noted above, by 
overperforming its function in terms of the ex- 
pectations of other subsystems. The police find too 
many violators of the law; the schools find too 
many illiterate high-school students; and so on. 
This phenomenon is sometimes viewed in terms of 
the unanticipated consequences of purposive social 
action (Merton 1936); it is always disillusioning to 
reformers. 

A second type of model, a deceleration one, is 
sometimes summarized in such aphorisms as “a 
new broom sweeps clean” or such mottoes as “turn 
the rascals out.” It was more emphasized in the 
past than recently, under the rubric of “ossification” 
or “formalism” (Cooley 1909). In this model, the 
new subsystem operates very well at the beginning. 
But instead of improving with time, it deteriorates. 
In the process of “de-bugging” the system, new 
rules are established, and conformity to the rules 
becomes an end in itself. After the original goal for 
which the subsystem was instituted has been 
achieved or outlived, survival becomes an end in 
itself. The subsystem accumulates “dead wood” and 
red tape and finally spends all its time performing 
its own survival tasks rather than contributing to 
the total community system. Or a subsystem origi- 
nally instituted to control another (e.g., a public 
utility regulating board) comes in time to identify 
with the subsystem it is supposed to regulate, so 
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that its contribution to the total community system 
is subverted rather than implemented. 

Finally, the stability or “natural limits’ model 
posits an instituted system that escalates and de- 
celerates in performance within certain limits. 
Early research on the business cycle, for example, 
viewed periodic stalling of the productive me- 
chanism as part of the system, that is, as a neces- 
sary, normal, and even desirable correction. 
According to this view, there are built-in controls 
that stop the system if it is operating too fast and 
start it again when adjustment has been achieved. 
The automatic check might come from some other 
subsystem rather than from within the subsystem 
itself. As soon as the performance of any subsystem 
shows signs of adversely affecting the performance 
of other subsystems, an automatic set of checks 
goes into operation. 

Purposive or strategic processes. Purposive com- 
munity disorganization as a process may be viewed 
either as benign or as revolutionary and subversive. 
Benign community disorganization usually seeks 
reform rather than revolution, although sometimes, 
as in Gandhi's use of it, it is revolutionary in its 
effects. However, benign community disorganiza- 
tion is ordinarily nonviolent and sometimes ostensi- 
bly carried on with altruistic intentions (Sibley 
1963). But if it does have to injure others, the 
theory is that the persons who start the disorganiza- 
tion must be willing to suffer also. 

Destructive community disorganization that is 
strategic rather than irrational in character is often 
a technique for seizing political power. It is a rec- 
ognized subversive technique in modern revolu- 
tionary movements, codified as the so-called “Phase 
One” of the revolutionary process in “wars of na- 
tional liberation”: “The greater part of any Phase 
One operation is covert and marked by a subtle, 
gradual deterioration of the existing sociopolitical 
system” (Pustay 1965, p. 59). Destruction of con- 
sensus or loyalty to a given status quo oF set of 
incumbents is a major goal. All possible community 
conflicts are exploited to foment instability or to 
prevent stability in the system. The theory of com- 
munity disorganization as subversion has been 
highly developed by communist leaders from Lenin 
(1905) to Mao Tsé-tung (1937). 

The technique of community disorganization by 
an outside enemy is as old as war itself. It includes 
not only destruction of physical plant and people 
by violence but also the nonviolent destruction of 
morale, As the value of the physical plant increases, 
nonviolent techniques tend to be preferred in order 
to save the physical structure for the time when 
the community is taken over. In modern wars drug 
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addiction, black marketing, counterfeit money, and 
alcohol have been used as subversive techniques to 
prevent the adequate functioning of the subsystems 
in the community and thus to weaken the ability 
to wage war. 

Benign community disorganization—an attempt 
by those within the community to coerce or deter 
an opponent by nonviolent means—is fairly new, 
at least on any extensive scale. It did not develop 
until industrialization had transformed the me- 
chanical solidarity of peasant society to an organic 
solidarity based on the division of labor. When, for 
example, families made their own candles, owned 
their own team of horses, and made their own 
bread, it was not possible to disorganize a com- 
munity by threatening to turn off power, to go on 
strike, or to tie up traffic. Modern communities, 
highly specialized and functionally integrated, are 
peculiarly vulnerable to the use of strategic dis- 
organization. 

Early analyses of the strategic use of civil dis- 
obedience and nonviolence were in terms of ethical 
principles, as, for example, in the writings of 
Henry Thoreau and Theodore Parker. Marx and 
Engels saw, though they did not admit, that the 
division of labor, greatly elaborated with indus- 
trialization, made the class war a mixed-motive 
rather than a “zero-sum” game. Though the power 
struggle is a “zero-sum” game, the economic strug- 
gle is a mixed-motive game, since both parties have 
a common interest in not destroying the means of 
production, that is, the physical plant of the com- 
munity. [See GAME THEORY, article on THEORETICAL 
ASPECTS.] Georges Sorel (1908) developed the con- 
cept of the proletarian general strike (as contrasted 
with the concept of the political general strike) to 
keep labor—capital cleavages alive and to serve as 
a strategic threat to those in power. Thus he sought 
to promulgate the idea of the transition to socialism 
as a catastrophe that the power elite would do any- 
thing to avoid. It was a myth that simultaneously 
organized the proletariat against the power elite 
and threatened the community with disorganiza- 
tion. 

In the middle of the twentieth century, Negroes 
in American communities disorganized communi- 
ties by civil disobedience, passive resistance, and 
demonstrations. Their purpose was not primarily 
to disorganize the community as an end in itself 
but rather to disorganize it just enough to call at- 
tention to the lack of consensus. Consensus implies 
“consent of the governed.” It need not imply active 
or positive agreement or acceptance, but so long 
as people conform to any set of norms it will be 
assumed that these norms have their passive con- 
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sent, if not their active approval. In order, there- 
fore, to demonstrate the withdrawal of consent and 
the lack of consensus, there must be nonconform- 
ity. And this nonconformity must be conspicuous 
enough for the message to reach a wide audience. 
The target may be the political subsystem (with 
nonconformity taking the form of civil disobedience 
or passive resistance), the economic system (sabo- 
tage, slowdown, boycott, strike), the school system 
(boycott, picketing), or the system regulating race 
relations (sit-ins, marches, etc.). Whatever the 
target subsystem may be, the purpose of the dis- 
organizing effort is to demonstrate that the consent 
of the governed is being withheld. 

Those who use community disorganization as a 
process of communication have to walk a fine line 
between making the disorganization serious enough 
to get across their message but not serious enough 
to alienate a total community. Negro leaders in the 
1960s developed a high degree of skill in locating 
strategic points where nonconformity would be 
conspicuously disorganizing but not too destruc- 
tively so. 


The “death” of communities 


There is good archeological and historical evi- 
dence that communities “die,” that is, pass from 
the historical scene altogether. But there is no evi- 
dence that “natural” disorganizing processes in- 
trinsic to the operation of the community as a 
social system were the cause of their “death.” Cli- 
matic change, exhaustion of natural resources, 
plagues, wars, technological change—these and a 
host of other explanations may be offered to ex- 
plain the communities’ demise. 

On a macrosociological level, some of the best 
minds have for centuries grappled with explaining 
the fall of Rome, the breakup of the feudal com- 
munities, the disintegration of anciens régimes, the 
passing of colonial systems, and the demise of a 
host of other systems. Much of the work is de- 
scriptive; and although some attempts to explain 
and interpret have been made, there is little una- 
nimity in the results. Carefully documented, micro- 
sociological (as distinguished from archeological ) 
studies of the disappearance or decline of specific 
communities are rare, and those we have deal 
primarily with such extrinsic factors as the exhaus- 
tion of natural resources (Collins 1941; Schreiber 
& Schreiber 1955), technological change (Caudill 
1963) or the impact of Western cultures on under- 
developed countries (Balandier 1955; Frazier 1955). 

Is disorganization per se a threat to community 
survival? If it is almost universal, to some degree, 
among industrialized societies, then we must con- 


clude that it is probably not fatal. However, there 
may be some level beyond which disorganization 
does become a threat to survival, Are there auto- 
matic corrections for disorganization when it 
reaches these limits? Are there determinable toler- 
ance limits? If so, what are they? If not, how does 
disorganization endanger or threaten the survival 
of the community? Some processes of disorganiza- 
tion may be more lethal than others; some sub- 
systems may be more critical than others in the 
lethal process, so that their malfunctioning is espe- 
cially destructive; and some subsystems may be 
more susceptible to breakdown in the first place. 
As yet, we know little about these processes. 

Any discussion of the “death” of communities 
should also take account of immigrant communi- 
ties, which tend over time to disappear as distinct 
entities. In such communities boundary mainte- 
nance becomes difficult, and the subsystems suffer 
attrition or transformation, even if they do not 
actually disappear. 

Since most sociological research is, of necessity, 
on surviving communities, in which some degree 
of disorganization is almost always present, we 
have no way of answering the question of the rela- 
tionship between disorganization processes and the 
“death” of communities. We know that communities 
change, almost beyond recognition, but, in one 
form or other, they seem to survive, Disorganiza- 
tion, either as a state or as a process, does not 
appear to be lethal. 

JESSIE BERNARD 


[Directly related are the entries CoNFLICT; DEVIANT 
BEHAVIOR; DISASTERS; SOCIAL CHANGE. Other rele- 
vant material may be found in INTERNAL WARFARE, 
article on GUERRILLA WARFARE; MENTAL HEALTH; 
Norms, article on THE STUDY OF NORMS; and in the 
biography of SorEt.] 
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it 
COMMUNITY DEVELOPMENT 


Community development embodies two major 
ideas. The first is that of conscious acceleration of 
economic, technological, and social change (devel- 
opment). The second, that of locality, refers to 
planned social change in a village, town, or city; 
it relates to projects that have obvious local signifi- 
cance and that can be initiated and carried out by 
local people. According to the widely accepted 
United Nations definitions, communities as units 
of action combine outside assistance with organized 
local self-determination and effort. They achieve 
goals that are both material (a new schoolhouse ) 
and nonmaterial (literacy, lowered infant mortality). 

The accent on development suggests that the 
emerging nations move through a series of stages 
on their way toward modernization. Leaders realize 
that to achieve economic goals more quickly, large 
numbers of people, especially in the rural areas, 
have to be mobilized, In the communist countries 
the collectivization of agriculture was the method 
chosen; in several noncommunist countries, such 
as India and the Philippines, community develop- 
ment or its equivalent is being used. Neither ap- 
proach has proven a panacea, and each has had 
pronounced and often unanticipated social effects. 

During the colonial period of many developing 
countries the central government concentrated 
upon communications and material resources, 
using the local settlements or groups of them for 
administration of justice and for tax purposes. 
Nevertheless, the colonial powers did initiate some 
activities resembling contemporary community de- 
velopment. In Africa there were the mass education 
programs in Kenya, Uganda, Gold Coast (now 
Ghana), sociétés mutuelles de développement rural 
and sociétés indigènes de prévoyance in French- 
administered areas, and foyers sociaux and cooper- 
atives in Belgian-administered territories. In Asia 
the rural reconstruction in India in the 1930s and 
the rural development in Ceylon and mass educa- 
tion in Burma in the 1940s were programs that 
predated the current large-scale community devel- 
opment efforts throughout much of Asia today. The 
growing interest in community development in 
Latin America is relatively recent. In the Middle 
East, programs of rural reconstruction were carried 
out in the 1930s by the Near East Foundation, but 
the post-World War 11 pioneering efforts were with 
the rural social centers set up in the United Arab 
Republic (then Egypt) by the Ministry of Social 
Affairs. 

The term community development now enjoys 
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wide usage in the West, even though community 
organization still best describes the mobilization of 
local resources for social welfare purposes. In 1948 
the United Nations organization had one com- 
munity development adviser in one country; in 
1962 it had 47 such experts in 31 countries. 


Intellectual and social origins 


The mixed ancestry of community development 
gives a clue to the problems of defining the term. 
Social workers, adult educators, local government 
officials, economic planners, city planners, and 
agricultural extension workers consider their re- 
spective professional fields to have been forerun- 
ners of community development, a fact which 
supposedly gives them each the right to speak 
authoritatively about its content and methods, 

In their descriptions of the current scene various 
writers not unexpectedly stress different themes. 
Arthur Dunham (1960) tends to view community 
development as organized efforts to improve the 
conditions of community life and the capacity for 
community integration and self-direction. Four 
basic elements ordinarily found are (1) a planned 
program; (2) encouragement of self-help; (3) tech- 
nical assistance, which may include personnel, 
equipment, and supplies; and (4) integrating vari- 
ous specialties for the help of the community. 

Governmental and national programs. T. R. 
Batten (1957) considers the field of community 
development “to include any action taken by any 
agency and primarily designed to benefit the com- 
munity.” He observes that one of the principal 
problems in using democratic methods in com- 
munity development is that the central govern- 
ments put pressure on village-level workers to 
achieve national goals within given time periods. 
As a result, the village workers attempt to speed up 
the programs with less democratic methods. When 
the program is highly formalized, as in many five- 
year plans, the focus sometimes tends to be upon 
the program rather than upon what is happening 
to the people involved in the program. The em- 
phasis is upon accomplishing sets of activities in 
health, welfare, agriculture, industry, recreation, 
and the like that can be quantified and reported. 

As these statements imply, many community de- 
velopment programs are national in scope and are 
geared to over-all governmental plans, be they 
three-year, five-year, or ten-year, for improving liv- 
ing and economic conditions. In this connection 
community development may be said to be a meth- 
od through which national goals are to be achieved. 

The government of India (India 1958) has been 
quite specific in treating community development 


as a method designed to initiate a process of trans- 
formation in the social and economic life of the 
Indian village. As a method it was supposed to do 
three things: achieve unity of thinking and action 
between all development agencies of government 
and between the official agency, the people’s agency, 
and the people; transform the social and economic 
outlook of the people, chiefly through village organ- 
ization; and conduct intensive area development 
based on a multipurpose approach. Recognition by 
the Indian government that agricultural develop- 
ment presented special problems too great to be 
borne by the Ministry of Community Development 
has led to abolition of this ministry. But there has 
been no decrease in the number of community 
development projects, and community development 
will no doubt continue to be regarded as the single 
most important method available to the government 
for coordinating social with economic planning. 

Fredrick G. Friedmann, who has studied UNLA 
—the Association for the Fight Against Illiteracy— 
in southern Italy as a form of community develop- 
ment, observes that “many Western leaders inter- 
ested in the subject look at community development 
as an attempt at extending, in the vein of the 
applied social sciences, proven techniques of ‘han- 
dling’ situations to the limited and relatively man- 
ageable proportions of a village community and at 
substituting ‘projects’ on a community level for 
large-scale government ‘planning’ ” (1960, preface, 
p. xiv). 

In the United Arab Republic “community devel- 
opment” refers to the organization of rural welfare. 
Unpublished research carried out during the early 
1960s by Doreen Warriner in Egypt showed that 
“self-help” by rural communities and “participation” 
in solving community problems have been little 
more than government slogans, since the emphasis 
has not been upon setting up organizations for the 
rural community. 

Programs based on local initiative. It must not 
be assumed that all community development pro- 
grams are governmental in nature. In the United 
States, for example, the stress has been upon local 
initiative, usually sponsored by private groups or 
organizations (women’s clubs, men’s civic clubs, 
junior chambers of commerce, welfare councils), 
with only an occasional assist from some govern- 
mental agency, Thus, community development is 
viewed as a method of carrying out specific proj- 
ects, each worthy in its own right, rather than as 
part of some detailed national plan or program. 
The nonmaterial benefits to the people are thought 
to be as valuable as the material goals achieved, 
since it is assumed that the more local residents 
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are involved in planning and decision making, the 
more they will rely upon their own community 
resources and less upon the government. This re- 
flects the strong individualistic and ameliorative 
strains in American culture as well as the value 
placed on local self-reliance and democratic par- 
ticipation. An OEEC (Organization for European 
Economic Cooperation) study team from Europe 
found one of the most surprising features of com- 
munity development in the United States to be the 
importance of private efforts in community im- 
provement and the small role played by local gov- 
ernment officials in these matters (1960). 

Various definitions incorporate this emphasis. 
J. D. Mezirow, for instance, has stated on several 
occasions that community development is an or- 
ganized effort to make possible, through training 
and education, a wide range of individual partici- 
pation in the democratic solution of community 
problems. Richard W. Poston (1958, p. 24) defines 
community development as “an organized body of 
knowledge which deals comprehensively with the 
community in its entirety, and with all of the vari- 
ous functions of community life as integrated parts 
of the whole.” He suggests that “the ultimate goal 
of community development is to help evolve through 
a process of organized study, planning, and action 
a physical and social environment that is best 
suited to the maximum growth, development and 
happiness of human beings as individuals and as 
productive members of their society.” 

Lowry Nelson, Charles E. Ramsey, and Coolie 
Verner (1960) see community development as 
an “education-for-action process.” It helps people 
achieve group goals democratically; the leader be- 
comes an agent constructing learning experience 
rather than the proponent of a program for com- 
munity improvement; primary importance is at- 
tached to the individual. Furthermore, it is 
problem-oriented at the community level; the 
means employed in the solution are more impor- 
tant than the solution itself; and it is one of several 
types of purposive change. 

Noncoercion and the problem of incentives. 
The definitions just cited show that community 
development is noncoercive. Totalitarian regimes 
do not view it as politically realistic or as ideo- 
logically desirable, since community development 
stresses the voluntary rather than regimented par- 
ticipation of the individual. But for nontotalitarian 
regimes there are real dilemmas: How long can 
authorities desiring rapid social change wait for 
positive results from a slow educational process? 
Or in working out national economic and social 
plans, how much weight can they give to the pri- 
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orities of villagers throughout the land? To what 
extent can those sponsoring community develop- 
ment rely on local leaders to initiate projects in 
terms of the “felt needs” of their fellow citizens, or 
must this initiative come from outside professional 
workers who stimulate and help local people bring 
into being some form of local action? Although a 
community program is noncoercive it operates 
within certain social controls: pressure brought by 
neighbors upon a villager who fails to carry out 
his part of the program; positive incentives (e.g., 
in the form of wheat) provided to those working 
on community projects; or national recognition 
given to the community and its leaders for a well- 
conducted program. 


Types of functionaries 

Of course, the resolution of these dilemmas and 
the types of social control employed will depend 
upon the leaders of the program and their under- 
lying assumptions, At least four types of func- 
tionaries are found in community development 
programs. First, local leaders are essential if there 
is to be genuine involvement of the people of the 
community. In some countries where community 
development is tied in with local government these 
leaders may be officials; elsewhere, they may be 
lay, voluntary leaders acting out of a spirit of pub- 
lic service. In developing countries great care is 
often taken to teach these leaders the importance 
of using democratic rather than authoritarian pro- 
cedures in efforts to involve their fellow community 
members. Special courses, such as those in the 
American Farm School in Greece, are held for these 
leaders, because ultimate success depends upon 
them. 

A second type of functionary is the community 
organizer, often called the village-level worker. He 
is the new element added by community develop- 
ment to earlier, traditional programs of rural 
change. He is trained in human-relations skills 
rather than in any subject-matter field, such as 
agriculture, health, or recreation. As a generalist, 
he is supposed to know how to relate these fields 
to the problem areas that the local people identify, 
but he does not claim high technical competence 
in them. By working with local leaders, he initiates, 
motivates, guides, and educates, supposedly taking 
into account the goals of the local people as well 
as the goals of the larger program that he is pro- 
moting. Once action has been decided upon by the 
community, the village-level worker becomes the 
expediter, the communication link, the one who 
marshals outside resources appropriate to the local 
needs at the time. 
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But he is relatively ineffective without a third 
type—the subject-matter specialist: agriculturist, 
sanitarian, literacy expert, and the like. Even be- 
fore the village-level worker appeared on the scene 
after World War 11, these specialists were trying to 
carry out changes in their special fields in rural 
areas. But they were doing it in a piecemeal fash- 
ion. One month a health worker would appear 
suggesting some line of action; later a livestock 
specialist would show up demanding vaccination 
for hog cholera, Under community development 
the local people are expected to prepare an over-all 
plan involving several of these fields, beginning 
with the project for which they have the greatest 
enthusiasm and sufficient means at their disposal. 
Thus, the subject-matter specialists, as in the case 
of some recent programs in the Philippines, are 
asked to go to a village as a team, not as com- 
peting professionals. A community development 
worker is also part of the team. Elsewhere, as in 
Ethiopia, the specialists may be assigned to com- 
munity development authorities for full-time work 
under these authorities rather than as part of the 
old-line ministries to which they eventually expect 
to return. In most countries getting the specialists 
to work together effectively presents more of a prob- 
lem than persuading the villagers to agree to a 
program of change. 

This problem is related to the fourth function- 
ary: the person responsible for keeping the admin- 
istrative machinery of a national program in 
running order. In Venezuela, for instance, horizon- 
tal and vertical coordination is being achieved at 
the state level through “regional community devel- 
opment bodies.” The larger the program, the greater 
the proportion of energy and money that goes into 
the mechanics of operation. Vast training pro- 
grams, long hours (even years) of negotiation with 
other government agencies, detailed plans and 
budgets for three or five years are required. In addi- 
tion, some officials must maintain a flow of tech- 
nical materials (pesticides, schoolbooks, drugs, 
etc.) so that they are on hand for ready use when 
the local communities need them. One conclusion 
seems clear: unless the head of the government or 
some influential official with a charismatic, popu- 
lar appeal makes community development a major 
concern, the bureaucratic pressures against its suc- 
cess on a national scale are so great that it is apt 
to bog down. But where people like Ramén Mag- 
saysay in the Philippines or Jawaharlal Nehru in 
India have referred to it often in public and given 
it their full support, the program has enjoyed a 
measure of success. 


Goals and achievements 


In Asian countries, which for more than ten 
years have had the widest experience with com- 
munity development, certain discernible trends are 
under way. First, the programs are stressing eco- 
nomic (including agricultural) goals more heavily 
than heretofore. Second, they are making greater 
use of local governments as the need for decentral- 
ization becomes more apparent. Third, the training 
of village-level workers is stressing the practical 
aspects (for instance, the actual grafting of fruit 
trees) as well as the theoretical aspects—a new 
educational departure for these countries. Fourth, 
new administrative arrangements are being devised 
to assure the coordination of subject-matter spe- 
cialists in accomplishment of program aims. Un- 
doubtedly, community development programs do 
give village people a feeling of political involve- 
ment which they would otherwise not have; they 
have substantial material accomplishmenis (roads, 
school buildings, new plant varieties, etc.) to their 
credit. But the biggest gain, perhaps, lies not at 
the village level but in the better understanding of 
village problems by higher government officials. 
The programs in Africa and Latin America are still 
too new to provide useful generalizations, except 
to indicate that they exhibit wide variety in organ- 
izational procedures and types of problems attacked. 
Land reform tends to be associated with com- 
munity development more than it was in Asia. In 
the developed countries community development is 
increasingly being relied upon as a method for 
local improvements. Social workers, agricultural 
extension workers, public health educators are be- 
ing trained in its techniques, and a few institutes 
have been set up to prepare specialists in com- 
munity development as a special professional field. 


The future of community development 

Community development is still too young to 
justify any long-range predictions about its iden- 
tity as a separate profession or its combination with 
some other field, such as public administration, 
agricultural extension, social work—to name but 
three. 

Like any emerging profession; it has begun to 
develop its applied theory, set forth chiefly in the 
form of principles of action that should be followed 
for effective practice. Since community develop- 
ment is so new, each mature practitioner must 
perforce come out of some previously existing dis- 
cipline or profession. This shows up in the lists of 
principles he sets down or passes on to his associ- 
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ates. Some stress the psychological overtones of 
motivation and group dynamics; others, the socio- 
logical caveats of recognizing social values and 
social structure; others, the administrative aspects 
of sound programming; and still others, the anthro- 
pological investment in cultural change, the edu- 
cators’ concern with learning, or the specialists’ 
concern with appropriate technology. 

There is not at the present time a body of tested 
theory on development. Nor do we know in any 
systematic way why some programs succeed, by 
the developers’ standards, while other programs 
fail. To date, and this is a crucial test, existing 
training programs do not draw upon any identifi- 
able community development theory as such but 
rely almost entirely upon social-science generaliza- 
tions developed quite apart from community 
development activities. 

IRWIN T. SANDERS 


[See also AGRICULTURE, article on DEVELOPING COUN- 
TRIES; FOREIGN AID; PLANNING, ECONOMIC, article 
On DEVELOPMENT PLANNING; PLANNING, SOCIAL.] 
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COMMUNITY-SOCIETY CONTINUA 


Since the emergence of the cities of antiquity, 
social philosophers have attempted to characterize 
and account for the contrast between tribal or 
rural life and the ways of city men. During the 
past century, the age-old contrast has been con- 
ceived in terms of pairs of societal types, such as 
rural-urban or primitive—civilized. The poles of 
such a pair are characterized by the opposite ex- 
tremes of type-defining variables, such as the 
degree of importance of kinship or the extent of 
division of labor in a society. The range of variation 
of such characteristics creates a conceptual “con- 
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tinuum” between the opposed polar types. Because 
the types can refer to either communities or socie- 
ties, the term “community-—society continua” is 
employed to cover these constructs. 

Polar types are one kind of “ideal type.” This 
term runs the risk of being misconstrued, for in 
this context “ideal” has no connotation of excel- 
lence. Here it means that the type is an “idea”—a 
mental construct. It is a concept derived from ob- 
servable reality but not conforming to it in detail. 
Some aspects of that reality are selected and ac- 
centuated in defining the type, because of their 
apparent interdependence and theoretical impor- 
tance. However, ideal types are not classifications, 
No actual society will conform completely to such 
a type. When used as a basis of comparison with 
life situations, however, the type suggests possible 
hypotheses and lines of investigation. Thus, ideal 
types are tools to be used in the analysis of em- 
pirical reality [see TYPOLOGIES]. 


Status and contract 

Modern use of polar types in social science stems 
from the evolutionary concerns initiated by Charles 
Darwin’s Origin of Species (1859) and Henry 
Maine’s Ancient Law (1861). Maine's objective in 
this study was to discover how the institutions of 
his day had evolved from those of antiquity. In con- 
trast to Johann Bachofen, John F. McLennan, and 
Lewis Morgan, who were his contemporaries, Maine 
avoided the pitfalls of flimsy historical reconstruc- 
tions. He stated that much of the study could not 
have been undertaken had it not been for the long 
record of Roman law, linking evidence of remote 
antiquity to the rules and institutions of modern 
society. For lack of other evidence, he was forced 
to use Roman law as a “typical system.” The 
propositions drawn from that law and other avail- 
able evidence were held to be typical of the social 
changes experienced in the emergency of western 
European societies, The validity of most of Maine's 
interpretations rests not only upon the quality of 
his evidence but also upon the logic of the relation- 
ships which he discovered among the changes. The 
“typical” features of ancient and modern society, 
as he described them, have the elements of an ideal 
typology, although Maine did not represent them in 
any such terms. 

On the basis of early writings of the Hindus, 
Hebrews, Greeks, and Romans, Maine concluded 
that primeval society was based on the patriarchal 
family, at least in those societies, There, the eldest 
ascendant male held absolute supremacy over the 
whole family, and the larger society consisted of 


an aggregation of such families. Maine then noted 
that the units of modern society were not families, 
but individuals. His study pursued the conse- 
quences of this difference through human history. 
Primitive law dealt with family groups as corporate 
entities defined by kinship. Crime, for instance, 
was a corporate act; land was jointly held; new- 
comers to such a system had to be incorporated 
into it through the creation of fictitious kinship. 

As societies expanded, a new sort of definition of 
social membership emerged, informed by the prin- 
ciple of contiguity. Locality, rather than kinship, 
thus became the basis of state organization. As 
the powers and privileges once resident in the 
family shifted to the growing state, Maine argued, 
the nature of men’s interrelations shifted from that 
of kin-based status to that of individually agreed- 
upon engagement or contract. Maine found vivid 
evidence of these changes in the decline of patria 
potestas (paternal authority), as dependents gradu- 
ally achieved separate rights under the Roman 
state, Thus, Maine depicts an ideal-typical sort of 
change from society based on kinship, status, ag- 
gregations of families, and joint property rights to 
a kind of society based on territory, contract, and 
individual rights, including those to real property. 
These ideas had a wide influence on his contem- 
poraries and on subsequent scholars. 


Gemeinschaft and Gesellschaft 


Ferdinand Ténnies was the first to make explicit 
the nature and use of ideal types, or “normal types,” 
as he called them. Stimulated by Maine as well as 
by Marx and Hobbes, he developed two such types 
which gave their names to his book Gemeinschaft 
und Gesellschaft (1887). Tönnies’ theory and 
typology rest on his view of the nature of human 
volition, of which he distinguished two types— 
Wesenwille (“natural will” or “essential will”) and 
Kiirwille (“rational will”). The former refers to 
volition that springs from an individual's tempera- 
ment, character, and habits. With rational will, 
however, the distinction between means and ends 
becomes important, and volition is dominated by 
thinking. 

Gemeinschaft cannot be accurately translated. 
It refers to the “community of feeling” (a kind of 
associative unity of ideas and emotions) that re- 
sults from likeness and from shared life-experience. 
Natural will predominates in Gemeinschaft rela- 
tionships, which are best illustrated by the links 
between mother and child, husband and wife, and 
brothers and sisters. Differences in power and au- 
thority do not in themselves destroy Gemeinschaft, 
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which may characterize the relationship of a bene- 
ficent father to a dutiful child. Nor is Gemein- 
schaft limited to formal kinship, for neighborhood 
and collective proprietorship produce a similar 
unity, and friendship expresses a kind of Gemein- 
schaft that is tied to neither blood nor locality. 

The predominance of rational will characterizes 
Gesellschaft. In Gesellschaft relationships, Tonnies 
says, “everybody is by himself and isolated, and 
there exists a condition of tension against all 
others . . . intrusions are regarded as hostile acts 

. . nobody wants to grant and produce anything 
for another individual . . . all goods are conceived 
to be separate, as are their owners” (1887, p. 65 in 
1963 paperback edition ). In such a society, rational 
will operates in terms of the logic of the market 
place. Relationships are contractual; values are 
monetary. Profit is the sole end of trade, and one 
man’s gain is another’s loss. As the merchant tries 
to free himself from all relationships that might 
conflict with commerce, he becomes, as Adam 
Smith pointed out, an individual who is not bound 
to any particular country; indeed, every man be- 
comes, in some measure, a merchant. 

Since real social groups vary in the degree to 
which they resemble Gemeinschaft and Gesell- 
schaft, the types provide a comparative basis for 
discovering order in that variation. Ténnies de- 
scribed one such application of the typology: “In 
the history of the great systems of culture,” he 
wrote, “a period of Gesellschaft follows a period of 
Gemeinschaft” (Ténnies 1887, p. 231 in 1963 
paperback edition). The latter period begins with 
social relations based on family life and on domes- 
tic economy; later, with the development of agri- 
culture and rural village life, there is a shift to 
cooperative patterns based on locality. Then fol- 
lows the growth of town life and the mental com- 
munity of religious faith and artistry. The Gesell- 
schaft period of history opens with the growth of 
city life based on trade and contractual relation- 
ships. Industrialization and the rational manipula- 
tion of capital and labor are accompanied by the 
growth of the state and of national life. Cosmo- 
politan life, toward which Tönnies thought society 
was moving, would be based on the ultimate ex- 
pressions of rational will—science, informed public 
opinion, and control by “the republic of scholars.” 

Tönnies has been criticized for assigning the 
names of his ideal types to actual periods of his- 
tory, for this makes it seem as though he is using 
them for simple classification. Actually, his discus- 
sion of this sequence stresses the progressive shift 
from a predominantly Gemeinschaft sort of society 
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to one dominated by Gesellschaft. In later and little- 
known works, he used the typology to analyze other 
sorts of phenomena, such as morals and public 
opinion, but the types did not figure in his really 
empirical investigations. 

The most specific attempt to operationalize the 
typology has been made by Charles Loomis and 
J. Allan Beegle (1950). They argue that Gemein- 
schaft and Gesellschaft have only limited scientific 
utility because they are “sponge” concepts, absorb- 
ing many vague meanings. To overcome this limita- 
tion, they break down the types into a dozen or so 
variables, largely derived from Talcott Parsons, 
Pitirim A. Sorokin, and Howard Becker and more 
or less directly related to Tönnies’ typology. The 
methodology they have devised to illustrate the 
typology expresses each of these variables in terms 
of an 11-point linear scale extending between 
Gemeinschaft and Gesellschaft poles. Actual social 
systems, such as the Amish family, are then allo- 
cated to positions on these scales by averaging 
the scale assignments made by a panel of judges. 
When a number of such scales are juxtaposed, a 
“profile” of each social system can be created by 
drawing a line connecting its ratings on the vari- 
ous scales. Such profiles permit visual comparison 
of ratings of different aspects of the same system 
or of the same variable in different systems. 

No claim was made that these ratings were 
based on anything more rigorous than the opinions 
of judges who had some personal experience with 
the social systems being judged. Nor was any the- 
ory tested by the procedure, although Loomis and 
McKinney (1956) have since used it to explore 
the nature of the difference between two Costa 
Rican communities selected to represent distinctive 
types of rural social organization in Latin America. 
In this investigation, only six scaled components 
of the ideal types were employed, including all of 
Parsons’ “pattern variables.” 


Mechanical and organic solidarity 


In his book The Division of Labor in Society 
(1893), Emile Durkheim examined the relation- 
ship between two facts that had already been noted 
by Auguste Comte and Herbert Spencer—that the 
division of labor in society was a source of social 
solidarity and that primitive society was relatively 
homogeneous in character. Noting the impossibility 
of observing social solidarity directly, Durkheim 
took variation in types of law as a symbol, or reflec- 
tion, of types of solidarity. Maine afforded him the 
basis for further analysis in the observation that 
law in ancient societies was characteristically penal 
or criminal law, while civil law predominates in 


modern society. Durkheim called the first type of 
law “repressive” and identified it with mechanical 
solidarity, or social unity based on likeness. He held 
that the moral sentiments common to all members 
of a society constitute a “collective conscience”; 
criminal acts are those that violate the common 
conscience and call forth passionate reactions of 
vengeance. Violent punishment of the offender can 
expiate the act because the punishment protects 
the collective conscience of the society from further 
violation. 

Durkheim pointed out that, from a logical point 
of view, the extreme case of an ideal-type society 
defined in terms of homogeneity would be an un- 
organized inchoate horde. In reality, however, al- 
though the mechanical solidarity of the most 
primitive known societies does indeed rest on 
homogeneity, these societies also show some dif- 
ferentiation into unspecialized aggregates such as 
clans. This simple repetition of segments caused 
Durkheim to suggest the term “segmental society” 
for this type. He recognized, however, that mechan- 
ical solidarity plays some role in all societies; as an 
index of its relative importance in a society, he 
took the proportion of law that was penal in 
character. 

In contrast to repressive law, which deals with 
criminal acts against society, civil law deals with 
relationships between special parties in society. 
The sanctions of civil law involve restitution rather 
than punishment, and such “restitutive” law pre- 
supposes cooperation derived from the division of 
labor. The associated type of social solidarity is 
based on the interdependence of specialized parts; 
using the biological analogy, Durkheim called it 
organic solidarity. To the type of society it charac- 
terized he gave no specific name, although he re- 
ferred to it as “occupationally organized.” 

Finally, Durkheim used the legal indices of sol- 
idarity to demonstrate that as one basis of solidarity 
develops the other regresses. It is always organic 
solidarity that wins out over the mechanical, he 
contended, The evolution of society can therefore 
be seen in terms of the passage from mechanical 
solidarity to organic solidarity, with “mechanical” 
or “organic” referring to the dominant type of sol- 
idary relationship at each evolutionary stage. Durk- 
heim saw a connection between this evolutionary 
process and such factors as increasing population 
size and density, the growth of cities, and improve- 
ments in communication. In the increasing com- 
petition resulting from rising population density 
and increasing social interaction, he found the 
cause of the increase in division of labor. 

Durkheim has been criticized by Robert K. Mer- 


ton (1934) for his overemphasis on division of 
labor as the source of solidarity in modern society 
and his use of legal forms as indices of solidarity. 
In addition, anthropological research has shown 
that primitive law has important civil aspects that 
neither Maine nor Durkheim recognized. Despite 
such objections, Merton found the societal ideal 
types to be useful constructs. 


The folk—urban continuum 


Recognizing some commonalty among peoples 
possessing folklore, folk songs, and folkways, Rob- 
ert Redfield sought other distinguishing features 
of the primitive and peasant peoples who constitute 
such “folk.” On the basis of his study (1930) of 
the Mexican peasants of Tepoztlán, he concluded 
that they were like primitives in their self-suffi- 
ciency, nonliteracy, and the local, traditional, and 
sacred orientation of their lives. However, he also 
found that they were part of the modern urban 
world and were gradually being assimilated into it. 

To pursue this process further, Redfield and two 
assistants studied a series of four communities in 
Yucatan: the modern capital city, a town, a village, 
and a tribal settlement of Indians. By comparing 
these communities in what had once been an en- 
tirely Mayan area, Redfield hoped to discover what 
was happening in the folk culture as it was being 
influenced by the modern city. In a more general 
sense, however, he was interested in the shift from 
the type of society represented by the more isolated 
village toward the type illustrated by the capital 
city. 

In The Folk Culture of Yucatan (1941), Redfield 
stated that in drawing up his list of folk charac- 
teristics he had been influenced by Maine, Durk- 
heim, and Tönnies. The list of characteristics con- 
tinued to grow, but only five of them were explicitly 
studied in Yucatan. Evidence was sought to test 
the hypothesis that progressive loss of isolation, 
when associated with an increase in heterogeneity, 
produces social disorganization, secularization, and 
individualization. Intercommunity variation along 
these dimensions could be adequately assessed be- 
cause of the original cultural homogeneity of the 
area. The total evidence, as well as the histories of 
individual communities, tended to confirm the 
hypothesis, but Redfield pointed out that it might 
not hold under all circumstances. 

An opportunity to refine Redfield’s hypothesis 
was presented when Sol Tax (1939) reported that 
in Guatemala, although Indian society was both 
small and homogeneous, it was also secularized 
and individualistic, with weak familial organiza- 
tion. This apparent conflict with the Yucatan find- 
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ings now seems to have been due to the difficulty 
of defining the units of analysis. Actually, Tax had 
shown that although each Indian community was 
homogeneous, Guatemalan society as a whole was 
large and heterogeneous, with an indigenous com- 
mercialized economy based on agricultural and 
craft specialization among the Indian villages. 
Thus, the villages were removed from isolation by 
trade; and this loss of isolation, according to Red- 
field’s hypothesis, would lead to the very character- 
istics observed by Tax. Where Tax and Redfield 
differed was on how marked such nonfolk char- 
acteristics were in Guatemala—which illustrates 
the difficulty of making such judgments without a 
clear basis of comparison. Nevertheless, the Guate- 
malan research highlights the fact that life can 
become less folklike and more urban as a result of 
participation in a highly commercial economy, even 
if that economy is not dependent upon modern 
cities. 

Redfield never defined the urban type of society 
beyond indicating that it was characterized by the 
antitheses of the folk traits. The latter were set 
forth most explicitly in “The Folk Society” and 
summarized as follows. 


Such a society is small, isolated, nonliterate, and 
homogeneous, with a strong sense of group solidarity. 
The ways of living are conventionalized into that co- 
herent system which we call “a culture.” Behavior is 
traditional, spontaneous, uncritical, and personal; there 
is no legislation or habit of experiment and reflection 
for intellectual ends. Kinship, its relationships and in- 
stitutions, are the type categories of experience and the 
familial group is the unit of action. The sacred pre- 
vails over the secular; the economy is one of status 
rather than of the market. (Redfield 1947, p. 293) 


The folk type has been criticized for its obvious 
lack of specificity and focus. This lack is really 
both a strength and a weakness. The advantage of 
the multiplicity of definitive traits lies in the variety 
of causal relationships that are suggested. The dis- 
advantage lies in the lack of suggestion as to which 
traits should be considered as causes, which as 
effects, and under what circumstances. If we had 
the answer to all of these questions, however, there 
would be no need for the ideal types. 

Further tests of the typology. Twenty years 
after Redfield studied Tepoztlan, Oscar Lewis con- 
ducted a restudy of the village. Some of his ob- 
servations were sufficiently different from those of 
his predecessor to require explanation, and Lewis 
concluded his report (1951) with a detailed criti- 
cism of both the original study and the folk—urban 
continuum. The latter criticism was considered by 
Horace Miner (1952) in a general evaluation of 
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the utility of the continuum. He concluded that 
Lewis’ critique reflected a legitimate concern for 
detailed structural, psychological, and historical 
analysis, which differed from, but did not conflict 
with, Redfield’s interest in developing general prop- 
ositions based on formal concepts. Redfield himself 
concluded that the apparent substantive differences 
between the two reports were really differences in 
emphasis resulting from the subjective “hidden 
questions” with which he and Lewis had approached 
the community. 

In the case of one of Miner's studies, The Primi- 
tive City of Timbuctoo (1953), emphasis was ex- 
plicitly placed on the discovery of evidence of 
disorganization, secularization, and impersonaliza- 
tion. This was done to ascertain whether Redfield’s 
“urban characteristics” would be found in a non- 
Western nonindustrial city. Timbuctoo was an 
African commercial center long before European 
contact, and it meets the requirements of Louis 
Wirth’s minimal definition of the city as “a rel- 
atively large, dense, and permanent settlement of 
socially heterogeneous individuals” (1938, p. 8). 
Miner did find cheating, prostitution, usury, and 
theft in this city, but “urban” behavior was largely 
limited to commercial and other relations among 
the major ethnic elements of the population; within 
each of these groups, life was found to be remark- 
ably folklike. Thus Timbuctoo, unlike Western 
cities, presented a mixture of folk and urban char- 
acteristics. 

These observations were placed in a broader con- 
text by Gideon Sjoberg (1952; 1960), who recog- 
nized the similarly mixed character of all preindus- 
trial cities. He also saw such cities and their 
associated peasant villages as the constituent parts 
of an ideal-typical “feudal society,” which con- 
trasted with both of Redfield’s polar types. Sjoberg 
characterized feudal society as being static and 
having strongly sacred value orientations. Unlike 
the folk type, however, it is more heterogeneous 
and is commercially oriented, A primary element in 
this heterogeneity is the small, hereditary, urban 
elite, which controls the political, military, and 
religious institutions. These institutions, in turn, 
provide the principal links with the rural peasants. 
The elite incorporates a literate element of scholar- 
priests and is served by urban craftsmen, organized 
along guild lines. In commerce, standardization of 
weights, measures, and media of exchange are 
generally lacking, and caveat emptor is the rule. 
While cities and societies resembling these types 
have often been developmental stages in the cumu- 
lative process of urbanization, there is no implica- 
tion that they are necessary stages in such devel- 
opment. 


Social complexity. Of the many studies that 
have drawn stimulation from the folk—urban con- 
tinuum, one other should be mentioned here be- 
cause it employed an unusual analytic method. It 
also used cross-cultural data from the Human Rela- 
tions Area Files. Linton Freeman and Robert Winch 
(1957) tested the single dimensionality of social 
complexity—a variable they found to be inherent 
in all of the continua discussed above. Various 
measures of complexity were applied to 48 societies 
and the resultant data subjected to Guttman scaling 
analysis; six of these variables scaled almost per- 
fectly, indicating their unidimensionality. 

Even more interesting were the hypotheses sug- 
gested by the scale relationships among the vari- 
ables. For example, the appearance of a money 
economy seems to be the initial step toward the 
other forms of cultural complexity studied. On the 
other hand, written languages seem to appear at a 
much later stage, for they were found only in soci- 
eties which also had complex systems of exchange, 
law, religion, education, and government. 


Parsons’ “pattern variables” 


Four pattern variables constitute the basic com- 
ponents of Talcott Parsons’ system of analysis of 
social action (1951, pp. 58-67; 1960). The vari- 
ables are “ranges which, in their simplest form, can 
be defined as polar alternatives.” In Parsonian anal- 
ysis, the pattern variables figure as dichotomies: 
each expresses a “dilemma” of choice between two 
distinct alternatives that are faced by the “actor” 
in every social situation. The nature of the variables 
is suggested by the way each relates to the defini- 
tion of behavioral expectations. Affectivity versus 
affective neutrality refers to whether immediate 
self-gratification or its deferment is expected. Speci- 
ficity versus diffuseness is concerned with whether 
the scope of the relationship is seen as narrow, like 
that between a clerk and customer, or broad and 
inclusive, as between spouses. Universalism versus 
particularism has to do with whether action is gov- 
erned in terms of generalized standards or in terms 
of a reference scheme peculiar to the actors in the 
relationship. Finally, the quality versus perform- 
ance dichotomy (also called ascription versus 
achievement) is concerned with whether the char- 
acterization of each actor by the other is based on 
who or what the person is or on what he can do— 
for example, on whether he has royal blood (ascrip- 
tion), or on whether he is a college graduate 
(achievement). 

Classification of societies. Parsons used com- 
binations of two of these dichotomies—universal- 
ism-particularism and ascription—achievement— 
to define four principal types of society. He pointed 


out that all societies organized around kinship fall 
under the particularistic—ascriptive pattern. The 
general emphasis of the fourfold typology thus re- 
flects the focus of the whole theoretical scheme on 
“variability among ‘civilizations’ rather than among 
primitive cultures.” The scheme, therefore, has 
evolutionary implications, as it distinguishes three 
social structural “types which tend to emerge when 
major types of cultural development in the literate 
cultures have occurred” (Parsons 1951, p. 182). 
The emergent type characterized by the universal- 
istic-achievement pattern is exemplified by the 
most industrialized societies. Division of labor in 
such societies emphasizes the specificity and affec- 
tive neutrality of occupational roles, which contrast 
with the diffuse and affective character of kin roles 
in primitive societies. 

If one disregards the two other possible types in 
favor of the universalistic-achievement pattern 
and the particularistic—ascriptive pattern, there is 
obviously a continuum between these ideal-typical 
poles. In empirical terms, one extreme character- 
izes all primitive societies and even feudal ones, 
while heavily industrialized societies resemble the 
other pole. There is a clear tendency for the four 
pattern variables to covary between these extremes, 
although all actual societies show mixtures of the 
two sets of characteristics. It is only the relative 
emphasis that differs. As Parsons says, these two 
clusters of pattern variables “very closely charac- 
terize what in much sociological literature have 
been thought of as polar types of institutional struc- 
ture, the best known version of which perhaps has 
been the Gemeinschaft-Gesellschaft dichotomy of 
Tönnies” (Parsons et al. 1953, pp. 207-208). 

Continua of economic development. It was 
Marion Levy who first used the pattern variables 
to define a continuum of economic development 
(1952). He suggested that societies could be placed 
on a scale of industrialization in terms of the effi- 
ciency of their use of energy and the sorts of energy 
employed. Furthermore, he contended that the 
social systems at the two extremes of such a scale 
differed with regard to certain basic qualities of 
social relationships—the degree to which they were 
based on nonrational traditionalism, whether they 
emphasized universalistic or particularistic criteria, 
and the degree to which they were functionally 
specific or diffuse. The economic systems of highly 
industrialized societies are more rational, univer- 
salistic, and functionally specific, and the problems 
of economic development can be seen in terms of 
the implied changes in these dimensions of non- 
industrialized systems. 

Bert Hoselitz (1953) followed this lead. Defining 
economic development in terms of per capita real 
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income, he used Parsons’ newly stated pattern vari- 
ables in differentiating “underdeveloped” economic 
systems from “advanced” systems. With the excep- 
tion of affectivity—neutrality, the other pattern vari- 
ables seemed to Hoselitz to be related to degree 
of economic development. George A. Theodorson 
(1953) has pointed out, however, that affectivity 
is evident in the absenteeism and labor turnover 
in underdeveloped areas, where the workers are 
reluctant to accept the unfamiliar discipline of the 
factory. Theodorson’s analysis also showed how 
the successful introduction of the factory system 
in nonindustrialized societies ultimately necessi- 
tated social changes which could be fruitfully con- 
ceptualized in terms of the pattern variables. He 
concluded that the pattern of universalism, achieve- 
ment, specificity, and affective neutrality was in- 
evitably linked to industrialization. 

It will be recalled that Parsons delineated four 
societal types and that a continuum of economic 
development is achieved only by ignoring the uni- 
versalism—ascription pattern that is evident in 
authoritarian collectivistic states, such as Nazi 
Germany, and the particularism—achievement pat- 
tern, such as that of imperial China. William Kolb 
(1954) brought these other alternatives into the 
discussion of industrialization and economic devel- 
opment. His focus was on the relation of cities to 
these processes, and his argument emphasized the 
danger of predicting the form of urban industrial 
development from the Western experience. He be- 
gan by pointing out that population density and 
ethnic heterogeneity do not, in themselves, produce 
a universalism—achievement orientation. The need 
for government intervention in the industrialization 
effort of new nations lends itself to the univer- 
salistic—ascriptive orientation of authoritarianism. 
There is also the possibility that even with moder- 
ate development of emphasis on achievement and 
universalism in the industrial sector of life, other 
significant areas of life may remain under the 
older ascriptive—particularistic values. Reacting to 
these suggestions, McKimm Marriott (1954) ex- 
pressed doubts concerning the feasibility of cate- 
gorizing whole societies in terms of Parsons’ vari- 
ables; however, he ventured the opinion that urban 
India seems to live in terms of a particularism— 
achievement type of system of marked vitality. 


At the North American Conference on the Social 
Implications of Industrialization and Technological 
Change in 1960, Hoselitz pointed out that the 
typologies of Ténnies and Redfield had been little 
used in studies of economic change. He contended 
that the Parsonian variables permitted sharper 
focus on the strategic mechanisms of change asso- 
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ciated with industrialization. The conclusions of 
the conference rapporteur général, Wilbert Moore, 
can well stand as a final observation on this whole 
approach. Polar ideal types have heuristic value in 
the development of theory, he says, but “such 
modes of analysis present problems of operational 
identification in research, and of mensuration when 
mixed situations of empirical reality are ap- 
proached” (1963, p. 361). 

Horace M. MINER 


[Directly related are the entries on AGRICULTURE, arti- 
cle on SOCIAL ORGANIZATION; INDUSTRIALIZATION, 
article on SOCIAL ASPECTS. Other relevant material 
may be found in SYSTEMS ANALYSIS, article on so- 
CIAL SYSTEMS; VILLAGE; and in the biographies of 
Darwin; DURKHEIM; MAINE; REDFIELD; TONNIES.| 
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See WAGES. 


COMPETITION 


Competition may be the spice of life, but in 
economics it has been more nearly the main dish. 
Competition has been a major force in the organi- 
zation of production and the determination of 
prices and incomes: economic theory has accorded 
commensurate importance to the concept. 

Competition enters all major areas of man’s life 
and generally connotes rivalry between two or more 
men or groups for a given prize. Competition is 
often an end in itself. Sporting events are clear 
illustrations: we should be shocked if two teams 
called off the event or arranged a tie and divided 
the prize. Indeed, the prize is a minor goal in a true 
sporting event. 

In economic life competition is not a goal: it is 
a means of organizing economic activity to achieve 
a goal. The economic role of competition is to dis- 
cipline the various participants in economic life to 
provide their goods and services skillfully and 
cheaply. 


Perfect competition 

Market competition. When one asks (as Cour- 
not was the first to do in a precise way in 1838) 
whether the competition of three merchants will 
serve better than two, or why two (or three) do not 
combine into a monopoly, the answers prove to be 
elusive. But one can partially evade such questions 
by posing a very extreme degree of competition, 
which the economist calls perfect competition. 

A main requirement of perfect competition is 
that the largest firm in an industry make a trifling 
fraction of the industry's sales (or purchases) and 
therefore that there be many firms in the industry. 
No definite number has been found for the maxi- 
mum share of one firm that is compatible with 
competition; presumably the permissible share can 
be larger the more elastic the industry demand and 
the easier the conditions of entry by new firms. 

These many firms, no one or few of which ac- 
count for an appreciable share of the industry's 
output, are assumed to act independently. This can 
be viewed as a second condition for perfect com- 
petition, or as an inevitable corollary of the large 
numbers in the absence of legal controls over the 
industry. For it is a fact that there are insuperable 
difficulties in organizing an effective combination 
of many persons when it is profitable for each per- 
son secretly to depart from the agreement, as is 
generally the case in economic life. 

Such large numbers suggest what is true: that 
(perfect) economic competition is impersonal. In 
the economic race there are 1,000 or 100,000 run- 
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ners, and each gets a prize proportional to his 
efforts. The fortunes of any one firm are inde- 
pendent of what happens to any other firm: one 
farmer is not benefited if his neighbor’s crop is 
destroyed. The essence of perfect competition, 
therefore, is not strong rivalry but rather the utter 
dispersion of power to influence market behavior. 
The power, for example, to restrict quantities sold 
and raise prices is effectively annihilated when it is 
divided among a thousand men, just as a gallon of 
water is effectively annihilated if it is spread over 
a thousand acres. 

A third condition of perfect competition is com- 
plete knowledge of offers to buy and sell by the 
participants in the market. This condition serves 
just the opposite purpose of the preceding condi- 
tion. The assumption that traders act independently 
serves to keep them apart and, hence, numerous; 
the assumption that each seller knows what various 
buyers will pay, and vice versa, is necessary to keep 
the parties together—in the same market. If seller 
S and buyer B dealt only with one another in 
ignorance of all other traders, and similarly for 
every other pair of buyers and sellers, each trans- 
action would represent an exchange under bilateral 
monopoly. 

These conditions of perfect competition are 
enough to ensure that a single price will rule in a 
market (in fact, perfect knowledge is enough for 
this purpose) and that this price is affected only 
negligibly by the actions of any one or few buyers 
or sellers. (It is sometimes additionally assumed 
that the product of all sellers be homogeneous, but 
this can also be viewed as part of the definition of 
the market or industry.) The definition of perfect 
competition is therefore sometimes expressed in 
the equivalent form: the demand curve facing each 
seller is infinitely elastic; and the supply curve fac- 
ing each buyer is infinitely elastic. (This definition 
also applies to the individual firm, which accord- 
ingly may be competitive even though the market 
in which it trades is not competitive. ) 

To these basic conditions of perfect competition 
—numerous traders on each side of the market, 
independence of action, and perfect knowledge— 
it is necessary to add divisibility of the commodity 
or service being traded. If the units are large, it is 
possible that minor discontinuities will emerge that 
allow some small market power to individuals. The 
point is sufficiently minor to be left to references 
(Edgeworth [1881] 1953, p. 46; Stigler 1957, pp. 
8-9). Š 

These conditions of perfect competition pertain 
to a single market, whether of shoes or bonds or 
carpenter's services. So far as the presence or ab- 
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sence of monopoly power is concerned, it is not 
necessary to look at any other market. For this 
reason these conditions pertain to what may be 
called market competition. 

It is traditional, however, to enlarge the con- 
ditions of competition, so they will ensure an op- 
timal allocation of resources, by specifying the 
nature of the movement of resources among mar- 
kets and industries. This enlarged concept, which 
may be termed industrial competition, is our next 
subject. 

Industrial competition. If a productive resource 
is to be utilized efficiently, it must be equally pro- 
ductive in all of its uses—clearly if its (marginal ) 
product is less in one use than another, output is 
not being maximized. Hence, two additional con- 
ditions commonly have been made a part of perfect 
competition: resources are mobile among uses; and 
their owners are informed as to yields in these 
various uses, 

A vast galaxy of private and public barriers to 
the mobility of resources has been erected in vari- 
ous times and places: boycotts, certificates of con- 
venience and necessity, patent licenses, settlements 
laws, franchises, licensing of occupations. Such 
barriers are all actually or potentially incompatible 
with competition. But it is not necessary for com- 
petition that the movement of resources be free: 
the retraining of a worker, or the transportation of 
a tool, may be costly without interfering with com- 
petition. We must enlarge our earlier condition of 
full information to include knowledge of the yields 
on resources in alternative employments. From an- 
other viewpoint, we may say that ignorance is a 
barrier to profitable movement of resources. 

If these conditions are fulfilled, the maximum 
possible output (measured by value) will be ob- 
tained from a productive resource. If this be true 
for each resource, the output of the economy is at 
a maximum. This famous theorem (labeled “on 
maximum satisfaction” by Walras and Marshall) 
is subject to a qualification, as all interesting prop- 
ositions are: the private marginal product of a pro- 
ductive resource (the amount its owner receives 
and hence what governs its allocation) must equal 
the social marginal product (private marginal prod- 
uct plus or minus the effects on others). Of course, 
the maximum value output is dependent upon the 
distribution of income, which affects the demands 
for goods and hence their prices. 

Time and competition. What we have termed 
industrial competition—competition including mo- 
bility of resources—obviously has an implicit time 
dimension. It takes time to move resources out of 
unprofitable fields, especially if the resources are 


specialized and durable, so that only through dis- 
entangling depreciation funds can the resources be 
withdrawn. It takes time, too, to construct a new 
factory or shop when one wishes to enter an in- 
dustry. Comparable statements can be made about 
the geographical and occupational mobility of labor. 
Similarly, time is a factor in the completeness of 
knowledge. It takes time to learn which industries 
or jobs are most remunerative, or to learn the prices 
quoted by various sellers, or the quality of service 
and product; and one’s knowledge is more complete 
and reliable, the more thorough the search for in- 
formation and the larger the experience on which 
it is based. 

Capital embodied in specialized and durable 
equipment will not be transferred to other uses in 
the short run except at extreme price differentials, 
even though in the long run the slightest differen- 
tial in returns may be sufficient to move capital 
funds. Conversely, only under extreme incentives 
will new establishments be created virtually over- 
night, as we sometimes observe in wartime. 

This fact that it is more expensive to do things 
very quickly than at a slower pace does not qualify 
the proposition that resources will tend to be put 
where they earn the most, but we are reminded 
of the implicit proviso: allowance must be made 
for the cost of moving the resources. 

The differences in returns to a resource in vari- 
ous uses can be very great in the short run, but will 
decline in the long run to a minimum level set by 
the cost of the most efficient method of moving 
resources. There is implicit in economic literature 
the belief that these minimum costs of movement 
of resources are very small relative to their returns, 
so little imprecision arises from neglecting them 
entirely. This may be true but has not been dem- 
onstrated. The belief, nevertheless, led economists 
(for example, J. B. Clark) to postulate instantane- 
ous and costless mobility as the pure case of per- 
fect industrial competition. It seems preferable to 
say that minimum differentials in returns to re- 
sources are achieved only in the long run. Market 
competition is not so intimately related to time. 
One’s information about price bids and offers im- 
proves somewhat as one searches the market more 
thoroughly—itself a time-consuming process—but 
the changing conditions of supply and demand 
lead to changes in prices that make the old in- 
formation obsolete. 


The theory of competitive prices 

The competitive structure of industry leads to 
the establishment of competitive prices. Competi- 
tive prices are characterized by two main proper- 


ties. The property of clearing markets is that of 
distributing existing supplies efficiently; the prop- 
erty of equalizing returns to resources is that of 
directing production efficiently. 

The clearing of markets. A competitive price 
is one that is not perceptibly influenced by any one 
buyer or seller. When we say that such prices are 
fixed by “supply and demand” we mean that the 
ensemble of all buyers and sellers determines price. 

Since every buyer can purchase all he wishes of 
the good or service at the market price, there are 
no queues or unsatisfied demands, given the price. 
Since every seller can sell all he wishes at this 
market price, there are no undisposable stocks, 
other than inventories that are voluntarily held for 
future periods. The competitive price, then, clears 
the market—it equates the quantities offered by 
sellers and sought by buyers. 

Whenever we find a persistent queue among 
buyers, we know that the price is being held below 
the level that clears the market, which we nat- 
urally call the equilibrium price. For example, 
when housing is unavailable under rent controls, 
we know that rents are below the equilibrium level. 
Whenever we find stocks held by sellers to be in 
excess of inventory needs, we know price is above 
the equilibrium level. The vast stocks of agricul- 
tural products held by the U.S. government are 
evidence that the prices of these products (more 
precisely, the amounts the government will lend 
on the products) are above the equilibrium level. 

The importance of prices that clear markets is 
that they put goods and services in the hands of the 
people who most urgently wish them. If a price is 
held too low, some buyers who set a lower value on 
the commodity will get it while others in the queue 
who set a higher value get none. If the price is set 
too high, goods that buyers would be glad to pur- 
chase at a lower price go unsold even though (if a 
minimum price is imposed on a competitive in- 
dustry) sellers would prefer to sell at this lower 
price. 

The equalization of returns. It is part of the 
definition of industrial competition that every re- 
source in an industry earn as much as it would 
earn in other industries, but no more. The self- 
interest of the owners of productive resources (in- 
cluding, of course, that most important resource, 
the laborer) leads them to apply their resources 
where they yield the most and thus to enter un- 
usually attractive fields and abandon unattractive 
fields. 

This equalization of returns, however, can be 
shown to imply that the prices of goods and serv- 
ices equal their (marginal) costs of production. 
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The cost of a productive service to an industry is 
the amount that must be paid to attract it away 
from other uses—its foregone alternatives. (This 
most basic concept of cost is the essence of the 
alternative or opportunity cost theory.) If the 
amount the productive resource earns in an indus- 
try is in excess of this cost, clearly other units of 
the resource presently outside the industry could 
earn more if they entered. Conversely, if the pro- 
ductive resource is earning less than its cost or 
alternative product, it will leave the industry. 
Hence, if price exceeds cost, resources will flow 
into the industry and lower the price (and perhaps 
raise cost by raising the prices of the resources); 
if price is less than cost, resources will flow out 
and increase the price (and perhaps reduce cost). 

The equality of the marginal products of a re- 
source in all its uses is the condition for efficient 
production. The equality of average products has 
often been substituted, with a regrettable loss of 
logic: consider the catastrophic waste (of capital) 
in having equal output per worker in two industries 
when the capital equipment per worker is ten times 
as large in one industry as in the other. But if the 
marginal product of a resource is equal in its vari- 
ous uses, it follows that marginal cost must equal 
price. The resources necessary to produce one more 
unit of product A could produce an equal value of 
B, so the marginal cost of A—which is the fore- 
gone alternative of producing B—is equal to the 
value of A that it produces. Marginal cost, formally 
defined as an increment of cost divided by the 
increment of product associated with the incre- 
ment of cost, and not the more easily measured 
average cost (total cost divided by output), is the 
economist’s fundamental criterion of competitive 
price—and of optimum price. 

Marshall’s period analysis. The alternative uses 
open to a resource depend upon the time available 
for its redeployment (or more fundamentally, how 
much one is willing to spend on its movement). 
This principle, joined to an empirical observation 
that one can alter the rate of operation of a plant 
much sooner than one can build a new plant or 
wear out an existing one, provides the basis for 
the standard (Marshallian) theory of long-run and 
short-run competitive prices (Marshall 1890). 

In the short run, defined as the period within 
which one cannot appreciably alter the number of 
plants (physical production units ), the only method 
of varying output is to work a given plant more or 
Jess intensively. The so-called variable productive 
factors (labor, materials, fuel) are the only re- 
sources with effective alternative uses in this period 
and therefore the only services whose returns enter 
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into marginal costs. The returns to the productive 
factors embodied in the plant are called quasi rents. 
So long as quasi rents are greater than zero it will 
be more profitable to operate a plant than to close 
it down. 

The long run is defined as the period within 
which the entrepreneur can make any desired de- 
cision—including the decision to leave one industry 
and enter another. In this period all resources are 
variable in quantity, and therefore the returns to 
all factors enter into marginal cost. 

The Marshallian apparatus permits very useful 
simplifications in price theory, but only if its under- 
lying empirical assumption is fulfilled: the long- 
run adjustments of the firm are of negligible 
magnitude in the short run (and hence can be 
neglected), and the short-run adjustments do not 
appreciably affect the long-run costs. When these 
conditions are not met (they fail, for example, if 
discharge of workers in this period will lead to 
higher wage rates in the next period), the full 
analysis of the short run will still require explicit 
analysis of the long-run repercussions of the short- 
run decisions. 


Alternative concepts 


The austerity and abstractness of the concept of 
perfect competition have led many economists to 
seek a more “realistic” concept. This search has 
been reinforced by the need for a concept of com- 
petition usable in the enforcement of the U.S. anti- 
trust statutes. A variety of concepts have accordingly 
been proposed, but because they were deliberately 
contrived to fit the infinitely varied circumstances 
of a vast economy they lack the analytical clarity 
of perfect competition. 

Workable competition. The most popular of 
these variant concepts is that of J. M. Clark, which 
he labeled workable competition (1940). The phi- 
losophy of this concept is clear enough: actual 
industries will seldom have thousands of independ- 
ent firms, and never will the entrepreneurs have 
complete knowledge. It is not useful to characterize 
all of these industries as imperfectly competitive, 
for some will be near-monopolies and others will 
have prices, outputs, and rates of progress that 
deviate in only minor respects from what perfectly 
competitive industries would experience. In partic- 
ular, many industries do not depart sufficiently 
from perfect competition (which is, of course, un- 
attainable) to create any need for antitrust actions 
or public regulation. 

Workable competition has been a very popular 
concept since its formalization in 1940, but its seri- 
ous ambiguity has not yet been reduced. How com- 
petitive an industry should be (using observable 


criteria we shall discuss below) to be workably 
competitive has never been settled. Indeed the cri- 
teria (prices, service, product innovation, rates of 
return) that deserve most weight in any applica- 
tion of the concept have not been agreed upon. 
Two competent persons who study a particular 
industry can disagree on its workable competitive- 
ness, and there exists no analytical basis for elimi- 
nating the disagreement. 

Monopolistic competition. The other leading 
concept, monopolistic competition, was formulated 
by E. H. Chamberlin (1933) and is directed to a 
different purpose. Chamberlin emphasized the di- 
versity in the products of firms that are normally 
considered members of a single industry; they dif- 
fer in details of quality, in repute, in locational 
convenience, in the religion of their producer, and 
a hundred other details that may influence their 
desirability to various buyers. He emphasized also 
the substitutability of products made by what are 
viewed as different industries; one may use alumi- 
num or steel or wood to build a chair, and osten- 
tatiously display one’s wealth with jewels, servants, 
or trips abroad. Each firm, in this view, has some 
elements of uniqueness (monopoly power) and yet 
many rivals, and the admixture gives rise to the 
title of the concept. The theory of monopolistic 
competition has led to a much more thorough ex- 
amination of the problems of defining commodities 
and industries, It has not been found useful in the 
analysis of concrete economic problems, 

Competitive equilibrium. The lack of conscious 
coordination of the behavior of individuals in a 
competitive market has led many writers to assert 
the impossibility of any stable equilibrium. Some 
have denied that any order is observable: the Con- 
tinental cartel literature usually uses the word 
“chaotic” as a prefix to competition, and most pro- 
posals for an “orderly” policy assume that a com- 
petitive system is disorderly. Others have found 
cumulative tendencies in competition: for example, 
W. T. Thornton said that “if a single employer 
succeed in screwing down wages . . . his fellow- 
employers may have no alternative but to follow 
suit” ([1869] 1870, p. 105). Sidney and Beatrice 
Webb elaborated this view into their famous theory 
of “higgling in the market” ([1897] 1920, part 3, 
chapter 2). 

Modern economic analysis, on the other hand, 
makes competitive equilibrium the central part of 
the theory of prices and allocation of resources. 
The presence of order and continuity in markets 
composed of many independently acting buyers and 
sellers has been established beyond serious ques- 
tion, on both theoretical and empirical grounds. 

The main stumbling block in the layman’s ac- 


ceptance of competitive equilibrium is the belief 
that many individuals acting independently will 
necessarily either undershoot or overshoot every 
appropriate change in output, prices, investment, 
and the like. If, for example, increasing demand 
calls for a 10 per cent increase in industry capacity, 
how can this precise total be achieved when a vast 
number of firms are individually and independently 
changing their plants in a hundred different pro- 
portions? In a sense this is a false question: no one 
can know that the following year’s demand will be 
exactly 10 per cent larger, and neither a public 
body nor a private monopolist can guarantee to 
have the “right” amount of capacity the next year. 
But let us put this complication aside. 

The answer, then, is that there is much informa- 
tion available to guide the decisions of the numer- 
ous independent firms. In part this is current 
information: every trade is abreast of the invest- 
ment decisions of its various firms, of the develop- 
ments in products and production methods, and so 
on. This information comes from salesmen, trade 
journals, customers and suppliers, and a host of 
other sources. The firm is guided also by past be- 
havior in the industry: if previous increases of 
output were supplied in some part by new firms, 
this becomes a factor in current decisions. 


Empirical evidence of competition 


A variety of statistical tests of the existence of 
competition have been proposed at various times, 
and at least three deserve some attention. 

The presence of numerous firms, none dominant 
in size, is directly observable and is usually de- 
scribed by a low concentration ratio. The main dif- 
ficulty with this structural test of competition is 
that the maximum concentration compatible with 
competition has not been determined, so the test 
is clear only when concentration is low. The prob- 
lem is complicated by the fact that we have had 
no theoretical guide in summarizing the frequency 
distribution of firm sizes, which can, of course, be 
done in many ways. 

Since a single price will rule under perfect com- 
petition, price homogeneity has often been proposed 
as a test of competition. We have already remarked 
that perfect knowledge is enough to ensure a single 
price, whether the market is competitive or monop- 
olistic. Indeed, in a market of numerous sellers and 
buyers it is improbable that all prices in a given 
short interval of time will be uniform. It is im- 
probable for two reinforcing reasons: the trans- 
actions will seldom be in completely homogeneous 
goods (quantity discounts, promptness of payment, 
and a dozen other characteristics vary almost in- 
finitely among transactions); and the cost of learn- 
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ing market prices, given numerous traders, is such 
that complete information is not worth it. As a 
result, strict uniformity of prices has properly been 
viewed by the courts as a phenomenon more sug- 
gestive of collusion than of competition. 

A related evidence of competition is more power- 
ful: the absence of systematic price discrimination. 
If sellers are persistently obtaining higher net re- 
ceipts (which need not be the same as prices) from 
some buyers than from others, we may be confident 
that they are acting in concert—a truly independ- 
ent firm would concentrate its sales on the buyers 
who yielded higher net receipts. 

A fourth, and perhaps the most traditional, test 
of the absence of competition is a high rate of 
return on investment. It has lost much popularity 
because of the difficulty of measuring profitability 
(in particular, the valuation of durable assets can 
conceal monopoly profits or create fictitiously high 
rates of return) and because an absence of high 
profits is compatible with various cartel arrange- 
ments. Yet it is true that unusually high or low 
rates of return will not persist for long periods in 
a competitive industry. More specifically, a recent 
study suggests that in unconcentrated manufactur- 
ing industries the rates of return of one year will 
provide no useful clue to the rates earned, say, 
five years hence (Stigler 1963, chapter 3). 


Public policies 

Laws, both statutory and common, have sought 
to protect competition for centuries. The Statute 
of Monopolies, which was passed in 1623 to re- 
strain the crown’s use of grants of monopoly for 
revenue, was a famous example, as were the stat- 
utes (which Adam Smith compared in rationality 
to laws against witchcraft) against forestalling, en- 
grossing, and regrating grain. 

The Sherman Act of 1890 was pathbreaking, 
therefore, not in its prohibition of restraints of 
trade but in the implementation of this policy by 
an administrative force charged with ferreting out 
and prosecuting such acts. This most basic of all 
antimonopoly laws forbade not only conspiracies 
in restraint of trade but also attempts to monopo- 
lize—and in such broad terms as almost to defy 
conflicts of spirit and letter. Criminal penalties 
were supplemented by the incentive of triple dam- 
ages to private parties who were injured by the 
forbidden acts. 

The complaint that the Sherman Act came into 
force only after competitive markets had been de- 
stroyed (which was neither true nor wholly false), 
the belief that a group of specialists could deal with 
industrial problems more effectively than the judi- 
ciary, and the general impatience of reformers— 
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all combined to bring about in 1914 the Clayton 
Act, which prohibited a set of practices that (it was 
believed) often led to monopoly, and the act creat- 
ing the Federal Trade Commission to enforce the 
Clayton Act. With amendments—the most impor- 
tant being the Robinson—Patman Act of 1936 and 
the Celler-Kefauver Merger Act of 1950—the legis- 
lative basis of U.S. policy had been developed. This 
policy includes certain discordant anticompetitive 
elements (the Robinson—Patman Act, with its goal 
of rigid uniformity of prices, and the legalization 
of resale price maintenance), as general policies 
have a habit of doing. 

That this policy has contributed to the competi- 
tiveness of the U.S. economy is difficult to deny or 
to document. Yet international comparisons—in 
particular, of the same industry (often composed 
of the same firms) in Canada and the United States 
—suggest that the policy has had substantial 
effects. So, too, does the fact that the favorite prac- 
tices of the formal cartel—a joint sales agency or 
division of customers—are quite uncommon in the 
United States. 

The policy of restricting agreements among com- 
petitors (but not the policy of seeking to prevent 
monopolies) has spread to numerous other nations 
since its introduction in the United States. The 
most common form is to require registration of 
agreements among firms in an industry, and the 
subsequent approval or disapproval of the agree- 
ment by a specially constituted body. This is the 
practice of England, Germany, and several other 
nations, as well as of the European Common 
Market. 

GEORGE J. STIGLER 


[See also ANTITRUST LEGISLATION; CARTELS AND TRADE 
ASSOCIATIONS; ĪNDUSTRIAL CONCENTRATION; MAR- 
KETS AND INDUSTRIES; MERGERS; MONOPOLY; OLI- 
GOPOLY.] 
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COMPONENTIAL ANALYSIS 


Componential analysis is a method of describing 
the subject matter of a language, It aims at con- 
structing verifiable models of how specific bodies 
of cultural (or ideational) content are coherently 
organized, insofar as such content is represented 
by words and expressions in a people's language. A 
method in both semantic and cultural description, 
componential analysis is perhaps best character- 
ized as a method of ideography. 

History. In Coral Gardens and Their Magic 
(1935), Malinowski demonstrated the immediate 
relevance of descriptive semantics for ethnography. 
But the application of rigorous method in this area 
began only after World War n, inspired by the 
methodology of structural linguistics and devel- 
oped and utilized by anthropologists trained in this 
discipline (Goodenough 1957). Indeed, the term 
“componential analysis” is taken from linguistics, 
where it is used to refer to the criteria by which 
distinctive categories of sound in a language are 
distinguished and, subsequently, to refer to the 
analysis of semantic distinctions encountered in 
grammatical paradigms (Harris 1948). 

The first ethnographic use of the method illus- 
trated the categorization of kinship relations in 
Truk (Goodenough 1951). In 1955 Conklin pub- 
lished the conclusions of a similar analysis of 
Hanunóo color categories. Then Lounsbury (1956) 
and Goodenough (1956), working independently, 
simultaneously published extended statements of 
the method, again utilizing kinship materials. Since 
then several such kinship studies have been pub- 
lished (see Bibliography), Wallace and Atkins 
(1960) were concerned with how far the results 
of such analysis reflected the actual cognitive 
organization of phenomena in the minds of the 
people being studied. How ideational models con- 
structed by componential analysis actually re- 
lated to “psychological reality’ became a subject 
of debate (Frake 1962; Burling 1964). A con- 


ference to discuss this and other matters relating 
to componential analysis was held in June 1964 
under the sponsorship of the Wenner-Gren Foun- 
dation for Anthropological Research (Hammel 
1965). Conklin (1962) observed that the method 
is applicable to the analysis and description of folk 
taxonomies generally and to a wide range of prob- 
lems in lexicography and ethnography. Frake began 
to explore its use in the analysis of such aspects of 
culture as the classification and diagnosis of dis- 
eases (1960) and the organization of religious 
ideas (1964). 

As of 1964, however, the clearest expositions of 
the method, of the theoretical issues it raises, and 
of its limitations by comparison with other methods 
for accomplishing similar descriptive objectives 
have been presented in connection with the analy- 
sis of kinship terminologies (Lounsbury 1964a; 
1964b). Because kinship has been studied more in- 
tensively and extensively by anthropologists than 
any other single aspect of culture, it is one of the 
few cultural domains for which anthropologists 
are readily able to meet the data requirements of 
the method. Minimally adequate notations for re- 
cording the data are available, and there is a fairly 
well-developed metalanguage for talking about the 
properties of genealogical space. To go beyond kin- 
ship is to face the arduous but scientifically im- 
portant task of developing suitable notations and 
metalanguages. 

Specific aims of the method. In the terminology 
of semiotics (Morris 1938), a linguistic expression 
designates a class of images or concepts, it denotes 
a specific image or subclass of images within the 
class on any one occasion of its use, and it signi- 
fies the criteria by which specific images or con- 
cepts are included or excluded from the class of 
images or concepts that the expression designates. 
Thus, what is signified are the definitive attributes 
of the class, An expression connotes other images 
or concepts that people associate with the expres- 
sion’s designatum but which are not themselves 
definitive attributes of the designated class. To say 
that the sky is cloudy is to designate a class or type 
of meteorological condition, to denote a specific 
image of such condition, to signify the definitive 
attributes of the class, and to connote such things 
as chill and rain. Componential analysis is con- 
cerned solely with the significational aspect of 
meaning. Thus, it differs sharply from most word- 
association approaches to semantics, which deal 
almost entirely with connotations (e.g., the “se- 
mantic differential” technique described in Osgood 
et al. 1957). 

In its concern with signification and definitive 
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attributes, componential analysis starts with ex- 
tensional definitions (listings of denotata) and 
seeks to reduce them to intensional definitions. For 
example, an extensional definition of the English 
kinship term uncle would list such denotata after 
it as mother’s brother, father’s brother, mother’s 
half-brother, father’s half-brother, mother’s sister's 
husband, mother’s half-sister’s husband, father’s 
sister’s husband, and father’s half-sister’s husband. 
An intensional definition might be as follows: An 
uncle is any kinsman by blood or marriage who is 
simultaneously (a) male, (b) two degrees of 
genealogical distance from ego, (c) not lineal, (d) 
in a senior generation, and (e) not connected by 
a marital tie in other than the senior generation of 
the relationship. 

As this example shows, an intensional definition 
is conjunctive, seeking to reduce the disjunctive 
extensional definition to a unitary class described 
as a product of the combination of several defini- 
tive attributes. That these are definitive attributes 
in this case is evident from the fact that to vary 
any one of them results in the judgment that uncle 
is impermissible as a term of reference. Aunt be- 
comes the appropriate term if we vary (a) sex; 
great-uncle if we vary (b); grandfather if we vary 
(c); nephew if we vary (d); and wife's uncle or 
husband's uncle if we vary (e). This, incidentally, 
illustrates a way in which the results of compo- 
nential analysis are capable of verification. 

General procedure in the method. We arrive at 
definitive attributes by a combination of two opera- 
tions: inspecting the set of a term’s denotata for 
common attributes and contrasting the set of its 
denotata with the sets of denotata of other terms. 
The latter operation is the crucial one. It leads us, 
among other things, to recognize hierarchies of 
subject matter or of semantic domains. 

To contrast the denotata of English dog and 
mackerel is to result in a bundle of discriminating 
features that also discriminate between cat and 
pickerel, dog and pickerel, and cat and mackerel, 
This provides a basis for distinguishing two seman- 
tic domains, the one to which cat and dog belong 
and the one to which mackerel and pickerel belong. 
That mammal and fish are cover terms designating 
these two domains shows us one kind of structural 
relationship between the significata of linguistic 
forms. The significatum of dog contains all the 
definitive attributes in the significatum of mammal 
plus some additional ones that discriminate its 
denotata from those of cat, horse, etc. 

Hierarchical relationships among the significata 
of words and expressions seem obvious enough to 
speakers of English in an English example, but 
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they appear to be characteristic of at least large 
portions of vocabulary in all languages. It is thus 
possible to sort vocabularies into distinct sets per- 
taining to different domains of experience. Com- 
ponential analysis helps us to determine what “goes 
together” in unfamiliar languages so that we do 
not arbitrarily sort them on the basis of rough 
translations or glosses into the domains of English 
(or other language of description ). In the language 
of Truk, raaw (“whale”) belongs to the domain 
designated by the cover term iik, along with most 
things that we would subsume under the cover 
term fish in English. There seems to be no semantic 
domain in Trukese that corresponds to the domain 
designated by English animal. 

Componential analysis has been directed pri- 
marily at systematically contrasting the sets of 
denotata of the several expressions within single 


domains or subdomains. Analysis has shown that 
the several expressions within a domain can be 
sorted into sets so that all the expressions in a set 
have mutually exclusive denotata at a given hier- 
archical level and differ from one another with 
respect to one or several dimensions of discrimina- 
tion (such as the several dimensions used to dis- 
criminate uncle from nephew, aunt, etc., in the 
example given above). Such a set of expressions 
constitutes a terminological system. The method 
of componential analysis has been applied almost 
entirely to delimiting and depicting the ideational 
structure of terminological systems. 

Illustration of the method—Moala kinship ter- 
minology. Moala is an island in Fiji whose social 
organization has been described by Sahlins (1962), 
His published data, which appear in Table 1, pro- 
vide the material for analysis here. The definitions 


Table 1 — Moala kinship data 


KINTERM EGO 
1. famaqu male or female 


ILLUSTRATIVE KINTYPES* 


Fo, FaBr, Fa ¢ // Co, FaFa ¢ // CoSo, FaMo 9 // CoSo, Mo 9 // CoHu, MoMo 9° // CoDaHu, 


Hu of any tinaqu, Mo 4 XCo, WiMoBr, WiMo 2 // Co 


2. tinaqu male or female Mo, MoSi, Mo 9 // Co, MoMo 9 // CoDa, MoFa ¢ // CoDa, Fa 9 XCo, Fa g // CoWi, 
FaFa 4 // CoSoWi, Wi of any tamaqu, HuFaSi, HuFa 9 // Co 
3. luvequ male Ch, BrCh, 9 XCoCh, 3 // CoCh, Ch of any á Ch of any tamaqu or tinaqu, 
Fa 9 // CoDaCh, Mo g XCoSoCh,* WiSiCh, Wi 9 // CoCh, BrWiSiCh, BrWi 9 // CoCh 
female Ch, SiCh, 9 // CoCh, ¢ XCoCh, Ch of 9 Ch of tamaqu or tinaqu, Mo 9 // CoDaCh, 
Mo & XCoDaCh, Fa 9 XCoDaCh, Fa ¢ XCoSoCh, Fa ¢ // CoDaCh”, Fa 9 // CoSoCh, 
HuBrCh, Hu 3 // CoCh, SiHu 3 // CoCh 
4, taciqu male Br, $// Co, Fa g // CoSo, Mo 9 // CoSo, all So of all tamaqu or tinaqu, all Fa of luvequ 
female Si, 9// Co, Mo 9 // CoDa, Mo 3 XCoDa, Fa 9 XCoDa, Da of tinaqu or tamaqu, Mo of luvequ 
5, wekaqu male Si, 9 // Co, Mo 2 // CoDa, Fa 3 // CoDa, Fa 9 XCoDa, all Da of tamaqu or tinaqu 
ate od female Br, 3 //Co, Fag // CoSo, Fa 9 XCoSo, Mo 2 // CoSo, Mo § XCoSo," 
all So of tamaqu or tinaqu 
6, vugoqu male or female  MoBr, FaSi, Mo 3 // Co, Fa 9 // Co, Br of any tinaqu, Si of any tamaqu, g XCo of any 
tamaqu, 9 XCo of any tinaqu, Ch of any wekaqu, SoWi, DaHu, So of any luvequ 
male SiCh, 9 // CoCh 
female BrCh, g // CoCh 
7. tavalequ male MoBrSo, FaSiSo, Mo å // CoSo, g // CoSo of any tinaqu, Fa 9 // CoSo, 9 // CoSo of any 
tamaqu, WiBr, Wi ¢ // Co, any wekaqu of Wi, SiHu, SiHuBr, SiHu 3 // Co, any taciqu of SiHu 
8. dauvaqu female MoBrDa, FaSiDa, Mo 4 // CoDa, g // CoDa of any tinaqu, Fag //CoDa, 9 // CoDa of any 
fomaqu, HuSi, Hu 9 // Co, any wekaqu of Hu, BrWi, BrWiSi, any faciqu of BrWi 
9. davolaqu male MoBrDa, FaSiDa, Wi, Mo 3 //CoDa, 3 //CoDa of any tinaqu, Fa 9 // CoDa, 
or watiqu 9 //CoDa of any tamaqu, WiSi, Wi 2 // Co, any taciqu of Wi, BrWi, BrWiSi, 
BrWi 9 // Co, any faciqu of BrWi 
female MoBrSo, FaSiSo, Hu, Mo 3 // CoSo, $ // CoSo of any finaqu, Fa 9 // CoSo, 
2 // CoSo of any tamaqu, SiHu, HuBr, SiHuBr, SiHu g // Co, Hu 3 // Co, 
any taciqu of Hu or SiHu 
10. tukaqu male or female FaFa, MoFa, Fa of any tamaqu or tinaqu, Fa of any relative of parents’ generation 
11. tubuqu male or female MoMo, FaMo, Mo of any famaqu or tinaqu, Mo of any relative of parents’ generation 


12, makabuqu 


male or female 


SoCh, DaCh, BrChCh, SiChCh, Ch of any luvequ, Ch of any relative of ego's child’s generation 


a. Abbreviations used: Fo, father; Mo, mother; Br, brother; Si, sister; So, son; Da, daughter; Hu, husband; Wi, wife; Sp, spouse; Co, cousin; // Co, parallel cousin: 
XCo, cross cousin; and Ch, child. 
b. Correction of error in source. 


Source: Adapted from Sahlins 1962. 


| 
f 
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in Table 1 are extensional, listing after each term 
a number, but not all, of the conceivable kin rela- 
tionships or kintypes that can be denoted by it. 
Any kin relationship, no matter how remote, can 
be denoted by one of the terms given. Because 
kinship terms are infinitely extendable to the kins- 
men of kinsmen, it is not possible to exhaust the 
universe of kintypes that may be denoted by any 
one of them, Analysis must work with a sample of 
kintypes such as that provided in the data. 

It seems evident from direct inspection of the 
data that 6 of the 12 kinship terms are used recip- 
rocally, in the sense that if A is taciqu to B then B 
is taciqu to A. The remaining 6 terms fall into 2 
reciprocating sets of 3 terms each: 1 and 2 recip- 
rocate 3; and 10 and 11 reciprocate 12. This ob- 
servation enables us to expand the data of analysis; 
e.g., we can assume that SiDaHu (man speaking) 
is a denotatum of 3 (Iuvequ) because the reciprocal 
WiMoBr is given as a denotatum of 1 (tamaqu). 

Direct inspection of the data also allows us to 
conclude that the sets of denotata of the several 
terms are differentiated, among other things, ac- 
cording to 

1. Degree of generation distance between ego 
and alter 

1.1 when ego and alter are in the same gen- 
eration (including categorical terms 4, 5, 7, 8, 9), 

1.2 when ego and alter are one generation 
distant (including categorical terms 1, 2, 3, 6), 

1.3 when ego and alter are two generations 
distant (including categorical terms 10, 11, 12). 

It is also evident that the difference between the 
denotata of term 4 and the denotata of term 5 and 
the difference between the denotata of terms 7 
and 8 together and the denotata of term 9 are a 
matter of 

2. Similarity of sex of alter and ego, with 

2.1 ego and alter of same sex (4, 7, 8), 
2.2 ego and alter of different sex (5, 9). 

Furthermore, we readily observe that the differ- 
ence between terms 1 and 2 and the difference 
between terms 7 and 8 are accounted for by 

3. Sex of alter. Thus: 

3.1 alter male (1, 7), 
3.2 alter female (2, 8). 
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Finally, cursory inspection reveals that the dif- 
ference between terms 1 and 2 and their reciprocal 
3 and the difference between terms 10 and 11 and 
their reciprocal 12 are a matter of 

4. Seniority of the alter’s generation. Thus: 

4.1 alter’s generation senior (1, 2, 10, 11 NS 
4.2 alter’s generation junior (3, 12). 

Leaving aside distinctions involving variables 2, 
3, and 4 above, we show the distribution of terms 
according to variable 1 (degree of generation dif- 
ference) in Table 2. What remains to be accounted 
for is the distinction between the reciprocal rela- 
tionships involving terms 1, 2, and 3 and that 
involving term 6 and also the distinction between 
the reciprocal relationships involving terms 4 and 5 
and those involving terms 7, 8, and 9. 


Table 2 — Moala kinship terms by distance in genera- 
tions 


GENERATIONS DISTANT KINSHIP TERMS 


Two 10, 11, 12 
One 1,2,3 6 
Zero 4,5 7,8,9 


If Moala society were divided into two inter- 
marrying patrilineal or matrilineal moieties, there 
would be no problem. All of these relationships 
would be readily distinguishable according to 
whether ego and alter were in the same or differ- 
ent moieties. But there are no moieties in Moala, 
and depending on how ego and alter choose to trace 
their relationship they may find themselves in a 
category covered by terms 1-5 or in a category 
covered by terms 6-9. Casual inspection of the 
data provided will not give us a clue as to what 
the discriminating factors are. But if we diagram 
every one of the denotata given—or that can be 
inferred by virtue of reciprocation—for terms 1, 2, 
3, and 6, as briefly illustrated in tables 3, 4, and 5, 
patterns inherent in the data are much easier to 
discern. 

In tables 3 and 4, every consanguineally related 
pair in the same generation in every chain of gene- 
alogical connection between ego and alter are (a) 
of the same sex, (b) of different sex an even num- 


Table 3 — Structure of relationships denoted by 1 (tamaqu) and 3 (luvequ)* 


uy INN 
X3 X3 
a Te ie 
1M ‘i 1M i 1M i 
M3 X3 X3 


* M, male; F, female; X, person of either sex. 


1M=F-F 


1 M—M = F—F 1M—F 
x3 X3 F=M3 
ian tl iL 
| 
1M F IM=F F uM 
X3 x3 F=M3 
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Table 4 — Structure of relationships denoted by 2 (tinaqu) and 3 (luvequ)* 


2F 2F—F 2F=M—M 2F—M 2F—M 
ka x3 x3 A=rs bons 
FF M—M FF FF 
tt A A Me Ma rr 
2l t 2l t atm 2F=A A 2F=A A 
ka ko ks ka ka 


* M, male; F, female; X, person of either sex. 


ber of times, or (c) of different sex an odd number sulting from this analysis is shown in Table 6. The 
of times when there is a marital tie in the most variables by which the sets of denotata were dis- 
junior generation in the relationship. In Table 5 criminated have a definite structural relationship 
we find the complementary pattern exemplified. to one another. Variable 1 partitions the entire 
Every consanguineally related pair in the same universe of kinship, being applicable to all of the 
generation in the chain of genealogical connection terms, and is the only variable that is so applicable, 
between ego and alter are (a) of different sex an The other variables subdivide the universe further. 
odd number of times or (b) of different sex an Thus the variables relate to one another as in a 
even number of times (or not at all) when there taxonomic hierarchy, but the terminology lacks 
is a marital tie in the most junior generation in the cover terms for such implicit entities as “kinsman 
relationship. of my own generation.” 

This pattern also appears in the denotata of Taxonomies and paradigms. The arrangement 
terms 4, 5, 7, 8, and 9, if we diagram them in a of variables in Table 6 reflects the hierarchical or 
similar way. It is even more clearly evident that implicit taxonomic structure, Since variable 1 (gen- 
sex differences of consanguineal pairs in the same eration distance) ranks highest in the structure, it 
generation in the genealogical chain do not affect is in the extreme left-hand column, Variable 5 (odd 
the most junior generation in the relationship, or even number of sex differences and/or marital 
whereas the number of intervening marital ties ties in the genealogical chain) is the next highest. 


count only if they occur in the most junior genera- Variables 2 and 4 are in complementary distribu- 

tion. Therefore, the discriminant variable may be tion in the-matrix and occupy equivalent positions 

described as in the taxonomic structure. Variable 3 ranks lowest 
5. Number of sex differences among the con- of all. 

sanguineally related pairs in the same generation If all the variables applied equally to the dis- 

in the genealogical chain between ego and alter, crimination of all terms, the arrangement of vari- 

excluding the most junior generation in the rela- ables in columns in a matrix would be entirely 


tionship, and/or the number of marital ties in the optional. The matrix would have the structure of 
genealogical chain that are in the most junior a paradigm instead of a taxonomy. 


generation. This is represented as The kinship systems that have been analyzed so 
5.1, when the number of sex differences and/ far produce matrices that are partially taxonomic 
or marital ties is an even one (1, 2, 3, 4, 5), and partially paradigmatic in structure. When sev- 
5.2, when the number of sex differences and/ eral alternative models (involving a choice among 
or marital ties is an odd one (6, 7, 8, 9). several discriminant variables) can be constructed 


The matrix of variables and kinship terms re- for a terminological system, they are usually alike 


Table 5 — Structure of relationships denoted by 6 (vugoqu)* 


SET SEN 6M OF 6M=F—M STENA 
Xó Xó X=X6 X=X6 ke X=X6 
[es rt ay it 
6M 6M | OF i: 6F ji ii 6F=M—F 
Xó xó Xó Xó X=X6 Xó 


* M, male; F, female; X, person of either sex. 


in this respect. Most of them differ at the middle 
and lower levels of the taxonomic structure rather 
than at the higher levels. This has obvious signifi- 
cance for students of cognition, quite apart from 
the question of whether any particular model is the 
one from which the participants derive their “feel” 
of how things work, 


Table 6 

DISCRIMINANT VARIABLES KINSHIP 
1 5 2 4 3 TERMS 
w iaoa $ 5 4. taciqu 
11 5.1 2.2 . . 5. wekaqu or ganequ 
11 5.2 2.1 . 3.1 7. tavalequ 
1.1 5.2 2.1 . 3.2 8. dauvaqu 
TI 5.2 2.2 9. davolaqu or watiqu 
1.2 5,1 . 4.1 3.1 1. tamaqu 
1.2 5.1 . 4.1 3.2 2. tinaqu 
1.2 5.1 . 4.2 ` 3. luvequ 
1.2 5.2 D : . 6. vugoqu 
1.3 ` . PEE) 10. tukaqu 
1.3 . . 41 3.2 11. tubuqu 
1.3 c š 4.2 x 12. makabuqu 


Some terminological systems in the domain of 
kinship contain fifty or more terms and a dozen or 
more discriminant variables. Such systems produce 
large, complicated componential matrices. The pro- 
cedural rules for grouping terms in rows and vari- 
ables in columns in such matrices are an important 
feature of the method of analysis. 

Componential analysis and anthropological the- 
ory. The Moala kinship terminology illustrates 
how componential analysis clarifies traditional 
problems in social anthropology. Anthropologists 
have regularly sought to explain kinship terminol- 
ogies like the Moalan one—and there are a number 
of generally similar ones—in terms of a dual divi- 
sion of society into exogamous moieties or in terms 
of preferred or mandatory cross-cousin marriage. 
Moieties are entirely absent from Moalan society. 
Nor can cross-cousin marriage explain the termi- 
nology, for although one must marry a davolaqu, 
it is explicitly reported that by tracing different 
routes people can convert a wekaqu into a davolaqu. 
And they manage things so that they have many 
more davolaqu than wekaqu. Furthermore, mar- 
riage with a close davolaqu, a real cross-cousin, is 
prohibited. Neither moieties nor cross-cousin mar- 
riage can be said, therefore, to provide the criteria 
by which people sort one another into different 
kinds of kin. What maintains the terminological 
system conceptually is the simple odd—even pattern 
in variable 5, For it is not necessary to count the 
actual number of marriages or sex differences in 
the genealogical chain in distant relationships. If 
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A and B have a relative in common, they can im- 
mediately relate to one another by the rule that an 
even of my even or an odd of my odd is my even, 
while an even of my odd or an odd of my even is 
my odd. Aside from this, ego's and alter’s sex and 
generation are the only things to take note of. This 
is but one example of how componential analysis 
calls into question some of the explanations of kin- 
ship terminology that have had wide currency 
among anthropologists (for another example, see 
Lounsbury 1964b), 

Of perhaps greater theoretical interest is the at- 
tention componential analysis directs to the rela- 
tionship between the ethnographer and what it is 
he seeks to describe. As advances are made in the 
rigor with which ethnographic data are analyzed 
and the coherency with which they are presented— 
a goal toward which componential analysis will 
have been, in retrospect, but one early step—cul- 
tural theory will of necessity be considerably 


transformed. 
Warp H. GooDENOUGH 


(Directly related are the entries COGNITIVE THEORY; 
Kinsuip; LINGUISTICS.] 
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COMPUTATION 


In recent years there has occurred enormous 
technological development and widespread applica- 
tion of automatic information-handling techniques. 
These are relevant to three broad classes of tasks 
with which the scientist frequently finds himself 
faced: (a) data processing, the sorting and sum- 
marization of large files of information, usually 
representing the results of empirical observation; 
(b) mathematical computation, the derivation of 
numerical values associated with mathematically 
defined constructs (for example, matrix and dif- 
ferential equation systems); and (c) simulation, 
the generation of data supposed to represent the 


“behavior” of some temporal process (for example, 
a physical or economic system), Of course, a given 
task may have more than one of these components; 
the widely used technique of regression analysis, 
for example, may be thought of as consisting of a 
data-processing phase, for the selection and reduc- 
tion of input data, followed by a computational 
phase, for the calculation of regression coefficients 
and related quantities. 

Historically, various special-purpose mechanical 
aids have been used to facilitate operations in one 
or another of the areas cited. Most recently, the 
general-purpose automatic digital computer has 
been developed as an effective and flexible tool for 
handling all three types of application. Accord- 
ingly, after a brief survey of older methods, the 
main body of the present discussion will be con- 
cerned with this device. 


Older methods 


Methods for data processing are strongly condi- 
tioned by the characteristics of the medium in 
which the data are recorded, A file in which each 
data entry is recorded on a physically separate car- 
rier, such as a card, has obvious manipulatory ad- 
vantages. Thus traditional techniques for data 
processing have centered around the concept of a 
unit record, Various schemes for aiding the manual 
manipulation of files of cards have been devised; 
a useful one depends on notching the edges of 
cards in coded patterns to permit mechanization 
of selection and sorting. The most significant line 
of development here, however, has been that of the 
punched card system, in which special-purpose 
machines are used to perform sorting, collating, 
reproducing, and tabulating operations on decks 
of cards carrying information entirely in the form 
of coded patterns of punched holes. Such systems, 
the basic principles of which were developed by 
Herman Hollerith in the closing years of the last 
century, are in widespread use today. Their effi- 
cient use depends on the fact that file manipulation 
procedures can often be specified as a sequence 
of elementary operations repetitively performed. 
Processing of any complexity, however, may re- 
quire many runs of the data cards through the 
same or different machines, and this is a factor 
that limits the magnitude of the tasks that can be 
attempted by the use of these techniques. 

Mathematical computation can be mechanized 
in two basic ways, digital and analog. In digital 
computation numbers are represented and manip- 
ulated as symbolic entities; the abacus, in which 
the configuration of beads on a rod is taken as a 
coded representation of a decimal digit, is an ele- 


mentary example of a computational aid based on 
the digital principle. In analog computation, by 
contrast, numbers are represented by the measure 
of physical aspects of some system; thus the slide 
rule, on which numerical magnitudes are marked 
off as lengths on a series of scales, is a basic ex- 
ample of a device for computing by the analog 
method, 

The desk calculator is a special-purpose digital 
device that serves to mechanize the most onerous 
part of pencil-and-paper computing; since it per- 
forms only the fundamental arithmetic operations 
on numbers manually entered into its keyboard, 
however, elaborate computations still require a 
great deal of routine human labor. This sort of 
process can be mechanized one step further by 
incorporating elementary arithmetic capability into 
punched card equipment, but the inability of such 
machines to do more than a repetitive sequence of 
quite simple operations on a given run is still a 
limitation. 

Manual analog methods include graphic tech- 
niques, such as the plotting of lines representing 
equations in order to determine their solution as 
an intersection point, and the use of nomograms, 
which may depend on predrawn forms suited to 
particular problems. In addition, a number of 
mechanical devices more advanced than the slide 
rule have been developed, such as the mechanical 
integrator. The evolutionary end product of this 
development is the modern analog computer, in 
which numerical magnitudes are represented as 
electrical voltages and circuit units interconnected 
by means of plug wires correspond to mathemati- 
cal systems with various parameters. Analog tech- 
niques are subject to fundamental limitations 
regarding accuracy, since they depend on the pre- 
cision with which physical quantities can be meas- 
ured, and are restricted as to the types of problems 
that can be handled, since they must be capable 
of being formulated in physical terms. 

Analog techniques also can be applied success- 
fully in the area of simulation; in fact, here the 
distinction between mathematical computation and 
simulation becomes blurred. For, if an analog com- 
puter is set up to “solve” the differential equation 
that governs the behavior of a physical system 
through time, it can be thought of in a very real 
sense as a simulator of that system. This has led 
to the exploitation of such techniques in experi- 
mental work, where there is to be interaction, say, 
between an actual physical system and a simulated 
one, 

Simulation by analog techniques is often useful 
but is limited; by contrast, any system with a 
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mathematical description can be simulated by digi- 
tal techniques. In fact, however, before the advent 
of the general-purpose computer, digital simulation 
of systems of any complexity was so inefficient as 
to be of little practical utility. 

Further reading on the material of this section 
may be found in Brooks and Iverson (1963, pp. 
52-144), Calingaert (1965, pp. 87-126), and 
Borko (1962, pp. 22-57). 


General-purpose digital computer 

The modern automatic digital computer, at the 
center of a system incorporating appropriate input, 
output, and auxiliary information storage devices, 
can be applied to data processing, mathematical 
computation, and simulation tasks in an integrated 
and flexible manner. This versatility is partly due 
to the physical speed and capacity made possible 
by contemporary technology, and partly due to a 
feature of the computer that was largely missing 
in earlier mechanisms—the internally stored pro- 
gram. Once a program or schedule of operations 
has been prepared, it can be introduced into the 
computer and used to control a long complicated 
sequence of steps, which are thus executed auto- 
matically at an extremely rapid rate without hu- 
man intervention. The possibility of preparing pro- 
grams of a general nature, which can then be 
tailored to particular cases by the specification of 
parameter values, and the fact that programs, once 
prepared, may be run by different users on differ- 
ent computers make the digital computer a tool of 
such power as to permit consideration of projects 
that would not formerly have been contemplated 
at all; this in turn often leads to the development of 
more advanced models and hence acts to influence 
the substance of the discipline to which the com- 
puter is applied. Thus, for example, the availability 
of programs incorporating provision for prelimi- 
nary data editing, extensive computation of resid- 
uals, and graphic display of results has markedly 
influenced the manner in which statistical prob- 
lems may be attacked. 

This potential, however, is limited by the capac- 
ity of people to plan and implement effective pro- 
grams; furthermore, the nature of computer opera- 
tion is such as to make it most desirable for the 
user to have some comprehension of its inner 
workings rather than to depend completely on a 
professional programmer to interpret his needs. 
The following sections therefore contain a com- 
pact introduction to the subject, which, it is hoped, 
will at least enable the reader to become aware of 
some of the more obvious pitfalls that the com- 
puter user must anticipate. 
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Computer structure. In order to avoid too nar- 
row a conception of function, it is appropriate to 
think of the general task for which an automatic 
digital computer is designed to be “symbol process- 
ing,” rather than “computation” in the strictest 
(numerical) sense. The organization and operation 
of a computer may be basically described in terms 
of five functional units, as depicted schematically 
in the block diagram of Figure 1. 


Storage 


Figure 1 — Computer block diagram 


The storage unit (usually picturesquely termed 
the “memory”) acts as a repository for symbolic 
information and also as an intermediary for all 
interaction among other units. Internal processing 
of information is implemented by the transmission 
of symbolic operands from storage to the opera- 
tions unit, where they are subjected to arithmetic 
or other transformations, the results of which are 
then transmitted back to storage. In order to set 
this process up initially an input unit is provided 
to introduce information from the external world 
into the storage unit, and in order that final results 
may be provided for the user there must also be 
an output unit for the opposite function. 

Thus far, the functions described are roughly 
analogous to those performed by a man doing a 
hand calculation: he maintains a worksheet (stor- 
age) from which he derives numbers to be entered 
into the keyboard of his desk calculator (opera- 
tions) and on which he copies the results; before 
the calculation proper starts some initial values 
may have to be entered (input), and when it is 
finished a separate sheet containing only the rele- 
vant results is prepared (output). The word “auto- 
matic” applied to a computer, however, implies 
mechanization not only of these physical processes 
but of the directing function supplied by the man, 
who causes a sequence of actions to be performed 
on the basis (presumably) of a predetermined 
schedule. In the computer the actuating function 
is performed by the control unit, and what dis- 


tinguishes the computer from earlier devices is 
that the sequence of actions called for by this 
control can be in accordance with a pattern of 
arbitrary complexity, as specified by a symbolic 
program recorded in storage. Before processing can 
begin, therefore, a program must be introduced 
(via the input unit) into storage; the fact that the 
computer can be switched from one task to an- 
other by merely introducing a different program 
is what gives it its “general purpose” designation, 
Note also that the program, because it consists of 
symbolic information recorded in storage, can be 
altered by internal operations; this gives rise to 
powerful and flexible procedural techniques in 
which programs systematically modify themselves 
in order to achieve maximum processing efficiency. 

Technological considerations dictate the use of 
bistable, or two-state, devices as the basic com- 
ponents of digital systems; this leads to the con- 
ception of all symbolic information ultimately 
coded in terms of just two symbols, the bits 0, 1, 
Particular patterns of several bits may then be 
interpreted as coded characters (letters, digits, 
etc.), which play the role of the elementary units 
of symbol processing. This notion is familiar from 
consideration of punched card or teletype tape 
codes, in which specified patterns of no-punch, 
punch represent characters drawn from a given 
set. In a computer, as in these contexts, the basic 
character set may include in addition to numeric 
digits the letters of the alphabet and selected spe- 
cial symbols such as mathematical signs and 
punctuation marks. 

The unit of transmission of information within 
the computer may be the character, but on the 
more powerful computers efficiency dictates that 
characters be transmitted and operated upon in 
larger aggregates called for convenience words. 
Data-processing applications generally involve 
much manipulation of alphanumeric words, con- 
sidered simply as arbitrary strings of characters; 
arithmetic operations, however, are meaningful 
only when applied to numeric words, which repre- 
sent numbers in a more or less conventional man- 
ner. Finally, words formed according to a particular 
format are interpretable by the control unit as 
instruction words, and it is in terms of these that 
programs are specified for automatic execution. 

Such special interpretations of computer words 
are applied only by the operations and control 
units; as far as storage goes, there is no distinction. 
Storage is organized into a series of subunits called 
locations, each of which is designated uniquely by 
an integer label, its address. The capacity of each 
addressed location is generally the same, being a 


character or a word of some standard length. 
Computer instructions are then typically designed 
to specify an operation to be performed and one 
or more storage addresses giving the locations at 
which operands are to be found or results stored. 
Computers with storage capacities of the order 
of hundreds of thousands of characters and which 
carry out internal operations at the rate of hun- 
dreds of thousands a second have become available 
in the last few years, and in the more advanced 
systems both capacity and rate may effectively be 
in the millions. Although basic input/output op- 
erations (card or paper tape reading/punching, or 
printing) are generally much slower, delays can 
be reduced to a considerable extent by designing 
these units so that their operation can proceed 
concurrently with the main internal sequencing. 
Finally, the capabilities of the computer can be 
considerably enhanced by imbedding it in a system 
containing additional bulk storage (perhaps mil- 
lions of characters stored in magnetic form on 
drums or disks) and intermediate input/output 
devices (magnetic tape units, for example), all 
communicating via the internal storage unit. 
Further discussion may be found in Gregory and 
Van Horn ([1960] 1963, pp. 58-125), Green 
(1963, pp. 12-29), and Borko (1962, pp. 60-91). 
Computer procedures. The popular conception 
of computer function seems to be that reflected 
in statements such as “Now they have a computer 
that can check everybody's tax return” or “Now 
they've come up with a computer that can simu- 
late world politics.” Although the area of intended 
application in the broad sense may influence de- 
sign, as the previous discussion suggests, com- 
puters are not built to do specific tasks but are 
programmed to do them. This is important to the 
user because it shows the need for focusing atten- 
tion not on computer availability but on program 
availability. And although the uninitiated but rea- 
sonably knowledgeable scientist may be aware of 
this fact, he often fails to anticipate the extent to 
which unqualified descriptions such as “regression 
analysis” fail to characterize a task definitively; 
details of matters such as input/output formats 
and computational methods are what determine 
whether or not a program is appropriate to the 
needs of a given user, and here information may 
be poorly communicated or not provided at all. 
Even worse, it may be erroneously communicated, 
partly due to lack of insight on the part of the 
program designer and partly because the tools for 
precise communication in the area of symbolic 
processing are as yet only imperfectly developed. 
For these reasons it is desirable that the user 
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have at least a rudimentary notion of the nature 
of computer instructions and programming tech- 
niques. This section and the one following outline 
briefly some of the concepts involved. 

The first problem to be faced when planning to 
use a computer is that of precise specification of 
the task to be carried out, in a form sufficiently 
perspicuous so that the procedure can be com- 
municated in human terms. Some form of flow 
chart can generally fulfill this function on several 
levels, from over-all to detailed. The basic idea 
here will be illustrated by an example. 

Imagine that it is required to design an auto- 
matic procedure for computing the sample mean 
and standard deviation, 

K 
Ke —k)*, 


for several sets of x), not all of which necessarily 
contain the same number n of items (in all cases 
it must be that n > 2, however). If these are to be 
processed in a single input file (perhaps a deck 
of punched cards) some means must be provided 
for distinguishing the data sets from each other, 
and one practical way to do this is to suppose each 
file entry (card) contains two values x, m, the 
first of which is normally data (an x; value) and 
the second of which is a “tag” to distinguish legiti- 
mate data items (m=0) from “dummy” items 
used to signal the end of a data set (m= 1) or 
the end of the entire file (m= 2). 

A complete flow chart for this task is given in 
Figure 2; it covers the repetitive reading of data 
sets and the computation and recording (for ex- 
ample, printing) of n, č, and s for each, The flow 
chart is a formal specification of a procedure that 
is described as follows: 


Initially, set the sum r and the index i to 0, in an- 
ticipation of a data run. Then read a pair of values 
x, m and test (in two steps) whether m is 0 or 1. If 
m = 0 add x to the sum r, and also increment the value 
i and enter x as x; in a data list; then return to read 
the next entry. If m = 1 the data set is complete, and 
the current value of i is then n for the set and 7 is 
determined as r/n. Now compute a second sum p by 
adding values (x; — 7)? for i=1,2,-++,m; following 
this compute s as the square root of q = p/(n—1) and 
record the values n, %, s. Repeat this entire process 
until the test gives m #0 and m #1 (so presumably 
m = 2); this indicates that there are no more input 
files and the procedure is finished. 


The stylized formalism of the flow chart em- 
ploys mathematical notation of the conventional 
sort but also makes use of special symbols signify- 
ing procedural action. These are “e”, meaning 
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“set” (to a value); “:”, meaning “compare” (for 
branching); and “___”, meaning “execute” (a 
standard subprocedure). Thus “r-r + x” means 
“set r to its former value plus x”; “i:n” means 
“compare i and n” (which causes the procedure to 
repeat as indicated if i < n and go ahead if i > n); 
and “SQRT(s;q)” means “execute a subprocedure 
SQRT(s;q)” (which is designed to have the effect 
se vq). 

The flow chart shows the structure of the com- 
putation in terms of procedural loops and re-entry 


Start 


Finish 


ici+l1 


p<—pt (x; =z)? 


q<p/(n—1) 
SQRT (s:q) 
RECORD (;n,x,s) 


Figure 2 — Flow chart for mean and standard deviation 
calculations 


points, without concern for details of printing 
formats, etc., which of course must be supplied if 
the procedure is to be actually carried out by 
computer, It is important to realize that these flow 
chart conventions are independent of the charac- 
teristics (and the idiosyncrasies) of any particular 
computer but that they do reflect general aspects 
of computer operation; thus both the notion that 
indices such as i can be manipulated independ- 
ently in dealing with sets of values x; and that 
parametrized subroutines such as SQRT( ; ) can 
be called upon and executed by a main program 
correspond to important features of actual com- 
puter operation. 

The flow chart symbolism given here, for which 
detailed rules can be stated, is suitable primarily 
for expressing procedures of mathematical compu- 
tation. The same basic idea, however, can be used 
to specify data-processing procedures; the main 
additional concept needed is a more versatile way 
of representing external files of given format and 
organization. Also, simulation and other internal 
processing applications that may involve non- 
numeric elements can be handled by an extension 
of the flow charting technique. 

Although no standardization of flow chart con- 
vention exists, and the choice depends to a certain 
extent on the application at hand, it nevertheless 
seems reasonable to ask that a flow chart represent 
an actual computer program accurately enough 
so that the filling in of details is routine and is 
unlikely to lead to serious discrepancies between 
what the user thought was to be done and what 
the programmer in fact specified was to be done. 

For additional discussion of flow charting see 
Calingaert (1965, pp. 19-23). 

Computer programs. To get some insight into 
the way in which a flow chart specification trans- 
lates into a computer program, assume a computer 
with 1,000 storage locations 000, 001, ---, 999, 
each of which holds a 12-character word inter- 
pretable as an alphanumeric datum, a number, or 
an instruction in the format “waßy”, where w is a 
3-character alphabetic operation code and each of 
a, B, y is a 3-digit storage address, 

Then the instruction 


ADD 500 501 500 


might then have the significance “Add the contents 
of storage locations 500 and 501, and record the 
result in storage location 500.” This would be 
appropriate only in case the 12-character words 
initially contained in storage locations 500 and 501 
were in numeric format (say, 11 decimal digits 
preceded by algebraic sign), so that the addition 


operation has meaning. The execution of this in- 
struction by the control unit results in the destruc- 
tion of the original number in storage location 500, 
since it is replaced by the sum; the number in 
storage location 501, however, is not altered by 
the operation. 

In order to be executed by the control unit, the 
ADD instruction itself must exist some place in 
storage, say at location 099; thus one must distin- 
guish between the locations (here 500, 501) to 
which the instruction explicitly refers and the loca- 
tion (here 099) at which the instruction resides. 
Concern with the location of the instruction itself 
is necessary because of the sequential nature of 
computer operation; somehow there must be speci- 
fied the order in which instructions are obtained 
and executed by the control unit, and the most 
straightforward way to do this is to design the 
computer so that instructions are executed in 
sequential order by resident address unless there is 
an explicit indication (in an instruction) to the 
contrary. To see the manner in which such an 
indication is typically set forth, consider a second 
hypothetical instruction 


COM 500 550 099 


which is to be given the interpretation “Compare 
the numbers in storage locations 500 and 550, and 
if the former is less than the latter, take the next 
instruction from storage location 099.” This effects 
a “conditional jump” in the instruction sequence; 
if the stated condition is met, the next instruction 
executed is that residing in location 099, but other- 
wise it is the instruction whose address follows in 
sequence. If the COM instruction itself resides in 
location 100, then the pair of instructions intro- 
duced form a “program loop” schematized as fol- 
lows: 


(099) ADD 500 501 500 
(100) COM 500 550 099. 


The effect of this piece of program is to add the 
contents of 501 repeatedly to the contents of 500, 
which therefore acts as an “accumulator” for the 
sum; this addition process is continued (by virtue 
of the “jump back” to 099) until the accumulated 
sum exceeds the number residing in location 550, 
at which time the program proceeds in the normal 
sequence with the instruction following that num- 
bered 100. 

If one imagines that locations 500, 501, and 550 
contain values, i, 1, and n, respectively, then the 
two-word loop is a computer implementation of the 
flow chart component shown in Figure 3 (which 
happens to perform a rather meaningless function 
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Figure 3 — Loop pattern 


in itself but with appropriate instructions inserted 
between the ADD and COM becomes one of the 
loops of Figure 2), This would serve to suggest 
the manner in which flow charts may be trans- 
lated into computer programs. Two important 
techniques in this connection, indexing of a set 
of values with a computed value (so as to be able 
to reference a general element x;) and linking of 
subprocedures into a program (so as to be able to 
execute a subroutine such as SQRT and jump back 
to the main program at the proper point when 
finished) may be accomplished in a computer by 
programmed modification of instruction addresses. 

Although the example illustrates some facets of 
computer control that are typical, the details of 
coding, format, etc. vary widely in practice. For 
example, it is common to have several different 
coding systems employed in the same computer 
for various purposes, perhaps a character code used 
for general alphanumeric information, a different 
code for number representation (often based on 
the binary radix 2 instead of the decimal radix 10), 
and a special format, not interpretable in either 
alphanumeric or pure numeric terms, for instruc- 
tions. Also, the basic “three address” pattern of 
instructions given here is not typical; usually 
instructions are “one address” and require explicit 
recognition of registers for retaining intermediate 
results in the operations and other units. Thus on 
most computers the addition operation of the ex- 
ample requires three instructions: one that clears 
an accumulator register in the operations unit and 
records the contents of location 500 in it, one that 
adds the contents of 501 to this and retains the 
sum in the accumulator register, and one that 
records the contents of the accumulator in location 
500. One or more special index registers are also 
customarily provided to facilitate indexing and 
linking operations. 

Besides instructions for internal operations such 
as those described, the computer must have in- 
structions for controlling the operation of input/ 
output and other auxiliary devices with which it 
communicates; these would of course be used in 
procedures such as the READ and RECORD 
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subroutines of the earlier example. The repertoire 
of a large computer may number instructions in 
the hundreds, many of these representing minor 
variants of each other to permit processing effi- 
ciency, 

Two characteristic features of computer pro- 
grams may be perceived from the discussion: a 
procedure is formulated for execution in terms of 
minutely small steps, and the symbolic representa- 
tion of these steps is not conducive to quick appre- 
hension of their effect in the aggregate. Not only 
does the preparation of a program for a given task 
require a great amount of routine labor, but the 
bookkeeping necessitated by the numerical ad- 
dressing scheme tends to produce errors. Since 
computers are generally capable of performing 
routine tasks more reliably than humans, the 
question naturally arises of whether some of the 
burden of program preparation cannot be passed 
along to the very devices intended to execute the 
programs. This leads to the concept of program- 
preparation programs, which translate programs 
written in some language more or less “procedure- 
oriented” into direct computer code. 

A basic example is the assembly program, which 
permits the programmer to write instructions in a 
form such as 


GO: ADD,I, ONE, I 
COM, I, N, GO. 


These instructions may be thought of as assembly 
language versions of those given earlier. GO is a 
label corresponding to the instruction address 099, 
and I, ONE, and N are labels corresponding to 
the data addresses 500, 501, and 550, respectively. 
The programmer writes in terms of such mnemonic 
labels, and the assembly program translates these 
symbolic instructions (made up of letters, commas, 
spaces, etc.) into corresponding direct computer 
instructions in appropriate format; generally, the 
programmer does not specify at all the actual 
numerical addresses that will be assigned to the 
various labels by the assembly program. There are 
also other features incorporated into assembly 
programs to simplify the specification of numeric 
constants, etc. 

Assembly language makes a program more per- 
spicuous but does not compress its level of detail. 
Higher order program-preparation programs exist 
for this purpose; for example, an algebraic trans- 
lator, which permits specification of the computer 
action already discussed in a form such as 


GO: I=I+1 
IFI>N,GO. 


This describes the procedure in terms much closer 
to a flow chart specification and also in general 
achieves an over-all symbolic compression. The 
program to translate such a language into com- 
puter code is correspondingly more complicated, 
however, and the resulting computer version may 
be considerably more inefficient than a program 
prepared directly in assembly language. There is 
an added disadvantage, which can only be appre- 
ciated through experience: the programmer is a 
level removed from the actual computer process, 
and this raises problems in the area of precise 
specification of computer procedures. 

Further discussion of computer programming is 
given in Borko (1962, pp. 92-133), Green (1963, 
pp. 30-64, 75-99), and McCracken and Dorn 
(1964, pp. 1-42). 

Program systems. From the previous section it 
may be seen that the user does not in general deal 
with the computer directly but through the media- 
tion of some program system that provides for 
translation from procedure-oriented language into 
actual computer code. On the larger computers, 
where efficiency is at a premium, this function is 
expanded to include the batching of programs for 
sequential execution with a minimum of human 
intervention and the keeping of the computing log 
and accounting records. Such a system makes for 
highly efficient computer utilization but removes 
the user one level more from direct access to the 
computer; “debugging” under such circumstances, 
for example, requires a succession of runs, with 
the user progressively detecting and correcting the 
errors in his program. 

This implies that in addition to a programming 
language the user must learn (or have access to 
someone who knows) the operating techniques 
associated with the particular computer installa- 
tion with which he deals. The knowledge of what 
translators and diagnostic programs are available, 
and of details concerning the formats in which 
control information must be submitted with pro- 
grams in order that they be usable, assumes à 
perhaps disproportionate but nevertheless critical 
importance. Although details vary widely from 
installation to installation, the type of support one 
may look for is indicated by the following categori- 
zation of “system routines”: 

loader—an elementary “job initiation” program 
that controls the reading of a program to be exe- 
cuted, perhaps supplying address modification 
based on the locations the program is to occupy 
in storage, etc. 

assembler—a program that translates a program 
in assembly language (which may consist of sev- 


eral sections, each programmed using symbolic 
addresses and mnemonic operation codes) into a 
single coherent computer code (in a form that 
can be processed by a loader). 

translator—(often called a “compiler”) a pro- 
gram that translates a program in some algebraic 
or other problem-oriented language into some more 
elementary (perhaps assembly language) form. 

monitor—a program that synchronizes translat- 
ing, assembling, and loading functions and, fur- 
ther, has a certain batching, scheduling, and 
accounting capability permitting a whole set of 
jobs to be run with minimal human intervention. 

multiprocessor—a supermonitor based on an 
ability to carry out several functions concurrently 
(for example, those of a central processor and 
several input/output devices) and dynamically 
schedule these for maximal efficiency. 

The order of listing here corresponds to both 
historical evolution and increasing level of com- 
plexity; monitor and multiprocessing facilities, in 
particular, are appropriate only to the more power- 
ful computer installations. At the time of this writ- 
ing there is evolving an even more advanced pro- 
gram system concept, growing out of the idea of 
multiprocessing: an executive system that inte- 
grates one or more central processors, several 
auxiliary input/output processors, and a large 
number of remote user-operated consoles in a way 
so as to make efficient use of the centralized facili- 
ties but at the same time permit each user to 
interact directly, in an “on-line” fashion, with the 
computer complex. In order to achieve this, the 
system must be able to run programs piecemeal 
and respond, in what is to a human a short time, 
to a multiplicity of external demands. The require- 
ments of this sort of “time-sharing” operation have 
led to radical new concepts in both the computer- 
design and programming areas. 

Translators are generally identified by the 
problem-oriented language that they accept as 
input, such as Fortran, Algol (two general alge- 
braic languages), Cobol (a standard file-processing 
language), Simscript (a simulation language), etc. 
A similar function is accomplished less efficiently 
but often more straightforwardly by an interpreter 
program, which both translates and executes si- 
multaneously, thereby carrying out the whole 
computing task in one rather than several phases. 

These system programs, along with various diag- 
nostic programs for analyzing both computer oper- 
ation and program operation, may be categorized 
as service routines; they serve the user by facili- 
tating the execution of the particular procedure 
whose results he desires. This function may be 
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contrasted with that of standard routines and 
standard subroutines, generally available on a 
“library” basis, which actually carry out all or a 
part of the desired task. One may anticipate the 
availability of subroutines for at least some of the 
following: 

radix conversion—if the computer uses a num- 
ber base other than 10, which is often the case. 
(This function will usually be incorporated at the 
assembly level.) 

special arithmetic—“floating-point” (flexible 
scale) and “double-precision” (extended length) 
numeric manipulation if not directly available on 
the computer instruction level; also complex 
arithmetic, etc. 

function evaluation—calculation of square root, 
exponential and logarithm, sine and cosine, etc. 
for arbitrary arguments. (A call for standard func- 
tions may be incorporated into translator specifica- 
tions.) 

polynomial and matrix operations—addition, 
multiplication by scalar, multiplication; evaluation, 
calculation of zeros (for polynomials); transposi- 
tion, inversion, diagonalization (for matrices). 

input/output editing—formating and sequenc- 
ing card and paper tape reading and punching, 
magnetic tape, drum and disk file operations, print- 
ing and other forms of graphic display. (This func- 
tion will sometimes be incorporated at the monitor 
level.) 

By definition, a subroutine is supposed to serye 
as part of a larger program; its usefulness there- 
fore depends not only on its conception with regard 
to generality and comprehensiveness but on its 
being accompanied by an appropriate set of speci- 
fications characterizing its action and the directions 
for “connecting” it into the program that uses it. 
Standard routines, by contrast, are programs for 
tasks that can be regarded as independent in their 
own right; the appropriate form of these depends 
very much on the applications environment. Pro- 
grams for regression, factor analysis, linear pro- 
gramming calculations, and solution of differential 
equations are generally in this category, as well as 
those for standard file-processing operations such 
as sorting and input/output media conversion 
(card-to-tape, tape-to-card, etc. ). 

The importance to successful computer usage of 
a good available set of service routines, standard 
routines, and subroutines cannot be underesti- 
mated, and it is this that leads to “software” costs 
equaling or exceeding “hardware” costs for an 
installation, a fact often unanticipated by the 
administration that set up the budget. Some alle- 
viation of the difficulty is provided by the existence 
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of various “user organizations,” generally associ- 
ated with a particular class or type of computer, 
which maintain a pool of programs available to 
their members, These are only guaranteed up to 
minimum standards of documentation, however, 
and the problem of making effective use of work 
already done by others remains one of the most 
formidable and often exasperating problems facing 
the user who wishes to reap the benefits afforded 
by computers without investing the time to under- 
stand fully the device that he is entrusting with a 
major role in his research or operations. 

The topics in this section are further developed 
in Borko (1962, pp. 140-170), Green (1963, 
pp. 65-74), and Gregory and Van Horn ([1960] 
1963, pp. 439-478). 


Error analysis 


Although computer applications in general are 
subject to the problem that imperfect understand- 
ing of procedures by the user may lead to unsatis- 
factory results, computation related to familiar 
mathematical concepts is especially dangerous in 
this regard. This is because the user often starts 
out with some preconceptions based on his mathe- 
matical understanding and is thus likely to neglect 
or misjudge the effect of computational error, 
which may be considerable. It is customary to 
measure error in either absolute or relative units, 
depending on circumstances; thus if x is a com- 
puted value and it is assumed that it represents 
some “true” value x", then 6 = x — x’, or possibly 
its magnitude | 6 |, is defined as the absolute error 
in x, and 5/x” (which is approximated by 6/x) is 
defined as the relative error. Computational error 
may also be classified according to source as fol- 
lows: inherent error, reflecting inaccuracies in ini- 
tially given data; generated error, reflecting in- 
accuracies due to the necessity of rounding or 
otherwise truncating the numeric results of arith- 
metic operations; and analytic error, reflecting in- 
accuracies due to the use of a computing procedure 
that represents only an approximation to the theo- 
retical result desired. 

In the example of the mean and standard devia- 
tion calculation given earlier, one might wish to 
analyze the effect of inherent error in the input 
x values on the computed č and s values, that is, 
to analyze the manner in which errors ô; attributed 
to the x; induce errors 6; and 6, in < and s, respec- 
tively. In the case of simple functions the effect 
can be estimated using the traditional techniques 
of calculus; in a calculation of complicated struc- 
ture, however, it may be difficult even to get a 
feeling for the manner in which errors in input 


values propagate. Of course, in this type of situa- 
tion, inherent errors in the data may be just what 
the calculation itself is concerned with, and hence 
it might be more appropriate to consider the input 
values x as exact as far as the computation is 
concerned, Even then it would be important, how- 
ever, to have some idea of the “sensitivity” of the 
computed results as a function of input; one might 
want to know, for example, that the closer to zero 
% gets, the more relative effect it will show from 
given relative changes in x values. 

In any actual implementation of the procedure 
diagrammed in Figure 2 there will be generated 
error introduced at each arithmetic step, the exact 
nature of which depends on the rounding rules 
followed by the computer arithmetic unit, which 
are often obscure to one who communicates only 
via a procedure-oriented language. Furthermore, 
analytic error is necessarily introduced at the stage 
where the square root is called for, since functions 
such as this can be computed only approximately. 
The closeness of the approximation, and hence the 
level at which analytic error is introduced, depends 
on properties of the square root procedure, which 
also are frequently not very well documented for 
the user. 

When it is considered that error effects of all three 
types may be interwoven in a calculation, and may 
propagate through all the calculational steps in a 
complicated manner, it is seen that the problem of 
error assessment is nontrivial. Furthermore, it is 
not generally taken into account how catastrophic 
relative error propagation effects can be; the sub- 
traction of two numbers each with a relative error 
of order 10 which happen to have a relative dif- 
ference of order 10® will yield a result with rela- 
tive error of order 10°, and the arithmetic pro- 
cedures used on most contemporary computers are 
such that this loss of significance is not brought 
to the user's attention unless special monitoring 
techniques are programmed into the calculation. 
And while it is generally true that both generated 
and analytic error can be minimized by computa- 
tional techniques (carrying more precision and 
using more refined approximations), an assessment 
is required to know when such measures should 
be resorted to. Poor computational technique can 
magnify the effect of generated and analytic error 
unnecessarily; a typical case is use of the formula 
Dix? — n? instead of Ð (x;— ž)? for calculating 
the standard deviation, since then the result tends 
to depend on subtraction of two large and nearly 
equal numbers. Finally, inherent error, as its name 
implies, is dependent only on the level of input 
error and the functional connection between input 


and output values, neither of which are subject to 
remedial action in the computational context. 
When small relative errors in input induce large 
relative errors in output, a process is said to be 
“ll-conditioned”; if this obtains, the most one can 
hope for is a means of detecting the situation. 

The subject of error analysis, in its several 
facets, has been extensively studied; although no 
comprehensive set of directives can be given, the 
user will be well repaid for becoming familiar with 
at least some of the work already done in this 
area, as indicated in McCracken and Dorn (1964, 
pp. 43-64) and Hamming (1962, pp. 24-40). 
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[See also SIMULATION. ] 
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COMTE, AUGUSTE 


Isidore Auguste Marie François Xavier Comte 
(1798-1857), French philosopher and sociologist, 
came from a Catholic and monarchical family in 
Montpellier. He abandoned the Catholic faith at 
the age of 13. In 1814 he entered the École Poly- 
technique in Paris. This school was then the center 
of political liberalism and of progressive thought in 
France; the mathematician Gaspard Monge had 
been one of its founders, and it remained com- 
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mitted to the development of mathematics and the 
sciences. 

While it is clear that Comte soon shifted away 
from pure liberalism, close scrutiny is required to 
establish his position with respect to conservatism 
and liberalism. His early writings indicate that he 
had many ideas in common with the counterrevo- 
lutionary and conservative movements at the begin- 
ning of the nineteenth century; later, at the be- 
ginning of the fourth volume of his Cours de 
philosophie positive (1830-1842), he explicitly 
acknowledged this affinity, stating that he agreed 
in part with Joseph de Maistre’s philosophy with- 
out, however, accepting his political views. Con- 
trary to some views (Hayek 1952; Salomon 1955; 
Spaemann 1959), this does not mean that Comte 
can be identified with the counterrevolutionary 
thinkers of his time. Indeed, his attitude differed 
from that of the conservatives in the same way as 
did Saint-Simon’s; both tried to repudiate the prin- 
ciples of the French Revolution without at the same 
time relinquishing its achievements. This twofold 
attitude, similar to that which Hegel had expressed 
somewhat earlier, is corroborated by Comte’s posi- 
tion during the revolutions of 1830 and 1848 and 
by his humanitarian philosophy, which eventually 
became a religion of humanity: the worship of the 
Grand Être. 

Comte’s association with Saint-Simon, which be- 
gan as early as 1817, is in itself further evidence 
that he does not belong with the conservatives. De- 
spite Comte’s later protestations that he did not owe 
anything to Saint-Simon, it is certain that he was 
deeply influenced by this remarkably brilliant and 
inspiring aristocrat who devoted himself to build- 
ing a new society based on the principle that it is 
the working class which is, in the broadest sense, 
the class of “producers.” Recent historical research 
(especially by Gouhier 1933-1941 ), has estab- 
lished that their collaboration during the period 
from 1817 to 1824 was much closer than Comte 
ever admitted. The change in Comte’s personal feel- 
ings toward Saint-Simon can best be followed in his 
correspondence with his friend Valat (1870), 
which shows how much he was initially stimulated 
by Saint-Simon and how he later withdrew from 
the relationship. 

Saint-Simon had recorded his essential ideas, be- 
fore he ever met Comte, in the most coherent of his 
booklets, Mémoire sur la science de ľhomme and 
Travail sur la gravitation universelle, both written 
in the year 1813. These contain the famous “law of 
three stages”; even though Saint-Simon was in- 
capable of developing this idea into a compre- 
hensive and consistent philosophy of history, as 
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Comte was to do in the Cours, the idea, neverthe- 
less, was originally his. In general, Saint-Simon dis- 
sipated his ideas in innumerable letters, leaflets, 
prospectuses, and pamphlets, throwing out striking 
formulas and slogans rather than elaborating the 
ideas step by step. He was a great speaker, a vivid 
advocate of new conceptions, always an agitator 
rather than a philosopher or a disciplined scholar. 
Even though Comte’s early work followed the same 
lines as Saint-Simon’s, his mode of thinking was 
not only more consistent but also more profound. 

An important similarity between Comte and 
Saint-Simon is their common negative attitude 
toward economic liberalism and their criticism of 
Adam Smith (Mauduit 1929). Here both came very 
near to Simonde de Sismondi and his Nouveaux 
principes of 1819 and sketched the first historical 
criticism of classical economics. 

While the influence that Saint-Simon had on 
Comte should not be minimized, neither should the 
differences between them; these were differences 
not merely in style of thought but also in the sub- 
stance of that thought. One such difference was 
revealed by Comte himself in his correspondence 
with Valat when he asserted that Saint-Simon’s 
advocacy of political action before the scientific 
system of positivism had been sufficiently devel- 
oped was putting the cart before the horse. An- 
other and more subtle difference appeared when 
Comte, in an article published under Saint-Simon’s 
name and in his journal, Industrie, called for the 
development of a secular system of morals and held 
that society is itself a system of “common moral 
ideas.” This was nothing less than the discovery of 
the normative character of social behavior, that is, 
of a moral system inherent in social relations. 
Comte asserted the relativism of moral ideas—that 
they vary with different cultures and social sys- 
tems—in opposition to the notion that norms are 
rooted either in divine revelation or in a general 
spiritual order, separate from social life and un- 
touched by it. This clear-cut statement of what we 
now call cultural relativism aroused considerable 
turmoil among the subscribers of the journal, and 
in the next issue Saint-Simon formally promised 
that he would “return to his first manner” (Pereire 
1912). Saint-Simon had never had any doubts 
about the origin of moral ideas and was, indeed, 
genuinely convinced that the traditional Christian 
norms could be adapted to the new conditions of 
industrialism by emphasizing the importance of 
philanthropy (Nouveau christianisme 1825). With 
respect to his conception of values, then, Comte 
was more radical than Saint-Simon. 

This is not true of Comte’s conception of the 
proper order of society. As has been judiciously 


stated by T. H. Huxley, Comte and positivism can 
be interpreted as “Catholicism minus Christianity,” 
that is, as an all-embracing hierarchical system of 
society, based on secular values of a mystical, hu- 
manitarian kind. This, taken together with Comte’s 
conception of the role of “consensus” in social proc- 
esses, brings him nearer to the views of the con- 
servative thinkers than to Saint-Simon’s revolution- 
ary demand that the producing class (the workers) 
be protected from exploitation by the idle or leisure 
class and that la classe la plus nombreuse et la plus 
pauvre be emancipated. With this demand Saint- 
Simon became a forerunner of Marx, whereas 
Comte remained a believer in the efficacy of a 
purely spiritual order, provided only that it is 
purged of theological and metaphysical ideas. 

It has never been contested that Comte invented 
the term “sociology,” which he first used in print in 
1838, at the beginning of the fourth volume of his 
Cours. Earlier, in the general introduction to the 
first volume, he had used the term physique sociale 
just as Saint-Simon had used this term, inter- 
changeably with such terms as physiologie sociale 
or système de gravitation sociale. A relatively casual 
reason for Comte’s change of terminology was that 
the Belgian statistician Quetelet published his 
Physique sociale in 1835. There was also, however, 
a deeper reason: Having accepted for some years 
the teachings of the great French astronomer 
Laplace, with whose ideas he had become acquaint- 
ed at the Ecole Polytechnique, Comte began to 
reject the leadership of mathematics in the devel- 
opment of the social sciences. He replaced the 
methodology of the sciences with the historical 
approach, as developed in the Cours, and thus 
largely repudiated the education he had received 
at the Ecole Polytechnique. 

The development of sociology as a science occurs, 
according to Comte, within the framework of a gen- 
eral reorientation of human thought. According to 
the “law of three stages,” that was expounded by 
Comte in the first two lectures of the Cours (after 
previous attempts in his early writings), every 
single branch of human knowledge has to pass 
through three different theoretical (or methodo- 
logical) stages before it reaches maturity: the 
theological or fictitious; the metaphysical or ab- 
stract; and the scientific or positive stage. The func- 
tion of the second stage is to act as an intermediary, 
since the first and the last stages are clearly so dif- 
ferent in their general outlook that it is impossible 
to pass directly from the first to the third. In the 
third stage all phenomena are regarded as subject 
to invariable natural laws that can be investigated 
by observation and experimentation. 

Comte began the second part of this new ap- 


proach to human thought with the statement that 
the different sciences, because of the varying degree 
of complexity of their respective substances, reach 
the stage of maturity at different times, and he then 
proceeded to locate all the different sciences in 
terms of the stage of their development. The order 
followed in this encyclopedic system is the “natural” 
or logical order, beginning with the least complex 
or most general phenomena, which are also most 
remote from humanity, and ending with those more 
relevant to human beings as investigated by the 
social science. The order of the sciences is there- 
fore astronomy, physics, chemistry, physiology, 
and, finally, social physics, or sociology. The sci- 
ence of mathematics is presented as an introduc- 
tion to the encyclopedia and somewhat apart from 
this order; it is the most important of all the sci- 
ences in that it is the most general formal basis of 
both science and philosophy. The science of psy- 
chology does not appear in this enumeration. 
Comte was convinced that psychology as it existed 
in his time had not yet passed the metaphysical 
stage. There is evidence that he thought of psy- 
chology as an appendix to sociology, to be devel- 
oped only after sociology had reached the positive 
stage. 

Interesting as this “law” may initially appear, it 
is hardly more than a set of preliminary and formal- 
istic categories and by no means constitutes the 
essence of Comte’s thought. Indeed, this part of 
Comte’s work has proved to be the least resistant to 
criticism. Comte’s sociology was developed essen- 
tially as a scientific remedy for the long-lasting 
social, political, and cultural crisis in Europe. The 
46th lesson of the Cours was devoted entirely to a 
sociological analysis of the contemporary crisis in 
France, which was the aftermath of the Revolution 
of 1789. What remained of the prerevolutionary 
system was being destroyed by the incessant and 
insoluble controversy between ordre and progrès. 
It was this “miserable oscillatory constitution of 
our social existence” that Comte sought to over- 
come by developing, with the help of scientific 
analysis, a third way that would eliminate the “in- 
tellectual anarchism” of his time. Since conserva- 
tism represented the party of ordre, and Jacobinism 
the party of progrés, Comte, following his own in- 
terpretation of history, divorced himself from both 
parties and tried to establish a third way, without 
succumbing to the eclecticism represented by Victor 
Cousin. 

Comte’s persistent ambivalence toward the French 
Revolution was already at the core of his thought 
while he was dominated by the ideas of Saint- 
Simon (see, for example, his essay of 1822, “Plan 
des travaux scientifiques nécessaires pour réor- 
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ganiser la société”). What was new in the Cours 
was his attempt to resolve it. It has been alleged 
that his panacea for the crisis of his time was a 
rational, scientific approach, based on observation 
and experimentation, and that this is what he 
meant by “positivism.” But although there is evi- 
dence, especially if we limit ourselves to the con- 
sideration of the Cours, that this was an element in 
his thought, it was only one element, and perhaps 
not the most important one. Comte himself stated 
several times that empiricism per se is not equiva- 
lent to positivism (Cours, Lecture 58) and that 
positivism offers a synthesis of our knowledge, 
thereby producing a new system of thought and a 
new dimension of consciousness. This very sig- 
nificant view is made more explicit in A Discourse 
on the Positive Spirit (1844a), as well as in his 
Appeal to Conservatives (1855), where he dealt 
with the different meanings of the term “posi- 
tivism.” 

As it was presented in the Cours, however, posi- 
tivism was misunderstood by contemporary ad- 
mirers and critics alike, the admirers seeing it as 
an unequivocal endorsement of the efficacy of sci- 
ence and the critics deploring it as “scientism.” 
This misinterpretation has been given wide cur- 
rency by the books on Comte by Littré (1863), 
Mill (1865), Ostwald (1914), and others. Having 
thus misconstrued the Cours, it became necessary 
for them to separate the Comte of the Cours from 
the Comte of such later writings as the System of 
Positive Polity (1851-1854), and the Catechism 
of Positive Religion (1852), since the System is 
permeated by reformist ideas and the Catechism 
by a new religion. The most common explanation 
of the “inconsistency” in his ideas stems from the 
vicissitudes of his personal life; since Comte had 
experienced one nervous breakdown in 1826 at the 
age of 28, the “change” in his outlook was attrib- 
uted to the impact of another personal crisis, the 
dissolution of his relationship with Clotilde de 
Vaux in 1846 and her death in 1847. The alleged 
drastic shift from his positivistic and objective 
approach to a new “subjectivism” has been con- 
nected with this unhappy emotional experience. 

The explanation would be plausible had the shift 
actually occurred. However, it is based on a com- 
pletely distorted view of the Cours and of its gen- 
eral meaning, as can easily be demonstrated by 
noting Comte’s repeated emphasis in that work on 
the necessity of an all-embracing moral and intel- 
lectual reform as the prerequisite of social reform. 
The last lectures of the Cours (56-60) are entirely 
devoted to a general appraisal of the achievements 
of science after the stage of positivism has been 
fully realized. Indeed, reformist intentions rather 
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than rational analysis are so basic to his entire phi- 
losophy that he came very near to Marx’s state- 
ment that “the philosophers have only interpreted 
the world in different ways, the point is to change 
it” (Theses on Feuerbach 1845). In fact, “praxis” 
was as important a goal to Comte as it was to Marx, 
the only difference between them being that Comte 
tried to combine this approach with the Cartesian 
heritage, whereas Marx, in his distrust of reason, 
destroyed the very foundation of philosophical 
rationalism and replaced it by revolutionary action 
directe, 

As the British Neo-Hegelian philosopher Edward 
Caird (1885) rather early appreciated, Comte’s hu- 
manism and the real meaning of his “subjective 
synthesis” is based on the fact that “nature becomes 
conscious of itself in man,” so that, as Comte put 
it, “man sums up in himself all the laws of the 
world.” Therefore, “to one who has understood the 
full meaning of the process, this ‘subjective syn- 
thesis’ will also be objective” (see also Marvin 
1936). The different terms used to define the “meta- 
physical stage of history” produce similar evidence. 
This stage was also called by Comte the “negative 
stage,” and its historical importance lies in the 
prevalence of criticism and destruction of the old 
conceptions of the world and of social life as well. 
Thus, the intellectual development of mankind had 
necessarily to pass through ages of anarchism and 
revolutionary restlessness, a true interregnum as 
Comte called it. The interregnum is characterized 
by “negative philosophy,” which is the necessary 
forerunner and prerequisite for the establishment 
of “positive philosophy,” or positivism, as Comte 
understood it, because a new order cannot be at- 
tained before the remains of the old system have 
been completely erased. 

For an adequate understanding of Comte, one 
has to consider the whole of his intellectual career, 
integrating his early writings with the Cours, 
with the subsequent attempts at self-interpretation 
(1844a; 1848), with the System, and finally with 
his sketch of a humanitarian religion. This was his 
particular response to a general humanitarian trend 
in the philosophy, the art, and the literature of his 
time, which began with Honoré de Balzac and pro- 
visionally culminated in a system of humanitarian 
metaphysics as developed by Pierre Leroux, whose 
influence during the early 1840s in France was 
much more important than has been recognized so 
far (König 1931). Comte was neither a “mad” phi- 
losopher—Dumas (1905), Seillière (1924), Maur- 
ras (1905), Sokoloff (1961), all notwithstanding— 
nor a partisan of “pedantocracy.” His main and 
vital interest was rather the systematization of the 


social background of human history into one body 
of knowledge, in preparation for a practical ap- 
proach to social reform based on a lasting order, 
the theoretical and moral principles of which he 
saw in the development of a new science, sociology, 
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[For the historical context of Comte’s work, see Post 
TIVISM; SOCIOLOGY, article on THE DEVELOPMENT OF 
SOCIOLOGICAL THOUGHT; and the biographies of La- 
PLACE and SAINT-SIMON. For discussion of the subse- 
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CONCEPT FORMATION 


Our world is filled with sets of objects, events, 
and ideas that share some common quality while 
differing in other characteristics. An organism that 
learns to respond to the common quality of a given 
set has learned a concept. The child, for example, 


learns the concept of blueness when he is able to 
select a blue balloon or a blue ball or a blue wagon 
from similar objects of different colors, When he 
makes this conceptual response, he is abstracting 
the blueness from other properties of the objects, 
like size, shape, or function. Similarly the child 
learns the concept of triangle when he distinguishes 
triangles of various shapes (right-angle, isosceles, 
scalene, equilateral, and obtuse) from other geo- 
metrical forms. Or he learns the concepts of right 
and wrong when he discriminates between socially 
approved and disapproved behavior. 

In experimental psychology the field of concept 
formation has achieved neither the methodological 
nor the conceptual unity characterizing such areas 
as sensation, perception, learning, and motivation. 
Traditionally, psychologists have not looked upon 
conceptual behavior as reflecting any unique or 
fundamental psychological process. Instead the 
common view has largely been that conceptual be- 
havior is a function of the interaction between 
basic psychological processes. As a result, psychol- 
ogists have for the most part sought to extend 
theories from other areas (e.g., learning, percep- 
tion ) to conceptual behavior instead of formulating 
hypotheses concerned exclusively with the facts of 
concept formation. 

Historical introduction. Experimental efforts to 
understand concepts were made initially by psy- 
chologists who by training or intellectual kinship 
were related to Wilhelm Wundt (1832-1920). For 
Wundt, introspection, the method of observing 
one’s own conscious experience, was the primary 
method of the psychological laboratory. The goal 
in investigating concepts was, first, to identify their 
basic mental elements and, second, to discover the 
principles that govern their combination—a goal 
previously set in investigations of sensory and per- 
ceptual experience. As a step toward understanding 
the conscious correlates of a concept, Titchener 
described it as a “symbol which holds a large num- 
ber of particular ideas together” ([1896] 1921, 
p. 312). 

Although the goals of introspective psychology 
seemed clear, the results were not. Introspection, 
as a general method of psychology and as a specif- 
ic technique for revealing the mysteries of thought, 
possessed severe limitations. Of paramount impor- 
tance was its apparent unreliability. The content 
of the mind seemed to be influenced by how one 
was trained to observe it. Elements of conscious- 
ness observed in one laboratory could not be de- 
tected in another. 

Introspective psychology was also handicapped 
by two related characteristics of phenomenal ex- 
perience that apparently interfered with the anal- 


ysis of the higher mental processes. First, mental 
experience seemed to be in a state of constant flux, 
incapable of being analyzed into static elements. 
William James, in his discussion of the conscious 
experiences associated with conceptual thinking 
(1892), recognized the inadequacy of formal intro- 
spection as a technique for describing subjective 
life. Second, introspective analysis of thought by 
H. J. Watt and N. Ach at Wiirzburg, Germany, 
sometimes revealed an extreme paucity of con- 
scious experience and at other times, none at all. 

Dissatisfaction with the introspective method 
encouraged some psychologists to propose a new 
approach to the science of psychology. Behaviorists 
argued that psychology would remain methodologi- 
cally unreliable, and empirically restricted, if it 
persisted in identifying its subject matter as con- 
scious experience. Instead, it was suggested, psy- 
chology should be conceived as the objective experi- 
mental science of behavior (Watson 1913). 

Although the philosophical issues were more 
complicated than Watson understood, his vigorous 
and convincing arguments redirected experimental 
psychology in the United States from the study of 
consciousness to the study of behavior. The exam- 
ination of the experiential correlates of concepts 
was discarded in favor of the objective investiga- 
tion of their acquisition and utilization. 

Hull (1920), in a doctoral thesis, illustrated how 
behavioristic methodology could be applied to the 
study of concepts. He trained college students to 
associate each of 36 Chinese characters with one 
of six sounds. These characters belonged to six 
different groups, each of which had a common part 
known as a radical, The appropriate sound (e.g., 
o0 yer) was the same for all the characters having 
the same radical. The subjects were under the im- 
pression that their task was to memorize the re- 
sponse (the appropriate sound) to each stimulus 
(the Chinese character). After a subject was able 
to give the correct response for all 36 characters, 
he was tested with an entirely new set of charac- 
ters that had the same radicals. Thus it was 
possible to discover whether the subject learned 
to respond to the common element (the radical) or 
whether he learned the appropriate response to 
each individual character. If he had learned to 
respond to the common element, he should now 
respond appropriately to all the new characters 
which had that common element. If, however, he 
had only learned to respond to the individual char- 
acter, he would be unable to identify the new 
characters. 

The results indicated that the subjects as a group 
behaved in a manner falling somewhere between 
the two extremes. On the very first trial 27 per cent 
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of the responses were correct, indicating that the 
subjects had learned, to some extent, to respond 
conceptually, that is, to react to the common 
element characteristic of all the instances of that 
concept. It is interesting to note that Hull found 
the most effective method of teaching the six con- 
cepts was by presenting each series of characters 
with the common element in red, thus facilitating 
the discrimination between the essential cues. 

Discrimination learning as a model. Hulls 
study set the course of research in conceptual 
behavior in the United States for decades to come. 
First, it offered an experimental method that could 
determine objectively the influence of numerous 
variables on concept formation. Second, the use 
of stimulus—response language in analyzing con- 
ceptual behavior was encouraged. The implication 
of this technical language system, a system that is 
widely used by neobehaviorists (the present-day 
descendants of Watson), is that there are at least 
three fundamental sets of variables in any psy- 
chological event: environmental (S), behavioral 
(R), and the association (—) between them. Within 
this language system a concept becomes “a com- 
mon response to a set of dissimilar stimuli” (see 
T. S. Kendler 1961, p. 447). Finally, the results 
of Hull’s study suggested that concept formation 
could be conceptualized as a case of discrimination 
learning based upon principles of conditioning. 
Pavlov explained the learning of a discrimination 
(conditioned differentiation) in terms of three fun- 
damental principles: conditioning, stimulus gen- 
eralization, extinction. When training a dog to dis- 
criminate between a white and a ray circle, it 
proved necessary not only to condition him to 
respond to the white circle but also to train him 
not to respond to the gray one. A dog who has been 
conditioned to salivate to a white circle will, with- 
out any additional training, salivate to similar 
stimuli, such as gray circles. Differential responses 
to gray and white circles can be learned only if 
the effects of stimulus generalization are neutral- 
ized. This can be done by extinguishing the gener- 
alized response to the gray circle, i.e., by not giving 
the dog food in the presence of the gray circle. 
Thus a discrimination can be established by the 
joint action of conditioning the response to the 
appropriate stimulus and extinguishing the gen- 
eralized response to the inappropriate stimulus 
[see LEARNING, article on DISCRIMINATION LEARN- 
ING; and the biography of Hutt]. 

This discrimination model, based upon the 
mechanisms of conditioning, generalization, and 
extinction, seemed to many psychologists obviously 
applicable to concept formation. The child who is 
learning the concept of red must not only acquire 
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a common response to all red stimuli but must 
also extinguish the response to similar stimuli, 
such as orange objects. 

The S-R discrimination model of concept forma- 
tion instigated a wide variety of research studies, 
far too numerous to summarize here. Of the more 
systematic research programs some were primarily 
concerned with investigating the influence of a 
specific set of variables; others were more inter- 
ested in analyzing the discrimination process un- 
derlying concept formation. 

Order of difficulty. Heidbreder (1948) in a 
classic series of studies, with college students serv- 
ing as subjects, investigated the order of difficulty 
of attaining different kinds of concepts: those rep- 
resenting objects (e.g., tree), form (e.g., circle), 
and number (e.g., five). Under varying conditions, 
the order of difficulty—concrete objects, form, and 
number—remained constant. This order was inter- 
preted as being positively correlated with the “thing 
character,” the concreteness, of the critical feature 
of the concept; a tree possesses more “thing char- 
acter” than does a circle, which in turn is more 
concrete than five (see Vinacke 1952, chapter 7, 
for discussion of concrete—abstract dimension of 
conceptual behavior), The order of difficulty Heid- 
breder initially found was not, however, always 
replicated, By modifying the critical cues on which 
the concepts were based, the hierarchy of domi- 
nance could be changed. In another set of studies 
under somewhat different experimental conditions 
(e.g., Grant & Curran 1952), attaining concepts 
based on color proved to be more difficult than 
identifying either concepts of form or number. This 
simple ordering of difficulty again does not tell the 
whole story. With practice, concepts of number 
become relatively easy to learn. Also number con- 
cepts, once learned, tend to perseverate more than 
do concepts of form or color. The problem of the 
hierarchy of dominance of various concepts be- 
comes even more complicated when the behavior 
of children is considered (see Russell 1956, chap- 
ter 8; Thompson [1952] 1962, chapter 9). Some 
data suggest that children under age three find 
form concepts easier to acquire than color concepts, 
but from three to five the relationship is reversed. 
Perhaps a fundamental order of difficulty will never 
be found, because the basic mechanisms responsi- 
ble for concept formation will favor one order under 
one set of experimental conditions and an entirely 
different order under a different set of conditions. 

Mediating processes. Rats have been reported 
capable of learning to choose a triangular form 
when paired with other forms despite variations 
in the size, shading, position, and amount of outline 


of the triangles. But is this “truly” conceptual be- 
havior? Osgood believes not, since the rat probably 
does not understand the abstract concept of trian- 
gularity and therefore would not respond to “three 
dots in a triangular arrangement .. . or to three 
places on a map, a three cornered block . . .” (1953, 
p. 667). He suggests that there are two kinds of 
classificatory behavior. One kind occurs when an 
organism responds to a common specific element 
in different stimulus patterns (e.g., the specific 
Chinese radicals in Hull's study). The other kind 
of classificatory behavior, the “true” conceptual 
type, occurs when varying stimulus patterns, not 
necessarily containing any features in common, 
elicit a common mediating abstract response that 
serves as a cue for conceptual behavior. Although 
the qualification “not necessarily containing any 
features in common” can be a source of semantic 
confusion (see Kendler 1964, p. 219), Osgood’s 
distinction directs attention to mediating processes 
that many investigators assume are of crucial im- 
portance to the discrimination mechanism underly- 
ing conceptual behavior, Experiments with simple 
concept-formation tasks with children of normal 
intelligence ranging from three to ten years of age 
(Kendler & Kendler 1962) suggest that the behav- 
ior of very young children is consistent with the 
principles of discrimination learning based on 
lower animals in that their responses are directly 
under the control of environmental stimuli. The 
behavior of older children, in contrast, is dependent 
upon responses, usually implicit, that intercede be- 
tween the external stimulus and the overt response 
to provide stimulation that influences the course 
of overt behavior. As children grow older, the pro- 
portion of those who behave in a manner that re- 
flects mediating processes increases in a stable 
fashion. The transition from behavior based on a 
direct connection between external stimulus and 
overt response to mediated behavior enables chil- 
dren to learn concepts more rapidly and utilize 
them under a wider range of conditions. 

What are these mediational events underlying 
conceptual behavior and how can they be con- 
trolled? Instructing children to verbalize the 
relevant cues in a simple concept-learning task 
encourages mediational behavior. But whether 
these instructions to name provide the essential 
cue (e.g., the word “black”) for conceptual behav- 
ior or instead direct attention to the essential cue 
(e.g., the stimulus black) or function in both ways 
simultaneously is not yet known. 

Second signaling system. Closely related to the 
work of the American mediational theorists are the 
efforts of the Soviet psychologists who investigate 


what Pavlov described late in his career as the 
second signaling system. The first signaling system 
dealt with stimulation from the external world. But 
man, Pavlov noted, is a verbal organism that can 
produce words that can function as stimuli. Such 
stimuli, which constitute the second signal system, 
differ qualitatively from others because they can 
represent in an abstract manner the multitude of 
separate stimuli of the first signal system. Accord- 
ing to Liublinskaya (1957) a word becomes a sig- 
nal of the second signaling system only when it 
becomes a concept. 

Natural development—Jean Piaget. There is 
practically no limit to the number of models of 
behavior that have been or can be used to investi- 
gate conceptual behavior. One of the distinguish- 
ing characteristics of both the S-R and the second 
signaling system orientations is the belief that 
classificatory behavior could, at least in large meas- 
ure, be reduced to principles of behavior observed 
in conditioning. Other investigators of conceptual 
processes do not feel compelled to commit them- 
selves to any psychological reductionism, while 
some actually oppose it. Jean Piaget has spent 
over forty years investigating the development of 
thought, or as he sometimes refers to it, “concep- 
tual operations,” in children. The fruit of his ef- 
forts, and those of his co-workers, is a theory that 
seeks to describe intellectual development from 
infancy to adolescence. The formulation is based 
on seminaturalistic observations of behavior of 
children in response to ingeniously designed prob- 
lems and adroit questioning. In general the find- 
ings suggest to Piaget and his co-workers that 
intellectual development can be characterized as 
a series of stages in which each stage lays the 
foundation for the successor. The concepts Piaget 
deals with are quite different from those investi- 
gated by the S-R psychologists, who for the most 
part investigate nominal concepts, e.g., forms, 
shapes, vegetables. Piaget deals with concepts of 
relationships, such as those involved in logical 
operations, which have relevance for the way the 
child perceives and organizes his world. For exam- 
ple, in the first stage of intellectual development 
(from birth to two years of age) the child learns 
the concept of permanency of objects; a doll still 
exists after it is removed from the child’s sight. In 
the second stage, which covers from two to seven 
years, the child can use language, and the inter- 
nalization of actions becomes possible. Symbolic 
function appears, but there is an absence of both 
“reversible operations” and of the concepts of con- 
servation of quantity, size, etc. (e.g., the child is 
apt to say that four checkers placed far apart are 
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more than four checkers placed close together). 
In the third stage (ages 7-11) the conceptual 
operations of the child include the ability to per- 
form concrete operations that belong to the logic 
of classes and relations but do not take into ac- 
count their combinatorial possibilities. The fourth 
stage (11-15), which leads to adult logic, is 
marked by the ability to reason by hypothesis. The 
logic is now concerned with propositions as well 
as objects. Movement from one stage to another 
is determined by a number of experiential and 
maturational variables, related to, but not exclu- 
sively determined by, age. 

Piagets impact on behavioristically oriented 
psychologists was initially not very great because 
of their strong methodological commitments to 
rigorous experimental procedures, In recent years, 
with an increase of interest in cognitive processes 
and the publication in English of a book (Flavell 
1963) that attempts to both streamline and clarify 
Piaget’s conceptions, a meeting of interests, if not 
of minds, has become more probable. Actually the 
interests of Piaget and the S-R psychologists sup- 
plement each other, the former being primarily 
concerned with changes that take place as children 
mature, while the latter are interested in discover- 
ing general principles of behavior that are appli- 
cable to all age groups [see DEVELOPMENTAL PSY- 
CHOLOGY, article on A THEORY OF DEVELOPMENT]. 

Information and decision theories. There are 
psychologists whose orientations and conceptuali- 
zations are at odds with the neobehaviorists. Be- 
ginning with a disenchantment with the capability 
of a model of behavior based upon principles of 
conditioning and learning to represent the richness 
of true-life cognitive processes (e.g., Miller et al. 
1960), these psychologists have turned to the re- 
cent developments of information theory, decision 
theory, and computers for both assistance and in- 
spiration. Hovland (1952) offered a “communica- 
tion analysis” of concept learning in which the 
information transmitted by specific instances of 
concepts could be measured. Such an analysis pro- 
vided a frame of reference against which to eval- 
uate the efficiency of the information-processing 
ability of human subjects. This communication- 
analysis orientation led to the planning of com- 
puter programs designed to simulate human 
behavior as well as experiments aimed at contrib- 
uting information on which programs could be 
based. The task of simulating conceptual behavior 
has proved to be more difficult than initially antic- 
ipated (see Hunt 1962). Success will not be 
achieved by isolated demonstrations of the simi- 
larity between the behavior of computers and 
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humans. What are needed, assuming that the 
primary interest is in the behavior of organisms 
and not the artificial intelligence of computers, are 
general principles that will generate programs 
capable of simulating human conceptual behavior 
exhibited in a wide variety of experimental studies. 
Hovland concluded that computers can function 
as potential aids in understanding concept forma- 
tion by “sharpening our formulations concerning 
mental processes and phenomena,” in encouraging 
“theories that have both descriptive and predictive 
power,” and for coping with the complex problem 
of dealing with a multitude of interacting variables 
(Hovland 1960, pp. 691-692). 

Bruner, Goodnow, and Austin (1956) approach 
the problem of concept attainment in a manner 
similar to the method of computer simulation of 
cognitive processes—but without the use of com- 
puters. Using the term “strategy’—borrowed from 
the mathematical theory of games and functioning 
like a computer program—they analyze the behav- 
ior of subjects categorizing a set of cards contain- 
ing geometrical figures. In different experiments 
the subjects were instructed to form categories that 
were conjunctive, disjunctive, or relational. Inter- 
est was centered on the kind of strategies used to 
attain these concepts; a strategy being defined as 
“a pattern of decisions in the acquisition, retention, 
and utilization of information that serves to meet 
certain objectives . . .” (1956, p. 54). Their objec- 
tives were to maximize the information obtained 
from each instance, to reduce “cognitive strain,” 
and to regulate the risk. The authors analyzed the 
results primarily in terms of four ideal strategies, 
and in terms of the advantages and disadvantages 
of each. Although the term “strategy” is rich in 
connotations, the questions of how they develop 
and influence behavior are far from clear. The 
value of the construct strategy would seem to de- 
pend on additional theoretical refinement and em- 
pirical data [see DECISION THEORY; INFORMATION 
THEORY]. 


In closing this article it may be appropriate to 
offer a brief summary of the crucial problems con- 
fronting investigators of concept formation. There 
is little doubt that the discrimination process is of 
primary importance in concept formation. The best 
method of teaching a concept would be to arrange 
the optimal conditions for discriminating between 
instances that belong to a concept and those that 
do not. Although such a principle would be gen- 
erally accepted, there would be much disagreement 
about its specific interpretation. Whether optimal 
conditions for discrimination could be best ar- 


ranged by reinforcing correct habits and not rein- 
forcing incorrect ones, by encouraging suitable 
mediational responses, by training the organism 
to perceive crucial differences, by developing ap- 
propriate cognitive systems, or by some favorable 
combination of all of these factors—all these issues 
would be open to dispute, Basic to this disagree- 
ment are two related questions: Do these apparent 
differences always represent real differences? If so, 
does their resolution depend upon their being cast 
in precise mathematical language? 

These theoretical issues are influencing specific 
experimental problems and shaping general re- 
search programs. A frank recognition among most 
investigators is that, to some extent, future theo- 
retical progress must await advances in our under- 
standing of the relationship between conceptual 
behavior and verbal and developmental processes 
as well as the design of new experimental tech- 
niques to tap the wide variety of concepts humans 
do learn and use. 

Howarp H. KENDLER AND 
Tracy S. KENDLER 


[Directly related are the entries REASONING AND LOGIC 
and THINKING.] 
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See SURVEY ANALYSIS. 


CONCILIATION 
See INTERNATIONAL CONFLICT RESOLUTION; La- 
BOR RELATIONS; NEGOTIATION; RACE RELATIONS. 


CONDILLAC, ETIENNE BONNOT DE 


Etienne Bonnot de Condillac (1714-1780) was 
a philosopher of the Enlightenment, psychologist, 
economist, and educator; through his work he helped 
to bring about the dominance of the ideas of Locke 
and Newton over the philosophy of Descartes. Born 
at Grenoble into a well-to-do family with a legal 
background, he led the life of a man of letters 
from 1740 to 1758, enjoying the friendship or the 
acquaintance of France’s leading philosophers and 
authors, among them Rousseau, Voltaire, Turgot, 
Diderot, d’Alembert, Morellet, Condorcet, Helvétius, 
Holbach, Grimm, Cabanis, Quesnay, Baudeau, Du- 
pont de Nemours, Le Trosne, and Saint-Peravy. 
From 1758 to 1767 he served as tutor to the young 
grandson of Louis xv, the orphaned duke of Parma, 
for whom he prepared a series of works which re- 
flect his educational theories; then he returned to 
Paris, only to retire in 1770 to a small property 
near Beaugency. There he remained, except for 
occasional visits to Paris, until his death, complet- 
ing La logique ({1780] 1947-1951, vol. 2, pp. 369- 
416)—done at the request of the government of 
Poland for use in Palatinate schools—and other 
works. His reputation as a philosopher, together 
with his writings (renowned for precision, clarity, 
and simplicity), won him membership in the Royal 
Academy of Berlin in 1752 and in the French 
Academy in 1768. 


Condillac’s writings fall into three categories: 
the philosophical, the educational, and the eco- 
nomic. His systematization, exposition, and devel- 
opment of Locke’s philosophy, together with his 
appreciation of Newton’s empiricism, constitute 
four books, the most important of which, Traité 
des sensations, “no student of the history of philos- 
ophy can afford to neglect” ([1754] 1930, p. xv). 
In Essai sur l'origine des connoissances humaines 
({1746] 1947-1951, vol. 1, pp. 1-118), he dealt 
with the origin of the faculties of mind and soul, 
the acquisition of knowledge, and the role of lan- 
guage in transforming sensation into reflection and 
in generating the highest operations found in think- 
ing. In his Traité des systèmes ([1749] 1947- 
1951, vol. 1, pp. 119-217) he criticized in detail 
the doctrines of Descartes, Malebranche, Leibniz, 
and Spinoza, rejected recourse to fixed ideas and 
abstract principles, and urged reliance upon ob- 
servation and experience. In his Traité des sensa- 
tions Condillac made simple sensation the sole 
source of man’s ideas and mental powers as well 
as of reflection and instinct or habit. He imagined 
a statue that has man’s organic structure, whose 
sleeping senses are awakened one at a time, then 
in pairs, and finally all together. With one sense 
awakened, the statue can experience only the sen- 
sations of which that sense is capable; but although 
only touch can judge of externality, other senses 
can learn from touch how to judge external objects. 
The development of understanding, desires, pas- 
sion, ideas, dreams, personality, and the facul- 
ties of the soul grows accordingly. Condillac ex- 
plained the formation of general ideas, the genesis 
of attention, memory, judgment, reasoning, and 
various sensations as originating in the experience 
of pleasure or pain, To the Traité des sensations 
he appended a short Dissertation sur la liberté in 
which he showed that experience, cognition, and 
reflection lead man to apprehend the consequences 
of alternative courses of action and to choose the 
preferable course. In the Traité des animaux ([1755) 
1947-1951, vol. 1, pp. 337-379) he criticized Buf- 
fon’s animal psychology and his neglect of the role 
of language in the differentiation of man from 
animal. 

Condillac’s philosophy of education is repre- 
sented mainly in his Cours d'études ([1775] 1947— 
1951, vol. 1, pp. 395-776; vol. 2, pp. 1-237), pre- 
pared for the duke of Parma but not published 
until 1775. In keeping with Condillac’s earlier psy- 
chological theory that all knowledge arises from 
sensations, he supposed that the mind and its 
capacities develop best when a child’s education 
proceeds by stages. The Cours consists of an intro- 
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duction; sections dealing with grammar, the arts 
of writing, reasoning, and thinking; and extensive 
accounts of ancient and modern history, which also 
include treatments of philosophy, government, lit- 
erature, and post-fifteenth-century scientific prog- 
ress. Condillac also made use of De l'étude de 
Uhistoire, a work prepared for him by his brother 
Gabriel Bonnot de Mably, a well-known author and 
expounder of collectivism. In La logique (1780) 
Condillac developed further his pedagogical views, 
explaining how “nature” teaches man to think and 
analyze, how ideas and faculties of the soul come 
into being, and how language develops and makes 
systematic analytical methods possible. In his in- 
complete and posthumous work, La langue des cal- 
culs ([1798] 1947=1951, vol. 2, pp. 417-529), hav- 
ing remarked the ambiguity and methodological 
inadequacy of verbal language as used in various 
sciences, Condillac noted the need in every science 
for exactitude similar to that found in mathematics 
and indicated how analogy might lead the student 
from the language and method of one science to 
the language and method of another. 

Condillac’s political economy is found almost 
entirely in Le commerce et le gouvernement con- 
sidérés relativement l'un à lautre ({1776] 1947— 
1951, vol. 2, pp. 239-367), only the second part of 
which conforms to the title. Therein he illustrated 
his opinion that economic science, while compli- 
cated and requiring (as does every science) a spe- 
cial language to convey its ideas, can be formulated 
in simple terms and yet serve analytical needs. He 
also showed how commerce can be constrained, as 
it was in France, and how, if it is free of all im- 
pediments, it fosters the growth and spread of 
wealth and prosperity. 

Part 1 is concerned specifically with the depend- 
ence of occupations and classes upon one another, 
to the role of property and national and interna- 
tional markets, and to the beneficial organizing 
role of competition and the price system when not 
impaired by monopoly or other contrived barriers 
to commercial freedom—then price variation can 
keep the supply of each good in balance with the 
demand for it at “le vrai prix.” Condillac’s views 
reflect mainly those of Cantillon and some of those 
of Galiani, Verri, Turgot, and the physiocrats. He 
described value as depending upon utility and as 
varying inversely with supply, as does price; grain 
constitutes the best measure of value over time. 
While his comments on determinants of supply are 
scattered throughout his work, the gist of his argu- 
ment seems to be that these determinants are re- 
ducible to terms of factor scarcity, cost of mate- 
rials, and cost of labor (mainly subsistence). 


Having rejected the nominalist theory of money 
and having shown how gold and silver became the 
dominant forms, he reasoned that their value de- 
pends upon their utility and scarcity, which, in 
turn, are conditioned by type of enterprise and 
access to credit, bills of exchange, and clearing 
arrangements. When unimpeded, gold or silver 
flows from places where its value is low to where 
its value is high, even as does grain, which, how- 
ever, is less mobile; whence the supply of gold or 
grain comes into balance with demand, and “le 
vrai prix” tends to prevail. Condillac defended lend- 
ing at interest, condemning its restriction, and 
noted that interest rates reflect the comparative 
abundance of borrowers and lenders and, in par- 
ticular instances, degree of risk. He discussed the 
structure of demand and its determinants; the 
interaction of increasing wants and progress in the 
arts; the impact of changes in the composition of 
demand upon the employment of land and labor; 
population growth and concentration; class and 
occupational composition; and the adverse effects 
of some types of luxurious expenditure upon a na- 
tion’s welfare and power. He did not appreciate 
the role of the division of labor, but he did identify 
the entrepreneurial role. 

Condillac’s psychological and epistemological the- 
ories exercised great influence in the eighteenth 
century and some influence in the early part of the 
nineteenth century. His views on pedagogy also 
seem to have exercised influence until the early 
nineteenth century; indeed, Joseph Neef’s transla- 
tion of La logique (see Condillac [1780] 1809) was 
used as a guide for education at Neef’s school. 
Condillac’s work on economics, however, exercised 
little influence. It appeared when Turgot and the 
physiocrats (with whose opinions his own were 
incorrectly identified) were in disfavor. Later, Con- 
dillac’s economic ideas were overshadowed by the 
work of Adam Smith and J. B. Say—the latter 
dismissed Condillac’s work as “ingenious trifling.” 
Only in the later nineteenth century did the ex- 
pounders of utility theory appreciate Condillac’s 
merit as an economist. 

JOSEPH J, SPENGLER 


[For the historical context of Condillac’s work, see 
ECONOMIC THOUGHT, article on PHYSIOCRATIC 
THOUGHT; LAISSEZ-FAIRE; and the biographies of 
CANTILLON; GALIANI; QuEsNAY; Turcor. For dis- 
cussion of the subsequent development of his ideas, 
see DEMAND AND SUPPLY; UTILITY.] 
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CONDITIONING 
See IMPRINTING; LEARNING; and the biography 


of PAvLOV. 


CONDORCET 


Marie Jean Antoine Nicolas Caritat, marquis de 
Condorcet (1743-1794), was a French mathema- 
tician, philosopher, and politician, the author of a 
philosophy of progress, of a program for educa- 
tional reform, and one of the first to apply the cal- 
culus of probabilities to the analysis of voting and 
to social phenomena in general. He was among the 
most original thinkers of the revolutionary age. 

Born at Ribemont in Picardy, Condorcet had a 
brilliant scholastic career with the Jesuits at Rheims 
and at the Collège de Navarre in Paris. He attracted 
the attention of such mathematicians as d’Alem- 
bert, Clairaut, Fontaine, and Lagrange, although 
his mathematical publications had no permanent 
importance. At the same time he attended the 
salon of Mlle. de Lespinasse, became a friend of 
Turgot’s, and, with d'Alembert, made a pilgrimage 
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in 1770 to Fernet to see Voltaire. He remained on 
excellent terms with Voltaire. He married Sophie 
de Grouchy in 1787; their salon at the Hôtel des 
Monnaies became one of the most brilliant in Paris. 

Condorcet pursued three careers simultaneously 
and with varying success—an academic one, an 
administrative one, and a political one. As assist- 
ant to the secretary of the Académie des Sciences 
from 1769 on, he wrote a large number of éloges 
and recorded proceedings; he was elected a mem- 
ber of the Académie Française in 1782. As an 
administrator, he was appointed inspecteur des 
monnaies in 1774 and entrusted with various 
scientific missions by Turgot, then the minister of 
finance. Finally, in the context of politics, he drew 
up, in 1789, the petition of the nobility of Nantes 
(one of the cahiers de doléances). As a deputy to 
the Legislative Assembly, he was an active mem- 
ber of the Committee on Public Education. After 
being elected to the Convention, he was chosen to 
prepare the Girondist draft for the constitution, 
but although his proposals were almost always 
passed on the floor, they were very rarely put into 
effect. In 1793 he shared the fate of the Girondins: 
his arrest was ordered in July 1793, but he man- 
aged to remain hidden in Paris until March of the 
following year; then he was arrested, and he died 
under mysterious circumstances in the prison of 
Bourg la Reine. 

An enlightened nobleman. Prominent in scien- 
tific, political, and worldly circles, Condorcet is a 
typical example of the enlightened segment of the 
nobility that supported the Revolution. He was one 
of the first to be converted to the idea of republi- 
canism. His ideology of social progress, his eco- 
nomic liberalism, and his faith in the omnipotence 
of rational knowledge allied him with the rising 
bourgeoisie rather than with his own class. 

Condorcet might well be called the last of the 
Encyclopedists. He took an active part in the pub- 
lication of the Supplément a lencyclopédie, and 
more particularly in recasting its mathematical 
portion, the Encyclopédie méthodique, in 1784- 
1785. His curiosity was universal, and his most 
characteristic effort was his attempt to join and 
coordinate mathematics and philosophy and thus 
“satisfy two passions at once,” as he wrote to 
Frederick the Great. He considered philosophy to 
include everything relating to the knowledge of 
man: “the metaphysical and social sciences, those 
that have man himself as their object . . .” (1847— 
1849, vol. 6, p. 494). His epistemological princi- 
ples were borrowed from Condillac, but Condorcet 
differed greatly from Condillac in his conception 
of man’s destiny and the fate of human societies. 
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Condorcet’s best-known work, the Sketch for a 
Historical Picture of the Progress of the Human 
Mind (1795), was written while he was in hiding 
in Paris in 1793-1794. In it he presented a history 
of the errors and the advances of humanity, in order 
to predict, direct, and accelerate its forward march. 
In Condorcet’s view, historical development coin- 
cides with the spread and triumph of the light of 
reason. 

Condorcet believed that education, more than 
anything else, produced the triumph of the En- 
lightenment. He regarded inequality of education 
as one of the main sources of tyranny and advo- 
cated public education that would offer to all who 
might benefit from it the “aid hitherto confined to 
the children of the rich.” Nonetheless, his interests 
remained those of the bourgeoisie and the enlight- 
ened nobility, and his educational recommenda- 
tions consist, in effect, of two separate programs 
based on class distinctions; one is for the lower 
classes and is essentially technical; the other is for 
the ruling classes and develops the critical facul- 
ties of future citizens. 

“Social mathematics.” An essentially new fea- 
ture of Condorcet’s educational program was the 
importance he gave to science at every level of in- 
struction, in particular to applied science. In addi- 
tion, he believed that the social sciences should be 
taught in institutes and lycées. One of his central 
ideas is the importance to the progress of humanity 
of developing an “art of society” (art social). 

In essence, Condorcet conceived of this art 
social as an “application of mathematics to the 
moral sciences,” a discipline at once empirical and 
deductive, making use of genuine mathematical 
models of human phenomena. Condorcet was con- 
vinced that “the truths of the moral and political 
sciences can be as certain as those that make up 
the system of the physical sciences” (1785, p. 1). 
However, he believed that this certainty generally 
does not apply to causal relationships but to prob- 
able connections. The primary concern of what he 
called “social mathematics” is the application of 
the calculus of probability to the description and 
prediction of human phenomena. 

This mathematics, as it emerges in Condorcet’s 
published writings and unpublished manuscripts, 
comprises a statistical description of societies, an 
economic science (physiocratic in inspiration, but 
oriented toward the more recent idea of collective 
welfare), and a combinatory and probabilistic 
theory of intellectual operations. 

This last theory appears in connection with the 
theory of voting, which is by far the best-developed 
part of social mathematics (1785). Voting was 
viewed by Condorcet as making manifest not so 


much a compromise among a number of conflict- 
ing forces as a true opinion. He assumed, there- 
fore, that the question being voted on has a true 
solution that is independent of the wishes of those 
voting and that these voters express in their indi- 
vidual choices their greater or lesser understanding 
of that truth, The problem of structuring the proc- 
ess of voting is that of producing the maximum 
probability of a collective choice of the true solu- 
tion; this may be done by varying the size of the 
voting body and the kind of majority required, as 
well as by considering the likelihood that each 
voter will make a correct decision. Condorcet was 
thus led to the construction of statistical models 
of voting bodies that permit the appraisal of the 
probability that collective decisions will correspond 
to the true answers to particular problems, This 
is, then, a real problem in operational research— 
how to establish voting procedures such that the 
chances of the emergence of correct decisions are 
maximized, 

Since he conceived of voting as a collective 
search for the truth, Condorcet had to deal with 
the problem of defining precisely what a collective 
decision or judgment is. Given a series of dichot- 
omous questions, the “yes” or “no” responses to 
them may be called “judgments”; how, then, may 
a single collective judgment or a coherent hier- 
archy of judgments be established on the basis of 
a set of individual judgments? Charles de Borda, 
1733-1799, had shown that in the choice of the 
“best” among three candidates the one so desig- 
nated by a simple majority might not be the same 
as the one arrived at when each is compared with 
the two others. Condorcet showed that such a 
weighing of preferences may reveal a circular 
order of preferences among the candidates— 
A>B>C >A—and, consequently, the possibility 
of an inconsistent collective choice even when 
individual choices are consistent. The mathemat- 
ical apparatus available to Condorcet was too crude 
for him to obtain results that could be applied 
empirically, but he did open the way to a highly 
original and important social scientific conception. 

Influence. The ideology of progress of Condor- 
cet’s Esquisse directly influenced Auguste Comte. 
His influence on the application of mathematics to 
human affairs can be found in such nineteenth- 
century works as those of Poisson and Cournot. 
The questions involved in the formation of a col- 
lective opinion have quite recently been taken up 
again by, among others, K. J. Arrow and G. Th. 
Guilbaud, although they have been reformulated 
in the light of welfare economics and the theory 
of games. 

GILLEs-GAsTON GRANGER 


[For discussion of the subsequent development of Con- 
dorcet’s ideas, see ECONOMIC EQUILIBRIUM; GAME 
THEORY; MODELS, MATHEMATICAL; and the biog- 
raphies of COMTE; COURNOT; PoIssoNn.] 
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CONFERENCES 


A conference is a meeting of individuals called 
together to engage in discussion with the aim of 
accomplishing a limited task within a restricted 
period of time. Conferences are of special interest 
to the social sciences as a productive method of 
organizing research, including interdisciplinary re- 
search and research results; as a setting for trans- 
national and international cooperation; as a formal 
means of validating social programs; as a method 
of inducing change in individuals; and as limited, 
observable group situations that are suitable for 
the study of group processes. The conference is 
also an area of action within which the interplay 
of theory, data gathering, and practice has been 
very productive. The increasingly widespread adop- 
tion of the conference form is creating a demand 
for a new set of professionals who are skilled in 
conference management. 


Characteristics 

The conference form may be modified in vari- 
ous ways. A conference may bring together three 
specialists for consultation. It may bring together 
all the personnel who have a permanent working 
relationship to one another in an ongoing organ- 
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ization, such as a psychiatric clinic, a government 
bureau, or an industry. It may bring together, peri- 
odically, individuals whose diverse professional in- 
terests overlap through their concern with a com- 
mon problem. In its most expanded version, it may 
bring together thousands of delegates and ob- 
servers, as, for example, in a White House confer- 
ence or an international gathering of experts, to 
consider a wide range of problems in a very large 
field of interest. In its core form, however, a con- 
ference is a gathering that is small enough so that 
its members, meeting in one room, can take part 
in open-ended discussion. 

Interchange at a conference is intended to be 
less formally structured than are the deliberations 
of a parliamentary body. At a very large conference 
that discharges such multiple functions as the dis- 
cussion of substantive matters, the establishment 
of a new climate of opinion, the orientation of its 
members to a new viewpoint, the validation of a 
proposed program, vu. the preparation of a set of 
resolutions and recommendations, several types of 
sessions are held. There are plenary sessions, in 
which the entire body participates and formal par- 
liamentary procedures are followed. There are also 
smaller sessions, designated as discussion groups, 
working groups, panels, “buzz” sessions, and so on, 
the organization of which is intended to encourage 
a more informal and active interchange among the 
participants. Ideally, the arrangement of a small 
conference is such that all its members sit facing 
one another around a table, and their interchange 
is mediated but not dominated by a chairman. 

Conferences in their many different forms draw 
on the pattern that is characteristic of group de- 
liberations for special or periodic purposes at all 
levels of social complexity. In its more familiar 
modern versions, however, the conference is a 
Euro-American invention, the development of 
which has been vastly accelerated both by the need 
for and by the possibilities of rapid communication 
in the twentieth century. Although large confer- 
ences, governmental and nongovernmental, rely 
particularly heavily on traditional western Euro- 
pean parliamentary procedures, the format and 
style of the modern conference have other deriva- 
tions as well. One important source is the format 
of the scientific meeting, as exemplified by the 
meetings of the Royal Society of London, at which 
substantive matters are presented to a group of 
peers; another is the format of the ecclesiastical 
councils, called together to issue validating pro- 
nouncements on matters of moment. Modern con- 
ference form and style also have been influenced 
by the long tradition of intellectual controversy 
within eastern European Judaism; by the pro- 
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cedures through which humble people, meeting to- 
gether, have attempted to arrive at consensus; and 
by the procedures of the Quaker business meeting, 
with its emphasis on a search for “the sense of the 
meeting.” Residues of these and other historical 
forms still may be discerned in contemporary 
conference organization. 

A conference is formal in certain well-defined 
respects. It must have a scheduled beginning and 
end, identifiable sessions, and a designated mem- 
bership. Its subject matter is specified. The con- 
ference plan must include, as a final step, a pro- 
jected outcome, whether this takes the form of a 
report, a set of recommendations or resolutions, a 
pronouncement about some program, or the formu- 
lation of new insights. 

Modern conference methods have developed in 
response to the need for forms that will permit a 
group to transcend, through face-to-face discus- 
sion, various barriers to good communication. To- 
day, communication may be greatly hindered by 
the divisions that result from bureaucratic arrange- 
ments or the compartmentalization of specialized 
subject matter. It may be reduced to insignificance 
by difficulties arising from differences in nation- 
ality, status, race, caste, class, or ideology. Espe- 
cially in new fields, where the quantity of informa- 
tion and the lag in publication make it difficult for 
research workers and practitioners to keep abreast 
of developments, the conference provides means 
for the rapid dissemination of new knowledge. But 
the conference group atmosphere also may be used 
to thaw stereotyped antagonisms, an aim that may 
conflict with other conference goals. 

The small substantive conference, specifically, 
has made one unique contribution to intradisci- 
plinary, interdisciplinary, domestic, and trans- 
national communication. It is simultaneity of par- 
ticipation, In place of the older linear system of 
publication, written rejoinder, and written rebuttal, 
there is immediacy of challenge, correction, and 
redefinition in the actual presence of all the par- 
ticipants to a discussion. Moreover, everyone pres- 
ent, whether or not he speaks, takes part in the 
conference process. Intrinsic to this type of confer- 
ence is a form of communication that is not one- 
to-one but many-to-many, visible, audible, and 
sensible to all those sitting around a table. Each 
member, not only as he speaks but also as he re- 
mains silent or moves restlessly, is part of what is 
happening. And in their subsequent work, indi- 
vidual members draw on the whole, not merely on 
the conclusions or even the transcribed proceed- 
ings. Participation in a small substantive confer- 
ence resembles the apprenticeship learning experi- 


ence gained by work in a laboratory under a great 
experimentalist or in the atelier of a great painter. 
In its purest form, such a conference is self- 
contained; its members are not required, for ex- 
ample, to meet the demands made on committees 
with definite purposes to serve. At its best, it is the 
source of new ideas that otherwise would have 
developed only much later or, sometimes, not at all. 


The conference as a research procedure 


In the 1930s, the interdisciplinary conference 
became part of the newer paraphernalia of re- 
search procedures in the social sciences and the 
life sciences. Its precursor, the interdisciplinary 
team in the child development clinic, was designed 
to coordinate action among cooperating disciplines, 
i.e., medicine, psychiatry, psychology, and social 
work. Similarly, the interdisciplinary conference 
was deliberately designed to introduce cross-disci- 
plinary correctives into the thinking of compart- 
mentalized and isolated disciplines. The Josiah 
Macy, Jr. Foundation conferences, beginning in 
1936 (Josiah Macy, Jr. Foundation 1955), are an 
outstanding example of this development. 

More recently, since the mid-1940s, a related 
type of conference, more focused and more im- 
mediately utilitarian in its aims, has become an 
integral part of research procedures. Conferences 
of this kind may be convened to generate research 
plans, elaborate upon earlier plans, evaluate prog- 
ress, introduce new ideas into an ongoing research 
program, or present and evaluate research results. 
With various modifications, they are modeled on 
the “Macy conference” style. That is, they are char- 
acterized by free, informal interchange and by 
rapid cross communication from many to many; 
eliminating the transitions and explanatory steps 
essential to written communication, they can make 
use of shortcuts possible only in face-to-face situa- 
tions. Conferences so organized will inevitably 
occupy an increasingly large segment of the time 
of applied social scientists as they deal with prob- 
lems that require a multidisciplinary approach. 

In an academic seminar in which the primary 
emphasis is on intellectual interchange or teach- 
ing, the material is to a certain extent incidental 
(Tannenbaum 1965). In contrast, the research 
conference is substantively concerned with the data 
of a research project. Shared data may be used, 
much as psychiatrists use case materials, as the 
medium of communication to carry preliminary 
and as yet unclarified and sometimes ambiguous 
formulations. Today, a large and complex research 
program may include within a common framework 
a cluster of small research groups whose member- 


ship is interlocking. Here the research conference, 
drawing on the work of all the groups, speeds up 
the development of fruitful hypotheses (Mead & 
Métraux 1953). The research conference also has 
been used to prepare materials for final publication 
through a series of preliminary conference sessions 
with various groups of specialists (Hoagland & 
Burhoe 1961; Luszki 1958; Mandelbaum et al. 
1963; Roe & Simpson 1958). This format com- 
bines the symposium and the conference method. 
In still another format, prepared papers, distrib- 
uted in advance to the participants, are designed 
primarily as a stimulant to informal interchange 
within a multidisciplinary group. The published 
symposia of the Conference on Science, Philosophy 
and Religion, which pioneered in the development 
of this conference style, only formally reflect the 
interchange of the discussions (Conference on 
Science, Philosophy and Religion . . . 1941-1943; 
subsequent volumes in this series). 


Initiating social change 


Conferences have become a recognized method 
of launching a new program, validating a new 
approach, or establishing a climate of opinion 
within which action, often a new type of action, 
can be taken. Most new programs undertaken in 
the United States, the United Kingdom, or inter- 
nationally are inaugurated by conferences of rep- 
resentative and significant people who set the 
stamp of approval on a program’s aims and pro- 
cedures. A conference of this kind is essentially a 
sanctioning device. Subsequently, other confer- 
ences may be convened to reaffirm a program's 
aims or to redirect it toward new goals. The White 
House conferences on children, held decennially, 
and conferences on world mental health (Interna- 
tional Preparatory Commission . . . 1948; World 
Federation for Mental Health 1961) are examples 
of the validating and renewal types of conference. 
The sanctioning and stimulating roles of the con- 
ference may be amplified by setting topics several 
years in advance and, by this means, focusing re- 
search attention on a coordinated set of problems. 
Conferences of this general type may also be spe- 
cifically designed to prevent the premature or- 
ganization of an action program. 

Methods derived from group dynamics and group 
psychotherapy have been used in conference set- 
tings; here individuals may undergo a personality 
transformation somewhat analogous to a personal 
conversion. This method may be very effective in 
reorienting members of an ongoing organization 
when, for example, a bureaucratic institution is 
being transformed from an authoritarian to a demo- 
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cratic, or from a centralized to a decentralized, 
structure (Lewin 1935-1946; Lippitt 1949; Brad- 
ford et al. 1964). 

The conference as the focus of research. Since 
the 1920s there has been a steady growth of in- 
terest in the scientific study of small groups, group 
processes, the management of group situations, 
and the design of experimental groups for research 
purposes. This field, as it has developed, represents 
a fusion, on the one hand, of the procedures of 
social work, psychotherapy, progressive education, 
and industrial relations, and, on the other hand, 
of allied interests within social psychology, so- 
ciology, and anthropology. Under the leadership of 
Elton Mayo, important work originated at the 
Harvard Graduate School of Business Administra- 
tion. In the same period, Moreno developed methods 
for the sociometric analysis of group structure and 
for the simulation of group situations through 
sociodrama, Under the leadership of Kurt Lewin 
research was carried out on group dynamics and 
group decision, and techniques were developed for 
experimental studies of group structure and innova- 
tion (Bavelas 1951; U.S. Office of Naval Research 
1951; Gyr 1951). Others concentrated on problems 
of group structure and interaction. One outcome of 
this research has been the expansion of interest in 
group processes in the policy sciences and among 
those concerned with public opinion formation. 

Parallel work on small groups, carried out in 
England, has been strongly influenced by an in- 
terest in the therapeutic situation, Research there 
has centered particularly in the group associated 
with the Tavistock Clinic and the Tavistock In- 
stitute of Human Relations. 

The conference as a social unit has provided a 
ready-made focus of attention for research workers 
concerned with problems of group process. The 
fact that investigations, whether intradisciplinary 
or interdisciplinary, have tended to be carried on 
by groups of research workers has had important 
consequences for the development of conference 
procedures. Social scientists interested in the study 
of groups formed a number of small societies, 
among them the Society for the Psychological 
Study of Social Issues, the Society for Applied An- 
thropology, and the American Sociometric Asso- 
ciation. As meetings of these societies made 
extensive use of the conference form, the special 
sophistication of those actively engaged in research 
on groups was fed back into ongoing thinking 
about conferences. Modern recording techniques, 
such as stenotyping and tape recording, which 
facilitated the study of conferences, gradually be- 
came standard equipment, and verbatim tran- 
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scripts became available not only for conference 
analysis but also for future conference planning. 
A further step was taken when the conferences 
that were convened to plan, coordinate, or evaluate 
the research on small groups also were subjected 
to the scrutiny of those whose observational and 
experimental work was concentrated on problems 
of group interaction. 

Throughout its development, the scientific study 
of group processes has been characterized by the 
most intense, continuous interplay of theory, data 
gathering, and practice. The methodology of this 
interplay is intrinsic to theories of the relevance of 
research to social action that developed in the late 
1930s and early 1940s. Equally important, how- 
ever, has been the double position of those involved 
in investigations, both as research workers study- 
ing group processes and as participants in situa- 
tions on which such research was being carried 
out. [See Groups; INTERACTION.] 

In the 1950s, a further development took place 
as new disciplines concerned with the analysis of 
nonverbal behavior entered the field. Although the 
published output of those engaged in research in 
kinesics, proxemics, paralinguistics, and metalin- 
guistics is not as yet large (Indiana University . . . 
1964), these several new approaches hold great 
promise for a further clarification of the communi- 
cation process as it evolves in the setting of a 
conference. [See KinEsics.] In the same period, a 
growing interest in ethology among students of 
human social behavior also has stimulated atten- 
tion to nonverbal elements in communication. In 
addition, the development of photographic tech- 
niques for the visual recording of interaction 
promises to add still another dimension to the 
analysis of the communication process. 

In the late 1940s several large-scale studies of 
the conference form were inaugurated. Some of 
these failed in their purpose, as no adequate solu- 
tion had then been found for the problem of the 
relationship between the detached observer of a 
conference and its deeply involved, committed 
membership. Since the conference form had 
become part of the regular procedures of organiza- 
tions having a professional interest in the confer- 
ence process, it seemed reasonable to build research 
plans around their meetings. However, when these 
conferences have been used for purposes of im- 
partial objective research, the professional partici- 
pants sometimes have rebelled against serving, 
simultaneously, as the subjects of observation. 
Their objections, which have seriously impeded re- 
search that might otherwise have been productive, 
have stemmed from the failure—perhaps on both 


sides—to grasp one essential characteristic of every 
successful conference, namely, the complete tem- 
porary commitment of every participant to the 
shared purposes of the group as a whole. In a 
situation in which the observer can share in this 
commitment to content and can channel his ob- 
servations in such a way that they facilitate the 
work of the conference, his insights are welcomed. 
In contrast, where his role is so defined that he 
must stand aside, taking notes that will be useful 
only in some context external to the purposes of 
the conference, his very presence is resented. This 
requirement of involvement has a corollary. A na- 
tural history method of recording and the use of 
artifacts intrinsic to the conference procedure, 
such as preconference letters, drafts of formula- 
tions prepared within the conference, and the con- 
ference transcript, are appropriate for conference 
research, but instruments developed for other pur- 
poses, for example, the questionnaires and self- 
evaluation forms used in measuring changes of 
attitude, are often completely inappropriate. One 
example of a conference that successfully com- 
bined the application of research with self-observa- 
tion was the Eastbourne conference, which had the 
assigned task of organizing international confer- 
ence procedures (Capes 1960). 

Conference management as a profession. Con- 
ference management is developing as a profession 
whose major focus may be problems of organiza- 
tion, the guidance of the group process, transcrip- 
tion, editing and publication, or some combination 
of these. Today, large organizations and institutions 
are likely to employ specialists in all these branches 
of conference management. 

The demand for specialists is related to, and has 
expanded with, the development of conference 
technology, particularly that aspect of it concerned 
with interpretation (Glen 1954). While conference 
management skills must still be acquired through 
informal apprenticeship learning, a more identi- 
fiable career specialization is likely to emerge as 
the number of international conferences, for which 
very special skills are required, continues to in- 
crease. Concurrently, it may be expected that high- 
level skills in conference communication, earlier 
the subject of intense research, will be more widely 
diffused among social scientists and the scientific 
community as a whole. 

MARGARET MEAD 


[Other relevant material may be found in COMMUNI 
CATION; DIFFUSION, article on INTERPERSONAL IN- 
FLUENCE; Groups; INTERACTION; KineEsics; SCF 
ENCE, article on SCIENTIFIC COMMUNICATION. ] 
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I 
PSYCHOLOGICAL ASPECTS 


Conflict refers to a situation in which a person 
is motivated to engage in two or more mutually 
exclusive activities. In a monogamous society a 
man cannot marry two women at the same time, 
no matter how attractive they are to him. A busi- 
nessman may be faced with the choice of hiring a 
lazy relative or an efficient stranger for an impor- 
tant position. The soldier in combat may be torn 
between the desire to run away and the fear of 
losing face with his comrades. Incompatible re- 
sponses cannot occur simultaneously. 

Conflict may occur on many different levels. 
On the overt behavioral level, a tribesman may be 
motivated both to approach and to avoid the taboo 
object. On the verbal level, a person may want to 
speak the truth but fear to offend. On the symbolic 
level, ideas may clash and produce cognitive dis- 
sonance. On the emotional level, the visceral re- 
sponses involved in fear and digestion are incom- 
patible. Motives are important in conflict, and for 
this reason the term motivational conflict is often 
used. However, there is nothing basically incom- 
patible about motives as such. Sigmund Freud 
suggested that opposite instincts exist side by side 
in the unconscious, with no disharmony. Conflict 


occurs only when the overt, verbal, symbolic, or 
emotional responses required to fulfill one motive 
are incompatible with those required to fulfill 
another. 

Social existence involves a great many conflicts, 
The prediction and explanation of social behavior 
require more than knowing that a person has 
learned to make a certain response at a given time 
and place when properly motivated. The situation 
frequently involves other motives that produce 
incompatible response tendencies. The individual 
in society, subject to the pressures of the many 
groups to which he belongs and the demands of 
the many roles he must play, often experiences 
personal conflict. The entire process of the sociali- 
zation of the child has been viewed as a conflict 
between the individual and society. Freud has gone 
so far as to say that civilization itself is a product 
of the clash between the incompatible demands of 
biological urges and social conformity. 

The concept of conflict is particularly significant 
in the areas of personal adjustment and mental 
disorder. Freud maintained that no neurosis exists 
without a conflict. Clinical studies suggest that 
psychological conflicts are of central importance 
not only in neuroses but also in psychosomatic 
disease, sexual deviation, and functional psychosis. 
Furthermore, psychological conflicts appear to 
contribute to various forms of social pathology, 
such as marital, educational, and vocational fail- 
ure; delinquency, crime, and prostitution; and 
alcoholism and drug addiction. Laboratory animals 
with experimentally induced conflicts may exhibit 
such maladaptive behavior as response fixation, 
tantrums, and catatoniclike posturing. They may 
also develop gastrointestinal ulcers, acquire a taste 
for alcoholic beverages, or become viciously ag- 
gressive. 

The psychoanalytic contribution. The first im- 
portant use of the concept of psychological conflict 
was by Joseph Breuer and Sigmund Freud in their 
work on hysterical neuroses (see Freud 1893—1895). 
These authors explained hysterical symptoms as 
the consequence of a conflict between incom- 
patible ideas. For example, Anna O. was a sensi- 
tive young girl much devoted to tending her dying 
father. Her ideas of devotion to her father were 
shaken by the eruption of a wish for him to die 
quickly and end his suffering. The disturbing 
thoughts were repressed, and a series of crippling 
hysterical symptoms developed. 

In his subsequent development of psychoanalytic 
theory, Freud placed a great deal of emphasis on 
a conflict between sexual and self-preservative in- 
stincts. Later, he phrased this as a conflict between 


libidinal wishes and ego anxiety. In order for a 
neurosis to develop, however, the libidinal wish 
must be repressed by anxiety. The unconscious 
libidinal urge then seeks an indirect outlet through 
such things as dreams, slips of the tongue, or 
neurotic symptoms. A neurotic symptom is a com- 
promise between the repressed libidinal wish and 
the inhibiting anxiety. Personality development 
may be viewed as a series of conflict resolutions. 
The most adaptive method of conflict resolution is 
sublimation, by which the libidinal urge is dis- 
charged in socially useful activities, such as science, 
art, and work. 

The relationship between internal and external 
sources of conflict is of some theoretical signifi- 
cance. While Freud speaks of the clash between 
the individual and society, he also says that a 
neurotic conflict must involve an internal source 
of inhibition. Thus, a prisoner who is denied a 
sexual outlet may be frustrated and rebellious but 
not neurotic. On the other hand, a person whose 
own guilt feelings deny him a sexual outlet may 
indeed become neurotic. The guilt feelings derive 
from society, of course, but have become internal- 
ized. It might be best to reserve the term “frustra- 
tion” for the external blocking of a motive and 
the term “conflict” for internal blocking. In both 
cases, the blocking can increase tension in the 
individual. 

Much of the controversy in psychoanalytic the- 
ory has centered on the motives that enter into 
neurotic conflict. Alfred Adler suggested as critical 
in neurosis the will to power and Carl Jung the 
inability to achieve self-realization. Freud, at a 
later stage, added an aggressive motive in the form 
of the death instinct. Many contemporary psycho- 
analysts emphasize conflicts involving achievement, 
affiliation, and dependency motives. Nevertheless, 
the basic Freudian idea of a conflict between some 
motivated response and anxiety as crucial in neu- 
rosis has gained widespread acceptance. 

The psychoanalytic ego theorists have challenged 
the Freudian concept of personality developments 
resulting primarily from conflict resolution. On the 
basis of direct observation of children, they have 
concluded that whole areas of development are 
conflict-free, particularly those that are ego func- 
tions, such as language, physical mastery, and 
curiosity. Thus, social interests in science, art, and 
work may be independent of libidinal development. 
Civilization may be more than a product of conflict. 

Finally, the culturally oriented psychoanalytic 
theorists, such as Karen Horney, Erich Fromm, 
and Harry Stack Sullivan, have seriously chal- 
lenged the Freudian concept of conflict between 
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biological instincts and socially derived inhibitions. 
Karen Horney, for example, suggests that the de- 
veloping personality is concerned only with secur- 
ity. In a threatening social environment, certain 
coping mechanisms become exaggerated and may 
come into conflict. For example, the neurotic per- 
son may vacillate between hostility and help- 
lessness. Thus, the conflict is between coping 
mechanisms, and both arms of the conflict are 
motivated by insecurity deriving from society. 

Lewin’s taxonomy of conflict. Kurt Lewin made 
a comprehensive analysis of personality in terms 
of field theory (see Lewin 1926-1933). The be- 
havior of the individual was seen as determined by 
a field of psychological forces. These forces were 
dependent, in part, on the positive and negative 
valences attached to various goals in the situation. 
Conflict was thought to arise when the forces rele- 
vant to two or more goals were of equal strength 
[see FIELD THEORY and the biography of LEWIN]. 

Lewin distinguished between three types of con- 
flict: (1) where a person is between two goals of 
positive valence; (2) where a person faces a goal 
with both positive and negative valence; and (3) 
where a person is between two goals of negative 
valence. These types of conflict have become known 
as approach-approach, approach—avoidance, and 
avoidance—avoidance, respectively. A fourth type 
has been added, double approach—avoidance con- 
flict. 

The approach—approach type of conflict is per- 
haps the simplest. For example, a child may be 
required to choose between two equally attractive 
desserts, a situation reminiscent of the donkey who 
starved to death when placed equidistant between 
two bales of hay. In fact, however, experimental 
studies with both human and animal subjects show 
that this type of conflict is resolved easily and 
rapidly. The equilibrium is not stable so that any 
slight movement toward one of the positive goals 
results in increasing the positive valence of that 
goal and in rapidly increasing momentum toward 
that goal. 

The approach—avoidance type of conflict, on the 
other hand, is quite stable. The approach and 
avoidance tendencies are pitted against one another 
so that an approach movement results in an in- 
crease in the negative valence and in a compen- 
satory avoidance movement, and vice versa. The 
experimental studies show that approach—avoid- 
ance conflict results in a great deal of vacillating 
behavior. Lewin gives the example of a child re- 
quired to perform some unpleasant task in order 
to get a reward. In addition to vacillation, the child 
may deal with the situation by minimal task per- 
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formance, abandoning the reward, or attempting 
some detour, 

The avoidance—avoidance type of conflict can be 
illustrated by the child who is required to perform 
an unpleasant task under threat of punishment. 
The child’s first tendency is to avoid both altern- 
atives and leave the situation, However, if the 
child is forced to remain in the situation, Lewin 
suggests he may withdraw into himself or engage 
in an emotional outburst. The experimental studies 
show that avoidance—avoidance conflicts are re- 
solved very slowly and with great difficulty. Fre- 
quently, the subject simply freezes or blocks or 
makes a response that keeps him equidistant from 
both alternatives. 

The double approach—avoidance type of conflict 
is seen in the situation in which there are two 
goals with approach and avoidance tendencies to- 
ward each. This type of conflict also seems to 
produce a great deal of blocking. Double approach— 
avoidance conflict is an important paradigm when 
one or more of the components are hidden or un- 
conscious. For example, a man deciding between 
two attractive women appears to be in an easily 
resolved approach—approach conflict. Instead, he 
may be unable to make up his mind because he 
unconsciously fears both women. Thus, he is really 
in a double approach—avoidance conflict. 

Miller's approach-avoidance model. The most 
influential contemporary theory of conflict is that 
proposed by Neal E. Miller. Miller concentrated on 
approach—avoidance conflict on the assumption 
that this type was of most significance in psycho- 
pathology and social behavior. Miller's strategy was 
to establish a theoretical model of conflict on the 
basis of carefully controlled laboratory experiments 
with animal subjects and then to extend these 
principles to complex human behavior. 

The model of conflict developed by Miller is a 
special application of more general principles of 
learning, specifically the stimulus—response learn- 
ing theory of Clark Hull. Approach and avoidance 
tendencies are conceived of as motivated responses 
whose strength is a function of the stimulus situa- 
tion and its relation to past learning, the motiva- 
tion of the subject, the incentive value of the goal, 
and the number of times the response in question 
has been reinforced. In a situation in which there 
are two incompatible responses conflicting with 
each other, the stronger response will tend to occur. 

Miller's analysis of conflict also rests on another 
important learning principle—the goal gradient. 
A white rat in a simple runway will tend to run 
faster the closer he gets to the end containing a 
goal object, such as food. In other words, the ap- 


proach tendency gets stronger the nearer the ani- 
mal is to the positive goal. It has also been shown 
that the further away an animal gets from a nox- 
ious stimulus, such as an electric shock, the slower 
he runs. Therefore, there appears to be a gradient 
with respect to the tendency to avoid a negative 
goal. Miller also made the crucial observation that 
the avoidance tendency fell off more rapidly with 
distance from the goal than did the approach tend- 
ency. Chiefly on the basis of the observation that the 
avoidance gradient is steeper than the approach 
gradient, Miller constructed his theory of conflict. 

Miller’s model is shown graphically in Figure 1. 
The distance from a goal that is both desired and 
feared is shown on the abscissa while the strength 
of the tendency either to approach or to avoid is 
shown on the ordinate. Both gradients fall off in 
response strength as distance from the goal -in- 
creases, but the avoidance tendency drops off more 
rapidly. The approach—avoidance gradients are rep- 
resented most simply as straight lines, but Miller 
emphasizes that curves of many shapes could 
satisfy the requirements of the theory. 

The particular situation shown in Figure 1 in- 
volves an intersection of the approach and avoid- 
ance gradients. The intersection is called the con- 
flict point. To the right of the conflict point the 
net difference between the strengths of the response 
tendencies is in favor of approach, while to the left 
of the conflict point avoidance predominates. There- 
fore, a subject at some distance from a desired and 
feared goal should approach part way until the con- 
flict point and then stop, A subject finding himself 
at the goal should retreat to the conflict point and 
stop. 
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Figure 1 — A simple approach—avoidance conflict 


Source: Neal E. Miller—" Experimental Studies of Con- 
flict’ in PERSONALITY AND THE BEHAVIOR 
DISORDERS, edited by J. McV. Hunt, Copyright 
1944 The Ronald Press Company, New York. 


The above basic prediction about conflict be- 
havior has been confirmed in numerous laboratory 
experiments with animal and human subjects. In 
a classic series of experiments, the response tend- 
ency was measured by the strength with which a 
white rat would pull against a temporary restraint 
either to get to a food goal or to get away from an 
electric shock in a straight alley runway. The 
strength-of-pull measure reflected the greater steep- 
ness of the avoidance gradient. Other animals 
trained both to approach and to avoid the goal were 
observed to go part way down the alley and stop. 
Actually, they did not stop abruptly at the conflict 
point but advanced and retreated alternately in a 
highly tentative manner. This behavior—reminis- 
cent of the vacillation of human subjects in an 
approach—avoidance situation—has been called 
conflict oscillation. It has been suggested that con- 
flict itself is aversive and increases tension level 
above and beyond a summation of the usual mo- 
tives involved. 

The real power of the theory is, however, that 
varying outcomes of conflict situations can be pre- 
dicted on the basis of knowledge about the factors 
assumed to influence the strength of the competing 
response tendencies and their effect on approach 
and avoidance gradients, The evidence shows that 
gradients are influenced by motivation as well as 
the number and size of reinforcements. Figure 2 
shows the effects of increasing the strength of the 
approach tendency. As the approach tendency is 
increased the entire gradient is raised and the 
intersection point with the avoidance gradient is 
much closer to the goal. Therefore, a subject should 
approach much closer to the goal but at the same 
time would experience much greater fear at point 
A than at point B in Figure 2. Conversely, lowering 
the avoidance gradient should also enable the sub- 
ject to advance closer to the goal but with con- 
siderably less increase in fear. 

The avoidance gradient can be lowered by reduc- 
ing fear with such drugs as alcohol, sodium amytal, 
and chlorpromazine. However, caution should be 
used in drawing therapeutic implications from 
these studies because there is some question whether 
the conflict resolution will continue once the drug 
is withdrawn. The simplest way of eliminating 
these experimentally induced conflicts is to allow 
the process of fear extinction to take place. Other 
methods include feeding the animal so as to evoke 
an emotional response incompatible with fear, 
gradually bringing the fear stimulus closer, and 
changing the stimulus conditions. [For discussion 
of treating humans, see MENTAL DISORDERS, TREAT- 
MENT OF, article on BEHAVIOR THERAPY.] 
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Figure 2 — The effects of varying strength of approach 


Source: Neal E. Miller— Experimental Studies of Con- 
flict” in PERSONALITY AND THE BEHAVIOR 
DISORDERS, edited by J. McV. Hunt, Copyright 
1944 The Ronald Press Company, New York, 


An unresolved problem in connection with Mil- 
ler’s model concerns the exact shape of the ap- 
proach and avoidance gradients. The most recent 
evidence suggests that the curves are ogival or S- 
shaped. However, the shape of the curve depends 
on the discriminability of cues along the path to 
the goal, the level of illumination, the species in- 
volved, and other factors. 

A more critical problem has to do with the rela- 
tive steepness of approach and avoidance gradients. 
Miller postulated that the avoidance gradient is 
steeper because it is based on the learned motive of 
fear, which varies greatly with external stimulus 
conditions, while the approach tendency is based 
on hunger, which is more dependent on the sub- 
ject’s internal cues and less influenced by external 
cues. While there is some evidence for this, other 
studies show that a learned approach tendency may 
conflict with a learned avoidance tendency, al- 
though both should be steep and possibly parallel. 
However, it may be that learning has more com- 
plex effects on the steepness of a gradient than has 
been assumed. For example, fear and guilt are both 
learned, but they may have gradients of differing 
steepness because fear is more dependent on ex- 
ternal cues than is guilt. Internal, cue-producing 
responses, such as thoughts and verbal labels, 
would be expected to have a great effect on the 
shape, slope, and height of approach and avoidance 
gradients in most social situations. 

Extension to displacement. The dimension of 
nearness may be viewed as a special case of stimu- 
lus generalization. Therefore, the model should be 
applicable to other situations involving the general- 
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ization of approach and avoidance gradients. This 
would include generalization from one goal object 
to another along a dimension of object similarity. 

Miller saw a connection between goal object gen- 
eralization and the psychoanalytic concept of dis- 
placement. For example, an individual who feels 
hostile toward his father might dream about a 
policeman, since dreaming about his father directly 
would produce anxiety. It has been shown that 
children whose parents severely punish aggression 
in the home are more aggressive outside the home. 
The scapegoat theory of ethnic prejudice is based 
on the idea of displacement and may be extended 
to situations involving aggression to outgroups. 

The same graphic model used for conflict may 
be extended to displacement, with the exception 
that the dimension along the abscissa is one of 
stimulus similarity instead of spatial distance. Mil- 
ler also assumes that the point of maximal dis- 
placement will not be at the intersection of the 
gradients, as in conflict, but to the right, where the 
net difference between the two response strengths 
is maximal in favor of approach. 

Several studies have supported this extension of 
the theory. In one, two rats were placed in a cage 
and taught to attack each other in order to turn 
off an electric shock. During this time they showed 
no tendency to strike a plastic doll in the cage 
with them. Later, when the rats were placed in the 
cage one at a time and the shock turned on, each 
attacked the plastic doll. In this study, the learned 
aggression toward the partner rat was prevented by 
his absence and the aggression was displaced toward 
the previously ignored doll. 

There are a number of situations in which con- 
flict and displacement might operate simultane- 
ously, Murray and Berkun (1955) attempted to 
account for this by developing a three-dimensional 
model combining Miller’s theories of conflict and 
displacement. In this model, the ordinate still rep- 
resented strength of response. The two other orthog- 
onal dimensions were nearness to a goal and 
degree of similarity of various goals to the original 
goal. The approach and avoidance gradients were 
joined to produce intersecting approach and avoid- 
ance planes. 

The Murray—Berkun model was tested in an ap- 
paratus consisting of wide white, medium width 
gray, and narrow black alleys adjacent to one an- 
other. In one alley (for example, wide white) the 
animals were trained first to approach for food and 
then to avoid a shock on the food tray. The alleys 
were connected by small windows so the animals 
could move from one to the other. After the con- 
flict was established in the original alley, but not 


before, the animals moved into the intermediate 
gray alley and approached closer to the goal, then 
entered the most dissimilar alley (for example, 
narrow black) and advanced all the way to the 
goal, as predicted by the model. 

The main results of the Murray—Berkun study 
have been confirmed by other investigators, but 
some controversy has developed over the dimension 
of similarity to the original goal. Some results sug- 
gest that simple distance may play a role, but 
other results suggest that stimulus similarity is still 
operating. The color of the alleys may be less im- 
portant than the width. Furthermore, olfactory cues 
may be important. 

Projective techniques. The conflict and dis- 
placement theories have also been extended, by 
Seymour Epstein (1962) and his associates, to the 
area of projective techniques. Projective pictures 
and word association stimuli can be arranged in 
sequence according to their relevance to a motive, 
such as sex, It is found, frequently, that the motiva- 
tion appears more openly on the pictures of low 
relevance. This can be explained as a displacement 
effect by assuming that the test responses are de- 
termined by intersecting approach and avoidance 
gradients. Epstein and his associates have also de- 
veloped a three-dimensional model, similar to the 
Murray—Berkun one, in which a sequence of pro- 
jective test stimuli form one dimension and time 
another. This model accounts for such findings as 
a parachutist’s anxiety toward association words of 
less and less relevance to parachuting as the time 
for the jump draws nearer. 


Personality, psychotherapy, and culture 

In their learning analysis of neurosis and psy- 
chotherapy, John Dollard and Neal E. Miller (1950) 
have made extensive use of the conflict model. 
These theorists have reinterpreted many of the 
psychoanaiytic observations on psychosexual devel- 
opment in terms of conflicts learned in childhood. 
Punitive parental training methods can establish 
conflicts in the areas of weaning, cleanliness, sex- 
ual behavior, and aggression. More subtle conflicts 
between social motives can be learned at later 
stages of development. 

Dollard and Miller agree with Freud that neu- 
rosis requires that the conflict be at least partly 
repressed. Repression involves the inhibition of 
thoughts, feelings, and behavior and may be viewed 
as the result of an approach—avoidance conflict. 
The emergence of thoughts of a sexual nature may 
be likened to an approach response, and the stop- 
ping of these thoughts is similar to avoidance be- 
havior. A sexually arousing situation increases the 


approach tendency and may also increase anxiety. 
Thus, the person may become anxious without 
knowing why. 

Various neurotic symptoms may be viewed as 
responses reinforced by anxiety reduction. A phobia 
represents an avoidance response to an anxiety- 
arousing stimulus and may also involve displace- 
ment. Conversion symptoms often serve an avoid- 
ance function. Obsessional thinking may function 
to block out anxiety-arousing thoughts. However, 
since the approach tendency is still operating, the 
neurotic symptoms are rarely effective and the 
person keeps finding himself driven back into 
the conflict situation. Therefore, the person shows 
oscillating, inconsistent, and maladaptive behavior. 

Psychotherapy is viewed by Dollard and Miller 
as a way of reducing conflict and eliminating re- 
pression. The permissive, nonpunitive attitude of 
the therapist constitutes a new set of learning con- 
ditions. Anxiety can gradually be extinguished, and 
the need for repression is thereby reduced. The 
patient can discuss his fears and motives. Of great 
importance in therapy is the transference relation- 
ship, or the generalization or displacement to the 
therapist of the conflict originally learned in the 
family situation. As repressions and symptoms drop 
out, the person can use his higher mental processes 
to solve his problems in his social behavior outside 
of the therapy situation. 

An example’of the more detailed use of the con- 
flict model is Dollard and Miller's explanation of 
the negative therapeutic effect. It often happens 
in psychotherapy that, following a period of prog- 
ress in which the patient gains some understand- 
ing, he suddenly becomes more anxious and symp- 
tomatic. This could happen if the therapeutic 
progress resulted in the lowering of an avoidance 
gradient. With the lowering of the avoidance gra- 
dient the patient might advance in thought or 
behavior closer to the goal but then would reach 
a new conflict point involving an increase in anxi- 
ety. Thus, therapeutic progress may have the para- 
doxical effect of making things worse for a time. 

The theories of conflict and displacement have 
been applied to a broader class of cultural phenom- 
ena by several social scientists. For example, in a 
cross-cultural study of primitive societies, John 
Whiting and Irvin Child (1953) have related a 
number of socialization variables to religious be- 
liefs concerning illness. They found that fear of 
others was related to socialization anxiety about 
aggression. However, the actual objects of aggres- 
sion—ghosts of relatives or animal spirits—were 
more difficult to predict. By assuming that these 
aggression objects lie along a continuum of simi- 
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larity to parental socialization agents and by apply- 
ing a modification of Miller's displacement theory, 
they were able to account for different patterns of 
aggression. When aggression anxiety is low, ag- 
gression is displaced more toward ghosts of rela- 
tives than of animal spirits; but, when aggression 
anxiety is high, aggression is displaced toward both 
ghosts of relatives and of animal spirits. 

The application of the theories of conflict and 
displacement to problems of complex human be- 
havior has been promising, but a great deal more 
research needs to be done. A major technical prob- 
lem has to do with the scaling of important social 
dimensions along which generalization may take 
place. A related problem is the role of symbolic 
processes, language, and socially learned discrimi- 
nations in conflict and displacement phenomena. 
Finally, more research is needed on the basic 
mechanisms involved, such as the determinants of 
the shape of the gradients, particularly with human 
subjects in a wide variety of social situations. 


EDWARD J. MURRAY 


[See also AGGRESSION, article on PSYCHOLOGICAL AS- 
PECTS; PHOBIAS. ] 
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IL 
POLITICAL ASPECTS 


In the nineteenth century, Marx and Engels ana- 
lyzed class conflict and the social Darwinists 
examined conflict among societies. But Marxism 
developed essentially as a doctrine of reform and 
revolution, while social Darwinism failed to pro- 
gress beyond oversimplified concepts like “abso- 
lute hatred,” “survival of the fittest,” and “the con- 
quest state.” The first sociologist to focus on conflict 
as a process both internal and external to the 
individual and also to the group was Georg Sim- 
mel, whose ideas have been critically restated by 
Lewis A. Coser (1956). Most sociologists lost inter- 
est in conflict, however, and became increasingly 
engrossed with patterns and processes of social 
integration. It is only in recent years, and especially 
with the threat of nuclear holocaust, that social 
scientists have focused serious attention upon 
problems of conflict. 


Conflict and integration 


It now seems evident that all human relations 
may be viewed as interlaced by two closely related 
processes—the conflictual and the integrative. To 
one degree or another, these two kinds of inter- 
action appear as soon as, and as long as, two or 
more individuals are in contact. They disappear 
only when the parties withdraw and the relationship 
is completely broken. 

Whenever two or more individuals or groups 
come into contact with each other, they may choose 
to make their relationship primarily conflictual or 
primarily integrative (i.e., cooperative, supportive, 
agreed upon). If the initial relationship is primarily 
conflictual, there will nevertheless emerge at least 
a few minimal strands of understanding and reci- 
procity—rules of combat, or perhaps only an agree- 
ment to disagree. If, on the other hand, the initial 
relationship is primarily integrative, it is certain 
that conflict will develop—if for no other reason 
than through the demands of the association itself 
as they compete with the preferences of individuals 
and component groups. 

Some degree of community, organization, or in- 
tegration is inherent in the concept of conflict. If 
the parties in question were not in the same place 
at the same time, or performing two incompatible 
functions at the same time, or cooperating to inflict 


reciprocal injury, there would be no conflict (Ber- 
nard 1957b, p. 112). Almost any aspect of conflict, 
however destructive, requires interaction between 
the antagonists, considerable communication, and 
the establishment and maintenance of many recip- 
rocal ties and subtle understandings. Conflict thus 
functions as a binding element between parties 
who may previously have had no contact at all 
(Coser 1956, p. 121). 

On the other hand, conflict may result in the dis- 
ruption or destruction of all or certain of the bonds 
of unity that may previously have existed between 
the disputants. 

Conflicts take place between individuals, be- 
tween individuals and organizations or groups, 
between distinct organizations or groups, between 
an organization and one or more of its compo- 
nents, or between component parts of a single 
organization or group. A conflict emerges when- 
ever two or more persons (or groups) seek to 
possess the same object, occupy the same space or 
the same exclusive position, play incompatible 
roles, maintain incompatible goals, or undertake 
mutually incompatible means for achieving their 
purposes. 


Inner conflict 


The type of conflict discussed above—essentially 
social conflict—should be distinguished from the 
inner conflict or quandary that emerges when in- 
compatible or mutually exclusive values present 
themselves in the form of an actual or potential 
choice or decision (Bernard 1957b, p. 113). Thus, 
an individual finds himself unable to make a de- 
cision because he is being pushed or pulled in op- 
posite directions. (The same holds true for a 
group.) He wants to be in two places at the same 
time or to perform two mutually exclusive func- 
tions at the same time. Like the proverbial ass be- 
tween two equally attractive bales of hay, he finds 
himself immobilized by the choice confronting him 
(Boulding 1957, p. 132). Or perhaps he finds him- 
self equally immobilized in trying to decide upon 
the “lesser of two evils.” 

In the individual human being the distinction 
between the inner conflict and the outer, or social, 
conflict seems relatively clear: the former involves 
a confrontation by the individual of a difficult 
choice between incompatible values, whereas the 
latter is concerned with an incompatibility between 
that individual and another individual or group. 
Yet the inner conflict can be viewed also as a con- 
flict between internal components of the individual: 
Her heart said Yes, but her common sense said No. 

When groups and organizations are considered, 


the inner conflicts of a given unit are undoubtedly 
social conflicts (conflicts between antagonistic 
actors ) when viewed on one system level; but they 
are also sometimes analogous to the inner conflicts 
or quandaries of the individual, when viewed on 
another level. Among groups and organizations, 
then, the dialogues, debates, and struggles associ- 
ated with decision making may be considered on 
one system level as social conflicts between indi- 
vidual parties; or they may be treated under certain 
circumstances as quandaries—conflicts internal to 
the organism—from the perspective of another 
level in the systems hierarchy. A group can be 
immobilized either because all the members are 
caught between opposing repulsions or attractions 
or because one component of the group is attracted 
(or repelled) in one direction, the other component 
in another direction. Thus, the distinction between 
the quandary and the so-called social, or external, 
conflict is frequently a matter of system perspec- 
tive, which must be determined by requirements of 
the problem, the method of analysis, and the 
theory of the investigator (Lewin 1935-1946). 

An actor who has the capacity for absorbing or 
destroying another actor may be viewed as domi- 
nant, An actor who cannot be absorbed by another 
actor or destroyed as an independent decision 
maker is sovereign or unconditionally viable. Con- 
versely, the actor who survives only through the 
sufferance of another, dominant, party is said to be 
conditionally viable. 

In circumstances where both actors are uncon- 
ditionally sovereign or viable, or where it does not 
pay the dominant party to extinguish the other— 
to secure conditional viability—we have what 
Strausz-Hupé has called “protracted conflict” 
(1959). Here the problem is how to control the 
conflict, i.e., how to maintain limits, rather than to 
resolve it (Boulding 1962, p. 59). 

Each actor or system (individual, group, organ- 
ization, nation-state, and so forth) may be viewed 
as responding to various stimuli, including projec- 
tions of his or its own needs, desires, and expecta- 
tions. Or, one may equally postulate that each sys- 
tem possesses as an inherent part of its structure 
a “view of the universe” or an “image” that includes 
some notion or “plan” of its role and purpose 
(Boulding 1957, pp. 125-126; also Miller et al. 
1960). Within the first view, each choice-point 
offers alternatives to which some combination of 
perceived rewards and penalties is attached. In con- 
nection with the second view, behavior or action 
may be seen as the carrying out of a process that 
moves the system toward the most highly valued 
part of the total image. 


CONFLICT: Political Aspects 227 


In any case, as the actor system carries out its 
means—end process, a certain number of links 
(choices or potential choices) will “snag” or “col- 
lide” with the links of other actors. Conflict gen- 
erally emerges whenever at least one party per- 
ceives that one (or more) of his goals, or purposes, 
or preferences, or means for achieving a goal or 
preference is being threatened or hindered by the 
intentions or activities of one or more other parties. 
The parties may be seeking to move or expand into 
the same field or physical space, or, more abstractly, 
into the same field of influence or behavior 
(Boulding 1957, p. 124). 

Perhaps the most important class of conflict 
processes is the reaction process in which a move- 
ment on the part of one actor “so changes the field 
of the other that it forces a movement of this party, 
which in turn changes the field of the first, forcing 
another move of the second, and so on” (Boulding 
1962, p. 25). Party A perceives—rightly or wrongly 
—that he is being threatened or injured by party 
B. Taking what he considers to be defensive action, 
A behaves in a way that B perceives as injurious or 
threatening. When B responds “defensively,” A, per- 
ceiving now that his initial observations and fears 
have been validated, increases his activities, and 
thus the conflict spirals. 

As the reciprocal threats and injuries rise, the 
parties may find no alternative other than to fight 
it out until one has reduced the other to submis- 
sion. On the other hand, at some point the penalties 
associated with an added increment of hostility 
may appear too great to one or both parties, and 
the conflict may decelerate, In due course, however, 
the anxieties, fears, and discomforts associated 
with their basic relationship are likely to become 
unbearable again, and the spiraling will resume. 
Essentially, the cold war is such a conflict in that 
it vacillates between a plateau of minimal, day-to- 
day conflict and occasional peaks where the hostile 
interchange stops just short of large-scale violence. 

The initial perception of threat or injury may or 
may not be accurate or justified. Many conflicts 
arise from what parties think may happen—from 
their anxieties, prejudices, fears, and uncertainties 
—rather than from any phenomenon that is 
actually threatening. 


Competition 

Conversely, even where actors are aware of in- 
compatibility, there may be no actual conflict if 
there is no strong desire on the part of at least one 
party to carry out the means—or achieve the ends 
—which are, or appear to be, incompatible (Bould- 
ing 1962, p. 5). Whether competitive situations 
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revolution, while social Darwinism failed to pro- 
gress beyond oversimplified concepts like “abso- 
lute hatred,” “survival of the fittest,” and “the con- 
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as a process both internal and external to the 
individual and also to the group was Georg Sim- 
mel, whose ideas have been critically restated by 
Lewis A. Coser (1956). Most sociologists lost inter- 
est in conflict, however, and became increasingly 
engrossed with patterns and processes of social 
integration. It is only in recent years, and especially 
with the threat of nuclear holocaust, that social 
scientists have focused serious attention upon 
problems of conflict. 


Conflict and integration 


It now seems evident that all human relations 
may be viewed as interlaced by two closely related 
processes—the conflictual and the integrative. To 
one degree or another, these two kinds of inter- 
action appear as soon as, and as long as, two or 
more individuals are in contact. They disappear 
only when the parties withdraw and the relationship 
is completely broken. 

Whenever two or more individuals or groups 
come into contact with each other, they may choose 
to make their relationship primarily conflictual or 
primarily integrative (i.e., cooperative, supportive, 
agreed upon), If the initial relationship is primarily 
conflictual, there will nevertheless emerge at least 
a few minimal strands of understanding and reci- 
procity—rules of combat, or perhaps only an agree- 
ment to disagree. If, on the other hand, the initial 
relationship is primarily integrative, it is certain 
that conflict will develop—if for no other reason 
than through the demands of the association itself 
as they compete with the preferences of individuals 
and component groups. 

Some degree of community, organization, or in- 
tegration is inherent in the concept of conflict. If 
the parties in question were not in the same place 
at the same time, or performing two incompatible 
functions at the same time, or cooperating to inflict 


reciprocal injury, there would be no conflict (Ber- 
nard 1957b, p. 112). Almost any aspect of conflict, 
however destructive, requires interaction between 
the antagonists, considerable communication, and 
the establishment and maintenance of many recip- 
rocal ties and subtle understandings. Conflict thus 
functions as a binding element between parties 
who may previously have had no contact at all 
(Coser 1956, p. 121). 

On the other hand, conflict may result in the dis- 
ruption or destruction of all or certain of the bonds 
of unity that may previously have existed between 
the disputants. 

Conflicts take place between individuals, be- 
tween individuals and organizations or groups, 
between distinct organizations or groups, between 
an organization and one or more of its compo- 
nents, or between component parts of a single 
organization or group. A conflict emerges when- 
ever two or more persons (or groups) seek to 
possess the same object, occupy the same space or 
the same exclusive position, play incompatible 
roles, maintain incompatible goals, or undertake 
mutually incompatible means for achieving their 
purposes. 


Inner conflict 


The type of conflict discussed above—essentially 
social conflict—should be distinguished from the 
inner conflict or quandary that emerges when in- 
compatible or mutually exclusive values present 
themselves in the form of an actual or potential 
choice or decision (Bernard 1957b, p. 113). Thus, 
an individual finds himself unable to make a de- 
cision because he is being pushed or pulled in op- 
posite directions. (The same holds true for a 
group.) He wants to be in two places at the same 
time or to perform two mutually exclusive func- 
tions at the same time. Like the proverbial ass be- 
tween two equally attractive bales of hay, he finds 
himself immobilized by the choice confronting him 
(Boulding 1957, p. 132). Or perhaps he finds him- 
self equally immobilized in trying to decide upon 
the “lesser of two evils.” 

In the individual human being the distinction 
between the inner conflict and the outer, or social, 
conflict seems relatively clear: the former involves 
a confrontation by the individual of a difficult 
choice between incompatible values, whereas the 
latter is concerned with an incompatibility between 
that individual and another individual or group. 
Yet the inner conflict can be viewed also as a con- 
flict between internal components of the individual: 
Her heart said Yes, but her common sense said No. 

When groups and organizations are considered, 


the inner conflicts of a given unit are undoubtedly 
social conflicts (conflicts between antagonistic 
actors) when viewed on one system level; but they 
are also sometimes analogous to the inner conflicts 
or quandaries of the individual, when viewed on 
another level, Among groups and organizations, 
then, the dialogues, debates, and struggles associ- 
ated with decision making may be considered on 
one system level as social conflicts between indi- 
vidual parties; or they may be treated under certain 
circumstances as quandaries—conflicts internal to 
the organism—from the perspective of another 
level in the systems hierarchy. A group can be 
immobilized either because all the members are 
caught between opposing repulsions or attractions 
or because one component of the group is attracted 
(or repelled) in one direction, the other component 
in another direction. Thus, the distinction between 
the quandary and the so-called social, or external, 
conflict is frequently a matter of system perspec- 
tive, which must be determined by requirements of 
the problem, the method of analysis, and the 
theory of the investigator (Lewin 1935-1946). 

An actor who has the capacity for absorbing or 
destroying another actor may be viewed as domi- 
nant. An actor who cannot be absorbed by another 
actor or destroyed as an independent decision 
maker is sovereign or unconditionally viable. Con- 
versely, the actor who survives only through the 
sufferance of another, dominant, party is said to be 
conditionally viable. 

In circumstances where both actors are uncon- 
ditionally sovereign or viable, or where it does not 
pay the dominant party to extinguish the other— 
to secure conditional viability—we have what 
Strausz-Hupé has called “protracted conflict” 
(1959). Here the problem is how to control the 
conflict, i.e., how to maintain limits, rather than to 
resolve it (Boulding 1962, p. 59). 

Each actor or system (individual, group, organ- 
ization, nation-state, and so forth) may be viewed 
as responding to various stimuli, including projec- 
tions of his or its own needs, desires, and expecta- 
tions. Or, one may equally postulate that each sys- 
tem possesses as an inherent part of its structure 
a “view of the universe” or an “image” that includes 
some notion or “plan” of its role and purpose 
(Boulding 1957, pp. 125-126; also Miller et al. 
1960). Within the first view, each choice-point 
offers alternatives to which some combination of 
perceived rewards and penalties is attached. In con- 
nection with the second view, behavior or action 
may be seen as the carrying out of a process that 
moves the system toward the most highly valued 
part of the total image. 
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In any case, as the actor system carries out its 
means-—end process, a certain number of links 
(choices or potential choices) will “snag” or “col- 
lide” with the links of other actors. Conflict gen- 
erally emerges whenever at least one party per- 
ceives that one (or more) of his goals, or purposes, 
or preferences, or means for achieving a goal or 
preference is being threatened or hindered by the 
intentions or activities of one or more other parties. 
The parties may be seeking to move or expand into 
the same field or physical space, or, more abstractly, 
into the same field of influence or behavior 
(Boulding 1957, p. 124). 

Perhaps the most important class of conflict 
processes is the reaction process in which a move- 
ment on the part of one actor “so changes the field 
of the other that it forces a movement of this party, 
which in turn changes the field of the first, forcing 
another move of the second, and so on” (Boulding 
1962, p. 25), Party A perceives—rightly or wrongly 
—that he is being threatened or injured by party 
B. Taking what he considers to be defensive action, 
A behaves in a way that B perceives as injurious or 
threatening. When B responds “defensively,” A, per- 
ceiving now that his initial observations and fears 
have been validated, increases his activities, and 
thus the conflict spirals. 

As the reciprocal threats and injuries rise, the 
parties may find no alternative other than to fight 
it out until one has reduced the other to submis- 
sion. On the other hand, at some point the penalties 
associated with an added increment of hostility 
may appear too great to one or both parties, and 
the conflict may decelerate. In due course, however, 
the anxieties, fears, and discomforts associated 
with their basic relationship are likely to become 
unbearable again, and the spiraling will resume. 
Essentially, the cold war is such a conflict in that 
it vacillates between a plateau of minimal, day-to- 
day conflict and occasional peaks where the hostile 
interchange stops just short of large-scale violence. 

The initial perception of threat or injury may or 
may not be accurate or justified. Many conflicts 
arise from what parties think may happen—from 
their anxieties, prejudices, fears, and uncertainties 
—rather than from any phenomenon that is 
actually threatening. 


Competition 

Conversely, even where actors are aware of in- 
compatibility, there may be no actual conflict if 
there is no strong desire on the part of at least one 
party to carry out the means—or achieve the ends 
—which are, or appear to be, incompatible (Bould- 
ing 1962, p. 5). Whether competitive situations 
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become conflictual may depend, then, upon whether 
the incompatibility is perceived and also upon 
whether the issues involved are considered impor- 
tant by the parties, 

Conflict suggests a special situation of competi- 
tion in which both actors are aware of the incom- 
patibility of potential future positions and in which 
each is strongly impelled to occupy a position 
incompatible with the perceived interests of the 
other. 

Whenever two or more systems come into con- 
tact, they may choose either a “conflict set,” where 
an action will benefit one actor at the expense of 
another; or a “trading set” of means and ends, 
where movements benefiting both parties are fea- 
sible (Boulding 1957, p. 131). 

Yet conflict is inherent in the trading set: each 
party is likely to seek a maximum advantage that 
cannot be achieved without reducing the advantage 
of the other party. A satisfactory trade can be 
achieved only if both parties, tacitly or otherwise, 
observe certain rules and limitations. 

Superordinate goals. A special type of competi- 
tion evolves when an otherwise conflictual struggle 
is consciously joined by two or more parties in the 
interest of some more important, mutually sup- 
portive, promotively interdependent goal: two 
traders decide to “dicker”; two friends meet for 
tennis; two teams play a violent, but carefully regu- 
lated, game of football; or two neighborly mer- 
chants try to outdo each other in the same line, but 
on the assumption that each will prosper (Levinger 
1957, p. 337). In each case, neither side is seeking 
to destroy the other nor to inflict more than the 
limited amount of injury that has been more or less 
agreed upon and stipulated by the rules of their 
competitive game. Two or more parties in conflict 
may be drawn into a cooperative relationship 
through their perception of an overriding, superor- 
dinate goal (see Oklahoma, University of . . . 1961 Jr 

In any one of these “institutionalized” arrange- 
ments, the parties accept, consciously or uncon- 
sciously, a contract or compact—elaborate arrange- 
ments or rules of the game for achieving their 
common purposes (fixing and enforcing bound- 
aries, deciding on possession of the ball, identifying 
violations, enforcing penalties, and so forth). 

The consensus may include not only rules but 
also rules for formulating rules; and it may desig- 
nate roles—messengers, referees, judges—to help 
in deciding the issue or to enforce the agreed-upon 
stipulations. The “game” is likely to provide for 
assignments of function, special prerogatives, limi- 
tations of power, and so forth. 

A procedure for resolving conflicts can be hon- 


ored by usage and thus gain status of custom. Or, 
when established or recognized and enforced by 
recognized authority, such a procedure takes force 
as law. A large part of both custom and law func- 
tions to resolve conflicts, or to prevent them, or to 
confine them within designated limits. 

In analyzing relationships between individuals 
or groups, we are not concerned, then, whether the 
relationship contains conflict; we assume that it 
does. The question is, How do they handle their 
conflicts—by unconfined violence and destruction 
and without institutional restraint, or by some 
regulated means within an agreed-upon frame- 
work? Have the two parties achieved sufficient in- 
tegration to channel their conflicts within tolerable 
limits, and how is the basic contract or integration 
maintained? Most organizations, including political 
organizations on all system levels, may be viewed 
in these terms. 

State A and state B, each seeking possession of 
an island, may thus be in conflict. To resolve the 
issue one state may withdraw its claim; or the two 
states may divide the island by mutual agreement; 
or they may both decide to fight for it. If state A 
withdraws without consultation or communication 
with state B, the conflict has been eliminated, but 
there is no integration—no agreement—between 
the two parties. 

If both sides agree to fight, the two states, while 
still in conflict over possession of the island, are 
not in conflict over the means of resolution. Thus, 
the conflict continues, but an element of integra- 
tion has emerged to the extent that the parties have 
agreed upon the method for deciding the issue. 
Later, if both sides tire of combat, they may agree 
upon an equal-sided armistice for achieving reso- 
lution by nonviolent and noncoercive means. On 
the other hand, if the armistice is unequal and it 
is clear that one side is coercing the other, then the 
agreement, although integrative, has been shaped 
by threat of superior force. 


Integrative instruments 


Integrative arrangements between parties can be 
reached in a number of ways. They can be entered 
into consciously and voluntarily, They can be im- 
posed by one party upon another. They can be 
“grown into,” more or less unconsciously, through 
habit and custom. They can be inherited, and indi- 
viduals can be—and, if we refer to states, normally 
are—born into them. 

If, in the course of a conflict situation, one side 
uses force and the other submits, the assumption 
is that the submitting side has “agreed” and that 
the consequence is an integrative but essentially 


violent and coercive relationship. On the other 
hand, if one party uses force and the other refuses 
to submit but eschews force and turns to some 
method of passive resistance, then there is a highly 
disintegrative double conflict involving both incom- 
patible purposes and incompatible rules of warfare. 
A contract, then, may be essentially violent and 
coercive or nonviolent and voluntary in nature. 

Separate parties with a common enemy can ar- 
range an integrative compact against the outside 
threat. A prolonged conflict with the British crown 
had much to do with the unification of the thirteen 
colonies under the Articles of Confederation, at 
first, and later under the constitution. 

The American Civil War illustrates a somewhat 
different process. Here there was no common 
enemy. The contestants were at war against each 
other, and yet, in the long run, the conflict had a 
unifying consequence. The pre-existing ties be- 
tween North and South, severely challenged prior 
to the war, were slowly and painfully repaired and 
greatly strengthened during the decades following 
the war. Many deep incompatibilities remained, but 
the conflicts were contained below the level of 
major violence. 

Depending on the system level, the means, or 
instrument, of a compact may take the form of a 
marriage or other contract, articles of incorpora- 
tion or confederation, a treaty, an elaborate con- 
stitution, and so forth. The instrument need not be 
written, however, or even explicit: two Indian 
tribes may smoke a pipe of peace; or the slave— 
whether through love, or fear, or constraint—may 
“agree” to the slavery “contract,” honoring the ar- 
rangement by choosing not to escape. In its impli- 
cations, then, an integrative arrangement may take 
any form, from the wholly consensual, to the mini- 
mally consensual; from the nonviolent, noncoercive, 
to the overwhelmingly violent and coercive. 

An integrative instrument can be wrung from a 
conqueror or ruler by a group of his subjects whose 
capability has increased sufficiently. In the later 
Middle Ages, European towns achieved their char- 
ters largely in this way. Such an instrument may 
increase the cohesion of the group at the expense 
of its relationship with the ruler or grantor, or, as 
has happened over the course of history in England, 
the instrument may have the effect of strengthen- 
ing bonds between the parties—the king, the par- 
liament, the church, the people. In either case, 
however, the conclusion of the instrument signals 
a change in the relative capabilities of the parties. 

How is the degree of integration determined? 
Clearly, it is revealed through the kind of function 
controlled by the contract rather than the mere 
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number of functions, the power of the contract 
to implement these kinds of functions, and the 
time span of its effectiveness. 

A given category of integrative means may in- 
clude a single action taken to resolve a conflict at a 
particular time, and also a procedure or set of pro- 
cedures designated for the resolution of issues 
relating to a problem over time. If two parties de- 
cide that a conflict over fish caught in disputed 
waters shall be resolved in a single instance by an 
even division of the catch, this means falls in the 
single-action category. If the parties decide, on the 
other hand, that all fish caught in these same dis- 
puted waters will be divided in this fashion for the 
rest of the season, or over a period of five years, or 
in perpetuity, then the procedural “over-time” cate- 
gory is appropriate. Treaties and similar agree- 
ments over time are designated in this way. 

Two distinct units signing a fisheries treaty have 
not integrated significantly. At the point when they 
agree to merge such functions as defense, taxation, 
legislation, and the enforcement of laws, however, 
the parties have gone a long way toward extinguish- 
ing their separate personalities and giving rise to a 
new state entity. When the populations of the two 
primary components transfer their loyalties to the 
new unit, when they internalize its symbols, when 
those who do not belong are viewed as “foreigners,” 
“outsiders,” and even enemies, then the integration 
may be considered virtually complete (Deutsch 
et al. 1957, pp. 5-6). Herein lies the difference be- 
tween a simple treaty and a federative instrument 
or a constitution. 

Constitutions as integrative instruments. Trea- 
ties respect and preserve the distinct identities of 
the individual parties, whereas a constitution joins 
separate contracting entities into a single unit of a 
new order. (Some so-called constitutions are pro- 
grams rather than contracts. In such cases, there 
is a compact somewhere inherent, but it may be 
tacit rather than explicit, or it may reside in some 
document carrying a different label.) Historically, 
many constitutions have developed from treaties. 

It often happened that a state enlarged its terri- 
tory by conquest of adjoining peoples or acquired 
colonies across seas or at other great distance from 
the homeland. Not infrequently, the populace of 
the central unit enjoyed nationality and rights of 
citizenship, whereas the conquered people were 
forced into a tributary or dependent or colonial 
status. The Roman Empire extended its jurisdic- 
tion over much of Europe and into Africa and Asia 
Minor somewhat in this fashion. This far-flung in- 
tegration “dis-integrated” with the Christian era, 
but a millennium later a number of European 
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nation-states began reaching around the globe in a 
not dissimilar pattern, 

Through a royal charter, the British East India 
Company, like overseers of a private estate, ruled 
over many differing peoples of India. By making 
treaties with native princes, moreover, the company 
extended its own jurisdiction—and in the long run, 
British sovereignty—over large reaches of further 
territory. 

Other European chartered companies—the Dutch 
East India Company, the Hudson’s Bay Company, 
the Plymouth Company, the London Company, and 
others—played similar roles over much of the 
earth. In America the companies concluded solemn 
compacts with the Indians, and in Africa they fol- 
lowed the same pattern with native chiefs. 

The charter of the British East India Company 
became the model for the Massachusetts Bay Col- 
ony charter, from which, in turn, a direct lineage 
can be traced to the constitution of the United 
States. Thus, the same kind of integrative instru- 
ment gave form, in the one instance, to British 
imperial control over India and, in the other in- 
stance, to unification of the thirteen colonies into 
the United States of America. 

Not all colonial powers expanded through writ- 
ten compacts, of course, although the compulsion 
to sign a treaty, even with the most impotent up- 
river chieftain, was surprisingly widespread. In 
any case, the relationships that were established, 
whether unwritten or formalized on paper and 
whether reached by free consent or the imposition 
of superior strength, were essentially contractual, 
or became so by observance, custom, and usage. 
Thus the great empires spread. 

An integrative means category may be no more 
than a rule for resolving a common conflict of the 
road: [We agree that] the first vehicle to enter an 
intersection shall have the right of way. A more 
comprehensive set is likely to combine a number 
of related rules into a code, for example, all rules 
for avoiding or resolving conflicts on the streets 
and highways. A still more comprehensive level 
will be characterized by means categories that 
specify by what methods the various rules are to be 
enforced (powers) and by whom (roles). Such an 
instrument combines within itself a complicated 
means-end hierarchy, a role structure, a pattern 
of communications channels, and a wide range of 
procedures for avoiding, containing, and resolving 
highway conflicts. 

By a particular law, then, we mean a rule of be- 
havior established by recognized authority to pre- 
scribe regulations or govern interactions, transac- 
tions, conflicts, and other human relationships. Any 


law can be viewed as a generalized integrative 
instrument to which the authorities who enact and 
enforce it and the individuals and groups who are 
subject to it are all parties, although they may not 
be equal before it. The function of a given law is 
usually to prevent conflict, to resolve conflict, or to 
constrain conflict within agreed-upon limits. 

A law always presupposes the existence of a 
“superior” compact, agreement, constitution, or 
other integrative relationship from which the law 
derives its authority and to which the law is tribu- 
tary. A treaty bridges two integrated, but more or 
less independent, units. Treaties, indeed, belong to 
the same order of integrative instrument as do 
constitutions, that is, they provide the primary ties 
between parties from which further arrangements 
are derived and to which they are tributary. 

A law and its “superior” constitution or other 
integrative instrument can both be viewed as links 
in a means-end chain. One function of the courts 
is to decide precedence when laws are themselves 
in conflict. It should be clear, then, that the relative 
order of links in a means—end chain—the whole 
problem, indeed, of what is a means, what is an 
end, and which links are superior and which are 
tributary—depends upon the chosen perspective, 
the unit that is chosen to be viewed as the behaving 
and value-invoking system. Inevitably, there will 
be conflicts of interest between actors on different 
levels. 

From the organizational, legal, or state view- 
point, the single law is a means toward supporting 
and maintaining the constitution, which, in turn, 
is a means toward the achievement of security, 
justice, and other goals of the total system. 

The individual citizen, by contrast, may under- 
standably view the constitution as a means toward 
enforcement of the law for the protection of his 
personal interests and the achievement of his indi- 
vidual purposes. This view, an essentially Western 
democratic concept, goes a long way (although not 
in any sense the whole way) toward subordinating 
the state means-end structure to the order of a 
subset in the means-end hierarchy of the indi- 
vidual. In practice, however, the individual, even 
in a democratic state, stands alone if he pushes his 
own interests too far at the expense of the funda- 
mental integration and the society it represents. 

Durability of integrative instruments. Are there 
methods whereby the probable durability of an 
integrative instrument can be assessed? 

A party that is dominant and enjoys uncondi- 
tional viability under an integrative arrangement 
will tend to perpetuate the relationship. On the 
other hand, a conditionally viable party to the con- 


tract will not have much choice other than to 
accede, unless he perceives a possibility for increas- 
ing his own viability. 

Reduced to simplest terms, the durability of a 
given compact or other integrative relationship will 
depend upon two main variables: (1) the relative 
capability of each party, that is, his power or rela- 
tive capacity for inflicting his will; and (2) the 
amount of dissatisfaction evoked, or penalty de- 
manded, by the relationship among the participating 
parties. 

The durability of an integrative relationship will 
depend also upon the precedents, that is, upon 
whether or not previous agreements have worked 
to the satisfaction of the parties and have been 
generally long lasting. 

Against this background, it is possible, by ar- 
ranging the capability and satisfaction—dissatisfac- 
tion (also utility-cost, or reward—penalty) varia- 
bles in different combinations, to suggest a number 
of hypotheses, 

If parties A and B are about equally strong and 
equally dissatisfied with their contract relationship, 
but if both also feel that the additional cost of 
breaking it would be undesirably high, then we may 
postulate that the treaty’s durability will be long 
but its effectiveness will be low. Neither party will 
wish to abrogate the compact, and neither will hesi- 
tate to interpret its provisions broadly or even to 
disregard them when there is not much danger of 
detection or reprisal, The levels of conflict and 
tension within the relationship will be high. 

In another circumstance, A may be powerful and 
consider his relationship with B satisfactory, where- 
as B is deeply dissatisfied (but weak). In this pat- 
tern, we might expect high durability and, in view 
of B’s low capacity, a considerable measure of effec- 
tiveness. A, in short, will be in a position to domi- 
nate B and enforce B’s strict adherence to the 
agreement. The relationship will be characterized 
by numerous petty conflicts, a submerged pattern 
of potential conflicts, and considerable tension. 

Over time, B’s stored dissatisfaction may amount 
to a deep and growing sense of frustration. If, then, 
B’s capability should begin to rise for any reason, 
there is a strong possibility of his striking against A 
even before his strength is greater than or even 
equal to A’s. His perceptions of dissatisfaction will 
be so high, in short, that he will strike for the re- 
ward of ending the relationship—even at con- 
siderable risk. The possibilities of this outcome may 
be enhanced, moreover, because the weak and frus- 
trated party with increasing strength will tend to 
overestimate his ability to punish his offending 
partner in the relationship. 
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On the other hand, if we assume that B’s capa- 
bility will remain low and that his dissatisfaction 
will diminish with time, it seems reasonable to pre- 
dict his gradual acceptance—as custom or law or 
simply as “the way things are”—of the once irri- 
tating relationship. Various institutions for han- 
dling conflict will become increasingly accepted. 
This, indeed, is how time frequently functions on 
the side of the invader by gradually resolving or 
subordinating a wide range of conflicts separating 
him from the vanquished. 

We have postulated that the durability of a given 
compact or other integrative instrument will de- 
pend upon the capabilities of each party to the 
relationship and the amount of satisfaction or dis- 
satisfaction that the various parties derive from it. 
The durability of a complex organization—a city- 
state, nation-state, or empire—will depend upon 
similar variables except that there may be more 
issues involved, and the “parties” to the relationship 
are likely to consist of a ruling elite, on the one 
hand, and a body of subjects, citizens, or rank and 
file on the other. 

The willingness of any participant to contribute 
freely to the state will depend upon the satisfac- 
tions that he perceives accruing to him from the 
association. 

All states use some degree of force in order to 
endow their laws with authority and in order to con- 
trol criminal elements. The difference between a 
state that relies primarily upon consent for its co- 
hesion and management of conflict and one that 
depends upon coercion lies, first of all, in whether 
the force that is used has been agreed upon by the 
majority of the people and is exercised for desig- 
nated purposes and within sanctioned limits; and, 
second, whether dissident minorities within the 
state are afforded certain minimal protection. Un- 
less this protection is available, a majority can 
coerce and tyrannize a minority even though the 
officers of the state are applying force by virtue of 


legal mandate. 
ROBERT C. NORTH 


[Other relevant material may be found under INTERNA- 
TIONAL RELATIONS; PEACE; POLITICAL BEHAVIOR; 


WAR.] 
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Creative Experience. 


m 
SOCIAL ASPECTS 


Social conflict may be defined as a struggle over 
values or claims to status, power, and scarce re- 
sources, in which the aims of the conflicting parties 
are not only to gain the desired values but also to 
neutralize, injure, or eliminate their rivals. Such 
conflicts may take place between individuals, be- 
tween collectivities, or between individuals and 
collectivities. Intergroup as well as intragroup con- 
flicts are perennial features of social life, 

Conflict is an important element of social inter- 
action. Far from being always a “negative” factor 
that “tears apart,” social conflict may contribute in 
many ways to the maintenance of groups and col- 
lectivities as well as to the cementing of interper- 
sonal relations. 

Nineteenth-century sociology paid much atten- 
tion to social conflict. In all social thought derived 
from Hegel, particularly in Marxian thought, con- 
flict is the key explanatory variable. The same is 
the case with social thinkers directly or indirectly 
inspired by social Darwinism, such as Herbert 
Spencer, Gustav Ratzenhofer, Ludwig Gumplowicz, 
and William Graham Sumner. The struggle for 
power and influence is one of the themes of Pareto’s 
theories, as well as those of Mosca, Michels, and 
Sorel. Similarly, in the classical tradition in German 
sociology, from Tönnies to Simmel and Weber, 
conflict was considered a major social phenome- 
non. Weber, for example, insisted that “conflict 
cannot be excluded from social life . . . ‘Peace’ is 
nothing more than a change in the form of the 
conflict or in the antagonists or in the objects of 
the conflict, or finally in the chances of selection” 
([1904-1917] 1949, pp. 26-27). Simmel, to whom 
we owe a classical analysis of various forms of 
conflict, insisted that “conflict is a form of socia- 
tion” and that “a certain amount of discord, inner 
divergence and outer controversy, is organically 
tied up with the very elements that ultimately hold 
the group together” ([1908] 1955, pp. 17-18). 
Similarly, the fathers of American sociology saw 
in conflict an inherent and ineradicable component 
of social structures. Most of them agreed with 
Robert Park that “only where there is conflict is 
behavior conscious and self-conscious; only here 
are the conditions for rational conduct” (1924, 
p. 578). 

In a more recent period, the functions of conflict 
and the study of conflict phenomena were neg- 
lected by American sociologists. If conflict was 
discussed at all, attention was paid mainly to its 


dissociative aspects. The stress on the need for 
common values and harmony led a number of 
social theorists, from Lloyd Warner to Talcott 
Parsons, to consider conflict a kind of sickness of 
the body social. Within the last decade, however, 
a number of theorists opposing the prevailing har- 
mony model have endeavored, partly under the 
influence of Marx and Simmel, to develop a conflict 
model of society. The works of Jessie Bernard 
(1957), Lewis Coser (1956), Ralf Dahrendorf 
(1957), and Max Gluckman (1956) illustrate this 
approach. 


The objective bases of conflict 


The objective bases of social conflict must be 
sharply separated from subjective elements. Failure 
to do so results in excessively psychologistic expla- 
nations, which cannot do justice to the structure of 
conflict or to the situations that give rise to it. 
Such objective bases for contentions vary widely. 
Conflicts may break out over the distribution of a 
great variety of scarce values and goods, such as 
income, status, power, dominion over territory, or 
ecological position. Such occasions for conflict be- 
havior need to be analyzed separately from disposi- 
tions and attitudes such as hostility, aggressive- 
ness, ressentiment, hatred, and the like. In certain 
types of conflicts, such as modern management— 
labor conflicts, the antagonists may harbor only a 
minimum of hostile emotions toward each other. 
Conflicts and hostile sentiments, although often 
associated, are, in fact, different phenomena. 

The distinction between realistic and nonrealistic 
conflict has proved valuable in analysis. Realistic 
conflict arises when men clash in the pursuit of 
claims and the expectation of gain. It is viewed by 
the participants as a means toward the achieve- 
ment of specific goals, a means that might be 
abandoned if other means appear to be more effec- 
tive. On the other hand, nonrealistic conflict, aris- 
ing from aggressive impulses that seek expression 
no matter what the object, allows no functional 
alternative of means, since it is not aimed at the 
attainment of a concrete result but at the expres- 
sion of aggressive impulses. Scapegoating provides 
an example; the object of attack is secondary to 
the dispositional need for attack. Thus, in nonreal- 
istic conflict there are functional alternatives for 
the target, in contrast to realistic conflict in which 
there are functional alternatives to the means used. 
In concrete empirical cases, it is likely, of course, 
that mixtures between the pure types of realistic 
and nonrealistic conflict will be found to be present. 

Hostile attitudes do not necessarily result in con- 
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flict; nor need we expect that objective discrepan- 
cies in power, status, income, and the like will 
necessarily lead to the outbreak of conflict, although 
they can be conceived as potential sources of con- 
flict. Here, as elsewhere, the way men define a 
situation, rather than the objective features of the 
situation, must be the focus of analysis. Potential 
claimants for greater income, status, deference, or 
power may be deterred from conflict because of 
fear of consequences or because they consider 
existing discrepancies in the distribution of valued 
objects to be legitimate. 


The structural impact of conflict 


The impact of conflict on social structures varies 
according to the type of such structures. In loosely 
structured groups and in open, pluralistic societies, 
conflict that aims at a resolution of tension between 
antagonists is likely to have stabilizing functions. 
If the direct expression of rival claims is permitted, 
such conflicts may serve to eliminate the causes 
for dissociation and to re-establish unity. In such 
flexible structures, multiple affiliations of individ- 
uals make them participate in a variety of group 
conflicts so that those who are antagonists in one 
conflict are allies in another. Thus, multiple con- 
flicts, although varying in intensity, are likely to 
crisscross one another and thereby prevent cleav- 
ages along one axis. The pluralism of associations 
in such types of societies leads to a plurality of 
fronts of conflict, and the intensity of any one of 
these conflicts is likely to be relatively low. Seg- 
mental participation in a multiplicity of conflicts 
constitutes a balancing mechanism within the 
structure. In this way, conflicts may be said to sew 
pluralistic society together. 

In rigid social structures and in closed groups, 
on the other hand, the impact of conflict is likely 
to be quite different. The closer the group, the 
more intense are conflicts likely to be, that is, the 
more highly involved the parties. Such groups tend 
to inhibit the open acting out of hostility since they 
fear its disruptive effect, Closed groups tend to 
absorb the total personality of their members; they 
are jealous of members’ affiliation with other groups 
and desire to monopolize their loyalty. The result- 
ant deep involvement of the members and the inti- 
mate association among them is likely to lead to a 
great deal of hostility and ambivalence, a hostility, 
however, to which the group denies legitimate out- 
lets. Hence, if conflicts break out in groups that 
have tried to prevent them, they are likely to be 
peculiarly intense. This is so because, first, the 
personality absorption in such groups tends to favor 
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the mobilization of all psychic energies in the 
conduct of the struggle, and, second, because these 
conflicts now are not likely to remain limited to 
the issues at hand but to revive all those grievances 
that were denied expression previously. All the pre- 
viously latent causes for conflict are now super- 
imposed upon one another. 

A similar situation prevails in large social struc- 
tures that are organized in unitary and rigid pat- 
terns. Here also, conflict, if it occurs at all, is 
likely to be intense. The lack of multiplicity of 
crisscrossing associations and multiple allegiances 
between members is likely to have as a conse- 
quence the superimposition of the various latent 
sources of conflicts. In such structure, a basic divi- 
sion of society into two large hostile classes or 
groups (the situation envisaged by Marx) becomes 
a strong probability, 

While closeness of association and structural 
rigidity tend to lead to high intensity of conflict, 
they do not necessarily lead to a high degree of 
violence of conflict. Violence, as distinct from in- 
tensity, refers to the choice of means for carrying 
out the conflict rather than to the degree of involve- 
ment of the participants. Intensity and violence 
may vary independently of each other. The more 
integrated into the society or group are the parties 
to the conflict, the less likely will the conflict be- 
tween them be violent. The greater the degree of 
integration, the higher the likelihood that the con- 
flicting parties will choose weapons that will not 
permanently menace their common bonds. Violent 
class struggles or class wars are likely to give way 
to less militant means, such as institutionalized 
strikes or regularized contests, in those societies 
that permit the integration of lower classes or 
ethnic and other minorities into the social order. 


Ideology and conflict 


Conflicts are likely to be more intense, and more 
violent as well, to the degree that the contenders 
are collectivity-oriented rather than self-oriented 
and hence consider that their struggle is waged for 
the sake of superindividual ends. Ideological strug- 
gles that transcend the merely individual ones 
allow the participants a “good conscience” in the 
choice of their means of struggle. Hence, individ- 
uals who see themselves acting as representatives 
of a cause, fighting not for self but only for the 
ideals of a collectivity they represent, tend to be 
more radical and merciless than those who fight 
for personal advantage. The ideological end may 
justify the means in the eyes of the participants 
and lead them to consider justifiable, in public 


ideological contention, means that they might re- 
ject in private conflict. 

This order of phenomena highlights the impor- 
tance of intellectuals, that is, the makers and 
shapers of ideologies in society. Intellectuals who 
transform conflicts of interests into conflicts of 
ideas help provide public justification of conflicts 
and hence to make them more intense. Conflicts 
may involve the pursuit of personal interests by 
private individuals or they may arise from the 
pursuit of the interests of various types of collec- 
tivities. Intellectuals, when they function as “ideol- 
ogists,” tend to strip such conflicts of their merely 
personal or merely interested aspects and to trans- 
form them into struggles over eternal truths. They 
thereby deepen and intensify them. Yet, lest the 
social role of intellectuals be overrated, it needs to 
be stressed that they can effectively function as 
“ideological agents” only in structures that favor 
the growth of ideologies. In pluralistic societies, 
crisscrossed by manifold conflicts upon a variety 
of axes, the role of ideologists is likely to be much 
less pronounced and their influence considerably 
reduced; they are likely to assume more important 
roles in those structures in which the superimposi- 
tion of conflicts upon one axis favors the emer- 
gence of unified ideological fronts. 

When manifold conflicts are superimposed upon 
each other, many and variegated interests can be 
fused by the adherence to a common ideology. 


Conflict, consensus, and social change 


The distinction between conflicts that exceed 
the limits imposed by societal consensus and those 
taking place within the basic consensus has in- 
formed political thought ever since Aristotle. Con- 
flicts that do not attack the basic consensus and 
are in effect waged upon the very ground of con- 
sensus are likely to lead to adjustments between 
the various parties and hence to contribute to a 
closer integration of the society. In contrast, con- 
flicts that attack basic assumptions of collective 
existence dissociate and split the society into war- 
ring camps. 

Loosely structured groups and open pluralistic 
Societies, by allowing conflicts to be fought out 
among a variety of contenders and on a variety of 
fronts, institute safeguards against types of con- 
flict that might endanger basic consensus. They 
minimize the danger of divergences touching upon 
central consensual values. However, in rigid groups 
or societies the chances are high that if conflict 
occurs despite the effort to repress it, it will reach 
down to the basic layers of consensus, For exam- 


ple, if major strata of a society's population are 
permanently excluded from participation in the 
society's benefits, they will tend to reject the very 
assumptions upon which the society is built, and, 
if the systems of legitimation no longer fully oper- 
ate, they will attempt to attack the social order 
through revolutionary violence. On the other hand, 
where no stratum of the population considers itself 
totally excluded from society's benefits, although 
all may still engage in multifarious struggles to 
increase their respective shares of income, wealth, 
power, or prestige, conflicts will tend to be waged 
within the limits of a consensus. 

Each society contains some elements of strain 
and potential conflict. Analysis of social change 
needs to focus attention on these elements, since 
they provide the dynamics of change. Elements 
that evade and resist the patterned structure of 
norms and the habitual balance of power and inter- 
ests may be considered harbingers of the emer- 
gence of new and alternative patterns emerging 
from an existing structure. Conflict prevents the 
ossification of social systems by exerting pressures 
for innovation and creativity; it prevents habitual 
accommodations from freezing into rigid molds 
and hence progressively impoverishing the ability 
to react creatively to novel circumstances. The clash 
of values and interests, the tension between what 
is and what some groups or individuals feel ought 
to be, the conflict between vested interest groups 
and new strata demanding their share of wealth, 
power, and status are all productive of social 
Vitality. 

Social change can be analyzed only in relation 
to specified structures. This is why it is necessary 
to distinguish between changes that take place 
within particular structures and changes that lead 
to the decay of old structures and the emergence 
of new ones. We may talk of a change of a social 
system as distinct from changes within a social 
system when all major structural relations, basic 
institutions, and prevailing value systems have 
been drastically altered. This theoretical distinc- 
tion needs to be made, even though it is recognized 
that the concrete changes of a system may be the 
consequence of previous changes within the system. 

Whether conflicts within a society will lead to 
adjustments between existing institutions or to 
systemic changes of the institutions depends on 
the degree of flexibility of the social structure. Flex- 
ible systems allow progressive rearrangements in 
their structures in tune with the outcome of various 
types of group conflict within them. Rigid societies, 
refusing to make such adjustments and allowing 
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the accumulation of unresolved latent conflicts, are 
likely to maximize the chances of violent outbreaks 
attacking the consensual structure and leading to 
changes of social systems. 


Ingroup and outgroup 

Sumner’s dictum that the distinction between 
the ingroup (ourselves) and the outgroup (every- 
body else) is established in, and through, conflict 
has found general acceptance. One of the prime 
mechanisms for the strengthening of group bonds 
and for the emergence of new groups has always 
been the creation or strengthening among group 
members of a sense of common values, interests, 
and purposes, all leading to a mobilization of the 
group's energies against outsiders. The distinction 
between “us” and “them,” perhaps the most funda- 
mental social distinction, is established and periodi- 
cally reaffirmed in social conflict between insiders 
and outsiders, between friends and enemies, In 
homogeneous societies, in which individuals partici- 
pate in only small numbers of social circles, this 
distinction is likely to be so encompassing that it 
allows only minimal relations between the mem- 
bers of all but a few social circles. However, in 
heterogeneous societies, that is, in societies in 
which individuals are likely to participate in a 
great number of social circles in which there will 
be high degrees of overlapping memberships in a 
variety of associations and groupings, this sense of 
exclusiveness will be successfully minimized. In 
such societies, with their varieties of functionally 
specific and nongeneralized conflict, men who will 
be friends in one relationship might well be ene- 
mies in another; conflicts with some lead to new 
alliances with others. If everyone is somebody's ally 
in some respects and his opponent in many others, 
such conflicts will draw all into a multifarious and 
varied group life in which the very crisscrossing 
of allegiances among men segmentally involved 
on a variety of fronts prevents exclusiveness and 
withdrawal. 


Peace and feuding, conflict and order, are cor- 
relative. Both the cementing and the breaking of 
the cake of custom constitute part of the dialectic 
of social life. One is hence ill-advised to distinguish 
sharply a sociology of order from a sociology of 
conflict, or a harmony model of society from a con- 
flict model. Such attempts can only result in arti- 
ficial distinctions. The analysis of social conflicts 
brings to awareness aspects of social reality that 
may be obscured if analytical attention focuses too 
exclusively on phenomena of social order; but an 
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exclusive attention to conflict phenomena may ob- 
scure the central importance of social order and 
needs to be corrected by a correlative concern with 
the ordered aspects of social life. We deal here not 
with distinct realities but only with differing aspects 
of the same reality, so that exclusive emphasis on 
one or the other is likely to lead the analyst astray. 
Perhaps we need return now to Charles Horton 
Cooley’s statement: “The more one thinks of it the 
more he will see that conflict and co-operation are 
not separable things, but phases of one process 
which always involves something of both” (1918, 
p. 39). 

Lewis A. COSER 


[Relevant material may be found under PEacE; So- 
CIOLOGY, article on THE DEVELOPMENT OF SOCIO- 
LOGICAL THOUGHT. ] 
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Iv 
ANTHROPOLOGICAL ASPECTS 


Conflict results from competition between at 
least two parties, A party may be a person, a family, 
a lineage, or a whole community; or it may be a 
class of ideas, a political organization, a tribe, or a 
religion. Conflict is occasioned by incompatible 
desires or aims and by its duration may be distin- 
guished from strife or angry disputes arising from 
momentary aggravations. It is safe to assume that 
some of the causes of conflict are to be found in 
the aggressive behavior that is almost universal 
among the vertebrates and is presumably adaptive 
in a broad range of environments. The function of 
much conflict appears to be the control of food and 
reproduction through the control of territory and 
the maintenance of well-organized dominance hier- 
archies that serve to reduce the amount of fighting 
in a group (Ciba Foundation 1966). 

Field studies of nonhuman primates reveal that 
relations between different species of mammals are 
usually peaceful or neutral; between groups of the 
same species, reactions range from avoidance to 
agonistic display and even to violence, Conflict 
between individuals within the local group occurs 
far more frequently than intergroup or interspecies 
conflict. One kind of behavior that appears to be 
limited to primates is the use of objects in agonistic 
display. This use of objects has not yet been ob- 
served in other animal orders. 

The evolutionary frame of reference of most 
primate studies leads to consideration of conflict 
and aggression as part of the species-specific pat- 
tern of adaptive behavior. Since this aggression is 
the result of the long-term evolution of the species, 
it is functionally adaptive in the present only if the 
conditions of the present are the same as those 
under which the behavior evolved. Today man’s 
way of life is radically different from that of the 
past. Yet we still inherit the biology of aggression 
that was adaptive in times past. In nonhuman 
primates, evolution has progressed slowly enough 
to maintain the balance between species-specific 
biological behavior patterns and social behavior. 

We cannot ignore man’s biological heritage as a 
framework within which culture is elaborated and 
developed. We must not, on the other hand, ignore 
the fact that man did develop culture and the 
ability to manipulate symbols. As a result, the evo- 
lution of human behavior needs to be explained by 


the processes of cultural selection as well as of 
biological selection. The study of conflict may be 
further clarified if conflict is not equated with ag- 
gressive behavior, since it is not a type of behavior 
but, rather, a situation resulting from incompatible 
interests or values. The anthropologist studies con- 
flict as a multidimensional social process that oper- 
ates in many different contexts and results in a 
variety of consequences. The simplest as well as the 
most elaborate display of conflict behavior is de- 
fined by its function, by the part it plays within a 
system of human activities, and by the ideas and 
values attached to it. 

Although theoretical attention to social and cul- 
tural conflict is relatively new in anthropology, 
ethnographers have long been recording instances 
of conflict occurring under a variety of guises. Such 
diverse phenomena as witchcraft practices, feuds, 
factionalism, warfare, competitive games, contra- 
dictory values, and discord between spouses have 
been viewed as conflict or as the potential means 
for displacing conflict from one level of social 
grouping to another. Anthropological studies of 
feuds, warfare, and witchcraft have frequently 
concerned themselves with institutionalized or so- 
cially regulated conflict involving groups. Conflict 
not formalized by institutional means has been 
described by ethnographers as part of the case 
material of primitive legal systems or in terms of 
acculturation pressures. 

The major theories of conflict, primarily influ- 
enced by the structural—functional theory of social 
organization and only indirectly by the psychoana- 
lytic theory of personality, have assumed conflict 
to be a valid generic term for such diverse phe- 
nomena as those mentioned above; this is in part 
because all these conflict situations involve opposi- 
tion over scarce goods and resources, either in a 
literal or in a figurative sense. 

Some recent theoretical studies emphasize the 
associative aspects of conflict; others approach 
conflict from the viewpoint of contradictions in 
systems of values. Anthropologists have also at- 
tempted to use theories of frustration—aggression 
displacement in studies of intergroup conflict and 
have applied psychoanalytic concepts such as iden- 
tification and status envy in interpreting frequency 
of intragroup aggression. 

The structural—functional approach to the study 
of conflict or aggression emphasizes the social 
structural sources of conflict, the functions of con- 
flict at various structural levels, the mechanisms 
for resolution or control of conflict, and the various 
behavioral means for communicating conflict. 

While anthropologists insist that conflict is in 
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great part a cultural product, they are also cogni- 
zant of the biological and social aspects of conflict 
among nonhuman primates. Although there may 
be discontinuities in the phenomena under obser- 
vation as they pertain to nonhuman species and 
as they pertain to human beings, field studies of 
primates may be useful in constructing an evolu- 
tionary view of social and cultural conflict that will 
help the anthropologist to determine the universal 
aspects of human conflict. 


The functions of conflict 


Equilibrium model. Confusion of the concepts 
of equilibrium, integration, and stability has led to 
ambiguity in the study of conflict. Conflict is dis- 
cord; its opposite is harmony—which for many 
implies integration. Integration and conflict have 
often been discussed_as opposites, and indeed con- 
flict has been equated with anomie—deviant, ab- 
normal behavior which impedes the successful 
integration of society. Integration and related terms 
frequently have been used with strong value con- 
notations, especially when changes resulting from 
acculturative pressures are described. Cultural in- 
tegration refers to coherence among cultural ele- 
ments and some degree of integration is, of course, 
characteristic of all societies. The absence of con- 
flict and the presence of cooperation and coordi- 
nated action are sometimes used as indexes of 
integration or societal stability. Conflict is more 
readily observable than is integration. As a result, 
much of anthropological discussion of integration 
or stability is implicit rather than explicit, as is 
illustrated both by the configurational view, which 
employs a psychological approach to cultural inte- 
gration, and by the functional view, which attempts 
to study the interrelations of various institutions 
of a culture. The configurationalists are particu- 
larly interested in discovering the themes or values 
that hold a culture together and distinguish it 
from others, and they refer to consistent, harmoni- 
ous, balanced cultures as “genuine” rather than 
“spurious.” There has been a marked trend, how- 
ever, to consider conflict and change as inevitable 
and essential aspects of the social process; this has 
led some to criticize the equilibrium model char- 
acterized by the functionalists, referred to by 
Dahrendorf as utopian. The functionalists’ stress 
on common values and equilibrium may have 
stemmed from a concern with normative behavior 
and the study of those societies not undergoing 
rapid change. 

In recognizing conflict as an inevitable conse- 
quence of the operation of a social system, several 
anthropologists have raised an interesting question: 
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Given the proposition that conflicts are both dis- 
ruptive and inevitable, how can a social system 
persist as a going concern? Gluckman (1956) 
takes the position that conflict need not disrupt a 
social system, that, indeed, it may contribute to- 
ward the maintenance of society. He points out 
that struggles between the Zulu princes of South 
Africa for the throne occur within a continuing 
social system. In this example the society directs 
and controls the quarrels through conflicts of alle- 
giance so that, despite rebellion among the Zulu, 
for example, the same social system is re-estab- 
lished. Multiplicity of conflicts within a social sys- 
tem creates a division of society into a series of 
opposed groups with cross-cutting membership. 
The degree to which conflict is regulated can vary, 
but the dysfunctional aspects of conflict always 
tend to be minimized by the.channeling of behav- 
ior along controllable lines. In a Welsh village, con- 
flicts within the social system may be institutional- 
ized in the activities of its football clubs, carnival 
associations, and brass bands, whereas among the 
Zulu and Swazi of South Africa the fundamental 
conflicts of the social system are acted out in the 
rituals of rebellion, Social cohesion, then, is found 
in the conflicts between differing allegiances or in 
the displacement of conflict onto institutionalized 
forms of entertainment. 

In certain African stateless societies organized 
on the lineage principle, stability was maintained 
by virtue of segmental opposition; equilibrium was 
achieved by a balanced opposition between terri- 
torial segments of the same order. Ethnographic 
study of the Iroquois and the Zuni in America and 
of the Tonga and the Nuer in Africa has led to 
an alternative suggestion, namely, that stability is 
the indirect result of the presence within each local 
group of people of a diverse set of ties cross-linking 
them with other groups. 

The stress-strain theory. While still dealing 
with intragroup dissension, the stress-strain the- 
ory contrasts with the previous theories in holding 
that group solidarity is reduced under certain con- 
ditions of external stress, and, in fact, is maladap- 
tive. Factionalism is likely to appear when there 
are strains within the social structure and when 
certain kinds of external stresses occur. Navajo 
witchcraft is described by Kluckhohn (1944), who 
Suggests that belief in and practice of witchcraft 
permit the expression of direct and displaced ag- 
gression in situations of stress which, Prior to the 
presence of U.S. authority, might have been ex- 
pressed in warfare. Acculturation pressures, domi- 
nation by an alien social system that overrides 
traditional patterns of authority, and a combination 


of environmental and economic threats accentu- 
ated by the impact of a dominant culture may also 
strain weak points in the social structure. Siegel 
and Beals (1960) attempt to isolate types of con- 
flict, each having its own particular explanation. 
Pervasive factionalism, which occurs between un- 
organized and transient groupings, is distinguished 
from schismatic conflicts, which lead to the disso- 
lution of subgroups of the larger society. Institu- 
tionalized conflicts are permanently or periodically 
resolved; examples of this are found in political 
party behavior, sport teams, dual organizations, 
and intermarrying kin groups. Relevant to the ques- 
tion of how conflict may produce instability are 
observations made among the Soga of west Africa. 
Fallers (1957) describes the strain and instability 
produced by a society structured both by corporate 
lineages and by the state; in this example there is 
a conflict of roles that results when incompatible 
institutions apply to the same persons. Conflicts 
which cannot be resolved within the existing social 
system eventually produce changes in the social 
structure. 

The literature allows us to distinguish more gen- 
erally between “elastic” and “rigid” social systems. 
An elastic social system tolerates open and direct 
expression of conflict and allows for organizational 
adjustment; within an elastic system there will be 
institutionalized arrangements for the expression 
of conflict, such as duels, feuds, joking relations, 
and witchcraft. In such societies conflict may serve 
to stabilize and integrate. Rigid societies have a 
tendency to suppress conflict, since they have less 
tolerance for it. In rigid societies, the acting out of 
pent-up conflict may cause changes in the social 
structure. The degree of tolerance of conflict varies 
and quite clearly affects the function of conflict. 


The occurrence of conflict 


Conflict occurs in all human societies but varies 
in degree and form of expression. In some societies 
verbal rather than physical aggression is more fre- 
quent, while in other societies more passive forms 
of expression may predominate. Some peoples in- 
hibit aggression within the local community, only 
to wage war with surrounding groups. Some social 
anthropologists have suggested that environmental 
factors, such as ecology, demography, and econ- 
omy, affect cross-cultural variations in social con- 
flict. For example, warfare may be related to the 
level of economic development and population den- 
sity. Archeological evidence indicates that warfare 
was unknown in Europe and the Orient until the 
late Neolithic period. Organized warfare was un- 
known in aboriginal Australia. Groups like the 


Pueblo Indians of the southwest United States 
rarely engaged in offensive warfare. It appears that 
competition between groups seeking to exploit the 
same territory or resource leads to conflict. 

Demographic factors. Density of population, 
which seems to be associated with greater individ- 
ual restraint and inhibition, may be a factor in the 
elaboration of verbal aggression or witchcraft prac- 
tices. For example, LeVine (1961) suggests that 
among the Gusii of Africa, proximity of co-wives 
is responsible for the great number of witchcraft 
accusations. Observations and experiments on non- 
human animals support the conclusion that density 
of population seems to increase in-group aggres- 
sion, which takes such forms as homicide and 
assault. Human beings sometimes invent institu- 
tions that provide a sort of safety valve whereby 
aggression generated in the in-group may be 
drained off internally instead of being translated 
into out-group hostility and warfare. Thus it is 
possible that some form of dual organization (such 
as the widespread phenomenon of twofold factional 
divisions) may be an instance of a socially invented 
safety valve. Greater population density may also 
call for more centralized and elaborated adjudica- 
tion procedures which, in theory at least, may curb 
the degree of violent conflict. However, although 
environmental factors may have specific influence, 
they cannot provide a general explanation of inter- 
group or intragroup variations. Variation is not 
explained by any general theory of conflict. Rather, 
anthropologists are exploring a number of hypoth- 
eses that rest upon systemic factors focusing on 
one of several structural levels—intrafamily, intra- 
community, intercommunity. 

Conflict within the family. Gluckman (1956), 
in his attempt to relate estrangement in the family 
to the type of extended kinship system, postulated 
that the structure or type of descent system is a 
major determinant of the frequency of divorce. He 
suggests that divorce is rare among societies with 
extreme patriliny and frequent in matrilineal soci- 
eties; bilateral societies would be expected to fall 
between the highest and the lowest rate. He dis- 
tinguishes a fundamental conflict between rights 
centered in a woman as a wife and rights centered 
in her as a child-bearer, arguing that it is the 
rights in woman as child-bearer which establish 
firm husband-wife relations. In a matrilineal soci- 
ety these two rights are separated, for the wife's 
descent group maintains rights in the woman as 
child-bearer, Where the woman bears children for 
her own blood kin, her wifely bond is weak and 
divorce is frequent. In patrilineal societies, how- 
ever, the woman’s kin transfer to the husband their 
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rights in her both as wife and as child-bearer. 
Where the woman bears children for her husband’s 
group, her wifely bond is strong. Fallers (1957) 
later observed that the Soga of east Africa were 
patrilineal in descent but not characterized by 
stable marriage, as Gluckman would have pre- 
dicted. This led Fallers to reformulate the original 
hypothesis, pointing out that the crucial variable 
in patrilineal systems is the degree to which the 
woman is socially absorbed into her husband's 
patrilineage. Where a woman is not absorbed into 
her husband's lineage, patriliny tends to divide 
marriages by dividing the loyalties of the spouses. 

While the single major factor, descent, and its 
concomitant features may be useful in explaining 
marital conflict when comparing societies as 
wholes, it does not predict what correlates are rele- 
vant to the relative incidence of marital conflict 
within a society. Political and economic relations 
established by marriage may be powerful determi- 
nants, and Cohen (1961), in his description of 
variations in marital conflict within a single society, 
the Kanuri of northeastern Nigeria, suggests that 
conflicting allegiances may be only a part of the 
picture. He argues that economic relations between 
husband and wife maintain or disrupt marriage 
relations. Dependency of men on female services 
seems to undermine the expected husband—wife 
authority relation, and where men among the 
Kanuri are dependent on women, the divorce rate 
is high. 

Intergroup conflict. The analysis of conflicting 
loyalties has also been used in explaining inter- 
group conflict. Both Colson (1953) and Gluckman 
(1956) argue that in stateless segmentary socie- 
ties, peace may be achieved through divided loyal- 
ties. Gluckman notes that among the Nuer of east 
Africa the vengeance group is sufficiently scattered 
so that when an intergroup killing occurs, retalia- 
tion may force a person into a position of having 
to fight with his neighbors against his kinsmen or 
of being treated as an enemy by his neighbors. In 
this untenable situation an attempt may be made 
to mitigate conflict through compensation rather 
than blood vengeance. Evans-Pritchard’s earlier 
analysis of the Nuer (1940) emphasized the bal- 
ance of power through segmental opposition, rather 
than cross-cutting ties, as the chief deterrent to 
intergroup violence. 

Murphy (1957) explored the problem of cross- 
cutting group loyalties and their effect on inter- 
group hostility and intragroup cohesiveness among 
the Mundurucu of the Brazilian rain forest. It has 
often been noted by social scientists that severe 
inhibition of aggression within a society encourages 
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the channeling of aggression through external out- 
lets such as warfare; it has also been noted that a 
threat coming from outside the group causes inter- 
nal solidarity. In the case of the Mundurucu, war- 
fare was not inspired by desire for material gain 
but, rather, served to release tensions built up by 
a society rigid in its inhibition of violence. There 
was a contradiction between the norms of behavior 
applicable to members of the in-group on the one 
hand and to members of the out-group on the other. 
Murphy links the form of residence with out-group 
aggression: matrilocal societies, characterized by 
multiple and conflicting loyalties of males sepa- 
rated at marriage from consanguineal kinsmen, 
must repress overt aggression in order to ensure 
cohesion. Hostilities that are built into the structure 
of such societies must be expressed by displaced 
aggression or unrealistic conflict if the societies are 
to continue to exist. The assumption is that there 
is aggression which must be repressed and that it 
is likely to find an outlet by displacement onto an 
outside group. Murphy believes that such societies 
are less likely to exhibit internecine violence than 
are patrilocal societies, in which men tied through 
descent continue living together after marriage. 

Thoden van Velzen and van Wetering (1960) 
tested the hypothesis associating matrilocality with 
the absence of in-group violence by comparing 50 
unstratified societies divided between matrilocal 
and patrilocal. They confirmed the correlation of 
matrilocality with intrasocietal peace, but their ex- 
planation is a significant departure from that of 
conflicting loyalties. They see most matrilocal so- 
cieties as lacking power groups—that is, groups 
which resort to aggression when the interests of 
one of the members are threatened. According to 
their view, aggression develops as a result of patri- 
local structures; it does not develop in matrilocal so- 
cieties, so that there is no need for them to repress 
aggression, as Murphy suggests. These approaches 
are not concerned with psychoanalytic assumptions 
concerning intersocietal aggression or with explain- 
ing warfare; they do attempt to explain why some 
stateless, unstratified societies are more peaceful 
than others. 

Recent studies which combine demographic with 
psychological explanations suggest that there are 
a type of culture and a type of social structure 
which produce an adolescent and adult personality 
predisposed to behave at times in a violent and 
aggressive fashion. Beatrice Whiting ( 1965) in- 
vestigated the frequency of assault and homicide 
in six societies. She considers their higher fre- 
quencies in two of the six societies in the light of 


the concept of “protest masculinity,” the status 
envy hypothesis of identification, and the preva- 
lence of female-based households. 

Briefly, the status envy theory states that an in- 
dividual identifies with the person who seems most 
important to him, the person who is perceived as 
controlling those resources which he wants. If dur- 
ing the first two or three years of life a child is 
constantly with his mother and sees and is handled 
by his father infrequently, he will identify strongly 
with his mother, not with his father. In short, if he 
is a boy, he will have cross-sex identification. If, 
later in life, he is involved in a world obviously 
dominated by men, he will be thrown into conflict 
and will develop a strong need to reject his earlier 
underlying female identification. This identity con- 
flict may lead to an overdetermined attempt to 
prove his masculinity, manifested by a preoccupa- 
tion with physical strength and athletic prowess, 
or attempts to demonstrate daring and valor, or 
behavior which is violent and aggressive, i.e., pro- 
test masculinity. 


Conflict resolution and prevention 


An understanding of the ways in which conflict 
is resolved is particularly relevant to an under- 
standing of the functions of conflict and its ex- 
pression. LeVine (1961) distinguishes five forms 
of conflict-indicating behavior: physical aggres- 
sion, public verbal dispute, covert verbal aggres- 
sion, breach of expectation, and avoidance. All of 
these forms may be found within the same culture: 
for example, Zapotec women rarely make use of 
physical aggression; Zapotec men rarely employ 
forms of verbal aggression. It is important to note, 
however, that those behaviors which proceed from 
and are indicative of conflict may also operate to 
resolve the conflict. If avoidance or physical ag- 
gression does not successfully resolve a conflict, 
the use of a third party to achieve settlement by 
arbitration, mediation, compromise, or adjudica- 
tion is likely. Certain institutionalized forms of 
resolution, such as councils, courts, go-betweens, 
or “crossers,” perform these functions. Anthropolo- 
gists have made significant note of the fact that 
the resolution and control of conflict need not 
necessarily be identified with specialized political 
offices. There are viable, stable societies which lack 
central government and specialized political roles 
but which nonetheless have available other means 
of resolving and regulating conflict. In such state- 
less societies a variety of institutions and person- 
nel, such as diviner and shaman, may function as 
agents of conflict resolution. Among the Bedouin, 


institutions such as the feud function to punish 
serious transgressions and act as a brake to pre- 
vent aggression, for feuds are dreaded. Among the 
Dobu, sorcery is a socialized ritual that operates 
as the medium for a nonviolent adjustment of op- 
posing interests. The style of conflict resolution 
derives from a society's structural principles of 
human association. 

The conditions which define the presence and 
use of specific controlling procedures are various. 
Greater density of population and the dissolution 
of family authority and power that accompanies 
the development of a centralized state system may 
strengthen adjudication procedures in place of 
mediation or arbitration. In general, the procedural 
aspects of legal organization are more developed 
as political organization becomes more specialized, 
but other factors, such as the allocation of au- 
thority, are also relevant. For example, for the reso- 
lution of marital conflict, the differential use of the 
court systems available to each of two Mexican 
villages is closely correlated with the degree of 
authority allocated respectively to the family and 
to the court systems (Nader & Metzger 1963). In- 
deed, the development of courts or councils may 
be inhibited by the dual organization structure of 
a community. 


A more comprehensive approach to the occur- 
rence, function, control, and resolution of conflict 
may be achieved by examining the life cycles of 
particular conflicts. The processes through which 
a conflict may pass may be found to be inherent 
in the type of conflict, Various mechanisms are 
employed within the same society to heal breaches 
of peace. These actions may range from informal 
arbitration to formal legal machinery or to the per- 
formance of public ritual. Mystical beliefs and 
ritual action, rather than judicial machinery, are 
particularly effective in dealing with disturbances 
arising from processes inherent in the life cycle 
of a group. 

Laura NADER 


[See also FEUD; INTEGRATION; POLITICAL ANTHROPOL- 
OGY; SOCIAL STRUCTURE; WAR. Directly related are 
the entries MYTH AND SYMBOL; SOCIAL BEHAVIOR, 
ANIMAL; SOCIAL CONTROL.] 
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CONFLICT OF INTEREST 


How can there be assurance that a government 
official, businessman, or professional person will 
properly perform his duty to the public, to his em- 
ployer, or to his client when that duty affects his 
own private economic interests? In its narrowest 
sense, this is the key question of conflict of interest. 
The question rapidly broadens when the range of 
primary interests attached to duty are considered 
in relation to the dazzling variety of property and 
other interests potentially in conflict. In modern 
industrial society, where services abound, where 
government and business intersect in countless 
ways, the nature of conflicting interests is most 
complex. Government employees retain private eco- 
nomic interests that may be benefited by official 
actions. Officers of one corporation may own stock 
of another, whose value may be affected by actions 
of the first. 

An ethical crisis in the public life of the United 
States after World War 11 was expressed in concern 
over conflicts of interest for upper-echelon civil 
servants. The matter was isolated and widely rec- 
ognized as a moral and legal issue about 1950, and 
has been pondered and studied ever since. Conflict- 
of-interest legislation and regulations eventually 
were adopted for the United States government and 
a number of states and municipalities and by 
governments of other nations. Every sign suggests 
that this movement of concern and control will 
become more widespread in the world and will 
apply beyond the executive branch of government. 

Since its control involves preventive law, conflict 
of interest must be distinguished from other kinds 
of criminal behavior, such as theft or bribery. A 
government official may be found guilty of bribery 
if he agrees to act in response to money or favors. 
The first general federal bribery statute in the 
United States, dating from 1853, declares that a 
payment to influence official action is a clear abuse 
and declares it a criminal act. By contrast, the 
term “conflict of interest” implies the need for pre- 
ventive action to mark the kinds of situations indi- 
viduals should avoid. “Regulation of conflicts of 


interest seeks to prevent situations of temptations 
from arising” (Association of the Bar . . . 1960, 
pp. 3-4). It is regulation of potential harm, of 
evil before it occurs. The added subtlety of pre- 
ventive law gives the control of conflicts of interest 
unusual delicacy. 

In the nineteenth century, if an official resolved 
a clash between his personal advantage and public 
duty in favor of personal advantage, it was called 
corruption. But since then “corruption” has come to 
be associated with only the most odious and obvi- 
ous forms of venality and self-service, and the term 
“conflict of interests” was coined to cover less 
blatant cases. The leading study on the subject 
limited itself to two interests: “, . . one is the inter- 
est of the government official (and of the public) 
in the proper administration of his office; the other 
is the official’s interest in his private economic 
affairs. A conflict of interest exists whenever these 
two interests clash, or appear to clash” (Associ- 
ation of the Bar . . . 1960, p. 3). The present 
article, accepting this usage, will review the signifi- 
cant recent developments in the framing of a 
system of relevant, up-to-date restraints against 
conflicts of interest in federal service in the United 
States. These limitations are being widely copied 
or adapted in other countries and by state and 
local governments in the United States. After sur- 
veying conflicts of interests in government service, 
this article will touch on other arenas in which 
such problems are now identified. 

The legal profession, far more than any other 
professional group or social-science discipline, has 
scrutinized the problem of government officials’ 
conflicts of interest. Leaders of the bar, as well as 
their clients in business and in other professions, 
have long counted public service as part of a viable 
career. Conscientious effort to disentangle public 
duty from private opportunity could not always 
shield such men from criticism. After World War 11, 
American law on the subject of conflict of interest 
was so antiquated that little protection was af- 
forded either the public or the large numbers of 
people entering and leaving public service, and 
many individuals were discouraged from taking 
jobs in the federal government. This was the chief 
impetus that led the New York City Bar Association 
to form, in 1956, a special committee, which has 
developed a corpus of knowledge on the subject 
(Association of the Bar . . . 1960, pp. vii—xii; 
Perkins 1963, p. 1114; Manning 1964). 

Several other factors have been behind a trend 
toward deepening attention to conflicts of interest 
in the public service: (1) the growing reliance of 

* government on the skills of individuals in private 


life, notably in the fields of science and technology; 
(2) the mobility of individuals in industry, busi- 
ness, and the professions, in and out of government 
employment; (3) the intricacy and extent of gov- 
ernment regulatory, promotional, and financially 
supportive programs affecting private sectors of 
the economy; (4) the expansion in peacetime of 
government as a customer of business; (5) grow- 
ing stock ownership among individuals, who have 
a resulting stake in the private economy (Perkins 
1963, p. 1114); (6) the emergence of government 
during the past decade as a major source of wealth. 
“The Wealth of more and more Americans depends 
upon a relationship to government. Increasingly, 
Americans live on government largess . . .” (Reich 
1964, p. 733). These accelerating changes have 
multiplied the number of potential conflicts of 
interests. The newness of these situations has led 
to subtleties difficult to understand, explain, or 
curb, 

Legal analysis has succeeded in isolating some 
of the peculiar problems of conflict of interest in 
modern society. Some relevant skills in coping with 
the issue were noted by the Harvard Law Review 
in this way: “. . . behavior profitable but in itself 
innocent may be outlawed because it tempts abuse 
of power or allows misuse of information and ad- 
vantages gained through government service. The 
resulting tension taxes the lawyer's faculty for in- 
ference drawing, for weighing competing interests, 
and for ingenuity in framing legal solutions” (“With 
the Editors . . .” 1963, p. vii). Lawyers have an- 
alyzed the aims of the conflict-of-interest statutes 
of the past and willingly specified the ethical norms 
upon which new rules of law should be based. Five 
principles have been established as fundamental 
to prudent conflict-of-interest regulations at the 
federal level in the United States (Perkins 1963, 
pp. 1118-1122). 

The first moral imperative to be protected by 
conflict-of-interest rules is that against self-dealing. 
A public official may not participate in government 
action where that specific action might significantly 
affect his private economic interests. 

A second principle, less distinct than the con- 
cept of self-dealing, stops a public official from 
accepting transfers of economic value from private 
sources, It is believed that gifts should be barred, 
even though bribery is absent, on the assumption 
that a danger of subservience remains present. A 
government official acting in his official capacity 
must be free of the relationship created when an 
individual accepts gifts and other valuable trans- 
fers from private sources (Perkins 1963, pp. 1119- 
1120), 
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Third, a public official should not ordinarily be 
allowed to drop his official role to help private in- 
dividuals or organizations in their dealings with 
the government. This is an especially forceful 
conflict-of-interest principle when compensation is 
received for the assistance. 

Fourth, the same principle also applies to a 
former public official in connection with his past 
official responsibilities. This principle is usually 
cushioned by a limitation of time, although objec- 
tion has been made that any definite limit, say 
two years, is both artificial and ineffective. Never- 
theless, the principle remains that a former official 
should not be able to trade against the government 
with influence gained from past association or 
friendship in his official capacity (Perkins 1963, 
pp. 1120-1121). 

The fifth conflict-of-interest principle holds that 
a public official should not be allowed to use for 
private gain confidential information acquired in 
his official capacity. There may be no specific in- 
jury to the government, but it is argued that a 
wrong occurs in the private use of information that 
belongs to the public (Perkins 1963, pp. 1121- 
1122). 

In 1962, following several years of study, educa- 
tion, and agitation, the federal code and regulations 
governing conflicts of interest were overhauled to 
express these principles better. The new act applies 
to all officers and employees “in the executive, legis- 
lative, or judicial branch of the Government, or in 
any agency of the United States” (Public Law 87- 
849, 76 Stat. 1121). Conflict-of-interest principles 
are applied to all forms of modern decisions: 
“ , . any proceeding, application, request for a 
ruling or other determination, contract, claim, con- 
troversy, charge, accusation, arrest, or other par- 
ticular matter in which the United States is a party 
or has a direct and substantial interest.” The offi- 
cial is culpable if he “participates personally and 
substantially” in any of these transactions “through 
decision, approval, disapproval, recommendation, 
the rendering of advice, investigation, or other- 
wise.” The 1962 act thus embodies a creative effort 
to make the chief conflict-of-interest principles ap- 
plicable to today’s circumstances. 

Criminal sanctions imposed by statute play a 
limited role in protecting conflict-of-interest prin- 
ciples. Bayless Manning has named many other 
means of conflict-of-interest regulation in the fed- 
eral government: 


The Senate, in its conduct of confirmation proceed- 
ings, has frequently imposed special conflict of inter- 
est standards not found in the statutes. Congress 
may also bring strong pressures to bear through its 


244 CONFLICT OF INTEREST 


investigatory process. Many relationships that might 
be legally acceptable, and not considered morally 
reprehensible conflicts of interest, may be politically 
very sensitive and politically unacceptable. And, 
finally, many agencies of the government have de- 
veloped their own operating rules and their own in- 
ternal regulations for dealing with the problem of 
ethics as it affects their own operations. No one 
called upon to consider a question of conflict of inter- 
est in the federal service will have done his job unless 
he has weighed and investigated, in addition to the 
statutes in the field, the applicable administrative 
regulations, likely Congressional response, and the 
general political environment. (Manning 1964, pp. 
10-11) 


The American press almost daily reports inci- 
dents that bring wide public disapproval but where 
neither the statutes nor informal sanctions have 
been effective in preventing a conflict of interest in 
the federal service. Resignations occur, but public 
criticism is not always effective. This becomes 
comprehensible when it is recognized that the goal 
of avoiding corruption is balanced against other, 
competing goals, particularly employee recruitment 
and morale. This is a typical statement of the view: 
“In drafting a conflict-of-interests statute it is easy 
to become overzealous and to forget the impact 
which a broad restriction may have. A well-drawn 
statute should prohibit conflicts of interests which 
are most damaging to the standards of good gov- 
ernment and yet not prohibit so much that com- 
petent people will be discouraged from serving” 
(Harvard Student Legislative Research Bureau 
1964, p. 69). 

This view has been carried further by some 
critics of conflict-of-interest regulation. In particu- 
lar, a proposal to establish a commission on ethics 
in government was criticized as irrelevant to the 
problem, which is seen fundamentally as a ques- 
tion of the status of the government employee in 
the community (Davis 1954, p. 915). Thurman 
Arnold said this: “Ethics in any group arise out of 
a sense of tradition and pride in his particular 
calling. Humiliate that group. Subject them to con- 
stant restriction and supervision. Refuse to trust 
them in any of their activities in or out of Govern- 
ment and you destroy any possibility of an effective 
ethical code” (U.S. Congress . . . 1951, p. 372). 

Altogether, several criticisms may be made of 
the approach to conflict of interest of the Bar Asso- 
ciation, the 1962 act, and the federal regulations. 
The code-of-ethics approach has been mentioned. 
Another view is that in a democratic society all 
government employees should be required to dis- 
close their personal assets and economic interests 


to the public or to Congress (Ross D. Davis in 
Chicago, University of . . . 1961, pp. 83-84). More 
often this suggestion is applied only to elected offi- 
cials, on grounds that voters are the appropriate 
judges of improper conflicts of interests. But such 
publicity would unduly violate the private affairs 
of all the employees. Such a requirement for elected 
officials is more pertinent, but whether voters func- 
tion effectively as judges of impropriety is open to 
question. 

Another view is that the kinds of conflicts of 
interest described here are only a small part of the 
pressure and cross fire public officials feel. Against 
this, a concept of the “public interest” is put for- 
ward as the value to be served, and words like 
“neutral,” “objective,” and “disinterested” describe 
the desired stance. Davis remarked that “in an 
ideal state the only influence brought to bear upon 
a public official should of course be his enlightened 
consideration of the public interest as defined by 
law” (Chicago, University of . . . 1961, p. 81). Yet 
the acceptance of interest groups at all levels and 
branches of government as legitimate registers con- 
siderable faith in the capacity of public officials to 
be disinterested. Of course, gross forms of pressure 
have been banned, and disclosure of lobbying and 
campaign expenses is required. The Bar Association 
report (Association of the Bar . 1961, pp. 
20-22) distinguishes conflicts of policy from the 
problem of personal conflicts of interest. The view 
is that when an organization seeks to persuade, 
even though the persuasiveness is backed by re- 
prisal, the acts are political. Accordingly, conflicts 
of policy are altogether proper and desirable. 

Actually, the Bar Association approach recog- 
nizes that public officials cannot put aside long- 
time associations. The legal realists of a past 
generation saw that emotional nuances and en- 
vironmental conditioning inevitably affect the ap- 
proach men take to the decisions they face. Of 
course, values such as affection or rectitude color 
official decisions. Indeed, nepotism is banned in 
government and other institutions, as a kind of 
conflict of interest. But the outlook of officials can- 
not and should not be neutralized or sterilized, and 
the conflict-of-interest laws can never control more 
than a small part of the total possible relationships 
that bear on those who make decisions. From the 
standpoint of the individual, conflicts of interest, 
under this usage, are a small part of political and 
personal ethics; from the standpoint of public af- 
fairs, they are a part of corruption and of the 
factors entering into the formation of public policy. 

Setting tolerable limits to conflicts of interest also 


troubles many nongovernmental institutions. Cor- 
poration officials may have conflicting economic 
interests with competing companies. Law firms and 
advertising agencies may have client relationships 
that impinge on other loyalties. Philanthropic 
foundations and universities encounter conflicts of 
interest. Accountants, physicians, scientists, and 
athletes are among those whose conflicts have been 
reported. 

Every group develops informal and unwritten 
codes of conduct, and it is only after a violation of 
such a standard that discussion begins to shape 
more precise rules. Then problems essentially like 
those in government come to the surface. Consider 
this definition of the attributes of conflicts in a 
corporation: “A conflict of interest exists where an 
employee has a private financial interest or other 
relationship outside the company that has the 
potentiality of being antagonistic to the best inter- 
ests of the company, even though it may result 
in no financial loss to the company and irrespective 
of the motives of the individual concerned” (Adam 
1963, pp. 12-13). The discovery and isolation of 
employee relationships that could bring potential 
harm raise delicate questions of supervision. These 
questions are so difficult that most private organi- 
zations continue to cope with them through infor- 
mal methods rather than by formal rules and 
procedures. 

Social scientists have yet to make studies de- 
signed to answer a number of key assumptions of 
popular writers on the subject. Is morale insurance 
against corruption? How may concepts such as 
morale and corruption best be defined and meas- 
ured? Are salary and tenure protection against 
employee conflicts of interest? To the extent that 
conflicts of interest can be distinguished from 
bribery and from conflicts of policy, what circum- 
stances nourish them? The effectiveness of statu- 
tory and other regulatory controls also needs to be 
measured. The effect of conflict-of-interest restric- 
tions on the recruitment of public officials and on 
the entry of persons into particular businesses 
should also be ascertained with exactitude. Such 
knowledge is needed to understand the subject, and 
it will also be pertinent to future efforts to delineate 
and regulate conflicts of interest. 

If the factors giving rise to these problems have 
universality, then developed societies are no more 
conflict-of-interest prone than developing ones. 
(This is another area where research is much 
needed.) Yet if the subject is not limited either to 
economic conflicts or to the public sector, conflicts 
of interest appear to be inevitable by-products of 
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complex societies, of the development of sophisti- 
cated relationships among business, professional, 
and governmental institutions. 

CLEMENT E. VOSE 


[See also CIVIL SERVICE; CROSS PRESSURE; LOBBYING; 
OFFICE, MISUSE OF; POLITICAL FINANCING. ] 
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CONFLICT OF LAWS 


Conflict of laws, a subject also known as private 
international law, arises from the universal ac- 
knowledgment that not every human transaction 
can be, or ought to be, governed by local law. The 
affairs of men are often conducted in such a way 
that a legal dispute contains a foreign element, and 
the conflict of laws is the systematic study of how 
national courts, in fact and in theory, take account 
of such foreign elements. 


Scope and character 


The main emphasis in the conflict of laws has 
generally been upon the rules used to select foreign 
law, these rules being called choice-of-law rules. It 
is also common to include in the subject matter of 
the conflict of laws various related matters pertain- 
ing to the jurisdiction of courts and to the degree 
of respect due foreign judgments, but in the main 
the classical problems in the field have been created 
by legal transactions in which the private law of 
more than one legal unit is potentially applicable 
and a choice must be made between the competing 
claims. For centuries judges and scholars have 
been baffled in their search for acceptable pro- 
cedures and for the criteria with which to conduct 
the search itself, 

There is a traditional distinction between public 
and private international law related mainly to 
whether the participants involved in legal contro- 
versy are governments or individuals. In this vein, 
public international law is defined as the corpus of 
rules binding governments in their relations with 
one another and the processes available for imple- 
menting these rules; private international law is 
considered as the law applied by domestic courts 
whenever a foreign element is relevant to the reso- 
lution of a legal controversy. Many specialists in 
this latter field of law object to the label private 
international law, suggesting that the law applied 
by domestic courts—even if it leads to the applica- 
tion of foreign law—is a field of national law, Such 
a view is especially prevalent in the United States, 
accounting, in part, for the currency of the term 
“conflict of laws” to describe the subject, although 


its use also reflects the emphasis among American 
scholars, perhaps to an excessive degree, upon the 
conflict of laws as it has evolved from interstate 
(intranational) transactions. In Europe, even in 
federal states, the subject, in contrast, is dominated 
by its international aspects, that is, by the study of 
the rules and processes by which the courts in one 
country give effect to the law of a foreign country 
or show respect for a judgment already reached by 
a foreign court. 

A concrete indication of the character of the sub- 
ject can be given by way of an oversimplified ex- 
ample of a legal situation calling for the application 
of choice-of-law rules. A Dutch sailor employed on 
an American ship is injured in the course of work 
by allegedly faulty equipment while the ship is at 
anchor in a Japanese port. He sues for recovery in 
a Dutch court and seeks application of the Ameri- 
can law on the subject because it leads to the 
largest recovery with the smallest burden of proof. 
The American shipowner, as defendant, contends 
that the negligence of the seaman was the cause 
of the accident and argues for the application of 
Japanese law, which, we shall assume, is least 
favorable to any recovery in these circumstances 
and least generous in its award of damages if some 
recovery is allowed. Which legal system and what 
legal rules would be chosen by a Dutch court as 
providing the governing law? Would the same gov- 
erning law be chosen for this kind of case by a 
court in the United States, the Netherlands, and 
Japan? Why is it so difficult to obtain uniform treat- 
ment of choice-of-law questions? Is uniform treat- 
ment necessarily desirable? What approaches domi- 
nate the analysis of choice-of-law problems in the 
contemporary literature? It is with questions of this 
sort that the field of the conflict of laws is prin- 
cipally concerned. 

The character of the solution given to problems 
of the choice of law has an intense practical rele- 
vance to the lives of people, Increasingly, it is im- 
possible to confine human happenings to a single 
jurisdiction. Whether in matters of personal status 
arising from marriage, divorce, birth, and death or 
in such typical commercial phenomena as the regu- 
lation of anticompetitive business practices, expro- 
priations of alien property, and the sale of goods, 
there is an increasing multinational complexity that 
in the event of controversy may make the choice 
of applicable law at once crucial and far from 
self-evident. 

Of course, the practical interest in the conflict 
of laws emerges as a consequence of the fact that 
the substantive laws of states are diverse, incon- 
sistent, and, occasionally, contradictory. Obviously, 


if all substantive law were the same, it would make 
little difference which system of law was selected 
to govern a controversy. But, in a world of diverse 
national, cultural, and ideological perspectives, 
there has never been a prospect of such substantive 
uniformity. Widespread attention has, therefore, 
been given throughout the history of international 
relations to the method and rationale used by vari- 
ous courts to select the proper system of law gov- 
erning controversies. It appears, at least super- 
ficially, that all states have a strong, common, and 
mutual interest in the adoption of uniform choice- 
of-law rules. For unlike the substantive rules to 
which they refer, choice-of-law rules appear to seek 
nothing more than the ordering of relations in a 
fair and convenient fashion and do not seem to 
express any commitment to policies or values of a 
particular state. Despite this appearance of neu- 
trality, however, efforts over several centuries to 
advance the acceptance of uniform choice-of-law 
rules have yielded few encouraging or tangible 
results. 

The study of conflict of laws has been developed 
by lawyers and has traditionally been neglected by 
social scientists. This neglect can be explained, in 
part, by the fact that so many of the typical prob- 
lems in the conflict of laws are either highly tech- 
nical (for example, dealing with the devolution of 
property located abroad) or too humdrum (for ex- 
ample, concerning the status of a foreign mail-order 
divorce) to be either accessible or interesting to 
those who lack formal legal training. Even lawyers 
find the subject unusually complex, since it pre- 
supposes a familiarity both with the law governing 
a wide body of substantive problems (for example, 
how does the treatment of the foreign element vary 
when one is dealing with matters of personal status, 
inheritance, torts, contracts, real and personal 
property, etc.?) and with the laws and decisions 
operative in a number of foreign states with dif- 
ferent legal systems. 

Despite these obstacles, there are strong reasons 
why social scientists should have at least some 
awareness of the major trends of thought evolving 
in the conflict of laws. For one thing, the conflict 
of laws is a precursor by many centuries of the 
current interest in the study of conflict resolution. 
A vast literature has developed a whole range of 
techniques to adjust the conflicting requirements 
of local law and foreign law, and debate continues 
on the character of justice in the face of such 
conflicts. 

Furthermore, the approach taken to the conflict 
of laws in different subject-matter areas might pro- 
vide students of comparative social and political 
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systems. with an interesting insight into such con- 
cerns as the hierarchy of social values and govern- 
mental policies prevailing in different national 
societies. For certainly the refusal to defer to for- 
eign law in some areas suggests the importance 
accorded domestic policy. The approach used to 
solve internal choice-of-law problems in federal 
states without much experience in international 
affairs (for example, Nigeria, Malaysia) may also 
be a useful way for those studying world order to 
gauge the acceptance of supranational sources of 
legal authority. 

The revived interest of political scientists in the 
study of the role of norms in world politics clearly 
makes the conflict of laws relevant, for the same 
endeavor is at the center of both subjects: to find 
a just and efficient way to allocate legal authority 
among the actors in a multiunit social system. The 
conflict of laws offers the inquirer not only a 
parallel setting in which to study the efforts to in- 
troduce order into a social system that lacks cen- 
tralized institutions, but it also provides a setting 
that tends to be less inflamed by political passions 
and therefore more susceptible to systematic analy- 
sis, concerned as it mainly is with interpersonal as 
distinct from intergovernmental legal disputes. 

Interstate and international conflicts. It is im- 
portant to distinguish the study of interstate from 
international conflict of laws. Rules to resolve inter- 
state conflicts generally arise in a federal social 
structure which shares a traditional language and 
possesses an overriding organic law (constitution ) 
and central legal institutions of interpretation and 
enforcement. These institutions are available to 
resolve serious conflicts that arise at the unit level 
and to set normative limits upon the discretion ofa 
state to adopt “eccentric” or excessively egocentric 
rules of conflict of law. For example, the United 
States constitution imposes obligations on each 
state to give “full Faith and Credit . . . to the public 
Acts, Records, and judicial Proceedings of every 
other state” (Art. Iv, 1); to respect “the privileges 
and immunities of citizens of the United States” 
and to grant every person “within its jurisdiction 
the equal protection of the laws” (x1vth Amend- 
ment, §1). Although the U.S. Supreme Court has 
acknowledged a considerable discretion at the unit 
level that allows the states to adopt diverse rules 
for dealing with foreign facts, there exists a higher 
law that remains potentially available to impart 
unity and coherence. 

In contrast, the international system is relatively 
decentralized, lacks an organic law, and possesses 
only very weak institutions for the adjustment of 
disputes at the internation level. In addition, the 
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differences in ideology, stages of economic develop- 
ment, domestic politics, and attitudes toward indi- 
vidual and governmental liability from nation to 
nation are generally much greater than are the dif- 
ferences among the units in a federal state. Where- 
as the interstate conflict of law can depend upon 
the resources of the national government to pro- 
mote the realization of a fair and reasonable sys- 
tem, the international regime for the conflict of 
laws is dependent to a far greater degree upon the 
highly decentralized ordering techniques of reci- 
procity, self-help, and self-restraint in order to bring 
a fair system of conflict of laws into being. 
Public international law does not exercise any very 
widely accepted “constitutional” function at the 
present time in the stabilization of private inter- 
national law. It does not provide a set of limits upon 
national discretion that might be invoked in diplo- 
matic negotiation or when appearing before a 
“higher” tribunal, for instance, the International 
Court of Justice at The Hague. (A regional form of 
legal integration, intermediate between the central- 
ization of a federal state and the decentralization 
of international society, exists, especially in Europe 
and somewhat in Latin America, and may soon 
develop to the point where it will warrant separate 
attention. ) 

The distinction between the operation of choice- 
of-law rules in a centralized (interstate) and in a 
decentralized (international) multiunit social sys- 
tem has many unexplored consequences. One of the 
most important is the difficulty of supplying jurid- 
ically convincing reasons why the courts of one 
state should apply the law of another under certain 
specified circumstances. True, this is part of the 
wider need in international law to demonstrate how 
legal obligations can be binding on the national 
level in the absence of any supranational source of 
law other than agreement among states or cus- 
tomary usage. However, in the area of the conflict 
of laws neither treaty nor custom provide any gen- 
erally acceptable guidelines, and with the virtual 
disappearance of natural-law thinking the tradi- 
tional mode of explaining the basis of legal obli- 
gation is no longer very persuasive. These circum- 
stances have prompted most commentators in recent 
times to regard the subject of the conflict of laws 
as a part of national law. 

This position is reinforced by the virtually uni- 
versal refusal of courts to apply the public law of 
foreign states to such matters as crime, taxation, 
or the regulation of business activities. The refusal 
stems primarily from a territorial notion of national 
sovereignty in which the public law of the sovereign 
reigns supreme within the boundaries of the state 
but nowhere else. The result of this refusal to 


apply foreign public law is to deny jurisdiction 
rather than to apply local law, except in the area of 
crime, where an elaborate system of extradition 
agreements operates as a partial substitute for the 
direct application of foreign criminal law. 


Historical development 


The systematic exposition of the conflict of laws 
rested originally upon the assumption of a supra- 
nationally ordained system obligatory upon judges 
at the unit level. The objective of the system was 
to assure uniformity of result in a legal dispute re- 
gardless of where the legal action was instituted, 
Naturally, such an objective could be realized, given 
the diversity of substantive law, only if a uniform 
system of either substantive or choice-of-law rules 
could be established. 

The dominant idea in the Roman legal system 
was that Romans were everywhere to be governed 
by the Roman law and that foreigners were to be 
governed by the jus gentium, the law of the peoples, 
a “superlaw” of substantive rights and duties that 
was assumed to be universally applicable. Thus 
there was no occasion to apply foreign law and no 
need for a system of conflict of laws. The city-states 
of Italy legislated on matters not covered by Roman 
law, and it was out of the conflict between these 
legislative acts (statua) that a need for some ap- 
proach to selecting the governing statute was 
initially felt. The need was acknowledged, in part, 
because of the regularity with which commercial 
intercourse among the city-states occurred. The 
technique evolved by the great Roman glossators 
Bartolus and Baldus to resolve the problem was to 
classify every statute as either real (for example, 
land rights), or personal (for example, status of 
person), or mixed. Elsewhere in medieval Europe 
somewhat different approaches to the resolution of 
conflict-of-law situations developed, the character 
of solution depending above all on the insularity of 
the social order and upon its prevailing political 
ideology, especially concerning the proper relation- 
ship between the domestic society, its members, 
and foreign societies. 

Territoriality. In France the feudal system in 
force gave great weight to the coutumes and to 
residues of earlier tribal law. The resolution of con- 
flicts depended on whether the issue had a terri- 
torial cause of action, and came under local law, 
or a transitory locus to which foreign law could 
properly be applied. This notion of giving primacy 
to the customs of each place prepared the intel- 
lectual climate for an era of territorial law, which 
gradually displaced the earlier acquiescence to the 
universal claims of Roman law. The growth of ter- 
ritoriality was abetted by the rise of the ideology of 


national sovereignty, the loss of the authority of 
the Roman church as a unifying influence, and the 
gradual decline of natural law as the basis of legal 
obligation. These nationalizing tendencies came to 
intellectual fruition in the seventeenth century 
when Johannes Voet (1698-1704) and Ulrich 
Huber (1689), scholars from the Dutch provinces, 
fully rationalized the primacy of the territorial law- 
giver by discarding any pretense of a supranational 
basis for deference to foreign law. To the extent 
that such deference was accorded, they explained 
it on the basis of comity, a concept that Joseph 
Story later carried forward in a treatise that exer- 
cised a formative influence upon the Anglo—Ameri- 
can development of the conflict of laws (1834). 
England’s system of common law, as the law of the 
realm, did not give rise to internal conflicts, and 
external conflict situations were so infrequent that 
no real approach to the conflict of laws developed. 

Nationality. The waves of nationalism oc- 
casioned by the French Revolution and later by the 
unification of Italy influenced the growth of con- 
flict of laws, especially in those parts of the world 
where the continental European influence pre- 
vailed. Stress was put upon nationality, at the ex- 
pense of territoriality, as the dominant connecting 
factor, An individual was first and foremost a 
Frenchman or an Italian; recipient of rights and 
responsible for duties under a particular national 
law, and only very secondarily subject to the law 
of the place where he happened to be. In contrast, 
doctrine in the common-law countries came in- 
creasingly to stress territoriality as the key vari- 
able: events were subjected to a particular law in 
terms of where the acts took place or where the 
property was located, and only a very secondary 
stress was given to the nationality of the actors in 
the process of selecting the governing legal system. 

As a consequence of this division between con- 
flicts based on the primacy of nationality and con- 
flicts based on the primacy of territoriality it is 
difficult, if not impossible, to unify the conflict of 
laws by means of the voluntary agreement of na- 
tions through treaty. There has been some success 
on a regional basis on the Continent, where there is 
general agreement on the proper basis for solving 
choice-of-law issues. Several European treaties oper- 
ate partly to codify and harmonize different na- 
tional practices and partly to obtain uniform treat- 
ment of certain troublesome details, especially in 
highly technical areas, for example, the interpre- 
tation of multilateral shipping documents. 


Basic problems of conflict of laws 


Despite the supranational origins of the conflict 
of law, first in the jus gentium of Roman times and 
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later in the natural-law basis of all law, the subject 
has been increasingly regarded as a part of national 
law. This nationalization results from the diversity 
of national practice and doctrine, the absence of 
any accepted supranational doctrine of obligation, 
and the general dominance of positivistic thinking 
that requires evidence of state consent as a pre- 
requisite to the existence of binding rules of inter- 
national law. (To be valid, according to the posi- 
tivists, the law must be posited by an authorized 
lawgiver, and those authorized in international so- 
ciety are the states themselves. Furthermore, their 
acceptance of legal obligation must be indicated 
either explicitly, as in treaty law, or tacitly, as in 
customary law. ) 

At the same time, however, it is widely accepted 
that a stable and just system of conflict of law plays 
a critical role in international life. 

This dilemma is clearly manifest in the work of 
Martin Wolff, who vigorously asserts the national 
character of the conflict of laws: “Today undoubt- 
edly Private International Law is national Law. 
There exists an English private international law 
as distinct from a French, a German, an Italian, 
private international law. The rules on the conflict 
of laws . . . differ nearly as much from each other 
as do those on internal (municipal) law” (Wolff 
[1945] 1950, p. 11). Despite the vigor of this state- 
ment Wolff, along with all but the most national- 
istic of writers, is at pains to point out that justice 
requires that each state consider the merits of the 
claims put forth by potentially applicable legal 
systems relative to the expectations of the parties, 
the stability of transnational social and economic 
intercourse, and the realization of the legitimate 
interests of foreign states. Wolff puts this inter- 
national aspect as follows: “Private International 
Law is not itself international, but it should cer- 
tainly be drawn up in an international frame of 
mind.” One of the real jurisprudential challenges 
of today is to devise a satisfactory juridical account 
of an “international frame of mind” (ibid., p. 16). 

Comity. If international law does not compel 
deference by a domestic court to foreign law, then 
it is difficult to generate a satisfactory legal basis 
for such deference. The problem of finding a basis 
is artificial to some extent, being one of the many 
unfortunate by-products of dichotomizing national 
and international law. Given the almost universal 
adherence to this dichotomy, however, scholars 
have been eager to rest the conflict of laws on some 
extranational legal foundation that stops short of 
claiming legal compulsion. The most influential of 
these attempts has undoubtedly been associated 
with the concept of comity, the traditional means 
by which Anglo-American courts acknowledge the 
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policy of deference to foreign law in domestic 
courts and, thereby, fulfill the injunction to mani- 
fest an international frame of mind. 

Comity is supposed to express the reality of a 
practice that is habitual and yet not clearly or 
formally required as a matter of legal duty. That is, 
a court refusing to apply foreign law would not be 
violating any legal duty and no foreign state would 
have a legal basis for complaint. However, it is so 
widely recognized as desirable to apply foreign 
law with a certain consistency that a need arises to 
fit the practice into a description of the workings 
of the legal system. It has, at the same time, be- 
come commonplace for jurists to criticize this reli- 
ance upon the idea of comity because of its am- 
biguity and vagueness. Since it provides guidance 
for neither courts nor private parties in specific 
cases, and since it seems to identify the process of 
adjudication with some type of “international eti- 
quette,” comity makes the whole subject of the 
conflict of laws appear to rest upon a system of 
obligation no more substantial than the practice of 
international politeness. 

In response, writers have tended in recent dec- 
ades to abandon comity as the explanation for the 
application of foreign law, either affirming the com- 
pletely national character of the conflict of law or 
searching for some substitute to express its inter- 
national aspect. One of the most persuasive efforts 
to evolve a substitute for comity is found in the 
work of Myres McDougal and Nicholas deB. Katzen- 
bach. These writers approach the problems of con- 
flicts as but a special case of the more general task 
in international society to divide up the competence 
to apply law among the territorial sovereigns that 
constitute international society. They argue for a 
form of legal order that can successfully emerge in 
a highly decentralized multiunit social system by 
being truly responsive to the demands of the units. 
The implication of this orientation is to urge upon 
domestic courts the legal duty to defer to foreign 
law whenever, in terms of protecting and realizing 
mutual freedom for national societies, it is appro- 
priate. As Katzenbach puts it: “So long as formal 
authority is organized and administered territori- 
ally, there is a mutual and reciprocal interest—a 
‘sense of the inconveniences which would otherwise 
result’—in extending areas of tolerance” (1956, p. 
1131), McDougal identifies law with the reasonable 
expectations of those participating in international 
life, suggesting, in contradistinction to the tra- 
ditional nationalization of the conflict of laws, that 
it is a matter of international law to defer in appro- 
priate circumstances to foreign law (McDougal & 
Feliciano 1961). 


This sociopolitical approach to the study of law 
in international affairs eliminates such dichotomies 
as those between private and public international 
law and even calls into question the mainstay of 
traditional analysis—namely, the distinction be- 
tween national and international law. Although this 
work seems certain to attract the interest of social 
scientists, it is regarded with great suspicion by 
most international lawyers, especially those in Eu- 
rope, and it has not yet been properly applied to the 
subject matter of the conflict of laws. Instead, most 
theoretical attention is still focused on the problem 
of finding general criteria for the solution of con- 
flict problems. 

The search for general criteria. There is wide 
disagreement, especially in the United States, as to 
whether the search for general criteria is worth- 
while and, if it is, what form it should take, 

The critics. Albert Ehrenzweig, the author of 
the leading contemporary treatise in English on the 
conflict of laws (1962), takes the position that the 
whole search for general rules for the solution of 
conflict-of-laws problems in unsettled areas of the 
law has proceeded on the wrong basis. Ehrenzweig 
contends that all a priori approaches are bound to 
be deceptive because courts in practice will not be 
bound by overly abstract and mechanistic rules of 
reference that direct them to apply a certain legal 
system for a given class of cases (for example, the 
law of the place of making for contracts, the law 
of the place of injury for torts, and the law of the 
situs for real property), but will use manipulative 
devices to promote the just outcome of particular 
disputes or to give preference to local as against 
foreign interests. The real indicators of how a con- 
flict problem will be solved, then, arise not from 
the doctrinal language used by a court to explain 
its decision but from the awareness of the sense 
of justice or bias that underlies the doctrinal ex- 
planation. Ehrenzweig thus sets for himself the 
very ambitious task of finding “true rules”—the 
living law, as distinct from the enunciated law— 
and asserts that only by this search can the pattern 
of judicial decision be made intelligible and pre- 
dictable standards of result be obtained. The most 
fundamental rule Ehrenzweig finds is that a court 
will tend to apply its own law to a controversy in 
those situations in which it is one of several plau- 
sibly applicable laws. Thus all instances of deference 
to foreign law are derogations from this underlying 
“true rule.” Ehrenzweig’s approach is based both 
on a critique of judicial practice (cutting through 
the technical or legalistic explanation to the real 
one) and upon an acceptance of its authoritative 
status (the role of a scholar is not to supply higher 


criteria but to analyze judicial practice so as to dis- 
cover the operative criteria). 

What for Ehrenzweig is a matter of pervasive 
methodology becomes for Brainerd Currie a matter 
of pervasive ideology (Currie 1963). Currie is less 
interested in what courts do than in developing an 
approach leading to what they should do, althoygh 
his jurisprudential strategy is to proceed by way of 
very close analyses of particular cases. He believes, 
to overstate it some, that a court should always 
apply local law when the forum has a governmental 
interest in the outcome of the controversy and that 
foreign law is appropriately applied only when the 
forum is disinterested in the outcome. In this re- 
gard he opposes the recent tendency of courts to 
balance the interests of various potentially applica- 
ble legal systems and to choose the law of that 
legal system which has the greatest interest in the 
particular case. Such balancing is for Currie inap- 
propriately undertaken by courts and is more 
properly a matter for legislative determination. 

Currie and Ehrenzweig repudiate the traditional 
search in the conflict of laws for allocation criteria 
posited in advance, and both affirm the fundamen- 
tal governance of controversies by local law. In 
consequence they renounce the ideal of uniformity 
of result, Currie not only denies the duty to defer 
to foreign law but also argues that courts should 
not defer except when they affirm both jurisdiction 
and disinterest. In the rare cases that satisfy these 
two conditions the courts cannot reach a proper 
decision and might just as well apply local law, or 
flip a coin, or dismiss the cases. 

The balancers. A less extreme approach, but 
one that very likely accords more closely with what 
judges think they should be doing and seems to 
have the strongest following in the United States, 
is best exemplified by the work of Willis Reese and 
Elliot Cheatham (Reese 1963; Cheatham 1960; 
Cheatham & Reese 1952) and by the tentative 
drafts of a second restatement of the conflict of 
laws under the influential auspices of the American 
Law Institute (1953-1965). These authors share 
with Currie and Ehrenzweig a distrust of the tradi- 
tional stress on general solutions to conflicts prob- 
lems, but they also argue for an intermediate 
approach whereby policy factors are grouped in any 
particular case so as to identify the legal system 
with the dominant interest in the outcome and to 
choose the legal result most in accord with sub- 
stantial justice. Ehrenzweig, a harsh critic of this 
enterprise, contends that it produces vague formu- 
las and keeps alive, despite denials, the tradition 
of discerning a priori forum-selecting choice-of-law 
rules, Reese and Cheatham defend their balancing 
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approach on the grounds that both practice and 
doctrine confirm this type of analysis and that 
recent landmark decisions in the conflict of laws 
exhibit the tendency to search for criteria by which 
to select the most interested forum and by this 
means to reach the most just result. 

We find, then, that the very character of the con- 
flict of laws, its methodology, and its governing 
ideology have been seriously questioned in recent 
years. There exists no scholarly or judicial con- 
sensus on choice-of-law rules. In particular, the 
traditional ideals of seeking uniformity of result 
and of establishing equality between domestic and 
foreign laws have come in for heavy criticism on 
the grounds that such ideals are unrealistic and 
inappropriate, since domestic courts actually do 
and, in fact, should accord preference to domestic 
law in a situation where it is one of the applicable 
legal systems. 

Operational problems. No less serious than the 
theoretical problems are the dilemmas facing the 
practitioners of private international law. Judges 
in courts are confronted by the operational neces- 
sity of choosing the legal system that shall govern 
most justly a particular controversy and of giving 
a satisfactory explanation of their solution on a 
particular occasion. Scholars through the centuries 
have sought to find a general solution that com- 
bines just choice-of-law rules with the universality 
of their acceptance. The overriding objectives are 
to get the just decision and to be assured that there 
will not be one just decision in Japan, another in 
Holland, and a third in the United States. 

There is, unfortunately, no broad consensus as 
to the form a general solution should take. The 
problem of legal systems using nationality as the 
criterion versus those using territoriality has al- 
ready been mentioned. (If a Frenchman lives in 
Brazil, is his estate properly governed by French or 
Brazilian law?) Furthermore, courts are less in- 
clined to follow the dogmatic solutions proposed 
by scholars than to seek to do substantial justice 
to the parties in dispute. Therefore, a court will 
tend to manipulate the criteria governing the choice 
of laws to fulfill its view of what justice demands. 
A good illustration is the basic idea that a court 
will always apply domestic procedure even if it 
defers to foreign substance. This allows it to char- 
acterize as “procedure” any device that will produce 
the desired result. 


The promise of conflict of laws 

Despite the critiques of the subject, the dominant 
trend continues to be the pursuit of uniformity and 
equality, especially if the emphasis is put upon the 
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international as distinct from the interstate aspect 
of the conflict of laws. The challenge today is to 
find an acceptable supranational basis for promot- 
ing these ideals in a divided world composed of 
states with different economic, cultural, and politi- 
cal outlooks. This challenge cannot be met by a 
new global ideology but rather by a series of more 
modest and concrete undertakings. One of the more 
promising research developments consists of the 
comparative-law efforts to take specific inventory 
of the differences in both substantive standards and 
in choice-of-law rules and to examine the prospects 
for their harmonization by unilateral or multilateral 
action. 

A sociological extension of this inquiry would be 
to take stock of the diverse social interests that 
account for differences in the conflict of laws from 
one national system to another and to work toward 
a set of solutions on the basis of mutual interests. 
This type of inquiry has been undertaken by Ken- 
neth S. Carlston (1962). It draws heavily upon 
functional sociology, especially organization theory 
and systems theory. Its persuasiveness also de- 
pends on the acceptance of a new image of global 
unity, eloquently summarized by C. Wilfred Jenks 
(1958) as “the common law of mankind,” in which 
the nation-state is supplanted to some extent in 
legal consciousness by new forms of social and 
political order that take greater account of both 
international institutions and of individuals in 
specifying the link between law and human wel- 
fare in transnational phenomena. As might be ex- 
pected, it is the powerful states that seem most 
reluctant to participate in this new attempt to 
supranationalize the conflict of laws. Such states 
as the Soviet Union, the United States, China, and 
France oppose most efforts to diminish sovereign 
prerogatives for the sake of an operative global 
system. 

In conclusion, the stability and fairness of inter- 
national legal undertakings seem to depend upon 
the strengthening of this renewed attempt to supra- 
nationalize the conflict of laws. As a result of the 
increasing interdependence of human activity there 
is a growing need for predictable outcomes in legal 
disputes. These outcomes should have a more sub- 
stantial base than the national affiliation of the 
forum. Given the diversity of contemporary inter- 
national society it is not realistic to seek this end 
by reconciling national policies so as to create a 
single substantive law. There is more reason for 
hope if the ancient quest for order amid diversity 
is pursued through a uniform approach to the allo- 
cation of legal competence among the national 
units that compose the global system. One illumi- 


nating context within which this allocation can be 
studied and realized is the application of choice-of- 
law rules by domestic courts. 

RICHARD A. FALK 


[See also INTERNATIONAL CONFLICT RESOLUTION; IN- 
TERNATIONAL LAW. Other. relevant material may be 
found under CONFLICT; INTERNATIONAL RELATIONS; 
Law.] 
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CONFLICT RESOLUTION 

See CONFLICT; COOPERATION; GAME THEORY; 
INTERNATIONAL CONFLICT RESOLUTION; NEGOTIA- 
TION; PEACE. 


CONFORMITY 


Every aggregation is necessarily characterized 
by some degree of uniformity. This is true of any 
set of objects, events, ideas, or organisms, be they 
members of these sets by virtue of natural determi- 
nants or by virtue of our own conceptions, how- 
ever arbitrary. Sugar and alcohol are members of 
such a set because they are both carbohydrates. But 
my pencil, my headache, and the spelling mistake 
I made three years ago in a letter to my wife are 
also members of a set; they share at least one thing 
in common: they all belong to me. 
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In human and animal aggregations some uni- 
formities are genetically determined, as is the case 
with various morphological attributes of the species, 
while some evolve and are maintained by means of 
a social process. Often, socially evolved and main- 
tained uniformities (i.e. social norms) redefine 
and channel biological uniformities. Sexuality, eat- 
ing, nursing, elimination, etc., are behaviors which 
have in one way or another been subjected to nor- 
mative controls in every society. The regulation of 
these biologically significant functions may be very 
pervasive or quite superficial, In all societies there 
are norms about when, what, how much, and in 
what manner one should eat. But similar regula- 
tions have not evolved with respect to sneezing. It 
is probably the case that those biological functions 
that are important to the survival of society are 
subject to pervasive normative regulation of their 
behavioral manifestations. Because sexuality is a 
significant factor in the survival and welfare of a 
society as a unit, sexual functions are jealously 
regulated by normative pressures. However, another 
biological function, breathing—absolutely neces- 
sary for the individual’s own survival but insig- 
nificant for the society—is not subject to normative 
regulation. Perhaps if air were less abundant and, 
like food, posed problems of distribution, breathing 
too would be normatively regulated. As a matter of 
fact, breathing does become subject to rather strict 
normative controls on submarines in emergencies. 

Although not all social uniformities revolve 
around the expression of biological functions, it 
has generally been recognized that normative regu- 
lation is imposed upon behavior (a) that is not 
under complete control of biological factors and 
(b) whose nonsocial (i.e., psychological or biolog- 
ical) determinants might lead to instrumental be- 
havior that is socially undesirable or harmful. 

The behavioral processes that establish and 
maintain social uniformities are subsumed under 
the concept of conformity. We shall speak of con- 
formity when the behavior of an individual is 
under the control of a group norm. A group (social ) 
norm, as was already noted, is a uniformity of be- 
havior among the members of a given group that 
is not the result of a physiological or biological 
uniformity among them. 


Sherif’s classic study 

While theoretical work on conformity dates from 
the earliest social philosophers, the experimental 
study of conformity is a more recent development. 
In 1935 Muzafer Sherif brought the process of 
conformity into the laboratory for the first time and 
subjected it to systematic observation. He utilized 
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the autokinetic phenomenon, which is a perceptual 
distortion occurring when individuals view a sta- 
tionary pinpoint of light in a completely dark room. 
Normally, the light appears to move about three or 
four inches, The illusion of movement is enhanced 
when the individual does not know the distance 
between himself and the light and when he expects 
that the light will move. But even when he is told 
that the light is stationary, the illusion of movement 
is rather compelling. 

Sherif’s pioneering experiment consisted in the 
collection of judgments about the apparent move- 
ment of the pinpoint of light from individuals 
tested alone and in groups of two and three. Each 
subject made one such judgment on each of four 
consecutive days, and 100 trials were given each 
day. Two conditions were examined, and in both 
the subjects were required on each trial to an- 
nounce aloud how many inches they thought the 
light had moved. In both conditions, too, one day 
was devoted to testing the subjects one at a time, 
and three days to testing them in groups of two 
and three. The conditions differed only with respect 
to the position of the “isolated” block of trials in the 
experimental sequence. In one condition the se- 
quence began with subjects making 100 judgments 
in isolation and continued with 300 group judg- 
ments. In the other condition the block of 100 iso- 
lated judgments followed three blocks in which 
judgments were made in groups. Sherif’s results 
were unequivocal: (a) when individual judgments 
were made prior to the group experience, relatively 
high variability was observed between judgments 
made by different subjects; (b) when subjects 
made judgments in groups of two and three, varia- 
bility between their judgments disappeared, regard- 
less of whether these judgments were made with or 
without prior individual experience; (c) when indi- 
vidual judgments followed group judgments, they 
were characterized by greater uniformity than 
when they preceded them. The group situation is 
sufficient to produce a uniformity among individual 
judgments, and the uniformity resulting in this 
way maintains a measure of stability even after the 
group members have been separated from one an- 
other. More recent work shows that norms reached 
in this manner are maintained for a period of more 
than one year (Rohrer et al. 1954). 

Sherif’s experiment stimulated a vast amount of 
research concentrating on three classes of prob- 
lems that roughly correspond to the essential fea- 
tures of his experimental design. Sherif made ob- 
servations of (a) an individual (b) engaging in an 
activity (c) in the presence of a group also en- 
gaged in that activity. Research questions investi- 


gated in the field of conformity were generally 
formulated in terms of these three features. First, 
what sort of individuals are susceptible to group 
influence? How does the individual’s past experi- 
ence influence the likelihood of his yielding to 
group consensus? Second, what sort of behavior or 
judgment is easy and what sort difficult to manipu- 
late by means of group consensus? How stable are 
these effects? Third, what characteristics of the 
group are related to the influence it can exert over 
individual behavior? Are some types of groups bet- 
ter able to influence a given individual than others? 
Is complete uniformity in the group a necessary 
condition for a successful influence upon its mem- 
bers? More complex questions revolve around the 
interaction of these classes of variables. Are indi- 
viduals who are susceptible to group influence in 
one situation also susceptible to it in other situa- 
tions? Are they susceptible equally for all sorts of 
behavior and to all sorts of groups? Are there, in 
other words, personality predispositions on the 
basis of which tendency to conform can be pre- 
dicted? Research in the area of conformity focused 
on the above three classes of factors and on the 
interaction among these factors as they affect the 
likelihood and degree of yielding to a group con- 
sensus. It should be noted that in all experiments 
situations are examined in which, for obvious rea- 
sons, the group attempts to influence the individual 
to behave in ways in which he would not normally 
behave. The measure of influence accomplished by 
the group consensus depends therefore on the com- 
parison of the individual's likelihood of behaving in 
a given manner when the group influence is absent 
with that likelihood when the group influence is 
present. This review of problems in conformity re- 
search will be divided into three classes of interests: 
those concerned with the task and behavior under 
control of the group, those concerned with the in- 
dividual characteristics of the conformer, and those 
concerned with the nature of the group responsible 
for the changes in the individual's behavior or judg- 
ment or opinion. 


Task variables 


In Sherif's experiment conformity occurred when 
individuals, confronted with an ambiguous stim- 
ulus, made judgments about it in the presence of 
one another. It was possible, however, that the 
autokinetic situation, because of its ambiguity, did 
not involve conformity as it occurs in vivo. The 
judgments of each individual in the group were 
based on two sources of information: (a) the 
announced estimates of his partners and (b) his 
own very vague and unstable impression of the 


autokinetic movement. Clearly, from the subject's 
point of view, the second source of information 
had questionable reliability, There were no solid 
grounds for depending on what one thought one 
saw in this situation. It was, therefore, tempting 
for the subject to simply rely on the judgments of 
the other group members. The most important con- 
tribution to furthering the understanding of the 
conformity process along these lines was made by 
Solomon Asch (1952; 1956). The basic question 
that Asch raised was whether results like those of 
Sherif’s would be obtained from judgment tasks 
that were considerably clearer and easier. Would 
the individual yield to a consensus among a group 
of peers even when it was perfectly obvious to him 
that the consensus was wrong? 

The experimental design employed by Asch in- 
volved two important modifications of Sherif's tech- 
nique. First, nonambiguous stimuli were used: his 
subjects were asked to estimate the length of a 
line by matching it with one of three comparison 
lines. The standard stimulus and the comparison 
stimuli were always presented under optimal view- 
ing conditions. Also, the comparison stimuli always 
included a line identical in length and all other 
respects to the standard line. The differences among 
the lengths of the comparison lines were so large 
that an individual making these psychophysical 
judgments by himself could easily achieve 90 per 
cent accuracy. 

Second, instead of utilizing ad hoc groups, Asch 
employed trained confederates who were previously 
instructed about what estimate to make on each 
trial, Normally, the experimental situation in Asch’s 
experiment consisted of 12 trials; on each trial the 
confederates, one at a time, would announce their 
estimates before the subject announced his. On 
seven of these trials the confederates would unani- 
mously select the wrong comparison line. Some- 
times their choice would differ from the correct one 
by as much as 70 per cent. This procedure removed 
the reciprocal and mutual aspect from the influ- 
ence process present in Sherif's experiment, thus 
rendering the analysis of conformity pressures 
more manageable. 

Asch’s experimental paradigm became the stand- 
ard one in subsequent conformity research. It 
unveiled the opposing forces involved in the con- 
formity process and allowed their systematic 
manipulation. The force of the individual’s own 
conviction could be manipulated by varying the 
difficulty of the psychophysical estimates he was 
asked to make. The force of conformity pressures 
exerted upon him could be manipulated by the size 
of the false majority, by how expert or confident 
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they were made to appear, by the subject’s depend- 
ence upon them for various gratifications, etc. 

Asch’s experiments gave a strong impetus to 
conformity research partly because of their pro- 
vocative results and partly because his experi- 
mental design brought into focus many new prob- 
lems that until then had not been well formulated. 
In one of his first experiments individuals judging 
the length of lines in the absence of a false major- 
ity attained about 93 per cent accuracy. But sub- 
jects who were exposed to the prior judgments 
of the experimenter’s confederates reached only 
67 per cent accuracy—a drop of 26 per cent. 
The difference between the accuracy levels of 
judgments made independently and those made in 
the presence of a false majority has generally been 
taken as the measure of conformity. 

It was Asch’s original suspicion that Sherif’s re- 
sults were obtained mainly because of the ambigu- 
ous stimuli he used. Asch thought that individuals 
would exercise greater independence of judgment 
when given a more definite, clearer, and easier 
task. As we noted above, however, Asch’s first ex- 
periment shattered these expectations. Even under 
optimal conditions his subjects yielded to the false 
majority 26 per cent of the time. In further work 
Asch reduced task difficulty even further. On sev- 
eral critical trials the judgments of the false major- 
ity were quite obviously false. For instance, on one 
of the trials the three comparison lines were three, 
ten, and two inches long. The standard line was 
ten inches long, but the experimenter’s confeder- 
ates unanimously chose the three-inch line as the 
“correct” match. On another trial the line chosen 
by them differed from the standard stimulus by 
four inches. In experiments of this sort the judg- 
ments are quite easy to make, and subjects working 
in isolation reached 98 per cent accuracy. But 
even with judgments extremely easy to make, there 
was as much as 26 per cent yielding, that is, sub- 
jects responding in the presence of a false majority 
reached only 62 per cent accuracy. 

It is generally found that the yielding is not 
completely eliminated by decreasing task difficulty 
(Blake, Helson, & Mouton 1957), but the degree 
of yielding can be somewhat reduced by such a 
decrease (e.g., Coleman et al. 1958). We shall re- 
turn to the problem of task difficulty when we con- 
sider the effect of certainty on yielding to a false 
majority. 

While most experimental work in conformity has 
concentrated on perceptual or psychophysical re- 
sponses of the sort used by Sherif and Asch, there 
are a few experimental studies to show that similar 
effects are obtained with other forms of behavior. 
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For instance, Blake, Rosenbaum, and Duryea 
(1955) report that contributions to a fund for a 
gift for a departing secretary followed patterns 
similar to those obtained by Asch. When presented 
with a list on which the gifts of others were marked, 
the individuals who were asked to make a contri- 
bution tended to match their gifts to those already 
listed. Thus, subjects who were given a list with 
gifts averaging 25 cents pledged an average of 
28.5 cents. Those who were presented with a list 
averaging 75 cents pledged to contribute an aver- 
age of 63.5 cents. But subjects asked to contribute 
a sum of money without being exposed to a pre- 
arranged list averaged 75 cents. 

Blake, Mouton, and Hain (1956) solicited stu- 
dents at the University of Texas to sign a petition 
requesting “. . . University officials [to] place lights 
on Littlefield Fountain to add to the beauty of the 
memorial.” In two conditions an experimenter’s 
confederate was invited to sign the petition in the 
presence of the subject. In one of these the con- 
federate responded by saying, “Sure, I'll sign,” 
adding his signature to the petition. In another he 
said, “No, I'd rather not,” and walked away. In 
a third condition subjects were asked to sign the 
petition in the absence of a confederate. When 
confronted with a positive action of a confederate, 
89 per cent of those asked signed the petition. In 
the absence of a confederate, 58 per cent of the 
subjects complied with the request. However, when 
confronted with a refusal of the confederate, only 
29 per cent signed the petition. 

In a similar attempt Rosenbaum (1956) invited 
students studying in the library to volunteer for a 
psychological experiment. In two conditions the 
subject was asked only after the experimenter ad- 
dressed his confederate, who either accepted or 
rejected the request. In a third condition no con- 
federates were present. With a positive response 
of the confederate, 67 per cent of those asked vol- 
unteered for the “psychological experiment.” In 
the absence of a confederate, 41 per cent volun- 
teered, but with a negative response of the con- 
federate, only 38 per cent of those asked volun- 
teered. 

An interesting variation of such experiments was 
introduced by dealing with violations of existing 
rules or customs. Thus, Freed and others (1955) 
affixed a “no admittance” sign to the main class- 
room building at the University of Texas. As in 
previous experiments, a confederate made either 
a positive or a negative response or no confederates 
were present. Thus, in one condition a confederate 
read the sign and entered the building. In another 
he read the sign and walked away. Given a model 


who himself conformed to the rule, 30 per cent of 
the subjects entered the building. In the absence 
of a confederate, 60 per cent of those observed 
violated the sign. But when preceded by a confed- 
erate who violated the sign, 90 per cent were ob- 
served entering the building. 


Individual variables 


Asch’s dramatic findings and the findings of 
those who further pursued his pioneering work 
clearly demonstrated the powerful influence a 
group can exert on an individual, But these studies 
also showed that conformity did not occur in all 
trials, nor did all individuals yield to the false 
majority. The question immediately arose whether 
individuals who maintained independence of judg- 
ments in the face of a strong, albeit false, majority 
did so because of an underlying personality trait. 
Is the tendency to conform a stable personality 
characteristic of the individual? Can this charac- 
teristic be identified on the basis of other knowl- 
edge about him? 

Consistency of conforming. Evidence indicates 
that tendency to conform is indeed a stable 
individual characteristic showing up consistently 
in different situations. For instance, Ferguson 
(1944) studied responses to different. attitude 
scales (religion, humanitarianism, nationalism). 
After having answered the attitude questionnaires, 
the subjects were informed about the responses 
to these questions made by a reputed majority. 
The attitude questionnaires were again given to 
the subjects. The results showed that individuals 
who shifted their attitudes on one scale also shifted 
attitudes toward the reputed majority on other 
scales. 

Using various tasks (i.e., recall of nonsense 
words, recall of entire paragraphs, the embedded- 
figures test, and Asch’s line-judgment test), Rosner 
(1957) found clear evidence of consistency of the 
yielding response. His findings indicate that within 
a single task and within a single experimental 
session the individuals who yield to the majority 
on early trials are also likely to yield on later trials. 
Second, individuals who yield in performing the 
given task during one experimental session tend 
to yield on that task in a repeated administration 
at a later date. His results also showed consistency 
among the various tasks: those subjects who yield 
to the false majority on one task also yield on 
others. 

Skill and confidence. The experiments reviewed 
above, in which judgment difficulty was manipu- 
lated, reflect in part the operation of task variables 
and in part individual difference factors. Tasks are 


never intrinsically easy or difficult. They are so only 
in relation to the individuals who work on them. 
The clarity and difficulty of judgments reflect at 
the same time a property of the task and a char- 
acteristic of the individual engaged in it. The sub- 
jective aspect of task difficulty is the individual's 
skill at that task, and the subjective aspect of am- 
biguity of judgments is his certainty or confidence. 
Various experiments have tried to show that task 
difficulty or ambiguity of the judged stimuli are 
not in themselves able to enhance conformity. 
Rather, the significant variables were found to be 
the subject's task skill and his confidence (e.g., 
Coleman et al. 1958). 

Individual difference factors that are related to 
task skill and certainty of judgment seem to show 
a similar pattern of results. Thus, for instance, 
Tuddenham (1961) found that children tend to 
conform more readily than adults, girls more than 
boys, and women more than men. Tuddenham 
(1959) reported that intelligence and conformity 
are negatively related. Appley and Moeller (1963) 
found a sizable correlation between conformity and 
the Edwards abasement scale, which in large 
measure reflects the person’s lack of self-confi- 
dence, Crowne and Liverant (1963) reported that 
conformers tend to have low expectations of suc- 
cess in judgmental tasks. When the prior judgment 
experiences of the individual are generally success- 
ful (i.e. accurate) and when he is aware of his 
accuracy, his tendency to conform to a false group 
consensus is reduced in later trials (e.g., Hollander 
1964), 

Strength of commitment. When pressures are 
exerted by a group on the individual in an attempt 
to change his attitudes or opinions, the relative 
success of the group depends, in large measure, 
on how strongly the individual is committed to his 
point of view. Conformity to a false majority is 
considerably reduced when it involves fairly stable 
values (e.g., Gerard 1953). But even when we deal 
with a strong commitment, some measure of con- 
formity is observed. 

Dependency. Among the personality variables 
found to be significantly related to conformity, de- 
pendency is perhaps the most outstanding. Various 
scales exist to differentiate between individuals in 
terms of whether they tend to depend on others 
for the gratification of various needs, for gaining 
understanding about the things and events around 
them, for learning if they behave appropriately or 
are considered worthwhile human beings. We would 
naturally suppose that a dependent individual 
would be more prone to conform than one who 
can function independently and who goes not need 
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to rely on others in judging his own actions. This 
conjecture is borne out by substantial research 
evidence. It was already noted that children tend 
to conform more readily than adults and that fe- 
males conform more readily than males. And it is 
generally found that children are more dependent 
than adults and females more dependent than 
males. Kagan and Mussen (1956) have shown that 
subjects who tend to respond to the Thematic Ap- 
perception Test cards in terms of dependency 
themes conform consistently more than others. 
Strickland and Crowne (1962), on the other hand, 
found that conformity is intimately associated with 
the individual’s need for social approval. Because 
schizophrenics are less responsive to the social 
environment and consequently less psychologically 
dependent upon it, one would expect them to con- 
form less than normal subjects. This hypothesis 
was tested but was not substantiated by Schooler 
and Spohn (1960). Interestingly, they found that 
the mental patients tended to respond neither in 
terms of the false consensus nor in terms of the 
correct stimulus match, but often chose the third 
alternative, i.e., a wrong response not supported 
by the false majority. It would seem that the schizo- 
phrenic is unresponsive not only to the social envi- 
ronment but to the physical one as well. 

Authoritarianism. The picture of the conform- 
ist as a dependent, submissive person with little 
confidence in his own ability leads one immediately 
to suspect that he is also quite sensitive to his rela- 
tionships with authority figures. A considerable 
literature exists to show some not entirely clear 
relationship between authoritarianism, as meas- 
ured by the F scale, and the tendency to conform 
(e.g., Steiner & Johnson 1963). Mussen and Ka- 
gan (1958) reported that the conformist tends to 
perceive his parents as harsh, rejecting, restrictive, 
and punitive figures. These authors suggested that 
the tendency to conform might be acquired in early 
childhood. Their hypothesis was based on an earlier 
work by Hoffman (1953), who pointed out that 
some forms of conformity are motivated by guilt 
caused by the person's hostile feelings toward 
figures of authority and in particular toward his 
parents. 

Affiliation and achievement. Dependency is a 
personality disposition related to the affiliative 
strivings of the individual. Independence, on the 
other hand, is related to his achievement strivings. 
Numerous studies have shown that need for affilia- 
tion and need for achievement fairly reliably pre- 
dict a tendency to conform, the first predicting an 
increase, the second a decrease in conformity 
(e.g., DiVesta & Cox 1960; Samelson 1958; Zajonc 
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& Wahi 1961). Often, however, these predictions 
have been complicated by an interaction between 
the two needs and by an interaction of these needs 
with birth order (e.g., Samelson 1958). 
Evaluation. On the whole, research that con- 
centrated upon the discovery of a unitary person- 
ality predisposition to conform was less than fully 
successful, and some of the research quoted above 
was not always borne out by follow-up studies 
(e.g., Steiner & Johnson 1963), The most reliable 
finding in this field has been that females tend to 
conform more than males. Other results have en- 
joyed considerably less frequent confirmation. 


Group factors 


One of the important factors that enhance the 
individual's resistance to group pressure is his 
confidence. We noted above that experimental 
manipulations that tend to increase his feelings of 
confidence (such as prior experience of success) 
result in an attenuated conformity response. But 
in a typical conformity experiment, before making 
his own judgments, the subject is confronted with 
the responses of a group of people. This response 
is often at odds with what he perceives. Clearly, 
the characteristics of the group, its size, its una- 
nimity, its reputed expertness in the task, are fac- 
tors diminishing the individual's confidence in his 
own judgments, 

Size and perceived accuracy of majority. Early 
work by Asch (1952) has shown that the con- 
formity effect is almost completely eliminated when 
the subject's judgments are preceded by the judg- 
ment of only one confederate. Asch reports that the 
subject's confidence in his own judgments is main- 
tained when contradicted by only one person. There 
is, however, a dramatic increase in conformity 
when the subject’s response is preceded by those 
of two other “judges,” and still a further increase 
when preceded by three. Thereafter an increase in 
the size of the group has little further effect. These 
findings have been in large part substantiated 
(e.g., Rosenberg 1961). 

Stimulus ambiguity has its subjective counter- 
part in the individual's self-confidence, But in the 
conformity situation the individual's confidence in 
his own judgments is always relative to what he 
perceives the accuracy of the group to be. If he 
perceives that others are more accurate, he will 
be more likely to agree with them in the case of 
conflict between stimulus information and group 
consensus (e.g., Deutsch & Gerard 1955), 

Consensus and divergence. Yielding to a false 
majority is reduced somewhat when the majority 
includes an individual making a correct judgment. 


Asch (1952) found that not only did the extent of 
conformity decrease when a “partner” was added 
to the group of confederates but that the subjects 
felt considerably less tense during the experimental 
situation—a condition almost universally observed 
in conformity experiments with a unanimous group. 

The amount of divergence between what the 
subject thinks is the “correct” response and the 
group response is a significant factor in maintain- 
ing the individual’s confidence in his own judg- 
ments. Most findings in this area agree that as this 
divergence departs from zero, conformity increases, 
(At zero divergence, of course, the degree of con- 
formity cannot be evaluated.) Conformity reaches 
a maximum at a moderately small extent of diver- 
gence, depending largely on the task, and with a 
further increase in divergence conformity falls off 
(e.g., Blake, Helson, & Mouton 1957; Whittaker 
1964). There is at least one study, however, in 
which no effects of divergence on conformity were 
found (Birney & Houston 1961). 

Attraction and status, The relationship between 
the group constituting the false majority and the 
conformist has been investigated from various 
points of view. The two main foci of interest among 
these studies have been the attractiveness of the 
group and the status of its members, In general, 
the findings indicate that conformity increases with 
the individual's attraction to the group (e.g., Thi- 
baut & Strickland 1956). Bovard (1953), however, 
found no relationship between conformity to a 
group norm and the attraction of the members for 
one another. On the other hand, Kiesler (1963) 
reports a curvilinear relationship between attrac- 
tion to the group and conformity, with maximum 
conformity obtaining at moderate levels of attrac- 
tion. Status of the group, too, was found to enhance 
conformity. In particular, a difference in status be- 
tween the group and the conformist seems to have 
pronounced effects on the likelihood of conformity 
(e.g., Raven & French 1958). It is interesting to 
note in passing, however, that the status of the 
conformist itself does not bear a simple relation- 
ship to yielding. Harvey and Consalvi (1960), 
working with delinquent cliques, found that the 
second-ranking member is more likely to conform 
to a false consensus among the members of his 
clique than is either the first-ranking or the last- 
ranking member. 

Private and public situations. The operation of 
group factors upon conformity is well illustrated 
in numerous studies that compare responses made 
under public and private conditions. Two im- 
portant findings emerge when individuals, con- 
fronted by a false majority, are observed making 


their judgments privately, convinced that these 
will never be known by the group. The first of these 
is that, in comparison with the public situation, 
conformity decreases (e.g., Deutsch & Gerard 1955; 
Thibaut & Strickland 1956). The second and more 
important finding is that conformity to a false 
majority is not completely eliminated even when 
the judgments are made privately, secretly, and 
anonymously. Since the group is a significant 
source of rewards and punishments for the indi- 
vidual, the difference between conformity under 
public and private conditions depends to some ex- 
tent on what consequences the individual expects 
from the group as a result of his nonconformity. 
Dittes and Kelley (1956) have shown that individ- 
uals who were not fully accepted by the group but 
who perceived their chances of becoming full- 
fledged group members to be high conformed to 
the false majority regardless of whether they were 
asked to make a public or only a private commit- 
ment. Individuals who thought that their chances 
of being fully accepted by the group were low and 
who, as a matter of fact, feared rejection from the 
group showed conformity only in public. The false 
majority had little influence on their private opin- 
ions. These results have been further corroborated 
in a field setting (Menzel 1957). 


While research results on conformity are not 
always perfectly reliable, a good deal of informa- 
tion has been gathered in this field since the pio- 
neering studies of Sherif and Asch. These results, 
which show that even under completely private 
conditions the individual is likely to surrender, at 
least in part, to the pressures of a group making an 
obviously wrong judgment, clearly demonstrate the 
degree to which groups may control individual be- 
havior and perception. The experiments on con- 
formity, moreover, do not deal simply with trivial 
responses and psychologically trivial forces having 
little significance for behavior outside of the psy- 
chological laboratory. From the point of view of 
the subject, the experimental conformity situation 
is quite real and quite serious. His involvement in 
it and its impact have been demonstrated by physi- 
ological activity symptomatic of stress arising dur- 
ing the confrontation of the individual with a false 
consensus (e.g., Back et al. 1963) and declining 
as a result of his agreement with the group (Law- 
son & Stagner 1957). 

ROBERT B. ZAJONC 


[Directly related are the entries COHESION, SOCIAL; 
IMITATION; SUGGESTION. Other relevant material 
may be found in GROUPS, article on GROUP FORMA- 
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TION; REFERENCE GROUPS; SOCIAL CONTROL; SOCIALI- 
ZATION; and in the biographies of LE Bon and 
TARDE.] 
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I 
THE CONCEPT OF CONSENSUS 


Consensus is a particular state of the belief sys- 
tem of a society. It exists when a large proportion 
of the adult members of a society, more particularly 
a large proportion of those concerned with deci- 
sions regarding the allocations of authority, status, 
rights, wealth and income, and other important and 
scarce values about which conflict might occur, are 
in approximate agreement in their beliefs about 
what decisions should be made and have some feel- 
ing of unity with each other and with the society 
as a whole. Consensus may also exist between in- 
dividuals in primordial or personal face-to-face 
relationships as in a family or in friendship; it may 
exist within a charismatic corporate body like a 
church or sect or within a society. Here we shall 
be considering only the consensual state of a soci- 
ety, that is, “macrosocial” consensus. (Dissensus 
exists in situations where potentially divergent 
“interests” confront each other and consensus is 
absent.) 

Three elements crucial to the functioning of 
consensus are (1) common acceptance of laws, 
rules, and norms, (2) attachment to the institu- 
tions which promulgate and apply the laws and 
rules, and (3) a widespread sense of identity or 
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unity, which discloses to individuals who experi- 
ence it, those features in respect to which they are 
identical and therefore equal. The sense of identity 
diminishes the significance of the differences on 
which dissensus and hostile sentiments would 
otherwise focus. Although these elements may vary 
independently, the strength of any one of them 
helps to strengthen the others. 

The pattern of consensual belief. Consensus 
refers to distributions of scarce values among indi- 
viduals who are in face-to-face interaction and who 
are competing for these values. Between workers 
and supervisors and among workers on a shop floor, 
among members of different political parties, or 
members of different factions within a political 
party, consensus operates to restrict the extension 
of dissensus and to limit conflict with regard to the 
allocation of valued objects. In these face-to-face 
situations the consensus is ordinarily, although not 
always, a “derivative” of beliefs which have a wider 
range of reference. The adjudication of conflicts 
with immediate superiors, colleagues, or political 
rivals is often a specific product of norms or max- 
ims which have a more general and a wider rele- 
vance. (The same applies to dissensus. ) Relatively 
few persons are in face-to-face interaction with 
those who are remote from them in such society- 
wide distributions as income, power, and deference, 
but it is to these larger distributions that attention 
is given, and it is about them that “serious” beliefs 
are held. Individuals are capable of perceiving the 
approximate dimensions of these larger distribu- 
tions and of passing judgment on their rightful- 
ness; indeed, the greatest importance is attributed 
to these distributions. Human beings are capable 
of rendering judgments about situations in which 
they do not actually participate, and many are 
strongly inclined to do so. Intellectual and moral 
sensibility impel them to judge the quality of their 
society and to formulate the attitude they should 
take toward it and toward the larger distributions 
which determine it. The fact that beliefs about 
such distributions are beliefs about power—a fact 
which draws attention to itself and which urgently 
demands judgment—makes macrosocial distribu- 
tions into objects of vital beliefs. 

The beliefs with respect to which consensus may 
exist include both cognitive propositions and moral 
standards about the justice or injustice of the dis- 
tribution of roles, facilities, and rewards and about 
the worth of the institutions of authority and order 
by which these distributions are brought about, 
maintained, and changed. These beliefs usually 
concern the rightness and the qualifications of 
those in authority to exercise it and of those re- 
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ceiving different amounts of the valued objects to 
receive what they do. These beliefs also concern the 
legitimacy of the institutions through which the 
bearers of authority and the recipients of the dif- 
ferent shares are selected. In a condition of macro- 
social consensus, the most important beliefs relate 
to the sense of unity which extends from those who 
possess it, whatever their own position, to strata 
of the population very differently situated in the 
distributions of rewards and very different in their 
share in the exercise of authority. Macrosocial con- 
sensus by definition does not include such beliefs 
as those which refer to the right order of personal 
relationships, the proper objects of aesthetic experi- 
ence and judgment, the origin and structure of the 
cosmos, and the nature and powers of divinity. 
(It is of course quite likely that where consensus 
exists with respect to such beliefs, the macrosocial 
consensus in which we are interested is more prob- 
able and more enduring.) 

Until the emergence of the modern liberal state 
with a plurality of religions, diverse political par- 
ties, and institutions for the conduct and control 
of class and other sectional conflicts, rulers tended 
to believe that consensus in religious beliefs was a 
necessary condition of the consensus required for 
social order. In the nineteenth century this view 
ceased to be regarded as valid. Yet it is quite pos- 
sible that consensus of beliefs apart from those 
about the justice or injustice of the distribution of 
scarce and desired objects helps to create and main- 
tain the vague sense of unity which is an essential 
part of consensus. There is perhaps a deeper truth 
hidden in the quip which asserted that England was 
a “country of a hundred religions and one sauce.” 
The approximate uniformity of standards of taste 
might have contributed to the sense of unity which 
enabled a macrosocial consensus to function de- 
spite the divisions concerning questions of theology 
and ecclesiastical organization [see LEGITIMACY]. 

Abstract or general ethical and political beliefs 
(“principles”) can enter into consensus insofar as 
they affect agreement or disagreement on particu- 
lar issues of legitimacy, distribution, selection, and 
so on. A situation in which general principles, ratio- 
cinatively or affectively arrived at, explicitly and 
visibly determine particular orientations is by no 
means impossible. Still, it is relatively rare in any 
society, and only rigorous ideologists insist on a 
very close dependence of particular decisions on 
the explicit principles which are part of an explicit 
and systematic belief system. 

Human beings assess the actions and orienta- 
tions of other persons in general categories, and 
their assessments give some direction to their par- 
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ticular and immediate response to these persons. 
General ethical standards, vague notions of the 
right ordering of life, and conceptions of the vir- 
tues which entitle men to power, deference, and 
other rewards and facilities can have some influ- 
ence on concrete responses to particular issues; 
insofar as they are consensually shared, such gen- 
eral orientations may be said to constitute part of 
the consensus. 

The beliefs which enter into consensus are not 
clearly articulated or systematically ordered. They 
are expressed sometimes in maxims and sometimes 
in ambiguous terms like “fairness.” They are often 
formulated negatively, repudiating particular situ- 
ations but not indicating what is positively right. 
Even the corpus of law, which is the most articu- 
lated precipitate of consensus, is full of gaps and 
is rarely very coherent in a logical sense; what is 
more, the specific imperatives and prohibitions con- 
tained in law are far more differentiated than the 
beliefs actually formulated or articulated by those 
who share in consensus. In societies with written 
constitutions, the body of beliefs and rules and 
images contained therein has some correspondence 
with the prevailing consensus, but the correspond- 
ence is only approximate and uneven. 

Dissensual patterns of belief are often more ex- 
plicit and systematic than are the consensual pat- 
terns which affirm the existing central institutional 
system. The more dissensual it is, the greater the 
likelihood of explicitness and systematic coherence 
in a pattern of belief. Accordingly, consensual be- 
lief patterns tend to be more pluralistic in the sense 
that they espouse a number of beliefs which are not 
wholly consistent with each other and which are 
able to coexist quite easily as long as no one of 
them is carried out in full. Thus, in contemporary 
Western liberal societies both equality and liberty 
are believed to be very important, but neither is 
pressed to complete fulfillment. They remain vague 
indicators of a direction or tendency rather than 
specific imperatives. A greater monism is generally 
but not always characteristic of dissensual patterns 
of belief. There probably is a positive correlation 
between monism in the pattern of belief and the 
degree of dissensuality vis-a-vis the prevailing pat- 
terns of belief. 

The beliefs which enter into consensus (or dis- 
sensus) vary in the intensity with which they are 
adhered to even within a single pattern of belief; 
some component beliefs might be very stringently 
insisted upon, others less so. Correspondingly, 
within a given society some of the adherents of a 
given pattern of belief might have a high average 
level of intensity of adherence, others might have 


a lower average level. The adherents of a dissensual 
(internally consensual) pattern of belief will usu- 
ally have a higher average level of intensity of 
adherence than the adherents of a consensual 
pattern. 

Consensual patterns of belief tend to be affirma- 
tive about the distribution of authority, the legiti- 
macy of its results, and the mechanisms and 
standards by which it is effected. There is nothing 
in the definition of consensus that requires this to 
be so, and there might indeed be occasions when 
the prevailing system of authority becomes the ob- 
ject of a widely shared negative consensus. This 
would be a situation in which the paradox of a 
dissensual consensus is realized. But that would be 
the exception. Insofar as a dominant consensus 
exists in a society, it usually more or less affirms 
the existing system of authoritative institutions and 
the distributions associated with them because 
authority, when it is effective, tends to establish 
its own legitimacy. The coercive powers of authority 
also help it to call forth a conformity which justifies 
itself by an acknowledgment of the legitimacy of 
the powerful. Perhaps even more important is the 
close affinity of outlook of the central cultural sys- 
tem with the central institutional system, and this, 
through teaching and exemplification, tends to dif- 
fuse itself into the outlook and the programs of 
large parts of the society, 

The empirical analysis of the pattern of con- 
sensus confronts the same difficulties as the empiri- 
cal analysis of beliefs actually held by individuals. 
Because of vagueness, ambiguity, unsystematic 
character, and variations in level of abstraction, 
actually held beliefs are difficult to describe, and 
there is a tendency on the part of sociological and 
anthropological analysts to systematize, clarify, 
and specify actually held beliefs to a point which 
makes the holder of such beliefs appear to be a sys- 
tematic philosopher. Premises are rendered explicit 
and particular judgments are generalized; as a 
result of this process, beliefs which are not know- 
ingly held are imputed by the analysts, who thus 
distort the nature of consensus and the mechanisms 
through which it operates. 

The social structure of consensus. In no soci- 
ety, however consensual, is the consensus ever 
universally shared. Nor does it depend upon univer- 
sal participation to be effective. Those adults who 
do share it, do so with very different degrees of 
intensity or concern. Of course no adult member 
of society is outside the system of allocation of 
scarce roles, facilities, and rewards, and as a result 
no adult, unless he is utterly—indeed almost cata- 
tonically—apathetic, can entirely avoid rendering 
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judgments at least about that sector of the distribu- 
tion which he perceives immediately around him. 
Moreover, the judgments which he renders are 
related in a determinate way to those current in 
the culture of his society. Yet his interest in the 
larger distribution and the problems of legitimacy 
associated with it might be very faint and in- 
frequent. 

Since one of the most important foci of consen- 
sus is the system of allocation of the whole society, 
those persons who do not concern themselves much 
or intensely with the total allocation may be said 
to be somewhat marginal in the structure of con- 
sensus. Even where they share general beliefs, 
about what is “fair,” for instance, the fact that they 
apply these beliefs only or primarily to their imme- 
diate environment and not to the larger society 
means that they do not participate in the consensus 
to the same extent and in the same way as those 
who apply the consensually shared norms more 
intensely and frequently to the macrosocial system 
of allocation. 

In societies which are relatively unintegrated 
and where centrally made decisions therefore im- 
pinge only very intermittently and marginally on 
certain sectors of the society (e.g., isolated villages 
which do not participate in a national economy), 
the members of those sectors might be entirely out- 
side the structure of consensus. But in modern 
societies, where government plays such a large part 
in influencing allocations and where country-wide 
communication and transportation systems bring 
at least some features of the central institutional 
system into the field of attention of even the un- 
educated and apathetic, complete abstention from 
the structure of consensus (or dissensus) is very 
rare. 

On some issues and on some occasions most of 
the adult population is consensual, but even within 
such general consensus there are many individual 
differences. Apart from differences in intensity 
among those who share in any particular belief in 
a consensus, there are differences in the number 
or proportion of beliefs adhered to. Every consensus 
is constituted by a plurality of beliefs, such as be- 
liefs about the right principles of remuneration, 
the proper shape of particular distributions (equal- 
ity and inequality), the prerogatives of property 
ownership, the propriety of incumbency by certain 
individuals and classes of individuals in certain 
positions on particular distributions, and the value 
of the total national community. Within the “pub- 
lic” of a consensus, some sectors might be attached 
intensely to all the beliefs, others intensely only to 
some of them and faintly to others, and some might 
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be attached only to a very few and repudiate most 
others. A common acceptance of certain of the 
beliefs might bind the entire “public” into a partial 
consensus; just how partial it is depends on the 
extent to which other beliefs are repudiated and 
various contrary ones espoused by the various 
subsectors of that public, For example, nearly 
everyone in a given country might accept the legiti- 
macy of the electoral system and of the authorities 
selected through it, and they might also accept a 
particular unequal distribution of income; but they 
might disagree sharply about the distribution of 
deference, and about the capacity of particular in- 
cumbents of authoritative roles to act beneficially 
on behalf of the society as a whole. 

Consensus exists in a complex interplay with 
dissensus, Dissensus is the state of disagreement 
of beliefs about allocative decisions and results. 
Those who are dissensual with respect to a par- 
ticular feature of an allocative system or process 
might be consensual with respect to other features 
or about the general properties of the system, Thus 
the dimensions or boundaries of the structure of 
consensus are variable within a given society, not 
only over a longer trend but from situation and 
problem to situation and problem. 

Party politics in modern societies are organized 
efforts to shift the boundaries of the partial dis- 
sensus within the framework of a partial consensus. 
But alongside these deliberate efforts to change the 
structure of consensus, there is a continual shifting 
of boundaries among partially dissensual groups. 
Beliefs about which there was once disagreement 
(e.g., concerning the right of the unemployed to 
public assistance) become consensual, while pre- 
viously consensual beliefs, such as those concern- 
ing the right of the inheritance of property, become 
partially dissensual. These historical shifts in the 
structure of consensus arise in part from the strug- 
gles of the proponents of the partially dissensual 
programs, in part from demographic and techno- 
logical changes, which are accompanied by changes 
in occupational structure, and in part from the un- 
folding of the dynamic potentialities of various 
patterns of belief. 

The variations in the “public” of consensus (ie., 
in the size of the population participating in it) 
testify also to the simultaneous coherence and in- 
coherence of the patterns of belief which individ- 
uals bring into consensus, If each individual had a 
perfectly systematized pattern of belief, those who 
disagreed with him on one particular belief would 
disagree with him on all others. On the other hand, 
if this pattern of belief were totally incoherent, 
there would be no stability in the “publics” formed 
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by the consensus around particular beliefs. Agree- 
ment on one issue would not entail any probability 
of agreement on any other. Yet, as we know, every 
consensual pattern of belief is borne in the first 
instance by a persisting “public” of adherents who 
gather around them, on particular issues and clus- 
ters of issues, extended and somewhat fluctuating 
“publics.” Some who are consensual on most issues 
are dissensual on a few; others who are dissensual 
on most issues are consensual on a few. From this 
determinateness of pattern an approximate stability 
of the “publics” is borne. 

The function of consensus. Consensus main- 
tains public order, that is, it reduces the probability 
of the use of violence in the resolution of disagree- 
ments; it increases the amount of cooperation 
which is not impelled by fear of the coercive power 
of the stronger party. It does so by (1) the reduc- 
tion of the probability of disagreements, (2) the 
restriction of the intensity of affect and strength 
of motivation with which disagreements are con- 
ducted and the softening of the rigidity of the 
attachment to the objectives about which the dis- 
agreements exist, and (3) the fostering of a readi- 
ness to accept peaceful modes of adjudicating dis- 
agreements among those who have a sense of 
mutual affinity or identity. 

There is no natural harmony of interest among 
men in society. Men are diverse in their propen- 
sities, and the material and symbolic objects which 
they seek are scarce in relation to the demand for 
them. Their “interests” are in conflict. Furthermore, 
a society—particularly a large-scale society—dif- 
ferentiated by generations, by occupations, by status 
and culture, will naturally tend toward a differen- 
tiation in beliefs regarding the rightness of the 
actions of authority and the justice of the existing 
social order [see ConFLIctT]. Where it exists, con- 
sensus is a counterforce against the fulfillment of 
the divisive potentialities of these divergent “inter- 
ests” and beliefs. Consensus facilitates collabora- 
tion: it reinforces the cooperation which arises 
from coincidences of interest, limits the range of 
the divergence of interests by defining ends in a 
way which renders them more compatible, and 
circumscribes the actions injurious to cooperation 
which might arise from the divergent interests. 

To be effective consensus depends particularly 
on those persons scattered throughout the society 
in many classes, regions, and occupations who 
have a fairly continuous concern with (1) the 
macrosocial distribution of roles, facilities, and re- 
wards, (2) the particular decisions taken in the 
center of the society inasmuch as they affect these 
distributions or are affected by them, and (3) the 


institutions in which these decisions are taken or 
which influence the distributions. The concern may 
take the form of institutionalized participation in 
the decisions bearing on these three aspects of dis- 
tributive events, or it may take the form of attitu- 
dinal involvement which affects action. The con- 
cerned are elites who, on behalf of the strata and 
collectivities generated by the distributions or on 
behalf of some ideal arrangement, take it upon 
themselves or are institutionally entitled to pass 
judgment and to attempt to influence opinion and 
decision. Their agreement or disagreement on par- 
ticular issues can promote harmony or conflict in 
the working of institutions and in the relationships 
of strata and collectivities. [See ELITEs.] 

Public order and effective cooperation among 
the diverse parts of the society do not, however, 
require complete and continuous consensus even 
among the elites. What is important for the main- 
tenance of order is that their disagreements about 
particular issues should exist within a consensual 
matrix. This consensual matrix is maintained 
largely by the sense of oneness with each other 
and with the whole society; this sense of oneness 
manifests itself in a sense of affinity even with 
persons with whom there might be many particular 
disagreements. Coalitions of interests, the bound- 
aries of which cut across each other, keep the sense 
of oneness from disintegrating. (Cleavages formed 
by the coincidence of boundaries of interest coali- 
tions, on the other hand, do serious damage to the 
sense of oneness.) The power to coerce is another 
important element in maintaining order in society, 
but it never operates alone for any length of time. 
Without a strong consensual reinforcement, coer- 
cion could in itself never be effective. It is a sup- 
plement to consensus, not a mutually exclusive 
alternative. 

The formation and change of consensus. The 
family which inducts the newly born organism into 
society is the first instigator of consensus. Within 
the family the child acquires generalized and affir- 
mative attitudes toward authority, which are the 
preconditions of a subsequent assimilation into a 
consensus based on authoritative institutions at the 
center of society. In school the child acquires some 
of the culture of the larger society, some knowledge 
and appreciation of its heroes, great events, and 
territorial scope; he forms an image of the society. 
From childhood on, great collective rituals repeat- 
edly renew the sense of unity with the larger 
society, and recurrent interaction with like-minded 
persons maintains and reinforces the disposition to 
attribute validity to those who speak and act 
through and for the central institutional system 
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and those who speak on behalf of the central cul- 
tural system. As a result, a substantial proportion 
of the indigenous population grows into a consen- 
sual culture which accepts the rightness and justice 
of existing distributions, the norms for judging 
them, and the institutions for maintaining and 
changing them [see SOCIALIZATION]. 

But the consensus never includes all the popu- 
lation, Some families and sectors of the population 
classes and ethnic groups—are at the margins 
of the dominant consensus or even outside it. Some 
people even reject the pattern of the beliefs which 
inform the dominant consensus as far-reachingly 
as is possible under the pressures of authority and 
the permeative influence of the central cultural 
system. Every large society has a dissensual as 
well as a consensual culture, which is sustained 
by religious traditions and regional and class cul- 
tures as well as by recurrently renewed ethical and 
metaphysical criticism of the prevailing system of 
authority and the allocations of which the incum- 
bents of authoritative and elite roles are the bene- 
ficiaries. Those who experience pain from the exist- 
ing distributions of income, power, and status, 
although often sharing much of the consensual 
pattern of belief, also have contrary inclinations. A 
society which inflicts the distress of a sense of 
exclusion and inferiority cannot wholly succeed in 
assimilating into its affirmative consensus those 
whom it wounds. 

The numerous particular situations of conflicts 
of interest and of norms of what is fair and just 
constitute the occasions for repeated rearrange- 
ments and regroupings of those who are consensual 
and those who are dissensual. The boundaries shift 
and shade off from the one zone into the other. At 
times the consensus might be very inclusive and 
at others it might lose some or much of its following. 

The strength of the central institutional system 
and the deep penetration of the central cultural 
system result, however, in a considerable tenacity 
of the beliefs that make up the consensus. But the 
beliefs themselves undergo gradual changes. In- 
deed, the traditional character of most of the con- 
sensual beliefs renders such changes feasible, and 
this too helps to maintain the consensus. The am- 
biguity which is inherent in traditionally trans- 
mitted beliefs fosters flexibility and permits con- 
tinuity in the face of changing circumstances. 
Moderate changes in the structure and in the in- 
cumbents of the various positions in the distribu- 
tions of power, income, and status can be borne 
without a serious diminution in adherence to the 
main beliefs in a prevailing consensus. Tf the losers 
by these allocative changes are not too drastically 
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affected, their participation in the consensus will 
not be greatly affected. Much depends on the con- 
tinued strength of the central institutions. If these 
continue to maintain the appearance of effective- 
ness, dissensual tendencies will be held in check. 
Strong attachment to the society as a whole, aided 
by the apparent effectiveness of authority, inhibits 
dissensual tendencies. Beliefs about “luck” and a 
belief that results are indicative of qualifications 
render losses even more bearable. 

When, however, drastic changes occur, the con- 
sensus is likely to be weakened by a deeper and 
more comprehensive withdrawal of those who are 
severely hurt by the changes. The diverse poten- 
tialities of interpretation which every consensus 
contains tends, under these circumstances, to be 
subjected to constructions which run in opposed 
directions, and reconciliation becomes more diffi- 
cult. It is also likely that large and rapid increments 
in the quantity of rewards received will disrupt the 
participation of the gainers in the hitherto domi- 
nant consensus. Such increments often raise the 
level of aspiration well beyond the limits permitted 
by the beliefs of the traditionally established 
consensus. 

Changes in technology disclose new possibilities 
of changing the share in valued objects possessed 
by various strata of the population. Those who per- 
ceive these possibilities and who, by virtue of their 
control over resources, are in a position to enhance 
their share will usually seek to do so. If they are 
prevented from doing so by the fixity of attach- 
ment of the incumbent elites to the existing dis- 
tributions and by the rigidity of the patterns of 
belief of those elites about the rightness of the 
existing distributions, a new focus of dissensual 
beliefs will be formed. Technological innovations 
also engender new occupations with new occupa- 
tional cultures. These new cultures might contain 
beliefs which cannot always or easily be accom- 
modated within the existing consensus, and this 
too leads to new foci of dissensual beliefs. 

Yet, despite the strains to which it is constantly 
subjected, the consensus of a society has much 
adaptability and considerable powers of endurance. 
Even in periods of acute civil disorder, when the 
previously legitimated authority has been expelled 
or has shown its weakness, the consensus is not 
entirely in dissolution. Society can never dissolve 
into a Hobbesian state of nature. Even though it 
has ceased to bind the warring groups, the sections 
of the population which are not intensely involved 
in the conflict might be quite consensual in their 
orientations toward each other or, at least, much 
more so than the groups which are violently in 
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conflict with each other. The disruption of society 
in situations of acute civil disorder consists of the 
active contention of violently dissensual elites for 
the machinery and symbols of authority and the 
control of the system of distribution. The persist- 
ence of the more consensual sections of the popu- 
lation in the beliefs they have hitherto held fosters 
the re-establishment of a substantial measure of 
consensus when the crisis passes. Even the two 
warring parties usually contend against each other 
on behalf of divergent interpretations of a com- 
monly shared constellation of beliefs. It is only that 
the sense of unity, of attachment to the whole soci- 
ety, which is an essential constituent of consensus, 
has been so violently ruptured that the affinity of 
substantive beliefs retains no restraining power. 
Once civil order is restored, consensus gradually 
becomes re-established. It will not be exactly the 
same consensual pattern of belief that existed pre- 
viously, The newly established elite, legitimated by 
effective incumbency, will both deliberately and 
unwittingly infiltrate some of its own beliefs into 
the previously operative consensus. However, the 
members of the new elite will, in their turn, become 
assimilated into the basic consensual pattern which 
is held by those they rule and which they too shared 

before coming to power. 
EDWARD SHILS 


[See also AUTHORITY; CONFLICT; COOPERATION; DE- 
MOCRACY; EQUALITY; IDEOLOGY; Norms; VALUES. 
Other relevant material may be found under Law.] 
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LLS 
THE STUDY OF CONSENSUS 


Political consensus involves kinds of agreements 
that are politically relevant. Students of this sub- 
ject have been concerned largely with questions of 
the stability of regimes, particularly democratic 
regimes. 

Four uses of the idea of consensus have been 
developed by writers interested primarily in the 
viability of democratic political processes. One con- 
ception has seen consensus as agreement on the 
“fundamentals” of democratic government. There 
is marked divergence, however, on the questions of 
what it means to agree on fundamentals and just 
how significant such agreement is (Griffith et al. 
1956). Others have argued that the consensus nec- 
essary in a democracy is chiefly a matter of habit- 
ual patterns of behavior which are more important 
than conscious agreement on democratic princi- 
ples (Prothro & Grigg 1960). A third position has 
emphasized the question of the acceptability of 
governmental policy to significant social groups. 
In this view, consensus on emotion-laden issues 
such as those embodied in the welfare state can 
have decisive influence on the stability of the 
political system (Lipset 1964). A fourth view of 
consensus in the democratic context has defined 
consensus as existing where all structurally impor- 
tant social groups contain adherents of major 
political parties. This view of consensus sees it as 
the outcome of “cross-cutting cleavages” which en- 
sure that social divisions and political conflicts do 
not reinforce one another too thoroughly. Here 
consensus is the opposite of acute conflict and is 
hypothesized as leading to political tolerance and 
a low-tension politics (Parsons 1959). It should be 
clear that these four conceptualizations of politi- 
cally relevant consensus are not mutually exclusive 
but in some cases complement each other, while 
in others they conflict. 

Two other views of consensus, although also 


related to the problems of democracy, have been 
developed and employed in different political con- 
texts as well. The first of these is consensus as 
legitimacy: approval of the existing government 
and/or its directives. Understood in this sense, 
consensus is seen to be one of the key elements in 
governmental stability. Others have discussed con- 
sensus in terms of the basic moral and social per- 
spectives that underlie political life. This has been 
a matter of deep concern to students of developing 
nations, In this view, the problem of creating con- 
sensus is closely related to questions of political 
socialization, ideology, political myths and rituals, 
modernization, and personality structure. 

Each of these concepts of politically relevant 
consensus deals with a form of agreement. Al- 
though this notion of agreement appears to be the 
core of the idea of consensus, such a conclusion 
hardly begins to clarify the important problems. 
Of acute concern, and the object of most contro- 
versy among students of consensus, are the ques- 
tions of how various kinds of agreement or lack of 
agreement are related to political stability, democ- 
racy, and the limitation of conflict. 

Consensus and political philosophy. Preoccu- 
pation with problems of social cohesion and legiti- 
macy was characteristic of political philosophy 
from its beginnings. Plato and Aristotle explored 
questions of political consensus. Both, for example, 
emphasized the significance of appropriate political 
socialization through education if the polity was to 
be stable, Several of the books of Aristotle’s Politics 
are concerned with problems of political viability 
and revolution and deal extensively with govern- 
mental legitimacy. 

Concern with political cohesion has been a re- 
current one in the modern period. Contract the- 
orists such as Locke, Hobbes, Rousseau, and Paine 
emphasized the need for a rational, conscious cre- 
ation of political consensus and employed the idea 
of the “contract” as the basis of legitimate au- 
thority. The utilitarians, and to a certain extent 
John Stuart Mill, provided an “interest” justifica- 
tion of government. In their view, a secular con- 
sensus should be based on a recognition that 
government should serve the interests of the great- 
est number. 

Political theorists of a more conservative or aris- 
tocratic hue have been profoundly concerned with 
consensus as a problem. Burke, for example, feared 
the spread of the ideology of the French Revolution 
because, in his view, it called into question the 
legitimacy of all forms of government. He cham- 
pioned political ritualism, a hierarchical social 
order, and close church-state relations as impor- 
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tant sustainers of consensus about governmental 
legitimacy. De Tocqueville emphasized the peculiar 
problems of consensus in a democratic polity and 
argued for the increased significance of religion as 
a binding force in direct proportion to the growth 
of political liberty. Concern continues with the sig- 
nificance of ritualism and religiosity in creating 
consensus. Several authors have emphasized the 
role of political holidays and ceremonies in pro- 
ducing a sense of national unity. 

Nationalism has been discussed as a semi- 
religious unifying motif in Western nations (Key 
1961, p. 43). It has also been argued that the 
American president and the British monarch in- 
spire a semireligious attachment which serves as 
a primordial unifying bond in the political system. 
Similar hypotheses have been offered with regard 
to the role of nationalist ideologies and nationalist 
leaders in developing nations (Apter 1963). 

Modernization and political consensus. There 
is agreement that modernization processes gen- 
erally involve acute crises of political consensus. 

Western societies. Lipset (1960, pp. 76-97) 
has discussed this process as it occurred in West- 
ern societies in terms of three basic crises of legiti- 
macy: first, the crisis involving church-state 
relations; second, the crisis involving the entry of 
the working class into politics; and third, the crisis 
surrounding questions of redistribution of wealth. 
He argues that governmental legitimacy can be 
undermined by these crises in various ways. For 
one, the entry of the working class into the political 
arena can be delayed too long, thus making for a 
revolutionary ideology in that class. Second, crises 
can accumulate through the inability to solve any 
one of them, In such a case, as in France, cleav- 
ages are intensified and governmental stability 
fundamentally endangered. Finally, too radical a 
change in the political system, again as in France, 
can alienate the more conservative groups, thus 
undermining the conditions for mutual tolerance. 

Lipset concludes (1964) that the wide accept- 
ance of a secularized welfare state marks the ends 
of these crises of consensus. Post-World War 1 
developments, especially economic growth, settled 
the intense political conflicts of previous western 
European politics. The older, largely nineteenth- 
century ideologies are outmoded. A new ideology, 
named “conservative socialism” by Lipset, has 
gained acceptance among major social groups and 
the major political parties. Political conflict, then, 
has lost its intensity and no longer revolves around 
what are felt to be fundamental differences of 
view. Europe is described as coming to resemble 
America in its politics, in that political parties offer 
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similar programs and a large middle class plays a 
crucial moderating role. Many have seriously ques- 
tioned this view. LaPalombara (1966a; 1966b), 
for example, has raised important methodological 
issues and also has challenged the substance of the 
argument as it applies to Italian politics. 

Lane (1962) has explored this new “ideology” 
as it appears among upper-working-class and 
lower-middle-class Americans. His data are drawn 
from intensive interviews of 15 men. He maintains 
that the strata these individuals represent are 
largely satisfied with the existing political system, 
as well as with the present patterns of social strati- 
fication. Their political attitudes are characterized 
by pragmatic responses, an absence of moral per- 
spectives on current issues, and the lack of a dis- 
tinct ideological identity. Lane argues that these 
individuals identify with various social groups and 
therefore their ideology, such as it is, is diffuse 
and tolerant. His findings support those of survey 
research concerning the attitudes of Americans 
toward the political parties and their candidates. 
It is unclear whether this “nonideology” is a dis- 
tinctly contemporary phenomenon in America or 
is merely the modern version of older aspects of 
American politics, It is also unclear whether the 
“ideology” Lane has uncovered is confined to a par- 
ticular geographic section of the United States and 
to a particular stratum of the population. 

Developing nations. Political consensus ex- 
pressed in terms of governmental stability is re- 
garded as a fundamental problem in developing 
nations. Many maintain that discontinuous social 
change, creating sharp imbalances between expec- 
tations and opportunities, makes political stability 
a rare and yet overwhelmingly necessary commod- 
ity. This gap between expectations and opportu- 
nities is conceptualized variously as a result of 
excessively rapid urbanization, the absence of 
employment for highly trained personnel, or the 
spread of ideas of equality and economic develop- 
ment. 

One problem of modernization involves the gap 
between mass and elite, This gap prevents effective 
communication in both directions and therefore 
blocks the development of a national normative 
culture and national identity, as well as preventing 
widespread needs and demands from becoming 
known to the elite with sufficient speed (Binder 
1964), 

Intense conflict and the absence of consensus on 
the ends and means of political life are also char- 
acteristic of developing states. Students of many 
countries have observed the severity of political 
party conflict involving groups with sharply dif- 


ferent world views, which often takes on some of 
the aspects of warfare. In these cases, the stakes 
involved in politics are perceived as being so high 
that no political group is willing to concede defeat 
voluntarily. The norms of tolerance cannot develop 
and governmental legitimacy is constantly threat- 
ened. Democratic politics may be able to develop 
only where political activists do not feel the stakes 
of competition are too high. We know little, how- 
ever, about the circumstances in which such per- 
ceptions change, and this knowledge is critical to 
the problems of developing nations. 

Consensus and “fundamentals” of democracy. 
Scholars have maintained that agreement on cer- 
tain fundamentals is a prerequisite of democracy. 
These fundamentals have been variously seen as 
the belief in human equality, the belief in the supe- 
riority of democracy, the belief in tolerance for 
dissenting views, and the belief in majority rule. 
Recent empirical examinations of the actual state 
of agreement on such matters in the United States 
has led to a more complex picture of the relation- 
ship between consensus on fundamentals and 
democratic practices. 

Prothro and Grigg (1960), using 90 per cent 
agreement as a measure of consensus, found it to 
exist on matters of majority rule and minority 
rights in two American cities, as long as statements 
were phrased in very abstract terms. When state- 
ments were made more specific, however, con- 
sensus broke down. Agreement appeared to be 
especially fragile when communists and Negroes 
were involved in the specific statements. The au- 
thors found that higher education and higher social 
status were related to greater agreement on the 
worth of majority rule and minority rights. They 
concluded that democracy can exist without con- 
sensus because many of the undemocratic are 
apathetic, and because agreement on specific pol- 
icies sustains unity despite lack of consensus on 
fundamentals. 

McClosky (1964), building on earlier work, 
found important differences in support of demo- 
cratic norms between party activists and the gen- 
eral electorate. Party activists were more likely to 
support majority rule and minority rights and to 
believe in governmental fairness. But even among 
activists disagreement was widespread, and there 
was frequently failure to achieve consensus on 
specifics. (McClosky defined consensus as 75 per 
cent agreement.) Consensus was highest among 
both activists and electorate on abstract statements 
concerning freedom of speech and of dissent. Con- 
sensus was weakest on questions of political, social, 
and economic equality. McClosky noted the use- 


fulness of apathy when many do not share demo- 
cratic norms, although he maintained that this 
condition has its dangers. In particular he stressed 
that many may mistakenly support undemocratic 
organizations and practices while thinking that 
they are defending freedom. In general, he argued, 
consensus is not necessarily a condition of political 
stability. When conditions are stable, consensus 
may be unnecessary. Only in time of social disor- 
ganization is consensus important as a stabilizing 
force. Finally, he argued that the conditions which 
enhance consensus on democratic norms are be- 
coming more widespread in the United States: 
education, urbanization, social mobility, prolifera- 
tion of the mass media, expansion of the middle 
class, reduced size and isolation of rural groups, 
declining number of groups living on substandard 
incomes, and more complete integration of minor- 
ity groups into social and cultural life. 

Consensus and political symbols. The role of 
national holidays, ceremonies, and rituals in cre- 
ating or sustaining consensus has been little 
explored. It remains questionable that such events 
facilitate consensus. Arguments concerning the 
supposed integrative role of the British coronation 
ceremony have been effectively challenged (Shils 
& Young 1953; Birnbaum 1955). Contrary hypoth- 
eses also exist, It has been observed, for example, 
that the creed of Americanism and the ceremonies 
that accompany it have sometimes served as a focal 
point of conflict (Key 1961, p. 43). A fervent 
belief in this creed has often been associated with 
persecution of those groups regarded as foreigners 
or aliens. At present, one cannot be certain about 
the function of any particular political ritual with- 
out knowing the social circumstances in which it 
is enacted. 

It is fairly clear, however, that myths and sym- 
bols can play a role in diminishing conflict in 
specific policy areas (Edelman 1960). Consumer 
groups, typically poorly organized, often achieve 
symbolic victories through the passage of regula- 
tory legislation covering a particular area of the 
economy. Producer groups, more involved in the 
details of policy making, can often shape the sub- 
stance of this regulation in a direction congenial 
to them, If the public is unaware of this change in 
the original intentions of the legislation, it assumes 
that these intentions are being carried out. In such 
a case, the general public believes policy is made 
in its favor, while those being regulated are also 
content with the arrangements, and a rather 
strange consensus may be said to exist. One must, 
however, ponder the consequences of such a con- 
sensus for the real content of political democracy. 
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Problems and research needs. The empirical 
and speculative work concerning consensus on 
fundamentals in the United States is open to seri- 
ous question on several points ( Willhoite 1963). 

Measuring democracy. Some researchers seem 
to have assumed that the United States is in fact 
a democracy. This assumption has confused the 
picture of the relationship between consensus and 
democracy. For example, when Prothro and Grigg 
discovered that consensus is weakest on questions 
involving Negroes and communists, they went on 
to conclude that democracy can exist without con- 
sensus. But if, in fact, American democratic prac- 
tices are least adequate where Negroes and 
communists are concerned, then lack of consensus 
appears to be far more significant. In order to 
relate attitudinal consensus to democratic prac- 
tices, one needs to measure not only the degree of 
consensus but also the degree of democracy. This 
has not yet been done. Therefore one must retain 
an open mind on the question of the exact connec- 
tions between these two variables. 

Negative consensus. A different line of attack 
flows from the analysis of negative consensus. 
Many authors have pointed out the significance of 
a consensus on behavior that will not be allowed. 
This matter has been of particular interest to 
students of international relations who have con- 
centrated on the problem of “encapsulating” conflict 
(Etzioni 1964). The shared fear of war has been 
cited as a major factor in keeping conflict under 
control. Likewise, negative consensus may play a 
role in domestic politics. For example, the tacit 
agreement, perhaps conscious, not to raise certain 
sorts of issues is of great importance in facilitating 
peaceful adjustment of disputes. If this is the case, 
then negative consensus may take the place in 
domestic politics once credited to a positive con- 
sensus on democratic norms. The gains of the civil 
rights movement in America have flowed, in part, 
from a negative consensus among whites on the 
need to avoid violence. Given such a consensus, 
certain Negro demands have had to be met (Kil- 
lian & Grigg 1964). This interpretation runs 
counter to the assertion that the civil rights move- 
ment has been successful because equality is a 
significant tenet in the American creed, The hy- 
pothesis is that negative consensus operates in 
place of positive belief. These conflicting interpre- 
tations are in need of further investigation. 

Consensus, cooperation, and stability. The con- 
viction that consensus, in any sense, is necessary 
for political life has been vigorously challenged. In 
many cases social cooperation and political co- 
hesion may not depend upon consensus. Further, 
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conflict as well as consensus can lead to social 
integration and successful problem solving. 

There is no direct relation between consensus 
and political equilibrium or integration. Consensus 
can retard political and social adaptation as well 
as facilitate it. For example, as Van den Berghe has 
pointed out (1963), consensus on such norms as 
extreme competition, individualistic laissez-faire, 
treachery, and witchcraft does not necessarily aid 
social solidarity. Moreover, consensus within groups 
can hinder consensus among groups. 

Consensus is not identical with cooperation, and 
the latter can exist without the former. Many fac- 
tors besides consensus may contribute to coopera- 
tive behavior. Williams (1964, p. 383) has observed 
that such factors include mutual advantage, power, 
technical capacities for communication, and social 
mechanisms for settling conflicts, The precise role 
of consensus in political stabilization clearly re- 
mains a matter of dispute. 

Problems of creating consensus. How is con- 
sensus created, and how is it changed? Under 
what circumstances, for example, do previously 
outlawed forms of conflict become legitimate or 
acceptable? Such changes have taken place in the 
domestic politics of many countries, involving, for 
example, issues raised by political pamphleteering, 
strikes, and mass demonstrations. In these cases, 
more democratic norms have been achieved pri- 
marily through serious social conflict. It is clear, 
therefore, that the identification of democracy with 
government by consensus can make sense only in 
that ideal case in which a perfect democracy has 
already been created, 

Under what circumstances will governmental 
initiative be successful in altering a previously 
existing consensus? Research into American race 
relations has demonstrated that attitudes adapt 
over time to overt behavior, but clearly such adap- 
tation has limits and preconditions, Related to this 
question is the problem of determining under what 
circumstances governmental legitimacy itself is 
undermined, particularly in relation to the govern- 
ment's ability to cope with social change. 

On the policy-making level, democracy has often 
been described as politics involving bargaining 
among political leaders through which a consensus 
is created. There has been little systematic study, 
however, of this consensus-creating process. Fi- 
nally, a critical problem for students of interna- 
tional behavior is consensus creation concerning 
the limitation of conflict. Osgood (1962) has de- 
scribed the general principles of a process of 
tension reduction based on the building of trust 
through graduated reciprocal initiatives by the 
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parties to the conflict. Fisher (1964) has advocated 
the “fractionation” of conflict, that is, the breaking 
down of complex, emotion-laden disputes into 
smaller parts which may prove easier to com- 
promise (see also Etzioni 1964). The difficulties 
faced in achieving a negative consensus on con- 
flict are clear, but these highly suggestive strategies 
may help to overcome some of them, 

Lewis Lipsitz 


[See also Democracy; MAJORITY RULE; MODERNIZA- 
TION; SOCIALIZATION. Other relevant material may 
be found in POLITICAL SOCIOLOGY; PUBLIC OPINION; 
SOCIAL CONTRACT; and in the biographies of BURKE; 
Hosses; Locke; MILL; PAINE; ROUSSEAU; TOCQUE- 
VILLE.] 
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1. POLITICAL AND SOCIAL ASPECTS 
11. ECONOMIC ASPECTS 


I 
POLITICAL AND SOCIAL ASPECTS 


Whatever else it may mean, “conservation” 
when used in relation to natural resources is a 
virtuous, a worthy word. Just men do not oppose 
conservation. As U.S. President William Howard 
Taft complained in 1910, “The subject of Conser- 
vation is rather abstruse, but there are a great 
many people in favor of Conservation, no matter 
what it means.” 

In the United States, where this good word has 
been used most, it has been called to the support 
of many policies and programs that on their face 
do not seem terribly consistent. If there is any 
regularity among them, it is that the policies and 
programs have demanded reform, based on certain 
scientific, democratic, and moral objectives. 
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When “conservation” came to be used in the 
United States, at the beginning of the twentieth 
century, it was associated principally with new 
federal programs for forestry; for regulating the 
use of western public lands to protect, in addition 
to timber, the livestock ranges, outstanding scenic 
and recreational sites, and wildlife habitats; and 
for developing water resources for irrigation of 
western deserts and navigation of eastern rivers. 

A half century later, in the 1960s, “conserva- 
tion” was being used to describe and support gov- 
ernment programs relating to cities. The “new 
conservation,” as spokesmen for President John F. 
Kennedy’s New Frontier and President Lyndon B. 
Johnson’s Great Society have called it, is concerned 
with the quality of the urban environment—with 
programs to acquire and protect open land in 
metropolitan areas and to prevent ugliness and 
the pollution of the environment. Between 1900 
and 1960 the nation’s population and its social 
problems and government programs had shifted 
from being dominantly rural to being dominantly 
urban; the use of “conservation” to describe prob- 
lems and programs followed the same course. 

Furthermore, “conservation” had been used in 
the intervening years with reference to pressing 
needs of the moment. The great depression of the 
1930s and the cold war of the 1950s are exam- 
ples. Most of the conservation programs of the 
New Deal were undertaken to pull the nation out 
of a deep depression and to redistribute income to 
disadvantaged groups. The Central Valley project, 
Bonneville Dam, and many other resource-develop- 
ment projects were begun with funds appropriated 
to stimulate the economy by emergency public 
works. Payments to farmers for soil-building prac- 
tices were basically supplements to their low in- 
comes. The public lands were improved by unem- 
ployed urban youth recruited into the Civilian 
Conservation Corps. 

In the 1950s conservation was linked to plans 
and programs to insure the adequacy of raw mate- 
rials to meet the needs of the free world. The 
report of the President's Materials Policy Commis- 
sion, which articulated this approach, was con- 
cerned with how to “avert or overcome materials 
shortages which threaten the long-run economic 
growth and security of the United States and other 
free nations” (U.S. President's Materials . . . 1952, 
vol. 1, p. 2). The public policies recommended by 
the Materials Commission were radically different 
in kind from those that had been associated with 
conservation in earlier years, among them policies 
for influencing the rates of technological develop- 
ment in American industry and for guaranteeing 
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private American investments for the processing 
of materials in foreign countries. 

Notwithstanding the number of policies and 
programs that have been called conservation, cer- 
tain types of objectives have recurred sufficiently 
frequently to give some loose form to the concept. 
The first of these is scientific method. 


Scientific method as an objective 


Science and technology have played so dominant 
a role in conservation policies and programs that 
the scientific method has been used not only as an 
analytical technique for solving resource problems 
but as an objective of public policy as well. The 
authority of science has been used regularly to 
justify conservation programs. Supporters of con- 
servation have understood the public decision- 
making process to be the somewhat automatic one 
of collecting scientific data and applying scientific 
principles to them. Thus, for example, a physical 
inventory is made of the forest resource; scientific 
principles of forestry are applied to the data thus 
collected; and there results a public policy of pre- 
scribed cutting practices on timber lands. The 
word “planning” has often been used to describe 
this process. 

Alongside the great reliance on science and 
technology for solving problems, conservation pro- 
grams have normally included general and specific 
proposals for the support and promotion of sci- 
ence, technology, and data collection on their own 
accounts—to add to knowledge. 

The conservation movement of the 1900s was 
above all a scientific movement, and its role in 
history arises from the implications of science and 
technology for modern society (Hays 1959, p. 2). 
The leaders of the movement came from such 
fields as forestry, hydrology, geology, anthropology, 
and civil engineering—several of which had come 
to be recognized as professions, with societies and 
standards, only in the last part of the nineteenth 
century. A central theme of the movement was 
support for scientific data collection by such agen- 
cies as the then recently established U.S. Geologi- 
cal Survey and for scientific research by the new 
Forest Experiment Stations and the Agricultural 
Experiment Stations. 

The conservation programs of the New Deal 
were also scientifically oriented. The high-level Na- 
tional Resources Planning Board served several 
purposes during its ten-year lifetime, which began 
in 1933; but the one for which it was created 
initially, and which remained a central purpose, 
was to guarantee that public projects undertaken 
with emergency and other public-works funds were 


planned by technicians in accordance with sci- 
entific principles. 

In the report of the Materials Commission, which 
dominated thought on conservation in the 1950s, 
more pages are devoted to technology and its prom- 
ise than to any other subject. Although the com- 
mission found that “the Government, up from al- 
most nothing since the beginning of the century, 
is now the great force behind scientific and tech- 
nical research in this country,” it concluded that 
the effort was not sufficient and recommended 
more funds for basic research on materials and 
more government planning and coordination of 
materials technology (U.S. President's Materials 
. ». 1952, vol. 1, pp. 144-145). 

The “new conservation” of the 1960s, with its 
focus on beauty and quality of the environment, 
continues to emphasize science and technology. 
However, the classes of expert skills called upon 
to develop the technical principles to be applied 
are broader, Thus, the conservationist today may 
be a lawyer working on scenic easements, a land 
planner, or an architect. 

Scientific elitism. The central role of science in 
conservation programs has meant a central role 
in decision making for experts and, concomitantly, 
a reduced role for lay judgment. And since the 
conservation scientists have been largely in gov- 
ernment service, there has developed a scientific 
elitism that emphasizes government regulation, by 
the experts, of private interests; and executive, or 
expert, power at the expense of legislative, or lay, 
power. 

As for the distribution of power within govern- 
ment, scientific elitism has led to the belief that 
the experts of the executive branch are better 
equipped than the laymen of the Congress to make 
decisions relating to conservation, because these 
are considered to be basically technical in nature. 
Foresters should determine the allowable annual 
timber cut; engineers should study the feasibility 
of river-development projects; agronomists should 
determine which ranges to keep open for grazing; 
soil scientists, which lands to retire from agricul- 
ture; planning technicians should select the public- 
works projects to be built. Furthermore, conflicts 
among competing resource users should be dealt 
with by experts and not by the political processes 
that involve the legislature. Land-management ex- 
perts should resolve land-use differences between 
forestry, livestock, wildlife, irrigation, and settler 
groups. Water-resource experts should adjust 
power, navigation, flood-control, recreation, and 
upstream-downstream interests to promote the 
best multipurpose development of river basins. 
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Legislators, being poorly prepared in matters tech- 
nical, fall easy prey to special interests that have 
no concern for scientific truth. Members of Con- 
gress, therefore, when they have tried to reconcile 
conflicting groups, have worked a positively bad 
influence on conservation programs. 

As a consequence of this scientific elitism, pro- 
ponents of conservation programs have held, first, 
that the legislature should grant the executive the 
very broadest discretion to deal with problems, and, 
second, that executive officers have an inherent 
discretionary power and responsibility to do every- 
thing for the public good—as they interpret it— 
that is not prohibited by law. Thus, executives who 
are technically competent should be encouraged 
to do everything that the law will let them do, not 
merely what the law directs them to do. This latter 
doctrine, as far-reaching a statement of the public 
servant's discretion as is to be found in the litera- 
ture of American government, was set forth by 
Pinchot and practiced by him and his professional 
colleagues in the U.S. Forest Service (Pinchot 
1947, chapter 12). A similar doctrine was ex- 
pounded by Pinchot’s superior, Theodore Roose- 
velt, and came to be known as the “stewardship 
theory” of presidential power. Roosevelt said: “I 
declined to adopt the view that what was impera- 
tively necessary for the Nation could not be done 
by the President unless he could find some specific 
authorization to do it. My belief was that it was 
not only his right but his duty to do anything that 
the needs of the Nation demanded unless such was 
forbidden by the Constitution or by the laws” 
(Roosevelt [1913] 1946, p. 357). Of course, it was 
one thing, and at that, quite controversial, for 
Roosevelt to claim such broad authority for the 
president, deriving it from the inherent constitu- 
tional powers of that office; it was quite another 
for Pinchot to claim it for forestry and other con- 
servation technicians. 

The claims for broad bureaucratic discretion in 
conservation programs were more regular and ex- 
plicit early in the century than they have been 
since. Nonetheless, this feature of scientific elitism 
has continued to apply to programs called conser- 
vation. Thus, in the 1960s a major controversy 
developed over the Wilderness Bill, the executive 
departments holding that their technicians should 
be authorized to designate, subject to presidential 
approval, which public lands should be set aside 
for wilderness, excluding thereby all other resource 
users; laymen legislators holding that such desig- 
nations should be authorized by the Congress, with 
full opportunity for legislative hearings. 

Finally, scientific elitism has helped to pin a bad 
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reputation on the nation’s legislature—that of pro- 
ponent of special interests and opponent of scien- 
tific method. 

Physical versus economic objectives. The em- 
phasis on natural science in conservation programs 
has led to analyses of problems and statements of 
goals in physical rather than economic terms. The 
barrels of oil in the ground and the number of 
barrels pumped out each year; the timber inven- 
tory and the board feet of lumber cut each year; 
the inches of topsoil on the land and the tons of 
it that annually wash down the Mississippi River 
—these have been considered the relevant data. 
They have led to simple conclusions: We are in 
danger of running out of oil; the topsoil will be 
entirely gone in another x years—unless vigorous 
government conservation programs are activated. 
The responsive government programs have in- 
cluded most physical solutions—contour plowing 
and the construction of dams are examples—to 
satisfy goals that, also, have been put in physical 
measures. “To each acre according to its needs and 
capabilities,” was the motto of the soil-conservation 
program; and proponents of the federal reclama- 
tion program in California have sought, as their 
goal, to ensure that not a drop of fresh water is 
wasted from the rivers to the sea. 

The close association of conservation in the 
1930s with efforts to redistribute national income 
to disadvantaged groups raised obstacles for those 
who analyzed resource problems in physical terms. 
Nonetheless, the limitations of a heavy reliance 
on physical data for identifying and solving con- 
servation problems were not exposed systematically 
until the 1950s, in the report of the Materials 
Commission, According to this report, the tradi- 
tional view of conservation had perpetuated two 
fallacies. First, it regarded the resource base as a 
fixed inventory that, when used up, would leave 
society with no means of survival. Second, it 
equated physical waste and economic waste—it 
encouraged the feeling that it is wasteful to use 
materials in ways that make them disappear. This 
attitude, said the commission, can lead to devoting 
a dollars worth of work to “save” a few cents’ 
worth of paper and old string. In its own analysis, 
the commission estimated the resource require- 
ments of the United States and the rest of the free 
world and devoted special attention to those re- 
sources that would not be forthcoming in sufficient 
quantity to meet future needs except at signifi- 
cantly higher relative prices. In other words, the 
commission attempted to foresee potential price 
rises and to plan in advance to forestall them or 
adjust to them by various types of measures. 
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Recently scholars have developed new tech- 
niques for combining the physical and economic 
analyses of complex resource systems (McKean 
1958; Maass et al. 1962). The production function 
for a resource, representing relations between 
physical factors, and its benefit function, repre- 
senting relations between economic factors, are 
believed to be so closely interrelated that they 
should be developed and analyzed synchronously. 
The methods of operations research and systems 
analysis make this possible [see OPERATIONS RE- 
SEARCH; PLANNING, SOCIAL, article on RESOURCE 
PLANNING], 

Scientific research and action programs. Princi- 
ples derived from the natural sciences have been 
developed for the conservation of each resource— 
forests, ranges, soil, wildlife—and one principle, 
that of ecological balance, or nature's balance, for 
the several resources in combination. This latter 
principle states that under natural conditions the 
resources of a region tend to be in balance and to 
remain relatively stable over long periods of time. 
Thus, under natural conditions the soil, cover, and 
moisture supply of a watershed are believed to be 
in balance, in the sense that plant growth is ad- 
justed to the amount and distribution of rainfall, 
and stream channels to the runoff they must carry. 
Even under natural conditions this balance is not 
static. It is influenced by long-term natural proc- 
esses, such as climatic forces, and by natural catas- 
trophes. The principle of ecological balance holds 
that the long-term processes are virtually imper- 
ceptible to man and that the natural catastrophes 
are infrequent, their effects usually confined to 
small areas that heal rapidly. 

Man, however, interferes with nature’s balance 
in a destructive way, according to the ecological 
theory that has been popular with supporters of 
conservation programs (Frank & Netboy 1950). 
He tends to set in motion forces that seriously and 
sometimes permanently disrupt the ecological 
balance. For example, his livestock may overgraze 
a watershed. The plants lose their vigor—that is, 
their ability to produce new leaves and roots and 
to deposit litter that covers and nourishes the soil. 
The carpet of litter not only wears thin but is 
destroyed by exposure to the elements and to 
animal hoofs. In time the soil loses its virility, 
including its capacity to absorb water and to resist 
forces of destructive erosion. Concomitantly, the 
flow and quality of water deteriorate. 

The public-policy consequences of the principle 
of ecological balance are clear: control man’s ac- 
tivities so that he cannot permanently or seriously 
disrupt nature’s balance. A great many conserva- 


tion programs have been based on this scientific 
principle. Furthermore, the principle provides its 
own criterion for evaluating the condition of the 
resources of a region: measure it against what 
were believed to be the natural conditions prior 
to man’s settlement. 

In response to this theory, certain scientists, 
frequently men who have not been associated with 
conservation programs, have argued that man is 
only the most recent and most complex of nature’s 
creatures; that he and his activities are a part of, 
not apart from, nature's balance (Tansley 1939; 
Firey 1960). Conservation ecologists have replied 
that man is unique; that he is endowed with 
capacities not found in other animals or found 
there only in concentrations so weak as to make a 
qualitative difference. These capacities are, of 
course, consciousness, intellect, and conscience. 

Quite separate from this continuing and, in part, 
philosophical debate, recent scientific studies have 
challenged the principle of natural ecological bal- 
ance and, therefore, the scientific bases of many 
public programs. Raup (1964) and his colleagues 
at the Harvard Forest have demonstrated that nat- 
ural catastrophes have such persistent and serious 
effects on natural environment that stable natural 
equilibriums may be infrequent. On certain slopes 
of the forest over 50 per cent of the trees are 
growing on blowdown stumps that were razed by 
numerous hurricanes. Malin (1956) has described 
the influences that kept the Great Plains grass- 
lands in a disturbed state for countless years be- 
fore the coming of the fur traders and settlers. 
His studies make an impressive case against the 
theory of nature’s balance and the doctrine that 
man, primarily, destroyed this. Scientists like these 
have modified their search for ecological balance, 
with its resulting emphasis on “Do Not Disturb”— 
on protection. They have instead focused their 
studies on adaptation, with its resulting emphasis 
on a dynamic and flexible system of controls and 
de-emphasis of the destructive character of human 
occupance. Yet the traditional association of many 
conservation programs with ecological balance is 
so strong that supporters of these programs tend 
to disregard or disagree with any contrary or even 
partially nonsupporting scientific theories. 

Conservation programs have been devised and 
then supported on the “authoritative base of sci- 
ence,” but when this scientific base or any part 
of it is challenged by new findings, supporters of 
the program, both in and out of government, are 
ambivalent. Their dedication to science leads them 
to promote the type of research that can turn uP 
new and contrary results. But the fact that their 
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public, and most often their legislative, support is 
based on a previously proclaimed scientific au- 
thority makes them hesitant to give currency to 
these new findings. 

‘Another illustration of this is the U.S. Forest 
Service and fire (Schiff 1962). When the service 
began its program for ridding the nation’s forests 
of what was then considered to be their greatest 
scourge, the doctrine that trees and fires don’t mix 
was generally believed to be a scientific truth. 
Public and legislative support for the now familiar 
Smoky Bear program was built largely on this 
truth, In the mid-1920s, however, Professor H. H. 
Chapman, the noted Yale forester, confirmed a 
new truth, which had been suggested previously 
by other forest scientists; namely, that fire is es- 
sential for the reproduction of longleaf and loblolly 
pine in the southeastern United States. Without 
fire to burn off the brush growth and ground litter, 
pine seedlings are unable to survive. If the Forest 
Service had been fully successful in its fire-protec- 
tion program in the southeast, it would have 
eliminated these valuable species. 

For over six years the Forest Service refused to 
investigate or check the new findings, though 
Chapman insisted they should. When Forest Serv- 
ice scientists did conduct investigations at their 
Southern Experiment Station, these confirmed the 
case for burning; but the service then refused to 
release the results. 

Finally, in 1939, after the incessant insistence 
of Chapman, the service published their scientific 
findings; but not until after World War 11 did they 
make any effort to inform the public that controlled 
burning was in some situations desirable and in 
others essential, Throughout this period of about 
twenty years, the Forest Service supported publicly 
the original scientific doctrine that forests and fire 
should never mix. The leaders of the agency 
feared that to modify or contradict this in any 
way might lead the public to be less vigilant and 
cause the federal and state legislatures to be less 
sympathetic to providing the means for preventing 
forest fires throughout the nation. 

The Forest Service and other conservation agen- 
cies are not the only government organizations 
that have encountered problems in accommodating 
both scientific research and action programs. The 
general problem, a fascinating one in bureaucracy 
and organization, is beyond the scope of this 
article. At the same time, because of their very 
heavy reliance on scientific method as an objective, 
conservation agencies have suffered the problem 
in an aggravated form. 

American conservation programs, with the pos- 
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sible exception of those based on the analysis of 
the 1952 Materials Commission, have been reform 
programs. Like other American reform programs 
of the twentieth century, but much more so, con- 
servation programs have relied on the authority 
of science. Faith in science has had a high stand- 
ing among the beliefs and commitments of the 
American people. Science has come to stand for 
material well-being, soundness, objectivity, and 
truth. A society with a proper respect for science 
and technology can enjoy the liberties of a free 
people. Much the same can be said of democracy, 
as an objective of American reform programs in 
general and of conservation programs in particular. 


Democracy as an objective 

For the American people, democracy has been 
not only their form of government but a faith, an 
ideal, an objective, “our form of patriotism” (Waldo 
1948, pp. 12-13). Every individual is important; 
every man should be master of his own destiny; all 
men are endowed with rights that should not be 
violated; the privacy of individuals should be free 
from unwarranted intrusion—these are the first 
tenets of the American democratic faith, and they 
have been associated directly with conservation 
programs. “Rugged individualism” is a phrase com- 
mon to conservation literature from the beginning 
of the century until today, and a highly evocative 
phrase this has been—whether it calls to mind 
Theodore Roosevelt on horseback or today’s heavily 
knapsacked citizen on foot, solitary in his enjoy- 
ment of nature’s wilderness. Conservation begins 
with the American people, who have been nur- 
tured on “a fierce sense of individualism,” and it 
“rests in the people's hands,” according to the 
school text published by the American Association 
of School Administrators and written by its Com- 
mission on Conservation (1964). 

Conservation programs, again apart from those 
inspired by the Materials Commission, have typi- 
cally defended this rugged individual, “the little 
man,” against monopolies and concentrated wealth, 
special interest and special privilege. Big business 
and financial power have been identified with the 
wasters of resources and the destroyers of beauty. 

The conservation movement of the early twen- 
tieth century was in part a reaction against the 
influence of private corporations, which had been 
growing rapidly since the Civil War. Unless the 
corporations were controlled, said the supporters of 
conservation-reform programs, the basic resources 
of the nation would come to be concentrated in the 
hands of a few, and these few would use up the 
nation’s wealth wastefully and profligately, for 
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quick private profit, with no concern for the long- 
range benefit of the people. Furthermore, the or- 
ganization of industry into combinations threatened 
the independent, self-made man with a faceless, 
ugly, and largely urban materialism. In recent 
years, as conservation has been used more and 
more in relation to urban problems, the real estate 
interests, the large industrial polluters of the en- 
vironment, the billboard lobby, and others have 
been added to the oil trust, the lumber lobby, the 
power interests, and the cattle barons as enemies 
of the common man—selfish despoilers of his 
heritage. 

There runs through all conservation literature 
the notion of a common, or public, good that dif- 
fers from the self-interest, the “selfishness,” of 
private operators, especially those with great finan- 
cial power. To define and enforce this common 
good, governmental action and public education 
are needed. Thus, alongside the basic belief in 
rugged individualism, there has developed a strong 
commitment to positive government as a means 
of effecting the common good. In fact, this pro- 
government attitude has been a principal charac- 
teristic of conservation programs, joining the em- 
phasis on science. At the turn of the century, such 
an attitude represented a drastic departure from 
then dominant values with respect to the role of 
government in society (Wengert 1962), and ever 
since, demands for governmental action to support 
conservation have been more far-reaching than 
those for action to support most other domestic 
policies. 

Commitments to rugged individualism on the 
one hand and to positive government on the other 
are presumably harmonized in a commitment to 
democratic institutions, and the literature on con- 
servation programs is sprinkled with references to 
faith in democracy and to consent of the governed. 
But when the investigator looks carefully, he finds 
that supporters of conservation programs have in 
fact had little confidence in democratic institu- 
tions. They have considered popularly elected as- 
semblies to be agents of special interests; they have 
opposed advisory committees in connection with 
administration of the national forests. Their con- 
fidence has rested with the technicians of the 
executive, who are qualified both to define the 
common good in relation to the subject at hand 
and to enforce government action to realize it. The 
common good, thus, is equated with scientific 
elitism. 

At the same time, conservation men have sought 
direct popular support for their government pro- 
grams through public education and public infor- 


mation. As a result of the sustained effort of gov- 
ernment conservation agencies and their allies 
outside government, conservation is a subject 
taught today in thousands of public schools all 
over the country. The curriculum emphasizes the 
scientific principles of conservation; in the class- 
room and in the woods children are taught ecology, 
soil classification, game management, etc. 

The government conservation agencies have also 
depended heavily on adult education to gain sup- 
port for, and cooperation with, their programs to 
reduce forest fires, to promote soil-erosion control 
and other agricultural practices, to preserve scenic 
areas, etc. 

If the supporters of conservation programs, while 
claiming democracy as an objective, have shown 
little confidence in democratic institutions, they 
have, through their outstanding efforts in public 
education, shown confidence in the common man. 
But one could argue that this has been more to 
gain public support for programs based on the 
scientists’ definitions of the common good than to 
encourage popular participation in defining this 
goal. 

The commitment to positive government has 
led, as might be expected, to the organization of 
a large number of special-interest groups and to an 
intense form of interest-group activity in conserva- 
tion programs (Wengert 1955). Since the con- 
servation technicians are concerned to see that the 
general interest, rather than special interests, pre- 
vails and since the general interest, for them, is 
the result of the application of scientific principles, 
they are ever on guard against the influence of 
these groups. This is in part why they have op- 
posed advisory committees that represent one oF 
more classes of users of the public lands, and why 
they have been suspicious of popularly elected as- 
semblies, which are susceptible to the influence of 
special interests. 

Big and little operators. From the beginning, 
but especially since World War m, the “little-man 
component of the democratic objective of conser- 
vation programs has meant contradictions in these 
programs; for the big operators, even or especially 
the very big ones, frequently practice better “con- 
servation” than the little ones. 

Take forestry, for example. In a massive inven- 
tory report entitled Timber Resources for America’s 
Future (U.S. Forest Service 1958), the Forest Serv- 
ice reported that “the forest condition is best” on 
public forest lands and on those owned by forest 
industries such as Weyerhaeuser. “There is little 
distinction,” said the service, “between the produc- 
tivity of recently cut lands in public ownership as 
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contrasted to those owned by forest industry.” The 
real contrast in quality of forest practices is be- 
tween the public and forest-industry ownerships 
on the one hand and the small private holding on 
the other: “There is conclusive evidence that the 
condition of recently cut lands is poorest on the 
farm and ‘other’ [meaning ‘small’] private owner- 
ships” (ibid., p. 106). 

Reduction in numbers of livestock on public 
ranges that are overgrazed is another example. 
Government range supervisors assert that it is 
easier to effect this conservation measure if the 
range is used by a few permittees, each with a 
large number of livestock, than by a large number 
of permittees, each grazing relatively few head. 
This is because the large operators, with their com- 
mensurately large private holdings, can absorb a 
cut of 10 or 20 per cent in public range use; 
whereas a similar cut for small permittees might 
force them into economic ruin. Thus, the small 
man presents the conservation problem. 


Morality as an objective 


In addition to commitments to science and gov- 
ernment, conservation programs have had a strong 
moral commitment or objective. Historians (Hof- 
stadter 1955) have shown that reform programs 
in America—prohibition, civic improvement, or 
conservation—haye often been a product of the 
“Protestant mind.” Reformers have both found their 
arguments in and made their appeals to the tradi- 
tional biases of American Protestants: individual- 
ism; a tendency to see every issue as a moral issue; 
an emphasis on man’s and society's guilt for abus- 
ing God’s gifts and on the need for missionary 
work to repair this. Conservation reformers, in 
particular, have inherited the moral traditions of 
rural evangelical Protestantism, even though today 
they use them in connection with urban problems. 
The present condition of our natural resources, as 
revealed by scientific inventories, “constitutes the 
gravest indictment that has ever been returned 
against a civilized people,” said a conservation 
magazine in 1909; and pronouncements concern- 
ing today’s conservation problems frequently are 
similar in tone and appeal. 

What precisely have been man’s and society’s 
sins? Of what are we guilty? First, of interfering 
with nature’s balance. The scientific principle of 
ecology is supported by and gives support to re- 
ligious beliefs. All things owe their gift of life to 
God, and nature’s balance is God-given. At the 
same time, man is outside nature’s balance because 
of his unique endowments, endowments that give 
him the power to transgress the balance (i.e., in- 
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tellect) and that make his transgressions sinful 
(i.e., conscience). These transgressions should be 
controlled, for man is in the relationship of steward 
to the resources that surround him. Man “has been 
made responsible for something that belongs to 
God. The good steward acknowledges this respon- 
sibility as a trust. . . . The orderly conservation 
and development of natural resources is man’s 
recognition of his responsibility under God to pro- 
tect and use wisely His precious gifts” (National 
Association of Soil Conservation Districts 1962, 
inside front and back covers). 

One sin is, then, to destroy nature's balance. A 
second one is to waste, and a third is to use our 
natural resources for private benefit rather than 
for common good. The importance of the sin of 
wastefulness in the American ethos is well known. 
It has led us, as the Materials Commission said, 
to saving old string. “In the orderly world of our 
Creator,” warns a recent conservation pamphlet, 
“there are penalties for extravagance.” 

“Our resources are God-given heritages that be- 
long no more to the present generation than to 
generations that are to come.” This is a common 
concern in conservation literature, and it is usually 
accompanied by the idea that single-minded pursuit 
of present profit in the development of resources 
may not protect the interests of unborn children. 
Supporters of conservation programs, in other 
words, have had little confidence that the discount 
rate of the private competitive market will account 
for legitimate long-run interests. Since those who 
develop resources for private benefit can thus fly 
in the face of the common good, they can act 
immorally. 

This third sin is especially likely to occur where 
monopolies, trusts, and conspiracies in restraint of 
trade are present in industrial organization. The 
supporters of conservation almost invariably find 
that such private combinations are operating 
against their programs, so they are typically en- 
gaged in battle against immoral conspiracies. Since 
‘Americans have a propensity to believe in con- 
spiratorial theories of history, a propensity derived 
in part from their Protestant ethic (Hofstadter 
1955), the battles of conservation have been popu- 
lar engagements. 

The political and social consequences of the 
moral objective of conservation programs generally 
have reinforced the consequences of the scientific 
and democratic objectives. The cases for positive 
government and for executive power have been 
enhanced; both are needed to combat evil, as well 
as to promote science and democracy. Reinforced 
with a sense of moral righteousness, the supporters 
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of conservation programs have considered those 
who disagree with them to be not merely misin- 
formed or wrongheaded but wicked and possibly 
vicious. The conservation men are ever fighting for 
their programs in a milieu of recurrent crises. See 
the remarkably readable works of Pinchot (1947) 
and Ickes (1934) and the exquisitely written and 
influential commentaries of DeVoto (1955) for 
examples, 

As in the cases of science and democracy, recent 
findings have questioned some of the bases for the 
moral content of conservation programs. But the 
moral component is so ingrained that the findings 
have frequently been challenged or ignored. Thus, 
Malin’s and Raup's researches have questioned the 
evilness of man’s encroachments on nature's bal- 
ance. The Materials Commission has redefined 
waste in a way that makes the concept more dif- 
ficult to relate to sin. And the good conservation 
practices of large industrial resource users, as 
compared with the poor performance of small op- 
erators, tend to blunt the charges of immorality 
made against corporations and combines. 


Conclusion on American conservation 


This article has sought to explain the social and 
political aspects of conservation programs, not to 
evaluate them. Because of the difficulties of ex- 
plaining frequent contradictions in these programs, 
however, the reader may have gained an erroneous 
impression that the writer's purpose was to evalu- 
ate and even to condemn. The contradictions are 
present because conservation has been used in re- 
lation to so many different programs and because 
the unifying theme, insofar as there is one, is 
simply that these have been programs for reform, 
based on certain attitudes toward science, democ- 
racy, and morality. 

Without trying to evaluate the substantive 
achievements of individual conservation programs 
—for example, forests protected from fire, topsoil 
protected from wind and water erosion, income 
redistributed to tenant farmers, highways protected 
from billboards—we can point to some meritorious 
consequences of scientific elitism and the accept- 
ance of positive government. These factors under- 
mined the Spencerian view, prevalent at the turn 
of the century, that government could not perform 
effectively. Insofar as conservation agencies were 
involved, the executive branch of the government 
was radically reformed, and these agencies—the 
Forest Service and the Geological Survey are ex- 
amples—continued, for almost half a century, to 
be models of what a professionally competent and 
efficient bureaucracy should be. They developed 


professional standards and loyalty to professional 
ideals; they introduced personnel programs em- 
phasizing the selection of officials through com- 
petitive examinations, professional training, and 
career planning; they enjoyed stability of leader- 
ship; they established close relations for research 
and recruitment with the nation’s leading univer- 
sities; they pioneered new methods of program 
planning. Scientific elitism and the acceptance of 
positive government put a new and handsome face 
on the executive; but not so meritoriously, they 
have helped to blacken the eyes of the legislature 
and have contributed little to reforming it. 

Finally, for general evaluation, the highly charged 
moral battle against evil that has characterized 
conservation programs has probably been good 
catharsis for the American people. A balanced 
opinion on this requires a general view of American 
social, economic, and political history that is be- 
yond the scope of this article. 

Now that conservation has come to be associated 
with urban as well as rural problems, it will be 
interesting to observe whether this unique combi- 
nation of science, democracy, and morality, with 
its resulting emphasis on positive government and 
on scientific elitism, will accomplish similar re- 
sults for the government’s urban functions. 


Conservation outside the United States 


Research to date has not shown that similar 
objectives have been combined to produce similar 
reform programs outside the United States, To be 
sure, all nations have some government programs 
like the conservation programs of the United 
States. In many cases these foreign programs were 
adopted long before their American counterparts. 
The scientific principle of sustained-yield forestry 
was brought to the United States from Germany, 
and the idea of protecting and conserving forests 
for national self-sufficiency and national survival 
was practiced in Napoleonic France, In other cases 
United States programs, e.g., those of the Tennes- 
see Valley Authority (TVA), have been models for 
foreign activity. In fact, TVA, meaning multipur- 
pose water development, is now a universal con- 
cept. 

In some countries these programs have not been 
called conservation at all, and in others, where 
the word is used, it has been in a different context. 
Thus, for example, the forestry activities of the 
French in north Africa were not called conserva- 
tion, whereas similar British activities in India 
were described as “forest conservancy.” The Indian 
government, however, in its five-year development 
plans, no longer uses “conservation” with reference 


to timber programs, reserving the word for nature, 
wildlife, and soils; and the British at home have 
always used the word as do the Indians today. In 
any case, outside the United States conservation 
has not meant programs of reform involving a 
combination of scientific elitism, a positive attitude 
toward government, and a religious commitment. 
Neither western European nor developing nations 
have needed a reform rationale to justify the 
power of government to act in forestry and related 
natural-resource areas. 

The United Nations and its specialized agencies 
pursue many programs for resources and related 
activities that superficially are similar to United 
States conservation programs; yet the United Na- 
tions activities are not typically called conserva- 
tion, nor do they, in fact, combine the several 
elements that have given character to the United 
States programs. The student who seeks references 
to the international programs in the indexes of 
United Nations publications should look under 
such entries as “economic development,” “technical 
assistance,” “land reform,” “natural resources,” 
“arid zones”; for he will find few or no references 
under the heading “conservation.” 

In 1949 the United Nations did sponsor the 
Scientific Conference on the Conservation and Uti- 
lization of Resources, but its subsequent confer- 
ences on the same and similar subjects have been 
given titles that emphasize, instead of conserva- 
tion, the application of science and technology to 
various resources for the purpose of economic de- 
velopment. Julian Huxley's UNESCO report on the 
conservation of wildlife in central and east Africa 
(1961 )—with its insistence that “the world is eco- 
logically out of joint,” that man has destroyed 
nature’s balance for shortsighted economic advan- 
tage, that African and world opinion must be 
aroused through popular education to protect 
Africa's wildlife and its habitats from exploitation 
by Africans—approaches the United States model 
that we have elaborated, but this report is atypical 
in the vast documentation of the United Nations. 


ARTHUR MAASS 


[See also PLANNING, SOCIAL, article on RESOURCE PLAN- 
NING. Other relevant material may be found in 
ECOLOGY; LANDSCAPE; SCIENCE, article on SCIENCE— 
GOVERNMENT RELATIONS.] 
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n 
ECONOMIC ASPECTS 


The economist concerns himself with the alloca- 
tion of scarce inputs among different uses and over 
time. The conservationists mission is narrower: 
ignoring the scarcity of labor and capital, he sets 
standards for the use of land or natural resources. 
Originally the conservation movement considered 
chiefly the allocation of natural products over time; 
later, their allocation among competing uses (the 
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“multiple-use” approach); most recently, in the 
adaptation of benefit-cost analysis, the allocation 
of capital to the development or preservation of 
nature. 

Thus, the gulf between conservation and eco- 
nomics has narrowed. But there is still a special 
conservationist approach: nature must be pro- 
tected from man’s ruthlessness, wastefulness, and 
ignorance, both because mankind jeopardizes its 
own present and future gain from the natural en- 
dowment and environment and because the pres- 
ervation of nature is a good thing in itself. The 
liberally trained economist, distrustful of such abso- 
lute value judgments, has also been disturbed by 
the imprecise and demagogic use of such phrases 
as “natural heritage,” “waste,” “wise use,” “debt to 
the future,” and “greatest good of the greatest num- 
ber for the longest period of time.” Consequently, 
after a brief examination of the writings of the 
nineteenth-century economists on similar topics, 
this article will attempt to distinguish and interpret 
the economic problems embedded in the writings of 
the conservationists. 

At least three important classical economists 
anticipated the conservation movement's writings 
about the uses of natural resources, although only 
Malthus is acknowledged by the conservationists to 
have done so, and then only in connection with 
population growth. But Malthus is actually the 
least significant of the three in the economic anal- 
ysis of the conservation question. 

Malthus (1798), in his basic model, for exam- 
ple, envisaged ever-increasing food production (at 
an “arithmetic” rate), a possibility denied by the 
pessimistic conservationists. Furthermore, Malthus’ 
commentators asserted that he had understated the 
ability of improved technology to obtain adequate 
produce from the earth. Both he and his critics, 
therefore, ignored the possibility of depletion. [See 
MALTHUs,] 

Ricardo (1817), on the other hand, taught that 
man must extend production to ever less fertile 
lands until society's growth would be stopped in an 
equilibrium of human and natural fertility charac- 
terized by low standards of living and high land 
values. Furthermore, in some inconclusive passages 
on minerals, Ricardo showed himself aware of the 
possibility of complete depletion. Indeed, Goundrey 
(1960), has argued that the conservationist liter- 
ature approximates a layman's discussion of the 
Ricardian stationary state. [See RICARDO,} 

For fifty years following Malthus and Ricardo, 
economists accepted this view of the relation of 
man to his resources [see RENT]. Although this 
analysis was couched in the sweeping dynamics of 


the early classicists, it actually did not deal with 
rates of growth or depletion, a neglect remedied by 
Jevons in The Coal Question (1865). This vigor- 
ous work, a model of research, forecast the Ricard- 
ian decline of both British mining and the indus- 
trial might dependent upon it. Jevons then posed 
the question, Should exhaustion be delayed by 
conserving coal for the future? His answer was not 
explicit, but his approach was clear: 


The alternatives before us are simple . If we 
lavishly and boldly push forward in the creation of 
our riches, both material and intellectual, it is hard 
to over-estimate the pitch of beneficial influence to 
which we may attain in the present. But the main- 
tenance of such a position is physically impossible. 
We have to make the momentous choice between brief 
but true greatness and longer continued mediocrity. 
([1865] 1906, pp. 459-460; italics in the original) 


Jevons had already considered substitutes for 
coal but concluded that their possible appearance 
would not contribute to Britain’s “superiority.” In 
the above quotation he summarized two important 
principles: first, that natural resources can be 
transformed into man-made riches, including intel- 
lectual capital, that might be of greater future 
value to all mankind than a stock of coal; second, 
that the rate of growth itself and the “fabric of 
varied interests” connected with change should be 
encouraged, rather than a high or a sustained level 
of output. This fine study is still a source of in- 
spiration to economists pondering the terms of 
trade between present and future production. 

Like Malthus and Ricardo, Jevons alarmed his 
contemporaries, his ideas leading, for example, to 
statistical examinations of Britain’s wealth and to 
political debates on her liabilities—the public debt. 
[See JEvons.] 

Conservation goals. But economists had little 
impact on the conservation movement, which in- 
stead developed certain unique social principles 
[see CONSERVATION, article on POLITICAL AND 
SOCIAL ASPECTS]. These principles were applied in 
the advocacy of a set of policies about resource use 
that could be traced back through European eco- 
nomic history to suggest that conservation had 
long been inherent in enlightened official action. 
Four distinct policies, in fact, were frequently cited 
as anticipations of what was to become the con- 
servation movement. 

Preservation of wildlife and forests. The first 
of the four policies was the preservation of wildlife 
and forests. Feudal Europe had set aside “forests,” 
or hunting preserves, for the monarch and had 
guarded them against farming and urban encroach- 
ment by stern laws, savagely enforced. The result 


was that by the nineteenth century many of these 
large areas were still wooded and comparatively 
undeveloped, some being used for timber, others 
as parks and game preserves. Although the un- 
democratic reasons for creating these reservations 
and the bloodthirsty enforcement of encroachment 
laws were sometimes glossed over, the fact that 
these areas existed, had survived commercial op- 
portunities, were often highly valued by the public, 
and were well managed by the state was frequently 
cited by conservationists. These reserves were un- 
doubtedly an important model for the national and 
state parks and forests set up in the United States 
and in some of the British countries. Also an im- 
portant forerunner of the nature sanctuaries that 
public and private agencies have recently pro- 
moted, they can be said, in retrospect, to have in- 
spired what has probably been the most successful 
aspect of conservation policy: the setting aside of 
natural areas as a source both of productive re- 
sources, chiefly timber and fish, and consumption 
resources for the leisure and recreation of an in- 
creasingly urban and educated public. 

Access to natural resources. The second policy 
goal was the achievement of a widespread ownership 
of and access to natural resources. Conservationists 
pointed out that in feudal times the supposedly ab- 
solute rights of the monarch over resources were 
actually limited by his duties to his tenants and by 
the fact that he was forced to delegate land man- 
agement to those who were using the land. This 
system, it was argued, had eventually degenerated 
and succumbed to the age of liberalism, when land 
was enclosed by capitalists and accumulated into 
large holdings; the countryside was heavily de- 
populated; and resource wealth was “seized,” 
“grabbed,” and “wasted.” The conservationist reac- 
tion to this trend was widespread. 

In Scandinavia, for example, especially in Swe- 
den, the liberal policy of alienating royal land to 
timber and iron companies was suddenly reversed; 
the remaining area was set up as government 
forest, managed by public enterprise. In other parts 
of Europe too, as the great American conservation- 
ist Pinchot noted during his period of training in 
France, the government had resolved to hold forests 
“for the people.” In many newly settled countries, 
large landholdings were disfavored, and it became 
official policy to encourage homesteads instead of 
ranches and plantations. Similarly, the American 
alienation of mineral and water rights was organ- 
ized so as to maximize the access of small operators 
to these resources. Thus, both in the United States 
and elsewhere, the conservationist had goals in 
common with latter-day physiocrats, single taxers, 
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socialists, muckrakers, and trust busters: the pre- 
vention of resource accumulation by a small num- 
ber of “land capitalists,” and the widespread dis- 
tribution of land ownership or of rental income. 

Prevention of depletion. The second goal, of 
course, was frequently in conflict with a third pol- 
icy: the prevention of rapid depletion and eventual 
disappearance of resources. This third policy has 
been called “conservation proper”: a program of 
state action or intervention to change modes or 
rates of use of natural resources. Both casual ob- 
servation and economic analysis strongly suggested 
that “waste,” erosion, and overrapid mining and 
logging were most common where holdings were 
small or where users had an incomplete title to the 
land. “Democratization,” as it is called in the pre- 
ceding article, was frequently antipathetic to long- 
term conservation. 

For conservationists, however, the pressing task 
was not the reconciliation of their objectives but 
the demonstration of the imminence of resource 
depletion. This was no small task. At first they had 
no evidence to offer except the logical proposition 
that because production was continuing and in- 
creasing, reserves must be declining—a proposition 
that was applied indiscriminately to reserves of 
timber, coal, oil, helium, whales, groundfish, sal- 
mon, and gold. When scientific inventories and 
geologic surveys eventually became available, near 
the end of the nineteenth century, they did seem 
to indicate increasing scarcity, although they also 
imposed a certain discipline on conservation ora- 
tors. Among the facts stressed was a decline in the 
number of large trees in the annual cut and a 
decrease in the size of fish being caught. It was 
shown, too, that petroleum reserves amounted to 
less than twenty years’ production and that the 
delta of the Mississippi was apparently building up, 
through upstream soil erosion, at an alarming rate. 

In the present century, however, the revision and 
perfection of these inventories has weakened the 
impetus of the conservation movement. In the first 
place, improved knowledge of animal and forest 
science has shown that population dynamics and 
ecology, rather than static inventories, are required 
for an understanding of future yields from the 
so-called renewable resources, so that attention has 
become focused on the many problems of manage- 
ment, rather than on the unique goal of hoarding. 
In the second place, it was realized that inventories 
have an economic dimension: the estimate of “ore” 
and “commercial” timber reserves available de- 
pends upon prices and costs, as modified by ever- 
changing technologies. Thus, not only did in- 
creasing resource scarcity become difficult to 
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demonstrate, but also the essential unity that a 
feared generalized depletion of all resources had 
imposed on the movement was lost. From the tech- 
nical point of view, the United Nations Scientific 
Conference on the Conservation and Utilization of 
Resources (UNSCCUR) meeting in 1949 showed 
the impossibility of taking a global view of natural 
resource inventories, so complex were the uncer- 
tainties and the interconnections between resource 
industries (United Nations . . . 1953). And from the 
policy point of view, the United States President's 
Materials Policy Commission report (1952), subse- 
quent European energy studies (Organization for 
European Economic Cooperation 1960), and the 
mammoth investigations by the staff of the Re- 
sources for the Future foundation (1963) utilized 
an economic approach to the problem of pos- 
sible scarcities that left little room for the conserva- 
tion faith, 

Management of common-property resources. 
The conservation movement was also influenced, 
although to a minor degree, by the problems of 
common-property resources—those in which pri- 
vate investment aimed at changing the form, scale, 
or management of a resource would be unprofitable 
because the investor cannot be sure he will harvest 
the returns. Much of the literature on this subject 
was bequeathed by the eighteenth-century and 
nineteenth-century writers on the European com- 
mon pastures and woods to those who later ob- 
served the low yields and unprofitable investment 
in oil fields, ocean fisheries, and certain water re- 
sources, where unrestrained private utilization 
caused external diseconomies of production. In 
addition, when the resources were capable of pro- 
ducing recreation benefits, there were external 
diseconomies of consumption. [See EXTERNAL ECON- 
OMIES AND DISECONOMIES.] Thus, when the con- 
servationists turned to the prevention of depletion 
and the advocacy of “prudent use” of oil fields and 
water bodies, they found there was no single, re- 
sponsible owner or manager. 

Eventually it was left to the technologists and 
scientists to define some conservation ideal for such 
resources, Petroleum engineers invented the con- 
cept of “maximum efficient rate” of production, de- 
signed to recover a high percentage of the under- 
ground crude. And fisheries biologists designed 
fishing regulations that would achieve a fish stock’s 
“maximum sustained catch.” Both these concepts 
implied regulation of the users of common-property 
resources, and holding to the prescribed rates was 
designated “conservation.” But the technical com- 
plexities of these subjects and the limited number 
of persons interested in them have left them some- 
what outside the wider conservation movement. 


The conservation movement. The four subjects 
—preservation of wildlife, redistribution of landed 
wealth, prevention of depletion, and prudent use 
of common property—became the basic precepts 
of the conservation movement. Upon them was 
based a structure of thought that, as formu- 
lated by scientists, attempted to show the loss to 
mankind from continued misuse and abuse; as 
formulated by more extreme and romantic writers, 
it attempted to find villains and scapegoats among 
those who “exploited” the natural heritage. While 
the former group projected nineteenth-century 
trends into the future, predicting shortages, low 
living standards, and high costs, the latter group 
attacked the farmers who used bad agricultural 
practices, the insatiable hunters and fishermen, 
and above all, the large mining and logging com- 
panies who, negligently or malevolently, despoiled 
the earth. 

The scientists’ work has led them into many by- 
ways. As they have learned more about new tech- 
niques of exploration, production, and concentra- 
tion, the oversimplified conservation ideals have 
been abandoned. National and international multi- 
resource conferences, such as the UNSCCUR meet- 
ing mentioned above, have been highly interesting 
but have failed to reveal the common aims and 
problems proclaimed in the 1900s. The more so- 
phisticated contemporary discussions of water re- 
cycling, desalinization, and purification, of mineral 
development, forest genetics, fisheries’ biology and 
oceanography, „game behavior, and population dy- 
namics, have, instead, revealed a multitude of 
social science problems concerning the re-education 
and adjustment of producers and consumers to 
new practices, costs, and opportunities. 

Further, there is nowadays no single “economics 
of conservation”; many methods of analysis, from 
the use of econometrics in demand and production 
studies through regional input-output studies to de- 
cision theory in the business firm, are now relevant. 
In what follows, therefore, we must confine the 
discussion to the economic analysis of the older 
conservation assertions, particularly to the claim 
that the market mechanism is powerless to deal 
with imminent depletion and shortage. 


Economic analysis of conservation 


At any given time, there exists for each resource 
what Ciriacy-Wantrup (1952) has called its “state 
of conservation,” the expected temporal distribution 
of use rates. A conservation policy, therefore, is one 
that would induce those who manage resources to 
change this state so that more use will take place 
at a later date. 

How does the original degree of conservation 


come about? The economic forces at work can best 
be understood by studying the behavior of the firm 
making planning and production decisions. The 
firm is deterred from postponing all use of its re- 
sources by the discount on future net revenues. A 
profit of $1 postponed one year is worth $1/(1 + i), 
where i is the rate of interest relevant to the firm’s 
lendings and borrowings. This expression is less 
than $1 and diminishes exponentially with the 
number of years the profit on resource use is de- 
layed. Consequently, there is an incentive to reap 
the harvest of its durable and its growing assets as 
soon as possible, The rate of interest, i, here stands 
for all the forces of time preference, shortsighted- 
ness, and liquidity preference that the conserva- 
tionist believes are leading to early depletion and 
future shortages. 

But if the firm advances the dates of use of its 
resources, its output per day must then increase. 
This increased rate will be possible only at higher 
unit costs, so the profit per unit will be smaller and 
the incentive to early depletion will be counter- 
balanced. 

The precise nature of such increasing costs has 
been debated by economists since Ricardo wrote 
his chapter “On the Rent of Mines” ([1817] 1962, 
pp. 46-47). It is now agreed that three types of 
increasing costs may be distinguished. In the short 
run, a resource operation with a given plant, trans- 
portation, and underground or surface layout will 
experience ordinary diminishing returns. In the 
long run, however, these facilities and layouts are 
flexible; the question now becomes whether the 
“law of diminishing returns” applies. Marshall's 
assertion that it did not led to a controversy with 
Taussig, Wicksell, and Cassell. Today it is agreed 
that higher costs associated with ores located 
farther away from the plant are irrelevant to the 
discussion of the costs of alternative rates of out- 
put, indicating only that costs will be higher in 
later periods than in early periods or that more 
elaborate workings will become necessary. They do 
not indicate that the curve of costs must, in the 
long run, rise with higher rates of output per year 
or per shift. In general, it would appear that the 
economic debate about proportionality and returns 
to scale applies as well to resource operations as 
to other businesses, with the added element of the 
limited space within which commercially attractive 
materials are to be found. This factor of limited 
space would appear to be an additional reason for 
expecting the long-run cost curve to be U-shaped. 

Amortization of fixed capital is a third source of 
rising costs. Except in extremely large sites, the 
life of a resource operation is shorter than the du- 
rability of the fixed capital needed to work it. Apart, 
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therefore, from those items of capital that can be 
moved to new workings, the equipment costs must 
be amortized over the number of units of material 
to be removed. Consequently, higher rates of ex- 
traction can be accomplished only by higher fixed 
costs per unit. 

The firm may encounter any or all of these 
sources of increasing costs and so be deterred from 
attempting an immediate exhaustion of raw ma- 
terials. In addition, it must contemplate the ex- 
pected course of future prices for its product. 

In the typical situation deplored by the conserva- 
tionists, mineral extraction is proceeding, reserves 
are shrinking, mines are closing, and metal prices 
are gradually rising. Under such circumstances the 
firm has an additional incentive to lengthen the 
state of conservation, Future use will be expected 
to receive higher market prices and unit profits 
than current use. It will pay the firm to adjust its 
production plan in favor of output in later periods. 
This incentive will be presented to all firms, so the 
evil day of future scarcity, dreaded by the con- 
servationist, is postponed by today’s planning for 
higher future profits. 

Thus, the firm must balance present profits, fu- 
ture profits, and the rate of interest. Discounted 
profits become an opportunity cost, or more pre- 
cisely a user cost, of present production. (User cost 
is the present worth of the profit that could be 
earned on an extra unit if its extraction were post- 
poned to the best future date.) Present operations 
should therefore be expanded until marginal cur- 
rent profits equal marginal user costs, in order to 
maximize the present value of the natural resource 
enterprise. 

In a perfectly competitive commercial economy, 
it could be said that the firm that only maximizes 
current profits is acting on the expectation that 
future prices and profits will be very poor. Assum- 
ing that the firm’s foresight is not only normal but 
is sharpened by the opportunity to gain from ac- 
curate perception of future markets, society can 
have no complaint about such a firm's state of con- 
servation. On the contrary, such present-worth 
maximizing firms will produce an optimum degree 
of conservation. This is why the academic econo- 
mist rejects the conservationist’s contention of the 
probability of a future starved of natural resources. 

But the conservationist can rightly point to the 
many assumptions underlying this economic model. 
If, for example, the firm (like many small miners, 
farmers, and loggers) discounts the future at a 
rate higher than that used in the rest of the 
economy, user costs will be too small to prevent 
present extraction of material that competitive 
forces would have awarded to the future. Or if the 


284 CONSERVATION: Economic Aspects 


firm is irrational, unthinking about its own best 
interests, future consumers may lose. Such market 
imperfections are the first of three important justi- 
fications for conservation policies. 

Market imperfections. The presence of market 
imperfections in the capital market, the labor mar- 
ket, or the final-product market can weaken the 
economists’ case that present-worth maximization 
by individual sellers will ensure an optimum state 
of conservation. Ciriacy-Wantrup (1952), Gordon 
(1958), and Scott (1955) investigate the conse- 
quences of such imperfections and weigh the argu- 
ments for remedial policies, such as regulation, 
subsidies, public ownership, and so forth. It is 
usually held that an imperfect land market—that 
is, a common-property situation—presents the best 
prima-facie case for interference in the interest of 
conservation. 

Other imperfections may also be serious. But 
conservation action is not justified by a few firms 
having a strong time preference, extreme illiquidity, 
or lethargic indifference to their own gain. For the 
too rapid production of a few firms will be counter- 
balanced by compensatory action of the other 
firms in their industry. The relatively high rates of 
output of the “imperfect” firms will raise market 
supplies and so depress prices; the depressed prices 
will discourage production by other firms and also 
discourage the opening of new operations. The 
competitive interperiod allocation of total supplies 
will be restored. Only if such a large fraction of 
total stocks is dumped on the market that the re- 
maining producers have insufficient reserves to 
restore the intertemporal balance will conservation 
intervention be justified. Hence, the mere citing of 
imperfections in some portions of the market does 
not in itself justify conservation policies. The mar- 
ket must be overwhelmingly imperfect. 

A better case for conservation intervention can 
be made if the owner has been placed in a situation 
where he cannot gain by delaying sale of his stocks 
until the future period comes. At least five types 
of circumstance may be responsible. First, gov- 
ernments may, by tax-free inducements to new 
resource firms, make it less profitable to extend pro- 
duction much beyond the duration of the tax holi- 
day. Second, government purchase, stabilization, or 
stock-piling schemes may create an incentive to 
take early advantage of supported prices, (In this 
circumstance, however, conservation objectives 
may be achieved by the stock-piling, even though it 
is done away from the original site.) Third, as dis- 
cussed above, users of common-property resources 
are incapable of taking advantage of future mar- 
kets and must extract as much as is profitable to- 
day. They may, however, “unitize” the management 


of the resource in order to gain from the process 
outlined earlier; or they may encourage the govern- 
ment to intervene to slow down present extraction; 
or, finally, they may individually produce in the 
present for their own inventory (because of the 
high costs of storage off the site, this is rarely an 
important alternative), Fourth, future values may 
be so enveloped in uncertainty that firms may indi- 
vidually opt for present profits. Fifth, future values 
may be in different uses than present values and 
may be unmarketable. That is, they may be the 
source of external economies to other producers 
(in controlling floods or erosion) or “intangible” 
services (of scenery or recreation ) to consumers. 

All of these circumstances are common in the 
resource industries, and many have been dealt with 
by special government institutions, such as those 
mentioned in the last section of this article. While 
it is true, as suggested earlier, that small resource 
owners are sometimes irrational and do not know 
their own best interest, investigation often shows 
that nonconservation behavior can be explained 
satisfactorily by one or more of the five circum- 
stances above. 

A less clear-cut situation exists when the value 
of production to some portion of society, such as 
the inhabitants of a province or country, is greater 
than the market value, Thus, a unit of timber out- 
put postponed may prevent a town from degener- 
ating into a “ghost town.” Forest and oil conserva- 
tion may well prolong the industrial importance of 
communities and nations that might otherwise suf- 
fer from the migration of labor and capital to 
locations where their incomes and productivity 
would be higher. Such cases are complicated, and 
it does not follow that anyone's interest is actually 
served by postponing the day of final exhaustion 
and closing down. There are indeed many direct 
and indirect subsidies for production from dwin- 
dling reserves, to support otherwise stranded com- 
munities. But such subsidies are hotly debated. 
Even if they are justified, it does not follow that it 
would have been better to reduce the early rates of 
extraction of such deposits than to subsidize them 
later. 

But the strongest case for conservation ulti- 
mately depends upon opposing views of responsi- 
bility to the future and opposing views of the future 
Situation. It is true that there are many externali- 
ties, but taken together they are insufficient to 
motivate the conservation movement. Let us take 
up these two matters one at a time. 

Responsibility to the future. The preceding 
account of the theory of use has shown that the 
future is not ignored by resource firms. The future 
is provided with such raw materials as are justified 


1 


by the owner's cost of holding wealth (the rate of 
interest). Although monetary fluctuations and 
changes in desire for liquidity may change the rate 
of interest from period to period, it is here a rough 
indicator of both the rate of return on other types 
of capital investment and the desire of present 
generations to save, It is this connection between 
saving and the state of conservation that has led 
many writers to conclude that the heart of the 
conservation problem is in the conflict between 
present and future (Gray 1913). Clearly, a lower 
rate of interest would reduce a firm’s tendency to 
discount user costs and to divert consumption of a 
raw material from the near future to the more 
remote future. 

In general equilibrium this diversion, however, 
might not occur for all resources. If, for example, 
savers became more willing to provide for the fu- 
ture, the supply of savings would increase (and 
interest rates fall) not only for resource owners but 
also for all potential borrowers, As the amount 
borrowed for private and social capital formation 
rose, the current demand for raw materials for 
these investment goods would expand; meanwhile, 
the demand for raw materials specifically needed 
for consumer goods would fall. It is quite possible 
that some index of demand for all natural resource 
raw materials would rise, not fall. Without knowl- 
edge of the elasticities of substitution between 
resource-based inputs and other inputs, it is, of 
course, impossible to predict what the outcome 
would be. Unless substitute inputs are generally 
adopted, however, the demand for resource-based 
raw materials by makers of both capital and con- 
sumer goods should be about the same as with the 
former, higher rate of interest. If the resource in- 
dustries supply this unchanged demand as before, 
then the change in the interest rate has had no 
effect. If they do so at higher prices, the final out- 
come may be that although the money rate of 
interest has fallen, the real (Wicksellian) rate has 
returned to its previous level. 

A third outcome is that the lower rate might in- 
duce resource owners to reject permanently some 
of the current demand and hold their resources 
longer. This possibility might emerge if it is ex- 
pected that all (present and) future prices for raw 
materials will be higher, but the discount rate re- 
mains lower, than before the change in rate. This 
third outcome, similar to the effect described by 
the Austrian economists as “lengthening the period 
of investment,” is the only one of the three possi- 
bilities favorable to general conservation of stock, 
or durable resources. 

Which of the three possible outcomes will ma- 
terialize depends on the new general equilibrium 
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following a change in the propensity to save, which 
new equilibrium may take many years to emerge. 
The raw materials demand from other types of new 
investment may develop rapidly or slowly; and the 
contraction of consumer goods requirements may 
also depend upon the durability of the consumer 
goods in question. Thereafter there will be a re- 
sponse lag as the resource industries, especially 
those producing renewable resources, make and re- 
make decisions about investment in extractive and 
regenerative activities. In other words, a simple 
change in the rate of interest will produce a new 
degree of conservation only after a long adjustment 
period, during which resources may be depleted 
more or less rapidly than before the change and 
more or less rapidly than after the final equilibrium 
is reached. 

Another difficulty about applying the interest 
rate concept is that although many writers concede 
a lower rate of interest will lengthen the period of 
investment, such theories do not offer predictions 
of which types of investment will actually increase 
in “duration.” The general lengthening may be ac- 
complished by the shortening of some types and 
the lengthening of others (Hayek 1941). 

It is, of course, impossible to dispense with the 
rate of interest in resource calculations. Some guide 
is needed by which today’s managers can spread 
possible output over a stated period of years. Other- 
wise they would have no measure of the amount of 
stock resources to be taken out of possible future 
use for present consumption. Graaff has described 
decision making about intertemporal allocation as 
planning to bequeath a certain “terminal capital” 
at a certain date; of the problems beyond this date, 
man takes no heed. This approach has proved to be 
a useful approximation in recent papers on welfare 
economics and growth, but its value for reasoning 
about resource allocation is limited. The planning 
horizon recedes as we approach it, so that some 
discounting is still required, with perhaps a sudden 
discontinuity at a certain point in the stream of 
future revenues and costs. 

Predicting the needs of the future. If some dis- 
count must be used and if society is to depend upon 
commercial estimates of future prices to indicate 
future needs, can we be sure that these needs are 
being correctly estimated? It is clear that many 
owners are not foresighted, but in view of the many 
government studies now available, it is hard to be- 
lieve that the information is now any worse than 
the rest of the information by which persons and 
governments govern their behavior. 

The real difficulty about these predictions is the 
certainty that requirements are going to change. 
The dynamism of the world economy means that 
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once-essential materials may become mere chunks 
of matter, of low economic value. Not only do 
fashions change (e.g., from cotton to man-made 
fibers), but new techniques change entirely the 
type of process responsible for providing a certain 
part of the consumer budget (e.g., from mail car- 
ried by coal-powered railroad to telephone and tele- 
vision), and these changes are apparently acceler- 
ating. Hence, producers are feeling less confident 
that their resource will have a future market. 

Furthermore, there is increasing evidence that 
expected future scarcity brings about its own solu- 
tion—through “self-generating technical change,” 
as Barnett and Morse (1963) have called it. An 
anticonservation view is now plausible: that man- 
kind can invent itself out of any shortage. Barnett 
and Morse feel that the only difficulty here is that 
increasing recourse to substitutes and increasing 
shortage of space may lead to some general de- 
terioration in the “quality of life.” 

Hence, estimates of future scarcity not only are 
uncertain but also are probably overstated. The 
resource owner can be forgiven if he decides that 
it is not worthwhile arranging to have supplies 
available for some remote posterity. Such uncer- 
tainty is now likely to provide a large discount, 
additional to the interest rate, in computing user 
costs of current production and depletion. 

Conservationists have disputed this conclusion 
on two grounds. The first is that energy sources are 
likely to become an ultimate bottleneck. Fossil 
fuels and falling water are known to be limited: 
tidal energy may be expensive to harness; we might 
soon have to depend upon sunlight and other solar 
radiations. This argument has been recently weak- 
ened by the discovery of the practicability of 
nuclear power sources. In any case, it is not at all 
clear what should be done, in a large way, to delay 
the time when lack of energy might become a seri- 
ously limiting factor to production or growth. 

The other ground for conservation action, based 
on uncertainty about technical progress, has been 
formulated by Ciriacy-Wantrup. Confining himself 
to renewable or flow resources (soil, water, plants, 
and animals), he argues that society should not 
gamble on the possibility that changes in tech- 
nology, tastes, and institutions will be favorable. 
He advocates, instead, that the level of such re- 
sources should be kept above a “critical zone,” be- 
low which any decline is irreversible. It should be 
the business of society to keep a stock of timber, 
soil, plants, fish, and wildlife available, which can 
be expanded to fill an “immoderate” need if such 
should arise. This view is close to that of the ecolo- 
gists, who advocate keeping the relationship of 


various living species to each other and to their 
habitat (swamps, rivers and lakes, wilderness, 
desert, and so forth) more or less intact. In effect, 
Ciriacy-Wantrup claims that the maintenance of 
this “safe minimum standard” is a fairly inexpen- 
sive insurance policy against the possibility that 
unpredictable changes may put a high value on 
living forms and water and soil complexes after 
they have become irrecoverable. 

Conclusion—the role of economics. It will be 
seen that the economists’ analysis of the conserva- 
tion question depends heavily upon the theory of 
capital, which (as in welfare economics, macro- 
economics, and growth theory) is probably the 
least-rewarding branch of economic theory in terms 
of useful generalizations about the real world. The 
objective fears of conservationists, under particular 
circumstances, could be realized; their assertions 
about subjective benefits and costs are more ques- 
tionable but have indicated fields for valuable 
analysis, so far mostly ignored by economics. What 
economics has done up to now is to show the 
variety of problems and objectives implicit in the 
conservation question; to attack the notion that all 
resource problems are susceptible to a common 
analysis or policy; to stress the scarcity of labor 
and capital, as well as that of natural resources; 
and to undertake the detailed description and 
analysis of land and raw material situations. 

Such applied research began early in the nine- 
teenth century, in forest economics, an ingenious 
combination of population theory and actuarial 
finance most fully developed in Germany and Scan- 
dinavia. The objectives, such as maximum sus- 
tained yield, were more technocratic than eco- 
nomic, but the methodology is easily adapted to 
other ends. Agricultural economics, too, got off to 
an early start and was well equipped to analyze 
soil exhaustion and conflicting land use from both 
institutional and marginalist points of view. 

The economics of mining, petroleum, fisheries, 
water-supply, and recreational land operations, 
however, were largely ignored by economists until 
the late 1930s. Most progress in these fields was 
made in peripheral studies of market organization, 
labor conditions, taxation, or public utility pricing, 
and most of these had little impact on resource 
policy or business practice. Indeed, today’s special- 
ists in conservation economics frequently find that 
the best pioneering contributions have been made 
by geologists, engineers, biologists, and admin- 
istrators. 

From the marriage of such “engineering eco- 
nomics” with orthodox welfare economics have 
emerged the variants of benefit-cost analysis, @ 


technique of decision making first applied in Eu- 
rope and the United States to the evaluation of 
water-resource projects [see WATER RESOURCES; 
WELFARE ECONOMICS]. It has been generalized to 
deal not only with fixed investments but also with 
conflicting uses of land resources. It is discussed 
more fully in another article [see INVESTMENT, 
article on THE INVESTMENT DECISION]. All that 
needs to be said here is that it makes possible the 
full and systematic study of any investment which 
claims to increase income or stability or fulfill 
other objectives. Variants of this approach, de- 
signed especially to deal with uncertainty, the dis- 
covery of new resources, intangible services, and 
opportunity costs, have been worked out in many 
countries, In view of the numerous distortions in- 
terfering with the optimum use of actual resources 
(such as the market imperfections previously dis- 
cussed), it is a particularly useful substitute for 
market forces in government decision making 
about the form and rate of resource exploitation. 


Conservation institutions 


The state of conservation of any particular 
natural resource is not alone the outcome of a 
variety of personal and economic determinants, 
such as the intelligence, education, taste, and 
means of the owners and managers. In addition, 
special pressures are brought to bear by taxes and 
tax rebates, subsidies, licenses and regulations, 
tariffs, quotas, labor laws, safety rules, and zoning 
statutes. These institutional arrangements temper 
and divert the forces of the market for each raw 
material and resource location. In some countries 
public ownership and the force of custom and habit 
are so overwhelming that the price mechanism 
works only slowly and spasmodically. In others, 
markets for the resource site, for shares in owner- 
ship, for capital, labor, and the final product, are 
so well developed that criticism of the state of con- 
servation is implicitly criticism of the open-market 
system itself. 

In all countries, however, a variety of special 
institutional arrangements have been made to 
modify the private regime of resource exploitation 
and renewal. Some of these supplement market 
mechanisms by providing information, education, 
or special assistance where these seem to be lack- 
ing. For example, many countries have systems of 
farm credit, which have, as one of their purposes, 
the supplementing of poorly organized private sys- 
tems of farm, livestock, and crop loans. 

Similarly, many countries maintain some kind of 
geologic survey, which performs for the mining in- 
dustry as a whole a basic research function similar 
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to the basic research carried on by the government 
for agriculture and by the universities for industry. 

A number of other such public institutions, im- 
portant but auxiliary to the basic market mecha- 
nism, might be identified. In the following para- 
graphs, however, it is intended to mention some 
institutions that go further and that compensate 
for the absence of a market or replace it. In addi- 
tion, there are institutions concerned with common- 
property resources that basically compensate for 
the absence of sole ownership. 

International institutions. From the adminis- 
trative point of view, the most interesting inter- 
national institutions are those concerned with re- 
sources that are open to several nations, either 
because they are in the high seas or in other areas 
outside national jurisdiction (Antarctica or outer 
space) or because the resource moves from coun- 
try to country (fish, migratory birds, international 
rivers and air). 

In the absence of specific treaties, customary in- 
ternational law governs the conditions of use of 
these resources. Usually the effect is that there is 
little restriction on the resource’s use, Thus, in the 
absence of specific agreements, high-seas fisheries, 
migrating birds and animals, rivers and air flowing 
across boundaries, may usually be exploited much 
as the nationals of each country wish, subject only 
to their domestic laws. And since the country has 
no power to regulate the actions of other countries’ 
nationals, there is little to be gained by making 
restrictions. 

For that reason, there have long been agree- 
ments whereby certain countries were given rights 
or sole rights to certain fisheries or bodies of water, 
although the older of these agreements rarely 
called for regulation to protect fish or animal popu- 
lation. Since 1900, however, a variety of conserva- 
tion treaties have dealt with fisheries in waters 
used by two countries, fish migrating between two 
countries’ seas, and high-seas fish populations. The 
first important example was the 1911 fur seal con- 
vention between Great Britain (for Canada), 
Japan, Russia, and the United States. Since then, 
fisheries treaties governing research, catching, and 
improvement have been made for whales, Pacific 
tuna, and Pacific salmon and halibut, and agree- 
ments have been made regarding North Sea and 
Baltic Sea trawling, and the northwest Atlantic 
fisheries. The general effect of such treaties is to 
leave fishing open to the members but to restrict 
the time, place, or technique of fishing in such a 
way as to increase the sustained yield. This is cer- 
tainly a gain, in that more fish are available for 
consumers. However, the regulations have been 
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criticized because they aggravate the overemploy- 
ment and overinvestment that are already inherent 
in the utilization of common-property resources. 
First, by raising the potential catch without limiting 
the entry of vessels, they draw still more vessels into 
the fishery (Christy & Scott 1965). Second, the 
method of protecting the fish stock (for example, 
the prohibition of modern methods or the setting 
up of closed seasons) frequently creates idle ca- 
pacity or requires expensive off-season storage 
facilities. Recent studies of the operations of the 
Pacific halibut convention have demonstrated that 
the industry is by no means healthy, although the 
treaty is frequently and correctly cited as a striking 
success in the management of wildlife, in that it 
has produced or maintained large catches com- 
pared with what would have been the case in the 
absence of international agreement. 

In this connection, it should be noted that the 
UN Geneva meetings on the law of the sea, in 
1958, succeeded in introducing two conventions, 
one on high-seas fisheries and the other on the 
mineral resources of the continental shelf. Prob- 
ably on the way to becoming accepted international 
law, these conventions should increase the possi- 
bility of bringing marine resources under economic 
management. 

There are a number of international river agree- 
ments, negotiated between the riparian states. 
Among them are agreements on the Indus, the 
Rhine, the Danube, the Rio Grande, the St. Law- 
rence, and the Columbia. All permit more valuable 
development than would have been possible had 
the signatories confined their activities to their own 
waters. Few, however, have gone so far as to en- 
courage the full utilization of the river without 
regard to the boundary, 

Some action has also been taken to protect mi- 
gratory wildlife. Eleven European states interested 
in bird protection signed a convention for this pur- 
pose in 1902, and Canada and the United States 
agreed on joint action in 1916. International action 
has also produced the 1936 convention on the pro- 
tection of African flora and fauna and the 1954 
convention on the prevention of pollution of the 
sea by oil. Such conventions are interesting be- 
cause they seek “nonmarket” objectives: they have 
been supported mainly by sportsmen, on the one 
hand, and by amateur and scientific ornithologists, 
on the other. The great destructiveness of modern 
hunting methods makes it possible to annihilate 
certain species entirely (an outcome that is much 
less likely in most fisheries), so these treaties 
should indeed be classified as preserving certain 
resources “for future generations.” 


National institutions. The conservation of re- 
sources within each country is implemented by a 
rich complex of specialized institutions. This is 
particularly so for common-property resources, 
which would otherwise be overexploited, depleted, 
and perhaps destroyed. Almost all countries now 
have fisheries’ regulations, applicable within their 
territorial and inland waters. Some of these regu- 
lations are explicitly to protect the exploiting rights 
of certain communities or tribes or the owners of 
certain outmoded kinds of equipment. But others 
exist to protect fish stocks by regulating time, 
place, and method of catching. Usually, however, 
permission to fish is free or cheaply obtained, so 
that conservation (actually, management) of the 
fish stock may be achieved while labor and capital 
are, as already indicated, economically under- 
employed. 

Similarly, pressure of oil and gas in petroleum 
fields and levels of water tables in aquifers may be 
regulated to prevent destructive loss from competi- 
tive exploitation. Prorationing of oil fields is a well- 
known technique. But spacing by unitization is rare 
in North America. It is more common in other 
countries, where concessions are given for manage- 
ment of, and investment by firms and syndicates 
in, entire fields. Conservation of the resource may, 
therefore, be achieved, but at the opportunity cost 
of a substantial waste of labor and capital. 

In the same context, the special tax treatment 
for petroleum and mineral exploration and devel- 
opment should be noted. Generally based on the 
“unusual degree of risk” in exploration ventures, 
these amount to an amelioration of income tax by 
depletion allowances and the “expensing” of pros- 
pecting costs against other income. It is, however, 
hotly disputed whether this undoubted risk is a 
sufficient reason for special treatment; in addition, 
it would, in general, seem to be an anticonservation 
treatment because it leads to more rapid explora- 
tion, production, and depletion than would other- 
wise be the case. 

Another device is the export control board. This 
has occasionally been used in time of national 
emergency to prevent the export of scarce raw ma- 
terials and is, in that restricted sense, a conserva- 
tion institution. Canada’s National Energy Board, 
however, has powers to prevent the export of fuels 
or electricity until it is satisfied that Canada’s 
future needs have been covered. A Norwegian pol- 
icy respecting the export of electricity has some- 
what the same effect. This type of policy is often 
indistinguishable from that of forbidding the export 
of certain raw materials in order to encourage fur- 
ther processing by home industries. It is, therefore, 


frequently a type of industrial protection, akin to 
the tariff. 

Forests and forestry are also the business of a 
large group of special institutions. On the one 
hand, as has been noted, many forests are publicly 
owned and come under management rules that, if 
anything, exaggerate future needs. Not only have 
some governments invested more than private 
owners in land acquisition away from agriculture 
and in the planting to forest of such lands, they 
also tend to grow plantations to ages (“rotations”) 
that aim at maximizing the physical volume of 
wood produced rather than its maximum present 
value. Many European countries, especially in Scan- 
dinavia, now obtain a large proportion of their wood 
from public lands (which are frequently in regions 
that were not wanted for any other purpose and 
remained, by default, in public ownership), The 
tropical countries, also, own most of their own 
timber resources. In addition, logging and process- 
ing lumber, on some, at least, of these lands, tends 
to be a nationalized industry, although in the United 
States and Canada it is common to have short- 
tenure loggers or logging contractors undertake the 
harvesting. 

On the other hand, there is an increasing tend- 
ency for private forests to be regulated by govern- 
ment, in the interest of ensuring reforestation and 
fuller utilization. The economic value of regulation 
by law is uncertain, but such regulation may be 
defended as a means of drawing to the attention of 
small owners the profitability of more intensive 
forest management; the burden of compliance with 
the law is frequently very small. 

This legalistic approach is usually supported by 
a system of property taxation applicable only to for- 
est lands, which reduces the incentive created by 
value-based taxes to cut while trees are still small. 
In addition, there are special income tax and inheri- 
tance tax arrangements available that reinterpret 
the usual tax distinctions between income and 
wealth, to deal with the fact that a growing forest 
is both a capital good and a crop. Scandinavian 
taxation is most elaborate in these respects, but 
attention should also be drawn to the capital gains 
status of forests in American federal taxation and 
to the reliefs for forests provided under the British 
death duties. Most important, however, is the ad- 
ministration of these laws and taxes, which has an 
educative effect similar to that of farm-extension 
work. 

Conservationists, often biologists, have long been 
concerned about recreation, but the study of park 
and recreational-site planning in general is just 
commencing. As explained at the outset of this 
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article, most public lands and parks “survived” the 
nineteenth-century period of liberal land alienation 
into private use. In most countries these parks 
today—in a period of increased population, higher 
incomes, and increased leisure—are intensively 
used by the general public and are imaginatively 
managed by conscientious park services. But be- 
cause their services are given freely, there is no 
market profit reaped; access is unrationed; parks 
are congested; and a “problem” has emerged. There 
is a clear public desire for more facilities, but gov- 
ernments have been slow to respond. Economic 
analysis of land-acquisition policies is just com- 
mencing, and conflicting users of land close to 
centers of populations have not been slow to dis- 
pute the establishment of public parks, nature con- 
servancies, camp grounds, green belts, playing 
fields, and so forth. A few private substitutes have 
appeared, charging for their services, but the prob- 
lem of land allocation between recreation and 
market uses is largely neglected by the social sci- 
ences, although it should be amenable to techniques 
now used in the economics of education. 

Finally, we may refer to the preservation of sport 
fish and wildlife. As has been mentioned, interna- 
tional agreements exist that have some effect on 
the preservation of some species. But for the most 
part, protection is achieved by the domestic control 
of hunting and, to a lesser extent, of habitat. 

In general, rights to hunt and fish are conveyed 
by licenses or permits, it being an offense to hunt 
without one. In addition, certain species are pro- 
tected absolutely in certain seasons or places or up 
to a certain quota. Usually permits are free or cost 
only a nominal amount, although permits for the 
hunting of big game (especially by foreigners) may 
be sold close to what the market will bear. Some- 
times hunting or fishing rights are attached to the 
land or water, are owned privately, and may be 
leased or sold. Such private rights produce rev- 
enues that can be used to conserve the resource, 
and these revenues also ration the demand. 

But “public” hunting is not usually rationed in 
this way. Enforcement of prohibitions against an 
extensive activity that takes place in open country 
is, of course, very expensive and in some countries 
has been quite ineffective. Elsewhere, a trio of prob- 
lems prevents a sophisticated conservation regime: 
the problem of acquiring or protecting habitats from 
other land uses; the problem of learning the needs 
of the various species inhabiting the environment; 
and the problem of arranging an understandable 
set of regulations, permits, and prices that will 
encourage the harvesting of surplus stocks of some 
species while protecting the diminishing stocks of 
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others. In the long run, there is the additional 
danger that a really effective management system 
will lead to the virtual domestication of the con- 
served species, thus negating much of the ethical 
and recreational value claimed for the preservation 
of wildlife. 

ANTHONY SCOTT 
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CONSERVATISM 


“Conservatism” is a word whose usefulness is 
matched only by its capacity to confuse, distort, 
and irritate. Since the patterns of thought and 
action it denotes are real and enduring, and since 
no substitute seems likely to be generally accepted, 
“conservatism” will doubtless have a long life as a 
handy, if dangerous, tool of social science, Scholars 
who use it lie under a severe obligation to be as 
exact as they can ever be in the handling of words 
that are encrusted with tradition and saturated 
with emotion. In particular, they must recognize, 
and thus distinguish among, the uses of this word 
that have become fairly standard in the years since 
World War 11. There are, it would appear, four 
such uses. 

Temperamental conservatism. Conservatism, 
by one definition, is both the “natural” and the 
culture-determined disposition to resist dislocating 
changes in a customary pattern of living and work- 
ing. It describes, crudely and yet effectively, a tem- 
perament or psychological stance, a cluster of 
traits that are on daily display by most men in all 
societies, The important elements in the conserva- 
tive temperament would appear to be habit (what 
William James called “the enormous fly-wheel of 
society, and its most precious conservative agent”), 
inertia (a force that often seems to be as powerful 
in the social world as in the physical), fear (espe- 
cially fear of the unexpected, the irregular, and the 
uncomfortable), and emulation (a product of both 
fear of alienation from the group and a craving for 
its approval). 

All these traits of character flourish with par- 
ticular vigor among some of the most unfortunate, 
insecure, and expendable groups in society. One 
may speak with propriety, if also with pity, of the 
conservatism of the poor, the conservatism of the 
aged, and the conservatism of the ignorant. At 
the same time, one must assign a high value to the 
conservative temperament in the pattern of social 
survival and even of social progress. A community 
made up entirely of men engaged in the quest for 
security and order would be dreary, decayed, and 
perhaps even cruel, but a community that counted 
no such men would be a scene of anarchy. Neither 
the division of labor nor the maintenance of law 
and order, neither the gathering of knowledge nor 
the transmission of experience from generation to 


generation would be possible if the conservative 
temperament, especially as it is transformed into 
the imperatives of organization, were not an active 
force in the lives of men. 

Situational conservatism. Conservatism, by a 
second definition, related to the first, is an attitude 
of opposition to disruptive change in the social, 
economic, legal, religious, political, or cultural 
order. It describes, somewhat less crudely and 
somewhat more effectively, a pattern of social be- 
havior, a cluster of principles and prejudices that 
are on daily display by many men in all developed 
societies. The distinguishing mark of this conserva- 
tism, as indeed it is of any brand of conservatism, 
is the fear of change, which becomes transformed 
in the political arena into the fear of radicalism— 
in this instance, the radicalism of men who pro- 
pose to “make the world over,” or at least to 
“improve” it at the expense of old values, institu- 
tions, and patterns of living. Situational conserva- 
tism is not confined to the well-placed and well-to- 
do. Persons at all levels of being and possessing 
may lament change in the status quo. For reasons 
that range from the intuitive to the pragmatic by 
way of the traditional, many men in developed 
societies can be counted upon to express a highly 
personal sense of satisfaction and identity with 
the established order. One happy home of situa- 
tional conservatism is the upper reaches of society; 
another is the land. Where these two shaping 
forces come together, as in the situation of an Eng- 
lish duke, the result can often be a caricature of 
conservatism. 

Somewhat unfortunately for the reputation of 
both temperamental and situational conservatism 
among social scientists, as well as for the reputa- 
tion of social science among political conservatives, 
several studies of political behavior have tended, 
despite repeated disclaimers of ill intent, to equate 
conservatism with authoritarianism, obscurantism, 
racism, fascism, alienation, maladjustment, and 
“the closed mind.” Some social scientists have 
elected to probe the dark frontier between politics 
and personality by constructing a spectrum that 
runs from an extreme labeled “conservatism” to an 
extreme labeled “liberalism,” and then, by eliciting 
and tabulating responses to questions designed to 
reveal attitudes toward controversial social and 
personal problems, to place the persons questioned 
—be they workers in Detroit or criminals in San 
Francisco or citizens chosen at random in Liver- 
pool—along the scale. It is revealing that “liberal” 
responses to the questions of even the most fair- 
minded students of political behavior are almost 
always given “plus” values in the resulting tables 
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and scales, while “conservative” responses are 
tagged with the ever so slightly tainted “minus” 
sign. 

The prototype of such studies was carried 
through by T. W. Adorno and his associates and 
was published as The Authoritarian Personality 
(1950). The question may well be raised whether 
this kind of study has not done damage to the 
image of conservatism and rendered the word itself 
suspect among many social scientists who hitherto 
found it useful and not at all pejorative in content. 
In any case, much careful research must be done 
before political and philosophical conservatism can 
be linked causally with the unpleasant aspects of 
temperamental and situational conservatism that 
are assumed to form this polar phenomenon known 
as “the closed mind.” 

Political conservatism. If we bring together in 
imagination a goodly number of conservatives of 
temperament and situation, then thrust them into 
the hurly-burly of active politics, we move naturally 
toward a third definition of conservatism that is 
roughly synonymous with the worn but still con- 
venient label “the Right.” Most persons who talk 
of conservatism mean political conservatism, that 
is to say, the aspirations and activities—most of 
them defensive rather than creative—of parties 
and movements that celebrate inherited patterns 
of morality and tested institutions, that are skep- 
tical about the efficacy of popular government, that 
can be counted upon to oppose both the reforming 
plans of the moderate Left and the deranging 
schemes of the extreme Left, and that draw their 
heaviest support from men who have a substantial 
material and psychological stake in the established 
order. 

Political conservatism is a phenomenon which, 
if we stretch the definition beyond the boundaries 
of common sense, is a universal of organized so- 
ciety. One may find, if one seeks them intensively 
enough, political conservatives in every country 
and culture. One may even say that since conserva- 
tism is essentially the defense of a going society, 
the leaders of the Soviet Union are conservatives. 
This, however, is to ignore both the history and 
logic of this phenomenon, which comes fully to 
life—as does its great partner and adversary, 
liberalism—in the civilized political and cultural 
struggles of the open, ordered, constitutional so- 
ciety. The Tories of Great Britain, the Republicans 
of the United States, the Gaullists of France, and 
the Christian Democrats of a half-dozen European 
countries are conservative parties in the most 
meaningful sense, the Liberal Democrats of Japan 
and the Swatantra party of India in a rather less 


292 CONSERVATISM 


meaningful sense, the Communist party of the 
Soviet Union in no sense at all except the arbi- 
trary or whimsical. The political situation in which 
they lived and wrought made it fully possible for 
Herbert Hoover, Stanley Baldwin, Dwight D. Eisen- 
hower, Konrad Adenauer, Sir Winston Churchill, 
and Charles de Gaulle to act as conservative states- 
men, but we would not easily find counterparts of 
such men in most parts of Asia and Africa. 

A distinction should be made, although it is 
often impossible to maintain in politics, between 
the two classic political positions of the Right: 
conservatism and reaction. Reaction is the position 
of men who sigh for the past more intensively than 
they celebrate the present and who feel that a 
retreat back into it is worth trying. While the 
conservative, a man essentially at rest, may in- 
dulge from time to time in reveries of 1945 or 
1928 or 1896 or even 1788, he is generally well 
adjusted, psychologically as well as programmati- 
cally, to “a world he never made.” 

The reactionary (or, as some might prefer, the 
restorationist), a man in motion, refuses to ac- 
knowledge that whatever has been settled must 
henceforth be considered good or at least tolerable, 
and he seems willing to erase some laws, scrap 
some institutions, even amend his nation’s consti- 
tution, so that he can roll back the social process 
to the time at which his countrymen first went 
foolishly astray. The restorationist should not be 
confused with the violent reactionary, who, like 
the violent revolutionary, seems ready to shed 
blood and to subvert all order in pursuit of his 
immediate ends. 

There is, it can be seen, an essential relation- 
ship—or perhaps nonrelationship—between con- 
servatism and revolution. The historic mission of 
political conservatism in the West has been not to 
defeat but to forestall revolutions, not to crush but 
to anticipate them. Conservatism in a truly revolu- 
tionary situation is a politics of delusion; a society 
that bursts into revolution will not be able to 
smother it by conservative means. The conserva- 
tive is above all a man of order, and a shattered 
society has no place for him. To speak of conserva- 
tism in France in 1792 or Russia in 1917 or Cuba 
in 1962 is to speak of political arts that no man 
could have practiced. The explosion of a genuine 
social revolution is rather certain evidence that po- 
litical conservatism has never flourished in a com- 
munity or, if it has flourished as an important 
movement, has failed in its mission. 

Political conservatism is a posture, whether 
adopted by individuals, classes, or parties, that is 
increasingly difficult to maintain in the twentieth 


century. In many parts of the world—for example, 
in totalitarian countries like the Soviet Union and 
Hungary, unstable countries like Colombia and 
Peru, and only half-formed countries like some of 
the new states of Africa and Asia—genuine con- 
servatism as a political force can hardly be said 
to exist, and persons who might make excellent 
conservatives in more ordered societies must choose 
between the nihilism of “standing pat,” the frus- 
tration of trying to recreate a dead past (which 
may never have existed in the first place), or the 
cryptoradicalism of riding the tiger of revolution. 
Even in those parts of the world in which the poli- 
tics of tradition, wealth, and aristocracy has been 
a historic force—for example, in Great Britain, the 
United States, Canada, France, Japan, and the 
Scandinavian countries—conservatives are sorely 
tried by the pressure of events over which neither 
they nor, as it often seems, any men seem to exer- 
cise effective control, 

The political conservative is almost always the 
prisoner of the social process as it is embodied in 
the traditions and institutions of his country, and 
thus the foil of those men who, knowingly or un- 
knowingly, keep the process in motion. They act; 
he only reacts—except on those rare occasions 
when he can emulate Benjamin Disraeli and “steal 
the Whigs’ clothes.” If they act as liberals, if the 
social process moves steadily but not explosively, 
his reactions can take the form of conservatism. 
But if they act as radicals, if the process begins to 
speed up sharply, his reactions must aim beyond 
mere preservation and at restoration, If, as now 
seems to be the case all over the world, the pace 
of history gets out of control, the conservative can 
no longer fall back on the simple, instinctive acts 
of conservation. He, like the liberal, must reason 
and discriminate; he, like the radical, may have to 
plan and gamble. The “conservative as reformer,” 
the right-wing politician who tries to outpromise 
liberals in the area of welfare legislation, is an 
uncomfortable man. The “conservative as revolu- 
tionary,” the traditionalist who acts “radically” to 
preserve the crumbling values and institutions of 
his community, is no conservative at all. Small 
wonder that many conservatives in Western coun- 
tries have turned away from politics to fight the 
good fight of tradition in such areas as art, criti- 
cism, and education. 

In no country in the world is the dilemma of the 
true conservative, the man who wants to be neither 
a reactionary nor a pseudoliberal, more poignant 
than in the United States. For many reasons—the 
suddenness with which both democracy and indus- 
trialism came to stay as the American way of life, 


the absence of such comforting relics of a more 
ordered age as an established church or a mon- 
archy, the crushing of self-conscious aristocracy 
by the forces of an upstart plutocracy, the feeble- 
ness of political and theoretical radicalism (to 
which men of status and substance could react 
creatively )—conservatism has never flourished in 
America as successfully as it has in Great Britain. 
Now that the kind of social change resulting from 
automation and the reach for space has become the 
style of the American way of life, the true agents 
of change—the business and managerial classes— 
are proving conclusively that sponsorship of social 
revolution and opposition to political reform can 
go hand in hand. The dilemma of the American 
conservative will never be more dramatically dem- 
onstrated than it was in the career of Henry Ford— 
in his personal habits and opinions the most con- 
servative of men, in his influence upon America 
one of the most marvelous agents of profound and 
protracted social change the world has ever known. 

Conservatism as philosophy. Beyond the con- 
servatisms of mood and deed, yet almost always 
relating directly back to them, is the conservatism 
of thought. As a philosophy dedicated to the de- 
fense of an established order, and also to the 
leadership of certain groups or classes within the 
order, conservatism is an important intellectual 
force in most countries of the West. In the years 
since World War 11, a season of disillusionment 
over the broken promises of the once-ascendant 
liberal tradition, conservative and pseudoconserva- 
tive thinkers have been more active than at any 
period in the past two centuries. 

Wherever it is an intellectual force, conserva- 
tism has earned its measure of respect and influ- 
ence the hard way. Unlike the radical or liberal, 
the genuine conservative engages reluctantly in 
political speculation. The most famous conserva- 
tive statesman of the twentieth century, Sir Winston 
Churchill, steadfastly refused, for all his literary 
skills, to reflect upon and write down the prin- 
ciples that animated his career. The mere inten- 
tion to spin out a theory of conservatism is some- 
how an unconservative impulse, and the pursuit of 
this intention carries a person dangerously far 
from that simple social piety, that hesitation about 
poking a finger into the “cake of custom,” that is 
the essence of the conservative point of view. There 
would seem to be a few grains of truth in the ob- 
servation of many critics that a conservatism in 
search of clear-cut principles is a conservatism 
already in retreat. 

Whether it is, in fact, in retreat or on the attack 
in the realm of politics, conservatism is flourishing 
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in the realm of ideas, and one must recognize the 
existence of a core of principles that is the com- 
mon property of the moderate Right wherever it 
fiourishes as a legitimate force. These, it would 
seem, are the persistent themes of the philosophers 
of modern conservatism: 


The existence of a universal moral order sanc- 
tioned and supported by organized religion. 

The obstinately imperfect nature of men, in 
which unreason and sinfulness lurk always be- 
hind the curtain of civilized behavior. 

The natural inequality of men in most quali- 
ties of mind, body, and character. 

The necessity of social classes and orders, and 
the consequent folly of attempts at leveling by 
force of law. 

The primary role of private property in the 
pursuit of personal liberty and defense of the 
social order. 

The uncertainty of progress, and the recog- 
nition that prescription is the chief method of 
such progress as a society may achieve. 

The need for a ruling and serving aristocracy. 

The limited reach of human reason, and the 
consequent importance of traditions, institu- 
tions, symbols, rituals, and even prejudices. 

The fallibility and potential tyranny of ma- 
jority rule, and the consequent desirability of 
diffusing, limiting, and balancing political power. 


One could go further with this catalogue, for 
example, by taking note of the conservative pref- 
erence for liberty over equality, the conservative 
insistence that education must discipline before it 
can liberate, the conservative delight in such 
phrases as “the tragedy of history” and “the inade- 
quacy of politics,” and the conservative celebra- 
tion of the prudent man and the ordered society; 
but the principles listed would seem to be the 
essence of self-conscious conservatism in the twen- 
tieth century. 

Two points should be noted about the list. First, 
it is a series of abstractions, and since conserva- 
tives express a horror of abstract thinking each of 
these ideas must be referred back to a particular 
society and tradition. Second, in the special case 
of the United States, society is confidently pro- 
gressive and the dominant tradition happily demo- 
cratic, indeed liberal. As a result, the philosophy 
of conservatism in modern America is a jumble of 
crosscutting answers to the persistent questions of 
political theory, especially the question of where 
the line is to be drawn between the rights of the 
individual and the demands of the community. 
Deep inside the shell of his half-Jeffersonian, half- 
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Hamiltonian ideology, however, the American con- 
servative nurses principles and prejudices that are 
closely related to those of conservatives in other, 
less self-consciously democratic countries. 

The conservatism of Burke. These considera- 
tions lead to a discussion of the most famous 
brand of philosophical conservatism, the school of 
political thought identified with Edmund Burke. 
This conservatism sprang to life in the turmoil of 
the 1780s and was especially a reaction to the ex- 
cesses of the French Revolution. Although men as 
separated in time and purpose as Aristotle, Cicero, 
St. Augustine, St. Thomas Aquinas, Richard 
Hooker, and John Locke had created essentially 
conservative systems of political thought, Burke’s 
Reflections on the Revolution in France (1790) is 
properly considered the purest source of conscious- 
ly conservative principles. Other important events 
in the birth of this conservatism were the indus- 
trial revolution, which made change rather than 
stability the style of the social process, and the 
Enlightenment, which put reason in place of tra- 
dition as the surest guide to human conduct. The 
result was the emergence of a political faith that 
celebrated the beauties of stability and tradition. 

The conservatism of Burke has been and re- 
mains a Western phenomenon, a philosophy pe- 
culiar to the Atlantic community and some of its 
extensions throughout the world. Indeed, although 
it has loyal adherents in many countries, this con- 
servatism has held continuous sway as a major 
political and intellectual force only in Great Britain. 
It has not flourished as it might have in France 
or Germany or Italy because, among other reasons, 
there has not been sufficient agreement among the 
men of the Right as to what institutions and values 
they want to conserve. It does not flourish as it 
once did in the United States because, among other 
reasons, democracy and industrialism have swept 
so much more powerfully, abruptly, and success- 
fully over the American scene than they have over 
any other country of the West. While John Adams 
could be a conservative in the style of Burke and 
serve as president of the United States, his real 
and spiritual descendants—Henry Adams, Brooks 
Adams, Irving Babbitt, Ralph Adams Cram, Paul 
Elmer More, and, more recently, Russell Kirk— 
have been cast, as Kirk himself has admitted rue- 
fully, in “the role of Don Quixote.” 

At its present stage of development, this con- 
servatism is prepared to defend most of the values 
and institutions of the West. The conservatism of 
Burke, as it is proclaimed in the writings of his 
political and intellectual heirs, is full of doubts 
about the goodness and equality of men, the pos- 


sibilities of progress, and the wisdom of the ma- 
jority—that is to say, about the democratic dogma 
—but it has long since accepted constitutional 
democracy as the only viable alternative to the 
totalitarianisms of Right and Left. The Burkean 
conservative has learned to suppress his persistent 
antidemocratic urges and to give new definitions, 
and thus a new influence, to such cherished con- 
cepts as tradition, order, and aristocracy. It must 
be remembered that he draws his inspiration from 
the Whiggish Burke, not from the reactionary 
Joseph de Maistre; his concern is ordered liberty, 
not order pure, simple, and at any cost. If he is a 
democrat by chance rather than choice and is 
gripped by a mood of pessimism rather than opti- 
mism about the prospects of democracy, he is 
nonetheless a democrat. 


Whether as a pattern of personal behavior, a 
cluster of social attitudes and prejudices, a force 
in the realm of politics, or an enveloping way of 
life and thought, whether as a phenomenon to be 
found on display by individuals, classes, interests, 
sections, parties, or even entire nations, conserva- 
tism bids well to be a major force in the latter part 
of the twentieth century. Even as great winds of 
social change sweep across the world and make it 
more difficult for men to think and act as con- 
servatives, the desire to discipline change, if no 
longer to arrest it, becomes, paradoxically, an ever 
more powerful urge. If conservatives can no longer 
afford to repeat after Lord Falkland that “when it 
is not necessary to change, it is necessary not to 
change” (White [1950] 1957, p. 127), or after 
Samuel Johnson that “most schemes of political 
improvement are very laughable things” (Rossiter 
1962, p. 51), they can at least insist, in the ad- 
monishing words of Disraeli, that those men who 
would pile change upon change and reform upon 
reform be properly deferential to “the manners, the 
customs, the laws, the traditions of the people” 
(White [1950] 1957, p. 127). It is their task to 
prove to an always doubting world that conserva- 
tism is an essential part of any pattern of ordered 
liberty. 

CLINTON ROSSITER 


[See also LIBERALISM; PERSONALITY, POLITICAL; RADI- 
CALISM; and the biography of BuRKE.] 
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I. INTRODUCTION 

1. DISTRIBUTION OF POWERS 
mi. Civit LIBERTIES 

Iv. Cıvıl RIGHTS 


I 
INTRODUCTION 


Constitutional law is law derived from, related 
to, or interpretive of a constitution. The term may 
in fact be used synonymously with constitution, 
since the function of a constitution is to provide 
the basic law for a governmental regime. But it is 
more common, at least in governments with writ- 
ten constitutions, for the term constitution to refer 
to the actual text of the basic document, while con- 
stitutional law connotes the constitution as inter- 
preted and applied by the organs of government 
and as elaborated and rationalized by scholarly 
commentators. 

Every regime must have an organizational frame- 
work, an agreed division of responsibilities among 
public instrumentalities, and a system of defini- 
tions of individual rights and status in the com- 
munity. In a traditional society these arrangements 
and understandings are established by custom, but 
in more modern societies they are subject, through 
a process of conscious choice, to determination or 
revision by the policy-making institutions of gov- 
ernment. These determinations have, since the 
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adoption of the American and French constitutions 
toward the close of the eighteenth century, gen- 
erally been codified in a comprehensive written 
document that has status superior to ordinary stat- 
ute law and that can be amended only by a special 
legislative procedure. 

In England, however, it has proved possible to 
proceed on into the twentieth century without any 
systematic codification of the rules of the political 
system. Constitutional law in England consequently 
cannot be clearly distinguished from other public 
law, since it has no superior legal authority; and 
any constitutional practice, even the most funda- 
mental, can be overridden or revised by act of 
Parliament. A treatise on English constitutional law 
is simply a treatment of all the written and un- 
written law generally considered essential to the 
operation of the governmental system. 

Where, as is now generally the case, a written 
constitutional instrument is in existence, consti- 
tutional law is a formal category of law defined by 
the scope and provisions of the document. Any 
matter covered in the constitution, even arrange- 
ments of slight general importance, takes on the 
stature of constitutional law. Conversely, matters 
not covered in the constitution, no matter how im- 
portant—such as political parties not being included 
in the U.S. constitution—are not subjects of con- 
stitutional law. 


Responsibility for interpretation 


Where a written constitution provides the foun- 
dation for a political regime, the principles of con- 
stitutional law for that regime are derived by inter- 
pretation of the language of the document; these 
interpretations are then applied in making govern- 
mental decisions and settling constitutional con- 
troversies. Some constitutional language is so 
specific that little interpretation is required, as, for 
example, the provision that the U.S. president 
“shall hold his office during the term of four years.” 
But most constitutional language leaves room for 
varying views as to its meaning, and such pro- 
visions as “due process of law” or “freedom of 
speech” or “unreasonable searches and seizures” 
can never be applied without making a choice 
among alternative interpretations. 

Whose responsibility is it to interpret a written 
constitution and authoritatively to declare consti- 
tutional law? An important part of this function 
falls to the executive branch, which must inevit- 
ably and continually interpret the constitution in 
making decisions on the use of executive power. 
In 1841 a U.S. president died in office for the first 
time. The language of the constitution left it un- 
clear whether it was the “office,” or only the “powers 


and duties” of the office which devolved on the 
vice-president. This important constitutional ques- 
tion had to be determined by the then vice-presi- 
dent, John Tyler, who, after some initial hesitation, 
decided that he was actually president and not 
simply vice-president acting as president. This view 
was quickly accepted as the constitutional law on 
presidential succession. 

Congress also must continuously construe the 
constitution as a guide to action. For example, 
the constitution authorizes the president to make 
treaties, “by and with the advice and consent of 
the Senate.” George Washington assumed that this 
provision meant that he should sit down with the 
senators and get their advice on treaties while they 
were being negotiated, but when he went to the 
Senate for this purpose, the senators preferred not 
to discuss the matter in his presence and voted to 
refer it to a committee. Washington had no choice 
but to withdraw, yielding to the Senate's interpre- 
tation of “advice and consent,” and no subsequent 
president has ever sought to repeat this tactic. 

These two instances indicate how it is possible 
for the executive and the legislature to make bind- 
ing interpretations of their own respective powers 
under a constitution. In fact, it was argued by 
many during the early years of the American repub- 
lic that each branch of government should be the 
authoritative interpreter of all those provisions 
dealing with its own status and powers. However, 
this tripartite theory of constitutional interpreta- 
tion was strongly challenged by Justice Marshall 
in the 1803 case of Marbury v. Madison (1 Cranch 
137), where he asserted the primacy of judicial 
interpretation of the constitution. 

In the Marbury case the Supreme Court declined 
to enforce a statute duly enacted by Congress be- 
cause, as Marshall interpreted the act, it was con- 
trary to the constitution. The judicial power to 
declare acts of Congress unconstitutional is not 
explicitly stated in the constitution, and the argu- 
ment as to whether the drafters intended the Court 
to have such power has never been entirely settled. 
Some of the major battles of American politics have 
resulted from legislative or executive resistance to 
the Courts interpretations, and notable efforts 
have been made to deny the supremacy of judicial 
views on constitutional interpretation. Both Andrew 
Jackson in his veto of the Bank Bill in 1832 and 
Abraham Lincoln in his first inaugural address 
took this position. 

In spite of such opposition, the Supreme Court’s 
qualifications to act as pre-eminent interpreter of 
the constitution have been generally accepted. The 
Court has minimized resistance to its role by adopt- 
ing rules of self-restraint, which keep it from pass- 
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ing on certain constitutional questions of a “polit- 
ical” nature, and has generally been deferential 
toward the constitutional interpretations by the 
president and Congress. It has seldom asserted a 
constitutional position that challenged a dominant 
popular opinion, although there have been such 
“self-inflicted wounds” as the Dred Scott decision 
in 1857 and the invalidation of the income tax 
in 1895, 

During the twentieth century the Court's most 
serious miscalculations on constitutional issues 
came in 1935 and 1936, when it declared a num- 
ber of New Deal statutes unconstitutional. Presi- 
dent Roosevelt was unable to get through Congress 
legislation authorizing him to appoint additional 
justices who would support the New Deal; but in 
1937 the Court itself reversed its position, thus 
terminating the dispute. Two decades later the 
Court’s constitutional views again became the 
center of controversy, primarily on the issue of 
racial segregation in the public schools, but also 
on certain national security issues in 1957, on legis- 
lative apportionment in 1962, and on the problem 
of religion in the public schools in 1962 and 1963. 

The enormous prestige of the United States Su- 
preme Court has helped to popularize judicial re- 
view of legislation and judicial supremacy in con- 
stitutional interpretation. Prior to World War 1 it 
was primarily in federal systems within the British 
Commonwealth—notably Canada and Australia— 
that courts with constitutional responsibilities some- 
what comparable to the U.S. Supreme Court were 
found. However, following World War 11 constitu- 
tional courts were established in West Germany, 
Italy, India, and elsewhere. 


Principles of interpretation 


Wherever the process of constitutional interpre- 
tation goes on, it must be guided by some more or 
less articulate theory about the extraction of mean- 
ing from constitutional language. The interpreter 
cannot merely insist that the constitution means 
whatever he wants it to mean, or at least he can- 
not admit such an approach to the interpretative 
process. For the essence of constitutional purpose 
is to establish a degree of certainty, to impose limi- 
tations that will affect all alike, and to give effect 
to rules that are external to the value system of the 
interpreter. Various approaches to the establish- 
ment of stable constitutional meanings have been 
proposed. 

First, it may be contended that the constitution 
should mean what its framers meant it to mean. 
In the United States it is a rare constitutional de- 
bate in which someone does not appeal to “the 
intention of the framers.” Thus, when the question 


297 


is raised whether reciting prayers in public schools 
is an “establishment of religion,” it will be argued 
by some that this depends upon what the members 
of the first Congress meant by this phrase when 
they were drafting the first amendment. In the case 
of some provisions, the purpose of the drafters may 
seem reasonably clear from the language of the 
document or the historical data. But generally this 
is not true. Constitutional language is always the 
product of group effort and compromise and may 
be deliberately chosen to bridge over differences of 
opinion, The intentions involved are the intentions 
of many individuals who participated to a greater 
or lesser degree and at various stages in the process 
of constitutional draftsmanship and subsequent 
ratification. The proceedings of the Constitutional 
Convention are known largely through the incom- 
plete notes taken by James Madison. Some of the 
men who participated in drafting the constitution 
were in wide disagreement about its intention 
within a few years. 

Because individual and group intentions are dif- 
ficult to determine, a second theory of constitu- 
tional interpretation proposes concentration on 
word meanings. As Justice Oliver Wendell Holmes 
put it, “We ask, not what this man meant, but 
what those words would mean in the mouth of a 
normal speaker of English, using them in the cir- 
cumstances in which they were used” ([1885-1918] 
1952, p. 204). This method employs somewhat 
narrower lexicographic skills as compared with the 
social historicism on which the first method relies. 
It is more rigorously confined by the document it- 
self and more closely related to the processes by 
which the written instruments of private law are 
construed. 

An alternative to these two basically historical 
methods of determining constitutional meaning is 
the approach of logical analysis, which was heavily 
used by Chief Justice Marshall in his great de- 
cisions. In Marbury v. Madison, for example, Mar- 
shall cites no judicial decisions to support his 
arguments and, although referring to “original in- 
tention,” makes no effort to quote contemporaneous 
evidence or opinion. His argument is primarily an 
exercise in logic. “It seems only necessary to recog- 
nize certain principles,” he says, “supposed to have 
been long and well established, to decide [the 
case].” The major principle is that the constitution 
is the supreme law of the land. The Supreme Court 
has taken an oath to uphold the constitution. The 
conclusion logically follows that when an act of 
Congress conflicts with the superior law, the Su- 
preme Court cannot enforce it, but must declare it 
null and void. 

This position has been so long accepted that the 
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logic supporting it may seem unassailable. But it is 
equally logical to argue that the constitution is the 
supreme law of the land, and since the president 
has taken an oath to support the constitution he 
cannot enforce a Supreme Court decision that con- 
flicts with the constitution, but must declare it null 
and void. The problem is simply not one to which 
logic can guarantee a correct answer. The limita- 
tions on the contribution of logic can be made 
clearer by stating a part of Marshall’s argument as 
a syllogism. 


Major premise: A law repugnant to the consti- 
tution is void. 

Minor premise: This law is repugnant to the con- 
stitution. 

Conclusion: This law is void. 


Assuming the validity of the major premise, the 
soundness of the conclusion depends upon whether 
the minor premise is factually true. But logic can- 
not tell us whether a particular law is repugnant 
to the constitution. That is a matter of informed 
opinion and judgment. Justice Holmes, in one of 
his most famous passages from The Common Law 
([1881] 1963, p. 5) disparaged the logical ap- 
proach: “The life of the law has not been logic: it 
has been experience, The felt necessities of the 
time, the prevalent moral and political theories, in- 
tuitions of public policy, avowed or unconscious, 
even the prejudices which judges share with their 
fellow-men, have had a good deal more to do than 
the syllogism in determining the rules by which 
men should be governed.” 

It seems clear that no one rule of constitutional 
interpretation can be asserted to be the proper or 
sole approach to constitutional understanding. A 
constitution is more a political than a legal docu- 
ment. Consequently, all of the factors that go into 
the formation of divergent political preferences 
will also be operative in the minds of constitutional 
interpreters, be they executives, legislators, or 
judges. How a constitution will be read depends in 
large part upon the men who are doing the read- 
ing and how the world looks to them. The constitu- 
tional system is not separate from the political 
system, but a necessary part of it, performing the 
vital function of giving order and structure to the 
processes of policy formation. Holmes put the con- 
ception of a “living” constitution into eloquent 
language when he wrote: 


. . when we are dealing with words that also are a 
constituent act, like the Constitution of the United 
States, we must realize that they have called into life 
a being the development of which could not have been 
foreseen completely by the most gifted of its begetters. 


It was enough for them to realize or to hope that they 
had created an organism; it has taken a century and 
has cost their successors much sweat and blood to 
prove that they created a nation. The case before us 
must be considered in the light of our whole experi- 
ence and not merely in that of what was said a hun- 


dred years ago. (Missouri v. Holland, 252 U.S, 416, 
1920) 
Stability and change 


The central problem in the development of a 
system of constitutional law is, as Holmes suggests, 
to recognize the need for adapting and changing 
to meet new circumstances, while preserving the 
basic values of the constitutional system. In a 
world growing constantly more crowded and com- 
plex with new ideas, new technical developments, 
new ways of living, new standards of social re- 
sponsibility, and new world crises, the pressure on 
constitutional rules established to meet the differ- 
ent conditions of an earlier day will inevitably force 
their revision or abandonment. 

The most obvious method of adapting constitu- 
tional law to changed conditions is to replace the 
old constitution with a new one. When an existing 
political system is overthrown, the new holders of 
power customarily seek to organize and legitimize 
their regime by the adoption of a new constitution. 
It is an index to the political instability of France 
that it has had 15 constitutions since the Revolu- 
tion. American states also adopt new constitutions 
often, but not because of political upheaval. State 
constitutions tend to be very long and detailed, 
regulating the minutiae of the state and local gov- 
ernmental system, and can become obsolete quite 
rapidly. Periodically, therefore, states find it more 
expedient to draft a new constitution than to 
attempt to patch up the old one. 

More commonly, the process of constitutional 
adaptation is achieved by amendment, interpreta- 
tion, and custom. The drafters of the U.S. consti- 
tution were the first to recognize the need for 
making provision within the document itself for its 
own revision. The presence of the amending clause 
was one of the factors that led Thomas Jefferson, 
originally inclined to oppose the constitution, to 
decide in its favor. 

The method of amending the constitution is 
usually more difficult than the passage of normal 
legislation, although this is not true in some coun- 
tries, such as New Zealand and the Republic of 
South Africa. The arrangements for amending the 
U.S. constitution, requiring approval by two-thirds 
of each house of Congress and three-fourths of the 
states, have on occasion been criticized as too dif- 
ficult. Following the Civil War amendments ( thir- 


CONSTITUTIONAL LAW: Introduction 


teenth, fourteenth, and fifteenth), there was a 
period of more than forty years during which the 
constitution appeared unamendable, in spite of 
strong pressure for revision to meet the problems 
of a rapidly expanding economy. But between 1913 
and 1933 six amendments were added to the con- 
stitution, and the experience with the eighteenth 
(prohibition) amendment showed that it was even 
possible for a small but dedicated pressure group to 
exploit the amending machinery. 

‘Article 89 of the French constitution provides for 
amendment of the constitution by vote of parlia- 
ment. In 1962, however, President de Gaulle ig- 
nored this method and submitted to a popular ref- 
erendum an amendment providing for election of 
the president by popular vote instead of by limited 
suffrage. In Australia proposed amendments, after 
passage by each house of Parliament, must be ap- 
proved by a popular referendum throughout Aus- 
tralia and also by a majority vote in four of the six 
states, The Canadian constitution (British North 
America Act of 1867) contains no provision for 
its own amendment; consequently, constitutional 
change has required recourse to the original source 
of the constitution, the British Parliament. 

The amending power has been less important in 
the development of the U.S. constitution than has 
the previously discussed process of interpretation. 
The document, drafted in 1787, could scarcely have 
met the needs of a world power in the twentieth 
century unless it had been construed with some 
flexibility. It was one of John Marshall's greatest 
achievements that he saw the need for a broad con- 
struction of the constitution. “We must never for- 
get, that it is a constitution we are expounding,” 
he said, one that is “intended to endure for ages to 
come, and consequently, to be adapted to the vari- 
ous crises of human affairs” (McCulloch v. Mary- 
land, 4 Wheaton 316, 1819). 

Revising a constitution by interpretation does 
create both theoretical and practical problems. 
Justice Frankfurter once suggested that “nothing 
new can be put into the Constitution except through 
the amendatory process. Nothing old can be taken 
out without the same process” (Ullmann v. United 
States, 350 U.S. 422, 1956), It must be presumed 
that he was using this rather extreme method of 
warning that if a constitution is to fulfill its func- 
tion of acting as a stabilizing and controlling influ- 
ence in a political system, the interpretations given 
to it must be maintained with some measure of 
continuity, and the agencies that interpret and 
apply the constitution must feel under some com- 
pulsion to accept the views announced by their 
predecessors. 
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Regard for precedent is, of course, one of the 
foundation stones of legal systems generally, ap- 
plicable in both constitutional and statutory inter- 
pretation. However, the principle of stare decisis, 
necessary as it is in the settlement of normal legal 
controversies, presents unusual difficulties when 
applied to constitutional meanings. If a constitu- 
tional interpretation, once announced, can be modi- 
fied only by constitutional amendment, an impos- 
sible burden is thrown on the amending machinery. 
The U.S. Supreme Court has, therefore, on occasion 
overruled its earlier decisions. One of the most note- 
worthy instances of the overruling of a prior con- 
stitutional position occurred in 1954 in Brown v. 
Board of Education (347 U.S. 483). In an 1896 
decision the Supreme Court had held that segre- 
gated facilities for Negroes were not a violation of 
the equal protection clause of the fourteenth 
amendment, provided they were the “equal” of the 
facilities provided for white persons. But when the 
Court had to decide in 1954 whether racial segre- 
gation was permissible in the public schools, it held 
unanimously that “we cannot turn the clock back 
. . . to 1896. . . . We must consider public educa- 
tion in the light of its full development and its 
present place in American life throughout the Na- 
tion.” Such consideration convinced the Court that 
the rule of “separate but equal” was no longer com- 
patible with the concept of equal protection under 
the laws, and consequently it was abandoned. 


The study of constitutional law 


Traditionally, students of constitutional law have 
concerned themselves with collecting and analyz- 
ing constitutional interpretations and organizing 
and rationalizing them into a coherent system. 
Some constitutional commentators gain such pres- 
tige by the force of their reasoning that they them- 
selves become authoritative sources of constitutional 
law. Thomas Cooley's A Treatise on Constitutional 
Limitations (1868), which appeared in the same 
year the fourteenth amendment was adopted, so 
effectively argued for the doctrine of implied con- 
stitutional limitation on the powers of American 
states that it was a powerful force toward the 
stricter judicial review of state legislative action 
that characterized the latter part of the nineteenth 
century. 

In the twentieth century the general constitu- 
tional commentary has tended to give way to trea- 
tises on the constitutional status of particular insti- 
tutions, such as Corwin’s The President: Office and 
Powers (1940) or Jennings’ Parliament (1940). 
There have also been frequent analyses of the ap- 
plication and interpretation of specific constitu- 
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tional provisions, such as Chafee’s study of the first 
amendment in Free Speech in the United States 
(1941), Constitutional law has been taught in U.S. 
colleges and law schools almost entirely by use of 
Supreme Court decisions collected in casebooks, 
reflecting the degree to which constitutional inter- 
pretation is monopolized by the courts. 

Behavioral trends in the social sciences have had 
a significant impact on the study of constitutional 
law. As early as 1913, Charles Beard argued that 
economic advantage had motivated the framers of 
the constitution. Later, prominent political scien- 
tists such as Corwin and Robert Cushman, biogra- 
phers of Supreme Court justices such as Swisher 
(1930) and Mason (1956), and the school of legal 
realists headed by Jerome Frank (1930) all empha- 
sized the influence of judicial personality on con- 
stitutional interpretation. Finally, in the 1950s, 
judicial behavior, particularly the decision making 
of the Supreme Court, began to be subjected to 
study through a whole armory of sociological and 
psychological tools, including voting bloc analysis, 
small group theory, Guttman scaling, vector analy- 
sis, and game theory (Schubert 1963). 


C. HERMAN PRITCHETT 


[See also PRESIDENTIAL GOVERNMENT.] 
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It 
DISTRIBUTION OF POWERS 


From the beginnings of political speculation, 
much time and theoretical effort have been as- 
signed to the demarcation and definition of juris- 
dictions between competing elites within various 
societies. While the bases of jurisdiction have 
shifted over the centuries, one can hear in con- 
temporary Israel echoes of Old Testament disputes 
between kings and prophets, and in contemporary 
America the rhetoric of states’ rights has archetypal 
resemblances to the polemical literature of the 
early modern struggle between royal centralists 
and feudal autonomists. And although the external 
form of modern disputes is radically different from 
earlier jurisdictional disputes, the key issue has 
not altered in two thousand years, namely, where 
does the final jurisdiction to define jurisdictions 
rest? 

This issue has always been, and remains today, 
the crux of the matter; in the vocabulary of mod- 
ern politics it is known, of course, as the question 
of sovereignty. Even the most ultramontane pope 
never denied the existence of the imperium, the 
emperor's sector of jurisdiction. Nor did the most 
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ambitious of the Holy Roman emperors ever assert 
that the sacerdotium, the ecclesiastical jurisdiction, 
was nonexistent. But a struggle raged for centuries 
that, although seemingly centered on the scope of 
the respective jurisdictions, was in fact directed 
to the antecedent problem of ultimate determina- 
tion of the appropriate forum for settlement of 
jurisdictional disputes. Archbishop Thomas a 
Becket’s argument in his confrontation with Henry 
11 in 1164 over the question of the criminous clerk 
(Pollock & Maitland [1895] 1952, vol. 1, pp. 447- 
457) was thus at base analogous to the logic of the 
Virginia and Kentucky resolutions of 1798-1799; 
like the archbishop, Thomas Jefferson and James 
Madison denied their opponents the right to define 
their own jurisdictions. 

Every political community larger than the Greek 
polis has been faced with the problem of distribut- 
ing power, at least in functional terms. Subgovern- 
ments—provinces, exarchates, shires, communes, 
etc.—have been established for purposes of admin- 
istrative convenience. Other varieties of subgovern- 
ment have originated from more complex origins, 
for example, Huguenot autonomy in France as a 
consequence of the Edict of Nantes, Scottish privi- 
leges under the Act of Union with England, Turkish 
communal rights as stipulated in the Cypriot con- 
stitution. In this context, what problems arise do 
so as a result of ambiguity in the relationship of 
the peripheral units to the center. When sover- 
eignty, i.e., the final power to define jurisdictions, 
clearly resides in the center, we have what is com- 
monly called a “unitary” system. In France, for 
example, the authority of départements, arrondisse- 
ments, and communes—the units of subgovern- 
ment—is clearly subject to determination by the 
central government. This does not prevent occa- 
sional conflicts: communist-dominated communes 
from time to time invoked the wrath of the min- 
ister of the interior by banning hydrogen bombs 
from their territory, passing resolutions against 
the Algerian war, or renaming streets after com- 
munist heroes, but the results contributed more 
in the way of publicity than jurisprudence. The 
offending local council, in such instances, is sim- 
ply dissolved by order of the central government. 

At the other end of the spectrum we find the 
confederation, which is characterized by the fact 
that the powers of the central government are 
subject to definition by the peripheral units. A 
classic instance of this type of government was 
the United States under the Articles of Confedera- 
tion; a modern example is Switzerland, where re- 
sidual power still—in theory if not always in prac- 
tice—rests with the component cantons. 
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Between these two polar positions can be found 
a huge variety of intermediate forms of which the 
most significant for our purpose is American fed- 
eralism. For almost two centuries political theorists 
have been attempting to define “federalism,” and 
have had uniformly disastrous results because in 
analytical terms federalism is an institutionalized 
technique of question begging. It is, to put the 
matter differently, an effort to exorcise sovereignty 
from the political sector. Since definition is there- 
fore understandably elusive, we shall have to settle 
for a description drawn from K. C. Wheare, which 
suggests that federalism is a system in which two 
levels of government operate within the same geo- 
graphical limits and neither has the power to de- 
stroy the other. [See FEDERALISM. ] 

However, before turning to a historical exami- 
nation of the distribution of powers between the 
American national government and the states, it 
would be advantageous to define the other major 
dimension of the distribution of power: the rela- 
tionship of various agencies of government to one 
another—or, in the American context, the issue of 
the separation of powers within the national gov- 
ernment. Again, the problem is one that has been 
canvassed since the time of Aristotle (Politics 
1298a40), although the form that the “separa- 
tion of powers” took among political theorists 
through the eighteenth century was almost totally 
irrelevant to the American innovation. From Poly- 
bius and Cicero through Montesquieu, the “separa- 
tion of powers” involved the allocation of gov- 
ernmental functions to orders of the realm or to 
estates, with the ideal result of a mixed govern- 
ment or “polity.” Thus, Thomas Aquinas (Summa 
theologica Quaestio cv), discussing the “right 
ordering of power in a principality,” argued that 
the best form of constitution incorporated a “judi- 
cious admixture of the kingdom . . . of aristocracy 

. and of democracy.” 

This was the essence of medieval constitution- 
alism—the conciliar movement, indeed, attempted 
with ephemeral success symbolized in the Frequens 
decree, issued in 1417 by the Council of Constance, 
to apply this formula to the government of the 
church—and it provided the foundation for Mon- 
tesquieu’s famous chapter in The Spirit of the 
Laws (1748): “On the Constitution of England.” 
To state the point differently, the tripartite division 
of the United States constitution was not intended 
to balance the authority of the “one, the few, and 
the many”; it was a functional differentiation be- 
tween governmental organs. John Locke’s notion 
of the “separation of powers” was founded on 
another premise, but one equally irrelevant to 
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American constitutional law. It is true that Locke 
identified a threefold division of powers (“The 
Second Treatise of Government” 1690), but he 
never asserted the equivalence of his three branch- 
es: his “executive” was a thoroughly dependent 
figure (§152) and, so far as domestic jurisdiction 
was concerned, was little more than an agent of 
the “supreme” legislative power (§149). And his 
third division, the “federative,” was indeed given 
wide authority and prerogative—but only for the 
purpose of conducting foreign relations (§147). 
The judiciary was attached to the executive. 

What then are the origins of the American doc- 
trine of the “separation of powers”? What did this 
doctrine involve in practice? By what means and 
to what degree has it been incorporated in Ameri- 
can constitutional law? These are the issues that 
will concern us in the last section of this analysis. 
First, however, we shall examine the division of 
powers in the American federal system, that is, 
the relationship between the national government 
and the states. 


The division of powers 


The American federal system has been widely 
acclaimed as a great, even original, contribution 
to political science, and it has provided the model 
for many subsequent governments. Federalism has 
always appealed to spokesmen for vested minority 
interests—whether religious, ethnic, or economic— 
who are seeking institutional guarantees against 
the power of a majority (Riker 1964). Nowhere 
has the connection between the integrity of the 
peripheral jurisdiction, i.e., the states, and the 
privileges and property of a national minority— 
slaveholders—been expressed more cogently than 
in John C. Calhoun’s A Disquisition on Government 
(1851). Today we can discern variations on the 
Calhoun theme in the rhetoric of the Cypriot Turks, 
the French Canadians, the Ceylonese Tamils, the 
Fijis (outnumbered on their ancestral islands by 
Indians), the Nigerian Yoruba—to name only a 
few instances. 

The theory that underpins this adulation is that 
federalism provides a viable framework for the 
reconciliation of majority power and minority 
rights. In terms of the distribution of power, the 
model rests on two assumptions: first, the central 
government has a plenary jurisdiction over certain 
sectors of policy making; second, certain other 
sectors of decision making are reserved in equally 
plenary fashion to the peripheral units (states, 
provinces, ethnic or religious communities). Obvi- 
ously, such a system requires a written constitu- 
tion, and for our purposes it is irrelevant whether 


residual power remains with the peripheral units 
(for example, U.S. constitution, article x of the 
bill of rights) or the center (for example, Canada 
under the provisions of the British North America 
Act of 1867). 

In principle, then, an impenetrable wall is built 
between the jurisdiction of the center and the juris- 
diction of the peripheral units. The nation as a 
whole exercises the necessary sovereignty to hold 
its place in the international community, while 
within the society the rights of minorities are pro- 
tected from “nationalization.” The character of the 
minorities may vary widely; in the United States— 
as in Australia—geographical prescription served 
as the justification for the establishment of the 
peripheral units: the previously self-governing 
units were integrated as the operating components 
of federalism. In Canada and the Union of South 
Africa, geography mixed with ethnic considerations 
(the French in Quebec, the Afrikaners in the Trans- 
vaal and Orange River Colony) supplied the ra- 
tionale, 

It has been suggested (McLaughlin 1932) that 
when the framers of the United States constitution 
met in Philadelphia in 1787, they found the theo- 
retical and practical arguments for federalism 
overwhelming, largely as an outgrowth of their 
colonial experience. In their struggle with the Brit- 
ish over the status of the colonies, American 
spokesmen developed a wide-ranging theory of 
American rights. However, it is hard to assert that 
the colonial experience provided a formula for the 
new federal experiment; on the contrary, it sup- 
plied full support for precisely the form of associa- 
tion that the Constitutional Convention destroyed: 
the Articles of Confederation. The essence of the 
colonial position vis-à-vis the British was that the 
Americans had the right to define the jurisdiction 
of king and parliament, i.e., the peripheral units 
defined the authority of the center, the very prin- 
ciple that later was incorporated into the Articles 
of Confederation. Curiously, then, when the fram- 
ers set to work to limit the authority of the states 
(and the caprice of “Rogues Island”), the argu- 
ments they utilized were drawn from the British, 
not the American, arsenal. They were in search of 
sovereignty in the classic Hobbesian sense, and, 
with few exceptions, took a dim theoretical view 
of states’ rights. 

This may seem to be an eccentric interpretation, 
flying as it does in the face of conventional wis- 
dom. However, it has the great merit of being sol- 
idly grounded on the proceedings in the Consti- 
tutional Convention (Roche [1952-1963] 1964, 
pp. 91-126), rather than on the a posteriori theo- 
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retical patina supplied by The Federalist. The 
views of “Publius” on the high merits of the 
constitution were analogous to a man’s eulogies 
of a wife he had secretly been forced to marry: 
both James Madison and Alexander Hamilton had 
supported the establishment of a unitary system 
at the convention, but debates were secret and no 
significant record was published until after Madi- 
son's death in 1836. What had occurred at the 
convention was that a strong unitary system—the 
“Randolph Plan,” prepared by Madison—which put 
the states completely at the mercy of the nation, 
was modified into what later became known as 
federalism. The latter was an improvisation that 
only subsequently was promoted into a political 
theory. In practical terms, it was a compromise 
between the unitary convictions of the delegates 
and their sense of political reality; the leading op- 
ponents of the Randolph Plan endorsed its goals 
but argued that it would be an act of political 
suicide to return to their constituent states with 
such a radical innovation. Thus the scope of na- 
tional power over the peripheral units was cut 
down, the sectors of national jurisdiction stipu- 
lated, and the states given equal representation in 
the Senate. In the course of ratification, propo- 
nents of the new government were forced to 
promise a bill of rights as a further safeguard to 
the jurisdiction of the states. 

When the constitution went into operation and 
the newly elected officials assembled in New York, 
there was nothing particularly lucid about feder- 
alism, The constitution was a model of studied 
ambiguity at certain key points, reflecting its 
authors’ pragmatic notion that the best way to 
deal with sticky problems was to turn them over 
to the future unclarified. The institutions of the 
new republic were provided for, but even here a 
conflict in the convention had resulted in a seman- 
tic compromise: Congress was given the power to 
establish an independent federal judiciary (below 
the Supreme Court, which was explicitly created) 
but left with discretion to utilize inferior state 
judicial machinery for national purposes if it so 
chose, Yet the distribution of power within the 
national government was a model of clarity when 
compared with the relationship between the latter 
and the states. 

An early dictionary, which Sir Ernest Barker 
was fond of quoting, defined a cello as a big violin 
and a violin as a small cello. The relationship 
between national and state powers under the con- 
stitution was a similar exercise in logical circu- 
larity. In the body of that document there was no 
discussion whatever of the distribution of power 
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between the center and the peripheral units; state 
objections led to the addition of the ninth and 
tenth amendments in 1791, but these did little 
more than restate the central ambiguity. Article 1x 
announced that “the enumeration in the Constitu- 
tion, of certain rights, shall not be construed to 
deny or disparage others retained by the people.” 
Article x provided that “the powers not delegated 
to the United States by the Constitution, nor pro- 
hibited by it to the States, are reserved to the 
States respectively, or to the people.” 

In other words, the ninth and tenth amend- 
ments, like the constitution itself, ignored the cru- 
cial institutional question: How, and by whom, were 
jurisdictional conflicts between states and nation 
to be resolved? What was to be the truth-finding 
mechanism in this open-ended situation? The im- 
mediate answer is the United States Supreme 
Court, and although there is no direct evidence 
that the framers intended the Court to exercise 
this supervision over the distribution of powers 
(there was no recorded discussion of the problem 
in the Constitutional Convention), circumstantial 
evidence supports this contention. The Supreme 
Court's power to review both acts of Congress and 
state legislation was probably taken for granted; 
indeed, the notion of a written constitution con- 
taining limitations on states and nation carried 
with it a need for machinery of enforcement, and 
the Supreme Court seems to have been assigned 
this task sub silentio. 

The constitution was hardly ten years old before 
the question of states’ rights shook the new repub- 
lic to its foundations. In a desperate effort to hold 
their slipping popularity, the Federalists, marshaled 
by Alexander Hamilton, attempted in 1798-1799 
to cripple the Jeffersonians by national legislation, 
primarily in the Sedition Act of 1798, which was 
designed to muzzle the Jeffersonian press. The 
Jeffersonians lost the battle in Congress and then 
appealed to the courts to declare the Sedition Act 
unconstitutional as an intrusion on states’ rights. 
The litigation never reached the Supreme Court, 
but at the circuit level the act was sustained as a 
legitimate exercise of congressional power. At this 
point, Jefferson and Madison realized that the rules 
had to be changed, that it was improper for an 
agency of the general government to have the right 
to assess the national jurisdiction, that, in Locke’s 
phrase, “no one should be a judge in his own 
cause.” The result was the Virginia and Kentucky 
resolutions of 1798-1799, the former prepared by 
Madison, the latter secretly drafted by Vice-presi- 
dent Jefferson, which attempted to reopen the 
whole issue of distribution of powers. These reso- 
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lutions have served ever since as the intellectual 
fountainhead for states’ rights formulations. 
Jefferson put the question concisely: 


. .. the several States composing the United States of 
America, are not united on the principle of unlimited 
submission to their general government; but that by 
compact . . . they constituted a general government for 
special purposes, delegated to that government certain 
definite powers, reserving each State to itself, the re- 
siduary mass of right to their own self-government; 
and that whensoeyer the general government assumes 
undelegated powers, its acts are unauthoritative, void, 
and of no force: . . . That the government created by 
this compact was not made the exclusive or final judge 
of the extent of the powers delegated to itself; . . . but 
that as in all other cases of compact among parties 
having no common Judge, each party has an equal 
right to judge for itself, as well of infractions as of the 
mode and measure of redress. (Kentucky resolutions, 
Nov, 16, 1798) 


Thus began the states’ rights tradition. In char- 
acteristic American fashion, it was an ad hoc re- 
sponse to political peril by the losing faction in 
national politics; Jefferson and Madison took a 
very dim view indeed of the logic of the Virginia 
and Kentucky resolutions when the latter were 
employed against the embargo of 1807 by Feder- 
alist state governments in New England (Levy 
1963, pp. 93-141). Moreover, in a paradoxical 
fashion, Jefferson and Madison were engaged in 
precisely the behavior they were criticizing: their 
concept of “compact” made the states the final 
judges of the extent of the powers of the general 
government, i.e., made the states judges in “their 
own cause.” Each side engaged in petitio principii 
by silently incorporating their conclusions in their 
premises: on the one hand, the centralist view was 
founded on the assumption that the national gov- 
ernment was founded by “the people”; on the other, 
the states’ rights position rested on the notion that 
the states as integral units of sovereignty had made 
a “compact.” And history was ransacked by all 
hands to “prove” the validity of the respective cases 
—with what a historian must report as inconclusive 
results. 

But the problems of Jefferson and Madison fade 
into insignificance when compared with the defi- 
nitional crisis that precipitated the Civil War. This 
is not the place to examine the constitutional tur- 
moil over slavery except to note that the crucial 
question was not juridical, but moral, and was thus 
insoluble by legal mechanisms. If one took for 
granted that a Negro slave was “property,” then the 
injunction of the fifth amendment that no person 
“shall be deprived of life, liberty, or property, with- 


out due process of law” stood as a massive barrier 
against national regulation of the South’s “peculiar 
institution.” If, in contrast, one assumed that a 
Negro slave was a “person,” then slavery consti- 
tuted an unconstitutional limitation on his life and 
liberty. Pace Chief Justice Taney’s efforts in Dred 
Scott v. Sanford (60 U.S. 393, 1857), one does not 
settle this order of problem in a court of law. 

The Union victory in the Civil War gave the 
compact theory its coup de grâce, and in State of 
Texas v. White (74 U.S. 700, 1869), the Supreme 
Court gave ideological approval to the work of Lin- 
coln’s armies. In constitutional terms, however, 
states’ rights lived on and was from time to time 
extracted from the museum of antiquities as a de- 
vice for limiting federal legislation deemed “radical” 
by a majority of the Supreme Court (see Corwin 
1936; 1938; 1941). It is reasonable to argue that 
in the 1960s states’ rights constituted no meaning- 
ful legal barrier to the exercise of national au- 
thority, and that the federal government operating 
in a national economy could reach any objective 
it wished through the exercise of the commerce 
power, the taxing power, or—as in the Civil Rights 
Act of 1964—a combination of the commerce pow- 
er and the long-ignored jurisdiction provided by 
the fourteenth amendment. 

Yet, curiously, to say this is not to assert that 
the United States has achieved a unitary form of 
government. Indeed, in net terms the substantive 
power of the states today is greater than it has 
ever been. What has occurred is a remarkable de 
facto distribution of power between states and na- 
tion (sometimes called “cooperative federalism”), 
in which the state governments exercise enormous 
autonomy. For many areas of public policy, for 
example, education, public welfare, urban renewal, 
road construction, federal funds are extensively 
provided to supplement local resources. In these 
areas the states are, in effect, “sovereign,” pro- 
vided they do not push their autonomy too hard or 
attempt to assert a theoretical basis for their juris- 
diction. However, if state officials try to assert final 
power through gerrymanders, discriminatory prac- 
tices in race relations, taxation of interstate com- 
merce, the Supreme Court tends to reaffirm the 
principle of national supremacy. Moreover, since 
the states are the effective units of political party 
organization they often provide political obstacles 
to centralization that are far more effective than 
their constitutional position might indicate. 

In short, the United States in a highly non- 
theoretical fashion has developed a relationship 
between the center and the peripheral units built 
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around the operating principles of de jure national 
supremacy and de facto local autonomy. Most of 
the time this relationship works extremely well on 
the basis of a political consensus not to clarify 
issues of sovereignty but, rather, to get on with the 
business of governance. 


The separation of powers 


The doctrine of the separation of powers has oc- 
cupied a distinguished position in the history of 
American constitutionalism. In its classic formula- 
tion the doctrine had two operating principles. The 
first was set forth by John Adams in article XXX 
of the Massachusetts constitution of 1780: “In the 
government of this commonwealth, the legislative 
department shall never exercise the executive and 
judicial powers, or either of them: the executive 
shall never exercise the legislative and judicial 
powers, or either of them: the judicial shall never 
exercise the legislative and executive powers, Or 
either of them: to the end it may be a government 
of laws and not of men.” The second, drawn from 
a maxim attributed to Bracton, announced that 
delegata potestas non potest delegari, that dele- 
gated powers could not be further delegated, for 
example, that authority delegated to Congress by 
article 1 of the constitution could not be passed on 
by the legislature to the president. 

The difficulty with this doctrine, however, is that 
from the outset it was honored more in the breach 
than in the observance. The constitution of Massa- 
chusetts, which contained Adams’ absolute inter- 
dict, also gave the governor a veto over legislation 
that could be overridden only by a two-thirds vote 
of the General Court. In other words, the governor 
of Massachusetts exercised immense “legislative” 
power. Elsewhere in the same document there was 
a provision that a number of “executive” officers 
(by our definition) would be “chosen annually by 
joint ballot of the senators and representatives in 
one room.” All judicial appointments had to be ap- 
proved by the Council, which was a curious hybrid 
of executive, legislative, and judicial powers chosen 
by joint ballot of the two houses. In sum, the 
Massachusetts paradigm of the separation of pow- 
ers on close examination is a shambles of inter- 
locking powers and overlapping jurisdictions. 

Yet, if we examine Adams’ handiwork rather 
than his rhetoric, we discover a principle of sepa- 
ration founded on the proposition that the execu- 
tive, legislative, and judicial institutions should 
have autonomous constituencies. The governor was 
to be elected annually by the people as a whole; 
the legislature by geographical subdivisions of the 
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commonwealth; and the judiciary was to hold 
tenure for life on good behavior. Adams was not, in 
other words, interested in separating powers in 
terms of their substantive content (executive, ju- 
dicial, legislative) but divided authority among 
discrete agencies (governor, General Court, judici- 
ary, Council), each of which exercised judicial, 
executive, and legislative jurisdictions. Conflict 
among these quasi-autonomous power centers, he 
believed, would prevent the growth of any central- 
ized autocracy. The separation of powers, then, 
rested on political physics, not metaphysics. 

The original Madison draft of the constitution, 
which served as the basis for discussion and modi- 
fication at the convention, contained no trace of 
the separation of powers. All power was vested in 
a supreme legislature; the upper house was picked 
by the lower house; the president and judges were 
selected by both houses. Gradually over the sum- 
mer of 1787 this centralist model was weakened, 
over the militant objections of Madison, Hamilton, 
and James Wilson, and in an ad hoc fashion the 
“separation of powers” (like “federalism” ) emerged 
from the compromises. Later, in the 47th Federalist, 
Madison observed that “the accumulation of all 
powers legislative, executive and judiciary in the 
same hands, . . . may justly be pronounced the 
very definition of tyranny” (Hamilton et al. [1788] 
1961, p. 324). Later in the same essay he adopted 
John Adams’ operational model, noting that the 
separation of powers did not mean that the “depart- 
ments ought to have no partial agency in, or no 
controul over the acts of each other. [Montesquieu’s] 
meaning, . . . can amount to no more than this, 
that where the whole power of one department is 
exercised by the same hands which possess the 
whole power of another department, the funda- 
mental principles of a free constitution, are 
subverted” (ibid., pp. 325-326). 

Historically speaking, the doctrine of the sepa- 
ration of powers entered American constitutional- 
ism as a fairly simple and matter-of-fact division 
of the agencies of the national government rather 
than as a subtle effort to differentiate powers in 
terms of their innate characteristics. The president 
exercised judicial power (pardons) and legislative 
power (the veto); the legislature undertook execu- 
tive functions (approval of appointments, treaties) 
and judicial functions (amnesties); and the courts 
to a lesser degree dabbled in executive and legis- 
lative activities. Moreover, there is no empirical 
evidence to suggest that the maxim about dele- 
gated power not being subject to further delegation 
had any standing: the statutes of the first 20 years 
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of the republic are full of laws delegating national 
authority to state institutions; state courts, for in- 
stance, were given the power to naturalize aliens 
and to handle a number of offenses against the 
laws of the United States. 

There were a number of disputes in the early 
nineteenth century that touched marginally on the 
doctrine of the separation of powers. President 
Jackson, for example, justified his employment of 
the veto on policy grounds (his predecessors had 
felt that the veto should be reserved for legislation 
the chief executive believed to be unconstitutional ) 
by asserting the full autonomy of the presidency. 
And, of course, a similar concept of institutional 
independence underlay Chief Justice Marshall's 
exercise of judicial review over acts of Congress. 
But a careful search of the precedents suggests 
that the separation of powers, although it occupied 
an honored position in the polemical literature, was 
not articulated as a serious principle of constitu- 
tional interpretation until the 1850s, when Justice 
Benjamin Curtis invoked it in two leading decisions. 

Denying the practice of 60 years, Curtis asserted 
in Cooley v. Board of Wardens of Port of Phila- 
delphia (53 U.S. 299, 1851) that “if the States 
were divested of the power to legislate on [inter- 
state and foreign commerce] by the grant of the 
commercial power to Congress. . . Congress can- 
not regrant, or in any manner reconvey to the 
States that power.” Suddenly delegata potestas non 
potest delegari attained constitutional status—and 
by ipse dixit: Curtis did not linger to justify his 
pronouncement. Nor did the justice stipulate the 
precise constitutional foundation for his views. 
Further reflection presumably indicated to him that 
a violation of the separation of powers fell afoul of 
the due-process clause of the fifth amendment; at 
least this was the thrust of his presentation in 
Murray's Lessee et al. v. Hoboken Land and Im- 
provement Co. (59 U.S. 272, 1855). Ignoring 
Adams’ model, Curtis attempted to define the sepa- 
ration of powers in terms of substance; embarking 
on an elaborate gloss on the nature of judicial au- 
thority, he rejected the time-honored operational 
definition of judicial power as that power exercised 
by judges. 

It would be pointless to catalogue the subsequent 
instances where the separation of powers turned 
up as a ratio decidendi in Supreme Court litiga- 
tion. What is important is to note that this doc- 
trine became a major theoretical roadblock to the 
development of administrative law, regulatory com- 
missions, and delegations of emergency authority 
to the president. The nub of the argument was that 
Congress could not delegate “legislative” or “ju- 
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dicial” power to administrative officers (whether 
located in the executive branch or in the so-called 
independent regulatory commissions )—a conten- 
tion which, if accepted by the Court, would have 
destroyed the whole regulatory apparatus that de- 
veloped in the wake of the industrial revolution 
and the growth of a national economy. An agency 
such as the Interstate Commerce Commission, for 
example, clearly fuses executive, legislative, and 
judicial power: it administers, it makes rules, and 
it engages in enforcement. 

The Supreme Court, however, was seldom pre- 
pared to fortify this high ground. Instead, the 
justices devised various escape mechanisms from 
the logical chains of separation doctrine: on one 
hand, they found that the powers exercised by ad- 
ministrative bodies were not legislative and ju- 
dicial, but “quasi-legislative” and “quasi-judicial” 
(see Rathbun v. United States, 295 U.S. 602, 
1935); on the other, they held that when Congress 
delegated authority to various administrative in- 
struments, this did not constitute delegation of 
“legislative power” if meticulous standards of ad- 
ministration were specified (see J. W. Hampton 
Co. v. U.S., 276 U.S. 394, 1928)—it was merely 
an implementation of legislative power (Cushman 
1941). Indeed, in 1934 the leading authority on 
the question, Edward S. Corwin, declared the doc- 
trine of the separation of powers moribund (1934, 
p. 145). 

Ironically, in 1935 the corpse arose from its bed 
and provided the Supreme Court with the basis 
for striking down the Oil Code promulgated under 
the National Industrial Recovery Act (Panama Re- 
fining Co. v. Ryan, 293 U.S. 539, 1934) and for 
several other subsequent judicial forays against the 
evils of the New Deal. But later decisions by the 
“Roosevelt Court,” notably that reached in U.S. v. 
Rock Royal Co-operative, Inc. (307 U.S. 533, 
1939), seem to have put this tumultuous spirit 
finally to rest. The closest the Supreme Court came 
to a separation-of-powers ruling in the years since 
1941 was in Sweezy v. New Hampshire (354 U.S. 
234, 1957), when Chief Justice Warren, while 
denying the applicability of the separation of pow- 
ers, seemed to suggest that New Hampshire had 
violated due process of law by delegating to its 
attorney general plenary authority to act as a 
subcommittee of the state legislature. 

To conclude, the separation of powers is thus 
back where it began in 1789 as an abstract affirma- 
tion of the independent constitutional existence of 
the president, the Congress, and the Supreme 
Court. And the notion that delegated power can 
not be further delegated has been demolished by 
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constitutional logic drawn from John Marshall: 
that congressional power where it exists is plenary, 
and that plenary jurisdiction includes the power to 
give power away. Limits on delegation do exist, but 
they are political, not constitutional, in character 
(Roche [1952-1963] 1964, pp. 127-161). 


Jonn P. ROCHE 


[Directly related are the entries FEDERALISM; PRESI- 
DENTIAL GOVERNMENT. Other relevant material may 
be found in CENTRALIZATION AND DECENTRALIZA- 
TION; DELEGATION OF POWERS.] 
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mt 
CIVIL LIBERTIES 


“Civil liberties,” as commonly used, is not a tech- 
nical, precise term but a loose one denoting the 
personal rights and freedoms that are—or ought to 
be—respected by government. The phrase is not 
quite so broad as “liberty.” It does not apply to the 
freedom of Robinson Crusoe on his island, where 
there was no government; it does not embrace 
those areas of private option where the law can 
play no part, as when a man freely chooses to be 
a fool, or a gentleman, or a knave; nor can the 
term be used very meaningfully in connection with 
such a concept as the “right of revolution,” which 
is by nature a nonlegal privilege. But in its broad- 
est usage the term is applicable to all those many 
claims of right that involve an actual or potential 
legal nexus between the individual and govern- 
ment. However, lately, especially in the United 
States, there has been some tendency to single out 
“civil rights” (the protection of minorities) as a 
separate category and to use “civil liberties” to de- 
scribe all other claims of personal right. The dis- 
tinction is not entirely stable, but it has the merit 
of subdividing an almost impossibly multifarious 
subject, and in the discussion that follows “civil 
liberties” is used in this somewhat narrower sense. 

Even within these defined limitations the subject 
is very extensive, and further problems of defini- 
tion and classification remain. It is evident that 
civil liberty can be thought of either negatively, as 
the individual’s right not to have something done 
to him, or positively, as his right to have something 
done for him—for example, as the right against 
state interference with the publication of a polit- 
ical pamphlet or as the right to be provided with 
the facilities for publishing it. The negative cate- 
gory is the traditional one, and it will command 
the lion’s share of attention in this article. But the 
truism that underlies the idea of positive liberty 
should not be overlooked: the freedom to read is 
meaningless if no books are available. Negative 
liberties can themselves be further subdivided into 
rights against interference by government and 
rights against interference by private individuals 
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or groups. Again the former category is the more 
traditional, but again the less orthodox view 
merits a word by way of emphasis. In order to be 
really free to speak, a street-corner orator may 
need not only the assurance that the police will 
leave him alone but also the assurance that they 
will protect him from the angry reactions of his 
audience, that the state will “hinder hindrances” 
to his freedom (see Table 1). 

Rights against government are sometimes still 
further divided into three types: political rights 
(those bearing on the political process, such as the 
right to vote or to engage in political controversy), 
economic rights (such as entrepreneurial freedom 
or the right to practice a profession), and private 
rights (which is a catch-all term meant to cover 
all rights that are neither political nor economic). 
Obviously these are imperfect categories, since 
they are not always mutually exclusive (is freedom 
of artistic expression a private right, or can it be 
said to bear indirectly on the political process? ), 
but they do represent distinctions that have been 
drawn in practice and in the literature, and they 
have, therefore, a loose pragmatic value. 

Finally, each of these three classes of rights 
against government subsumes two kinds of rights, 
the “substantive” and the “procedural.” The sub- 


stantive civil liberties are those regarded, in some 
degree, as ends in themselves; procedural rights 
are those having to do with the way in which gov- 
ernment must proceed in dealing with substantive 
liberties. 


Rise of the modern concept 


Some concept of civil liberty can be traced far 
back into European history. Athenians of the age 
of Pericles gloried in their freedom of discussion 
and in their right to participate in public decisions. 
In the Middle Ages the feudal order rested heavily 
on the idea of legal rights that even the greatest 
lord was bound to respect, always in theory and 
often in fact. The barons who wrung the Magna 
Charta from King John were not, as they saw it, 
claiming new privileges but demanding that tra- 
ditional immunities be reconfirmed. Nevertheless, 
the modern concept, although related to these 
older forms, is different enough to be regarded as 
a new species. In the Greek city-state freedom was 
a matter not so much of private right as of public 
good: the individual's liberty was instrumental to 
and defined by the welfare of the city. And in both 
the Greek and the medieval understanding men 
held their rights by virtue of their status rather 
than by virtue of their manhood. The freedom of 


Table 1 — A schematic description of civil liberties 


LIBERTY 


Civil liberties 


POSITIVE RIGHTS 


NEGATIVE RIGHTS 


Against private persons 


Against government and groups 
l 
POLITICAL RIGHTS ECONOMIC RIGHTS PRIVATE RIGHTS 
Substantive Procedural® Substantive Procedural Substantive Procedural 
Adequate income Voting Possession and life Protection from: 
Educational facilities Political candidacy use of property Physical liberty Bodily harm 
Health services Political discussion Occupation Religious belief Economic coercion 
Housing Assembly Buying and selling and practice libel 
Organization Contracting Artistic expression Discrimination 
Petition Inquiry Involuntary 
Teaching servitude 
Privacy 


a. The enumeration of specific rights and liberties in this table is meant to be suggestive, rather than exhaustive. 


b, The same procedural rights apply generally to the three categories of substantive rights: security against arbitrary administrative action (e.g., 
detention, coerced confession, unreasonable search and seizure, confiscation of Property), security against unfair trial procedure: 
hearing, denial of counsel, compulsory self-incrimination, double jeopardy, cruel and unusual Punishment), and security against 


ex post facto laws, and bills of attainder. 


illegal 
inadequate e and 
statutory prohibitions, 
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the Greek citizen did not extend to noncitizens, 
and the feudal vassal enjoyed the rights associated 
with his rank and order and was obliged, of course, 
to perform reciprocal duties. The dominant modern 
idea acknowledges a realm of personal value that 
is a good in itself, quite apart from the welfare of 
the community, and with some exceptions it as- 
sumes that rights accrue to men because they are 
men and not because of their status or their 
performance. 

This idea, indeed, also finds some antecedents 
in premodern times; it smacks of the viewpoint 
that penetrated Roman law by way of the Stoics 
and Cicero. But in modern history the idea was 
given a new and irresistible thrust by the Reforma- 
tion, which fostered a belief in individualism, and 
by the rise of capitalism, which provided that be- 
lief with the indispensable support of the nascent 
bourgeoisie. At various times since the seventeenth 
century, historical forces have emphasized one or 
another of the classes of civil liberties described in 
the opening paragraphs above. At first the main 
thrust was for freedom from the private power of 
the nobles and others who monopolized trade and 
restrained its development; this antifeudal move- 
ment supported a strengthened monarchy and a 
unified nation-state. Then, as the stronger monarch 
himself became a threat, the partisans of freedom 
sought to curb the state authority he embodied, 
first by demanding that his agents, judicial and 
administrative, respect certain procedural limita- 
tions when enforcing his will, then by claiming po- 
litical rights that challenged his monopoly of state 
powers. The latter movement gathered force in the 
seventeenth and eighteenth centuries and reached 
a climax in the American and French revolutions. 
Its full implications have not been realized even 
to the present day, but under its impetus govern- 
mental power in western Europe and in North 
America tended gradually to devolve from the 
monarch to an oligarchy and finally to the ma- 
jority of the populace. As this development pro- 
ceeded it became apparent that political rights, 
although vital, were not enough: the majority, like 
the monarch, might use state power for good or ill. 
One result of this realization was a re-emphasis 
of substantive rights, a re-emphasis that began in 
the nineteenth century with an increased demand 
for economic freedom and shifted in the twentieth 
century to a new concern for other rights against 
government. This shift was accelerated by the 
spectacle of the “totalitarian” systems that arose in 
Europe after World War 1. Another result was the 
development of the concept of positive liberty— 
“freedom for” rather than “freedom from”—which 
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in some modern systems is regarded as a supple- 
ment to, and in others as a substitute for, the 
negative view of civil liberty. 


Problems of conflicting liberties 


Although, as has been said, different concepts 
of civil liberty have been emphasized at different 
times in the Western world, in general the devel- 
opment has also been cumulative—that is, the 
shift to a new emphasis has seldom led to an ex- 
plicit repudiation of the old. The result is that most 
modern states at least profess to respect nearly the 
whole vast and sometimes bewildering congeries 
of civil liberties that has been described. Evidently 
this profession creates problems, not the least of 
which arises from the fact that these heterogene- 
ous liberties may conflict with one another. If gov- 
ernment undertakes to ensure positive rights—for 
example, the right to an adequately paid job—it 
must assume broad authority to control the eco- 
nomic order, and a government thus endowed with 
powers may use them to encroach not only on 
economic liberty but on liberty in general. In the 
communist world this dilemma has been more or 
less frankly resolved by the subordination of even 
such rights as free speech and fair trial to the 
cause of “strengthening the socialist system.” In 
the West peoples have tried, with varying success, 
both to enjoy the “welfare state” and to maintain 
private and political freedom, although the range 
of economic freedom from public control has cer- 
tainly contracted since the heyday of laissez-faire 
ideology in the nineteenth century. There is an old 
and still controversial question whether other 
rights can survive in a state where economic rights 
have been thus demoted. So far the evidence sug- 
gests a very tentative “yes.” In the United States, 
for example, the increase in government economic 
control since 1930 has been accompanied by a 
growing (although still inadequate) concern for 
other personal freedoms. But a different answer 
might have seemed justified in the 1930s, when 
the twin phenomena of economic depression and 
fascism dominated the world landscape, and his- 
tory has perhaps not yet provided enough data so 
that the book can be closed on this issue. 

However, even if the problem of economic liberty 
is defined away or bypassed, potential conflict be- 
tween other civil liberties remains a difficulty. The 
right to comment on public affairs is no doubt 
essential in any free society, yet if the comment 
concerns a pending court case, it may impair the 
right to fair trial, which is equally essential, The 
religious zealot must be allowed to urge his views, 
but he may in doing so encroach on the right of 
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others to be let alone, the “right to privacy.” Most 
discussion of such perplexities has so far been 
limited to ritualistic advocacy of “freedom” or 
“order,” categories of little analytic value in this 
context. There is need for more treatments that 
recognize and cope with both horns of the dilemma. 

Much the same thing can be said about the over- 
arching problem of drawing the line between per- 
sonal rights and public authority in concrete in- 
stances. Because they were struggling against an 
outright state policy of repression, the historic 
spokesmen of freedom usually employed the lan- 
guage of exhortation: their declarations took the 
form of general and often absolutist propositions. 
Opposition to them, on the other hand, was equally 
unqualified: liberty was “license,” which in turn 
was defined as anything the governors wished to 
proscribe, Such polarization was inevitable in a 
predominantly authoritarian system, and exhorta- 
tion will always have its uses. But in states whose 
governments are committed to recognizing a range 
of legitimate civil liberty (and this is true today in 
most states in the noncommunist world), these 
uses are limited. The problem in such polities is 
to develop a just and workable reconciliation of two 
acknowledged goods—personal rights and com- 
munity need, Hortatory, absolutist generalizations 
on either side can carry only part of the way to the 
problem's solution. Indeed, if depended on too 
much, they may hamper the solution by obscuring 
the fact that such a solution exists, that both the 
claims of the individual and the claims of au- 
thority must usually be weighed in the scales. 

It is arguable that certain personal rights ought 
to be absolutely immune from state transgression: 
the “right to believe” has sometimes been so re- 
garded; some procedural immunities, such as the 
right against forced confession, may fall in the 
same category. But all legal systems have recog- 
nized that at some point most freedoms may be 
restricted; the difficulty is where and how to set 
that point so as to admit valid community claims 
and yet ensure the maximum of personal liberty. 
The general principle is that government should 
inhibit freedom only as much as it has to in order 
to serve important community needs, and various 
attempts have been made to formulate a rule or a 
test that would implement this premise. The “clear 
and present danger” concept (that speech can be 
restricted only when it threatens an immediate and 
serious evil) represents one such attempt; the 
“balancing test” (that speech can be restricted 
when the state interest in suppression outweighs 
the private interest in freedom) represents an- 


other. But insofar as these “rules” are not merely 
tautological, they are shorthand phrases for a large 
number of alternatives that must be considered 
before a reasonable conclusion is reached—for ex- 
ample, the distinctions between “advocacy” and 
“incitement,” between “prior restraint” and “subse- 
quent punishment,” between “state convenience” and 
“state necessity.” It would be desirable to analyze 
these alternatives in terms of the issues posed by 
the conflict between various exertions of state 
power and various kinds of liberty (e.g., an out- 
right prohibition of free speech may raise ques- 
tions different from those raised by an ordinance 
that only regulates the time, place, and manner of 
such speech; freedom of religious utterance and 
freedom of political advocacy may stand on dif- 
ferent grounds). 

Although conceptual treatments of the kind dis- 
cussed above would be useful, they would leave un- 
answered other very important questions that 
should not be overlooked. Even if analysis can pro- 
vide formulas for the reasonable protection of 
civil liberties, there remain the issues of what con- 
ditions are most likely to secure that protection in 
practice and, more specifically, what constitutional 
arrangements are most useful in this regard. 


Conditions for protection of civil liberties 


The first question is really an aspect of a much 
broader one—what conditions make for a “demo- 
cratic” or “competitive” political system—and this 
is an issue too large and complex to be adequately 
treated here. However, a few contingent sugges- 
tions can be ventured. As might be expected, eco- 
nomic abundance in a given society seems to en- 
hance the likelihood that civil freedom will be 
tolerated: broadly speaking, the more highly de- 
veloped the country, the less the chance of arbi- 
trary, authoritarian government. Yet this correla- 
tion may not be entirely dependable: it has been 
argued that in the early stages of economic prog- 
ress a country may be more, rather than less, prone 
to generate the tensions that lead to authoritarian- 
ism. Education is another factor that appears to 
produce a national milieu favorable to civil liber- 
ties. Indeed, studies have found that even within 
nations the most educated tend to be the most 
favorable to “democratic values.” But it has been 
contended that the first steps toward widespread 
literacy may turn a nation in quite the opposite 
direction, and the case of Nazi Germany demon- 
strates that even a high educational level is not 
enough. Habit and custom also play a part: a well- 
established tradition of libertarianism tends to per- 
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petuate itself, It has been urged that “pluralism” 
with respect to economic and social interests con- 
duces to a free polity; certainly a nation sharply 
divided into two camps (e.g., the rich and the 
poor) is not likely to be a free one. Scholars have 
also from time to time argued that such factors as 
“national character” and “consensus on funda- 
mentals” are determinative, but empirical study so 
far has not fully confirmed these insights. Finally, 
it is worth noting that certain political character- 
istics may affect the matter. Heavy involvement in 
politics of the military or of religious groups is 
often unfavorable to civil freedom; a one-party 
state need not be tyrannical, but it is likely to be. 
With the proliferation of new nations in modern 
times, political scientists are finding examples of 
an almost endless variety of objective circum- 
stances, and further progress can be expected 
toward systematic understanding of the environ- 
mental prerequisites for civil liberty. 


Constitutional guarantees 


The emergence of new nations should shed 
brighter light on the constitutional arrangements 
for civil liberties. The past offers several prescrip- 
tions, but two stand out: the British system of 
parliamentary supremacy, in which civil liberties 
are preserved by the tradition of governmental 
self-restraint, and the American system, with a 
written bill of rights interpreted and enforced by 
judicial review. The difference is not so sharp as 
this bare description suggests—in practice Parlia- 
ment also feels committed to historic written docu- 
ments, such as the Act of Settlement, and custom 
plays a part in determining the actual constitu- 
tional structure in the United States—but a differ- 
ence it is nonetheless. In spite of Britain’s impres- 
sive success in maintaining civil liberties, the 
device of a written, nominally binding bill of rights 
has been adopted very widely by modern nations 
(Israel and some of the states of the British Com- 
monwealth are among the exceptions), and de- 
bates on this matter seem rather academic. But 
questions remain about the content and character 
of such a statement. What kinds of rights should 
be included? Should the declaration list “positive 
tights,” such as the right to employment, which 
was formally secured by Germany's Weimar con- 
stitution and by the Soviet constitution of 1936? 
Should it include guarantees of rights against pri- 
vate action, such as the right against “abuse of 
economic power,” which is specified in the consti- 
tution of modern West Germany? The difficulty is 
that such rights are not self-executing; they de- 
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pend on the willingness of the government to take 
the positive steps that are necessary to make them 
real—e.g., to stimulate the economy, or to adopt 
adequate regulatory measures. It has been argued 
that such guarantees are useful as statements of 
aspiration and as admonitions to the governors. 
Contrariwise, it has been suggested that being 
legally unenforceable, such merely moral prescrip- 
tions may cheapen the whole concept of rights in 
the minds of both governors and governed. 

Even if the bill of rights takes the traditional 
form of negative restrictions on government, should 
these restrictions be expressed in general terms 
(“due process of law,” “freedom of religion”) or 
should they be spelled out in detail? Should the 
rights be stated as absolutes (“Congress shall make 
no law . . . abridging the freedom of speech”) or in 
qualified terms (as in the Nigerian provision that 
rights may be limited in ways that are reasonably 
justified in a democratic society)? Should the dec- 
laration list all negative rights that seem important 
(the Weimar constitution protected motherhood 
and forbade public instruction that hurt the feel- 
ings of nonconformist pupils); if not, which rights 
do merit explicit statement? Common sense and 
past experience suggest tentative answers to such 
questions; further observation and future experi- 
ence should bring more enlightenment. 

Finally, supposing a nation’s cultural and po- 
litical environment makes civil liberty plausible 
and the nature and content of the bill of rights has 
been determined, how can those rights be best 
enforced? Probably the primary, and surely an in- 
dispensable, agency is an independent judiciary. 
It is hard to see how civil liberty can be more than 
an empty promise unless there are courts free to 
handle claims between persons and to check the 
arbitrary acts of administrative officials. Whether 
the judiciary should also be granted the power to 
enforce the bill of rights against the legislature 
itself—that is, whether it should exercise the power 
of judicial review—is an important question dis- 
cussed elsewhere [see JUDICIAL PROCESS, article on 
JUDICIAL REVIEW]. A few decades ago many 
scholars, even in the United States, were skeptical 
of judicial review, which they felt had been used 
by the Supreme Court to restrain economic reform 
rather than to protect the rights of man. Since then 
that court and, to a lesser extent, the judiciaries of 
some American states have done more to defend 
civil liberties than any other agency of American 
government, and the attitude of scholars has 
changed accordingly. During the same period there 
has been a tendency, although not an overwhelm- 
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ing one, for other countries (e.g., the German Fed- 
eral Republic, Italy, India) to adopt judicial review 
in some form, and as history proceeds there will 
be a growing body of evidence on the relation be- 
tween civil liberties and this once uniquely Ameri- 
can institution. 

ROBERT G. MCCLOSKEY 


[See also CIVIL DISOBEDIENCE; DEMOCRACY; EQUALITY; 
FREEDOM; HUMAN RIGHTS.] 
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Iv 
CIVIL RIGHTS 


Although the terms “civil rights” and “civil liber- 
ties” are often used interchangeably, when they are 
differentiated the latter generally denotes the rights 
of individuals, while the former refers to the con- 
stitutional and legal status and treatment of 
minority groups that are marked off from the ma- 
jority by race, religion, or national origin. The fol- 
lowing illustration will show the difference. In the 
United States since the end of slavery, there has 
never been any question about the Negro’s equal 
rights to freedom of religion or freedom of the 


press—basic civil liberties; at the same time, he 
could be treated as a member of his race, and not 
as an individual, with respect to the schools he 
could attend and the public facilities he could 
enjoy—basic civil rights. The distinction between 
the person as an individual and the person as a 
member of a group has its roots in history, mo- 
rality, and social psychology. Reinhold Niebuhr 
noted that 


It may be possible, though it is never easy, to establish 
just relations between individuals within a group purely 
by moral and rational suasion and accommodation. In 
inter-group relations this is practically an impossibility. 
The relations between groups must therefore always 
be predominantly political rather than ethical, that is, 
they will be determined by the proportion of power 
which each group possesses at least as much as by any 
rational and moral appraisal of the comparative needs 
and claims of each group. (1932, pp. xxii-xxiii) 


While individual and group rights are to be dif- 
ferentiated, it is probably true that there is no 
chance for the emergence of the latter if the 
former are denied, so that the struggle for civil 
liberties must first be won and the fundamental 
human rights vindicated and secured before minor- 
ity rights will be recognized. The struggle for civil 
rights cannot be conducted for those who are yet 
denied basic human rights. 

In the broad sweep of history—though no doubt 
there have been numerous exceptions—the rela- 
tions of a dominant majority toward a weak minor- 
ity group, or of the conqueror toward the defeated 
enemy, first took the form of total annihilation or 
of cannibalism; then the form of slavery or total 
subjection; then the milder yet still severe form of 
assignment to an inferior caste; then cooperation 
and equality. Yet in modern times all forms have 
coexisted: in Nazi Germany the Jewish people were 
exterminated, in India the Untouchables still suffer 
because of the caste system, and in the United 
States the Negroes are moving into full equality. 
Even in the ancient world, while Aristotle was 
teaching that non-Hellenic peoples were fit only for 
slavery, his former pupil, Alexander of Macedon, 
acted on the principle that Greeks and Persians, 
victors and vanquished, could associate on the 
basis of equality and fraternity. 

On one hand, one finds everywhere and at all 
times fear and hatred of the foreigner, the stranger, 
the man of different color or tongue or beliefs; on 
the other hand, there is evidence of an effort of 
the human consciousness to be aware of the uni- 
versal in all men, of a common bond and a com- 
mon destiny. “The universal in its true and inclu- 
sive sense is a thought,” Hegel said, “that it has 
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cost thousands of years to bring to human con- 
sciousness, and that received its full recognition 
only through the aid of Christianity. The Greeks 
knew neither God nor man in their true univer- 
sality” ({1817] 1841, p. 321). This is too sweeping 
a judgment, for Stoicism of the Hellenistic age 
taught the ideal of the cosmopolis, or world state, 
in which all men, Greek and barbarian, urban and 
rural, would enjoy equality; but it is true that the 
ideals of the Fatherhood of God and the brother- 
hood of men found their clearest expression in the 
Hebrew prophets and Christian teachings. In time, 
Stoic and Biblical ideals became fused in various 
schools of natural law that dominated political 
thought in the West down to the nineteenth cen- 
tury. Thus, denial of equality to minority groups 
could and was challenged in the name of religion 
or political theory or philosophy. In modern times 
the denial of equality to religious, racial, and eth- 
nic minorities also assumed ideological and even 
religious formulations in theories of racial inferi- 
ority and in various political ideologies; but with 
respect to all such formulations, Arnold Toynbee's 
judgment seems appropriate: “The present vogue 
of racialism in the West,” he wrote, “however, has 
really little to do with current scientific hypotheses. 
A prejudice so strong as this cannot be accounted 
for by a cause so rational” (1934). 

Until World War 1 one could hardly speak of a 
struggle for civil rights; one could see only a record 
of oppressions and persecutions of minority groups. 
But then came an awakening of nationalism, and, 
at least in Europe, oppressed nations became inde- 
pendent, and minority religious and ethnic groups 
within nations were guaranteed their collective 
rights by constitutions and international treaties. 
Although these guarantees generally proved inef- 
fective, they were significant for recognizing the 
moral claims of minorities. 

The claims became universal and reached a 
climax at the end of World War m. First the 
Indian subcontinent won freedom in 1947, then 
Indonesia, then in 1957 the Gold Coast (Ghana), 
in Africa. In 1945, when the United Nations was 
organized, the white race of Europe and North 
America dominated the world. The UN Charter was 
signed by fifty nations. By the 1960s the UN had 
over 100 members, more than thirty of them new 
African nations; and the white race had begun to 
recognize its own minority status. Almost through- 
out the world there was violence. However, where- 
as in former centuries what was obvious was the 
oppressive measures imposed upon minorities by 
the dominant group, now one saw minority groups 
resort to violence against what they considered 
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to be oppressive majorities: for example, Turks 
against Greeks in Cyprus; Kurds against Arabs in 
Iraq; Negroes against East Indians in British 
Guiana; Muslims against Hindus in India; and 
Hindus against Muslims in Pakistan. In some coun- 
tries, as in Belgium and India, there were strug- 
gles over language rights. No minority group was 
willing to remain in a position of relative inferi- 
ority and subjection. In some instances the strug- 
gle for civil rights was against a dominant minor- 
ity, as by Buddhists against Roman Catholics in 
South Vietnam or by the nine million African 
Negroes against the three million whites in the 
Republic of South Africa. Sometimes the struggle 
for civil rights, as on behalf of the three million 
Jews in the Soviet Union, was conducted by con- 
cerned persons living outside the area (the Jews 
of Israel and of the West). The pattern, when 
viewed ecumenically, is extremely complex. 

Apart from national emancipation after World 
War 1 and the winning of independence by the 
peoples of Asia and Africa after World War 11, the 
most dramatic developments in the struggle for 
civil rights have been in India, the United States, 
and in international organizations. These will be 
reviewed briefly. 

India. More than fifty million Indians were 
deemed to be Untouchables by birth and were pre- 
vented by custom and law from social and re- 
ligious contact with other Hindus. For decades 
Mohandas K. Gandhi had led a nonviolent struggle 
for their equality. When India became a member 
of the (British) Commonwealth of Nations and 
adopted a constitution in 1949, the Untouchables 
won official political, economic, and social equality. 
This equality affected their status as well as their 
competitive opportunity, thus striking at the roots 
of the caste idea of “outcastes.” In guaranteeing the 
right of equality, the constitution states that the 
state shall not discriminate against any citizen on 
account of caste; that no citizen shall, on the 
ground of caste, be subject to restriction in ac- 
cess to or use of shops, restaurants, public wells, 
and tanks; and that the practice of untouchability 
is forbidden. The constitution also guarantees the 
practice of any calling without restriction and pro- 
vides that the state shall promote “with special 
care” the educational and economic interests of 
“the weaker sections of the people” and shall pro- 
tect them “from social injustice and all forms of 
exploitation.” The law grants former Untouchables 
preferential treatment by the government in jobs 
and offices and reserves for them educational and 
professional opportunities. 

The struggle for equality has been helped not 
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only by law but also by certain material forces, 
notably expanding urbanism and industrialism and 
the growth of transport and communications. 

However, factors have developed that tend to 
perpetuate, and even to strengthen, the caste or- 
ganization of Indian society: all castes, high and 
low, have become corporate political bodies that 
court favor and power, so that elections are fought 
on caste lines, party tickets tend to be formed on 
the same bases, and recruitment and promotions 
reflect caste consciousness. Caste associations have 
grown in number and strength. At the National 
Integration Conference in 1961, President Radha- 
krishnan stated that although caste is ceasing to 
be a social evil, it has become a political and ad- 
ministrative evil. Caste loyalties are utilized for 
the purpose of winning elections or getting people 
into jobs, for exercising some kind of favoritism 
or nepotism. It is doubtful, however, that untouch- 
ability is no longer a “social evil,” for the vast ma- 
jority of the caste, illiterate and economically weak, 
submit silently to continued discriminations; and 
in a country where there is substantial unemploy- 
ment, it is natural that members of the most de- 
pressed classes should suffer continued discrimina- 
tion rather than run the risk of losing their jobs 
through retaliation and intensified prejudice. In 
any case, India demonstrates the difficulties in- 
volved in establishing civil rights in traditional 
societies. 

United States. The first civil rights challenge 
in the United States was presented by the Indians. 
As the white man pushed westward, he demanded 
more and more of the Indians’ land. Congress in 
1830 enacted the Indian Removal Act, which was 
based on the assumption that segregation would 
end the conflict between the races, as land would 
be provided for the Indians in western territory 
and the whites would occupy all of the eastern 
lands. Coupled with land hunger was the policy of 
putting Indian cultures and tribal organizations 
into the “melting pot,” where they would be de- 
stroyed, and of converting Indians from hunters 
and fishermen into farmers and cattlemen. Cul- 
tural, social, and economic assimilation would, at 
the same time, make available much of the “sur- 
plus” Indian lands, which consisted of 150 million 
acres in 1873. The policy was expressed in the 
General Allotment Act of 1887, called the Dawes 
Act, under which it was possible to individualize 
the Indian landholdings and to permit Indians to 
dispose of the land as they wished. Citizenship 
could be acquired by Indians as they left their 
tribes and chose to live among the civilized people. 
By 1933, two-thirds of the lands held by the Indians 
in 1887 had been lost. 


An act of 1924 gave citizenship to all Indians, 
but as late as 1956 there were states that denied 
them the franchise, Other discriminatory practices, 
some of them required by law or regulation, were 
prohibited by a 1934 statute. In the same year Con- 
gress enacted the Indian Reorganization Act, the 
most comprehensive law since 1887. Under its 
terms, Indian landholdings were increased and 
land conservation measures were introduced; the 
organization of Indian tribes was reinvigorated, 
and tribal customs and laws were given dignity and 
power; liberal credit policies were established to 
aid the tribes and individual Indians. The law en- 
couraged respect for Indian cultural life and insti- 
tutions, especially Indian arts and crafts (under 
an act of 1935). 

Government policy in the 1950s reversed the 
trends and reverted to pre-1934 positions; in 1961, 
however, there was again a fundamental change 
in course when the federal government apparently 
returned to the Indian policies of the Franklin D. 
Roosevelt and Truman administrations, Federal 
policy thus fluctuates between a desire not to recog- 
nize the Indians as in any way differentiated from 
the rest of the population, and thus not entitled to 
any special claims or rights, and a desire to afford 
them an opportunity to develop their own styles of 
life and to protect them against greed and preju- 
dice. Thus, there is a conflict between total assimi- 
lation and cultural pluralism. While the Indian 
encounters varying degrees of prejudice and dis- 
crimination in many parts of the country, his 
friends continue to debate his future between the 
poles of integration and disintegration of Indian 
life and culture. 

The Negro presented problems of a different 
complexity and order, for after emancipation he 
had to bear the memory, habits, and history of 
slavery; he had no indigenous culture and tribal 
life that he wanted to preserve; he could never 
claim that he once owned all the land. His skin 
was darker, he originally came from another con- 
tinent, and he could be counted in the millions; 
and there were no large reservations where he 
could make his home—he had to live in close 
proximity to those who had been his masters. 

At the end of the Civil War and after adoption 
of the thirteenth amendment, which outlawed 
slavery and peonage, Congress proceeded to ensure 
equality for the four million freedmen by providing 
for their “personal liberty, personal security,” and 
their “free enjoyment” of “immunities and rights.” 
“Civil rights” began with passage on April 9, 1866, 
of “An Act to Protect All Persons in the United 
States in Their Civil Rights, and Furnish the Means 
of Their Vindication.” Since the act’s constitution- 
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ality was doubted, two months later Congress 
passed the fourteenth amendment, which was rati- 
fied in 1868. Under its terms, Negroes became citi- 
zens of the United States and of the states wherein 
they resided; the amendment provided, too, that no 
state shall deprive any person of his life, liberty, or 
property without due process of law nor deny to 
any person the equal protection of the laws. The 
fifteenth amendment, ratified in 1870, provided 
that the right to vote shall not be denied or abridged 
by the United States or by any state on account of 
race, color, or previous condition of servitude. 

To implement these constitutional provisions, 
Congress, until the end of Reconstruction in 1877, 
enacted a series of civil rights acts. The most im- 
portant of them was the Civil Rights Act of 1875, 
which prohibited racial discrimination in inns, pub- 
lic conveyances, theaters, and other places of 
public amusement. From the debates on this act 
in Congress, it is clear that its purpose was to wipe 
out all the incidents and badges of slavery. In 
1883 the Supreme Court held the act unconstitu- 
tional on the ground that the statute prohibited 
racial discrimination by private individuals, while 
the fourteenth amendment authorized congres- 
sional action only to enforce prohibitions on state 
action. Then in 1896 the Supreme Court, in Plessy 
v. Ferguson, upheld as constitutional state laws 
that required racial segregation in public convey- 
ances, on the theory that “separate but equal” fa- 
cilities were not a denial of equality. This decision 
was interpreted as extending the cloak of consti- 
tutionality to racial segregation in schools, all pub- 
lic buildings and institutions, restaurants, theaters, 
and all other public accommodations, publicly or 
privately operated. Racial segregation was the pat- 
tern in 17 southern and border states and in the 
District of Columbia. 

The “separate but equal” rule was flagrantly 
flouted—separation was enforced, but equality was 
not provided. The schools and other facilities af- 
forded Negroes were patently, and often grossly, 
inferior; for example, for 1951 the expenditure 
per pupil in average daily attendance for nine 
segregated states was $136.73 for white and 
$74.67 for Negro; and the average salary per mem- 
ber of staff was similarly disproportionate. In 1938, 
in a case brought by a Negro who sought admis- 
sion to the law school of the state University of 
Missouri, and in 1950, in cases involving a Negro 
student of law at the University of Texas and a 
Negro graduate student in education at the Uni- 
versity of Oklahoma, the Supreme Court questioned 
the way the “separate but equal” rule was oper- 
ating and raised serious doubt whether this rule 
could be used at all in graduate and professional 
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studies without denying the Negroes’ constitutional 
right to equal protection under the fourteenth 
amendment. In the field of interstate transporta- 
tion, also, in cases decided in 1946 and 1950, the 
Supreme Court considerably weakened the consti- 
tutional underpinnings of racial segregation. The 
deathblow to de jure racial segregation came in 
1954, in Brown v. Board of Education of Topeka 
(347 U.S. 483), in which the Court unanimously 
upheld the contention of the National Association 
for the Advancement of Colored People (NAACP) 
that segregated public schools are not “equal” and 
cannot be made “equal.” In subsequent cases the 
Court outlawed segregation in state colleges and 
universities, transportation, parks, municipal golf 
courses, public beaches, and wherever the state 
participated in a property's maintenance. 

Compliance with the mandate to desegregate 
public facilities was resisted in the 17 affected 
states, especially with respect to public schools, 
and particularly in Mississippi and Alabama. Under 
the banner of “states’ rights,” all sorts of devices 
were resorted to by segregationists to nullify the 
orders of the federal courts; and the segregation 
forces attempted to paralyze the NAACP so that 
it might not continue its legal defense operations 
in the south. 

In 1955 and 1956, under the leadership of Mar- 
tin Luther King, Jr., a Negro minister, the Negroes 
of Montgomery, Alabama, engaged in a boycott of 
the segregated buses in that city. King was a fol- 
lower of Gandhi and Thoreau in his philosophy of 
nonviolent resistance to immoral or unconstitu- 
tional laws and customs. Three years later, under 
NAACP leadership, Negro high-school students in 
Oklahoma City, Oklahoma, launched a sit-in dem- 
onstration at lunch counters in chain stores to win 
nonsegregated service. At Greensboro, North Caro- 
lina, in 1960 several Negro college students staged 
a sit-in at a lunch counter, spontaneously and 
without organizational support. These instances 
served as a pattern for what soon became a wide- 
spread movement throughout the southern and the 
border states against segregation in privately 
owned places of public accommodation. In many 
cities the sit-in demonstrations were successful; 
but everywhere in the affected states and cities 
there was massive resistance, and white extremists 
often resorted to violence and even murder. The 
situation was aggravated by attempts made by 
Negro organizations to induce Negroes to register 
as voters in places where they were systematically 
kept off the suffrage rolls. 

When it became apparent that the situation had 
become intolerable, President John F. Kennedy 
submitted to Congress a comprehensive civil rights 
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bill to outlaw racial segregation in places of public 
accommodation, to eliminate discrimination in em- 
ployment, to assure free suffrage to the Negro, and 
to end discriminatory practices in housing. The 
bill was based on the assumption that the Supreme 
Court would overrule or circumvent its decision 
in the so-called Civil Rights Cases of 1883, just as 
it had overruled Plessy v. Ferguson (1896) in 
1954. Congress (following a filibuster in the Sen- 
ate) passed the bill on July 2, 1964, and it was 
immediately signed by President Lyndon B. John- 
son. Although Congress had passed civil rights acts 
in 1957 and 1960, the act of 1964 was the first 
really significant civil rights breakthrough in Con- 
gress since the end of Reconstruction in 1877. 

The new federal law was not as necessary in the 
north as in the south. In 1865 Massachusetts en- 
acted the first state law in the country banning 
racial discrimination in places of public accommo- 
dation. By 1900 there were similar statutes in 18 
states; by 1960 there were 24. In 1945 New York 
became the first state to enact a fair employment 
practice act; by 1960 there were similar laws in 
17 states. There was a network of related laws 
dealing with discrimination in housing and in edu- 
cation. The statutes barred discrimination on ac- 
count of religion or national origin as well as on 
account of race or color. Yet the combination of 
criminal, civil, and administrative remedies pro- 
vided by this complex of laws did not prevent the 
rise and spread of de facto racial segregation, 
especially in the northern urban centers, where 
Negro ghettos came into being and were enlarged 
by the Negro migrations from the south. In these 
ghettos the schools were almost exclusively Negro; 
the rate of unemployment was substantially higher 
than it was for white workers; the housing was 
substandard or constituted a slum area that was 
a breeding place for crime, delinquency, drug 
addiction—evils that in turn fostered unemploy- 
ment, apathy, frustration, Under these circum- 
stances, enactment of the Civil Rights Act of 1964 
did not bring to the Negroes of Harlem and other 
slum centers in the north promise of a new day, 
and in the summer of that year there were out- 
bursts of violence in the Negro ghettos in Man- 
hattan, Brooklyn, and Rochester, New York; Jersey 
City, New Jersey; and elsewhere. 

These outbreaks of anger against the evils of 
de facto segregation precipitated two reactions: on 
the one hand, a white “backlash” against civil 
rights; on the other hand, a more widespread recog- 
nition of the fact that the movement for civil rights 
must be intimately linked with policies of full em- 
ployment, greatly improved educational facilities 


and techniques for all economically underprivi- 
leged children, urban redevelopment that would 
provide better housing and neighborhoods for 
lower-income groups, and, in general, policies that 
would cope with the social, economic, and cultural 
aspects of the modern megalopolis. The Economic 
Opportunity Act of 1964 and the creation, in 1965, 
of the Department of Housing and Urban Develop- 
ment, headed by the first Negro in the cabinet, 
were in part responses to civil rights challenges. 
The Voting Rights Act of 1965 was a major step 
toward implementing the fifteenth amendment and 
promised greater political power for the large 
masses of Negroes in the southern states. 
International arena. The ideas of human dig- 
nity, equality, and fundamental human rights 
are deeply rooted. Numerous passages in the Old 
Testament command nations not to oppress the 
alien and the stranger and to protect the poor, the 
orphan, and the widow. At the same time, the 
pages of history record “man’s inhumanity to man” 
that has made “countless thousands mourn.” The 
struggle for equality has been aided by the revo- 
lution in travel and communication, and in tech- 
nology generally, so that men see themselves as 
being interdependent more than they had ever been 
before. For example, Negroes in the United States 
know that the African peoples have won their in- 
dependence and that their representatives have 
places of equal dignity and rights in international 
organizations and meetings; and this phenomenon 
has brought moral strength to U.S. Negroes, so 
that they demand, and struggle for, their rights as 
Americans and as members of the human family. 
Lord Acton, in an address in 1877, stated that 
the “most certain test by which we judge whether 
a country is really free is the amount of security 
enjoyed by minorities” ({1861—1910] 1948, p. 33). 
It was not until the twentieth century that official 
agencies for judgment came into being. Following 
World War 1, as we have noted, there was an at- 
tempt to protect some national minorities by trea- 
ties; the mandates system under the League of 
Nations should also be noted in this connection. 
But the international, programmatic promotion of 
minority protection and human rights in general 
was undertaken only after World War 11. Articles 1 
and 55 of the Charter of the United Nations were 
designed to promote and encourage respect for 
human rights and fundamental freedoms for all, 
without distinction as to race, sex, or religion. UN 
bodies that work toward this end are the Third 
Committee of the General Assembly, the Economic 
and Social Council, the Human Rights Commis- 
sion, the Commission on the Status of Women, 
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and the Sub-Commission on Prevention of Dis- 
crimination and Protection of Minorities. 

In 1948 the United Nations General Assembly 
adopted the Universal Declaration of Human Rights 
as a statement of principles to be implemented 
by subsequent conventions that would be binding 
on ratifying member states. In December 1965 the 
Assembly approved the convention on the elimina- 
tion of all forms of racial discrimination. An im- 
portant feature of the convention was the provision 
that allowed ratifying nations to bind themselves 
to permit individuals or groups to charge their 
governments with violations. 

Other international organizations that are con- 
cerned with these matters are UNESCO, the Inter- 
national Labour Organisation, the European Com- 
mission on Human Rights and the European Court 
of Human Rights, and the Inter-American Com- 
mission on Human Rights. Mention may also be 
made of the International Commission of Jurists 
and the International League for the Rights of 
Man. 

While the European Convention on Human 
Rights has established compulsory machinery, the 
Commission and the Court, for dealing with cases 
involving violations of human rights, and the In- 
ternational Labour Organisation has established 
effective machinery for investigation and reports 
respecting certain areas of human rights, other 
agencies normally rely on implementation through 
reporting systems. 

The chief utility of the agencies has been to ex- 
pose to general view the facts of human rights 
violations and to win support for the claim that 
there is a universal law of humanity, under which 
individuals and groups (religious, racial, cul- 
tural, and national) should somehow be protected 
against attempts to destroy (genocide), discrimi- 
nate against, or humiliate them, What Walter 
Bagehot called “government by discussion” (1872, 
chapter 5) has not come into existence in the inter- 
national field; yet the open and vigorous discus- 
sion of human rights in governmental and non- 
governmental agencies has done much to make 
meaningful the preamble to the UN Charter, which 
proclaims the determination of the nations “to re- 
affirm faith in fundamental human rights, in the 
dignity and worth of the human person, in the 
equal rights of men and women and of nations 
large and small. . . .” 

Thus, a denial of civil rights—whether it takes 
place in Birmingham, Alabama, or in Birmingham, 
England, or in the U.S.S.R., or in South Africa, or 
in Bechuanaland, or in Algeria—is no longer a 
parochial domestic question, but has become a 
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matter of universal concern, a concern that implies 
the belief (as stated by Pope John xxm in Pacem 
in terris 1963) that human society is founded on 
the principle that every human being is a person, 
that his nature is endowed with intelligence and 
free will, that he has rights that flow directly from 
his very nature, and that these rights are universal, 
inviolable, and are such that they cannot in any 
way be surrendered. 

MILTON R. KONVITZ 


[See also ASSIMILATION; HUMAN RIGHTS; MINORITIES; 
Race; RACE RELATIONS; SEGREGATION.] 
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CONSTITUTIONAL PSYCHOLOGY 
See under PSYCHOLOGY. 


CONSTITUTIONS AND 
CONSTITUTIONALISM 


Constitutionalism in its distinctive sense is a 
modern phenomenon which can be defined only by 
facing the complexities of defining a constitution. 
From Aristotle to the present day, many such 
definitions have been offered, and some of these 
will be discussed below in connection with the his- 
tory of constitutionalism. 

At the outset, however, the eighteenth-century 
and nineteenth-century concept of a constitution 
as a formal written document ought to be dis- 
carded. All such documents are subject to a steady 
evolution; and the living constitution, like all living 
law, is something transcending such formal enact- 
ment as well as preceding it. Furthermore, a con- 
stitution such as that of Britain or Israel is just as 
much “written” as the American or French consti- 
tution, that is to say, embodied in written docu- 
ments of all kinds even though not codified and 
assembled in a single document. Although out- 
moded, this documentarian, or code, concept of a 
constitution played a significant role in the heyday 
of constitution-making after the French Revolu- 
tion—as it does, in fact, even today in many of the 
emergent nations. 

In addition to this documentarian concept, one 
finds several broad philosophical and legal con- 
cepts of constitutions which have been important, 
even though they lack the distinctiveness of the 
modern Western conception. Aristotle in a sense 
set the stage for the equivocation which has char- 
acterized the basic term throughout its history. 
When speaking of the politeia, he employed the 
term to refer both to a distinctive political order, 
the so-called mixed constitution of the “polity,” and 
to political order or regime in general. In other 
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words, every regime, even a tyranny, was said to 
have its constitution or politeia. 

Related to this Aristotelian notion is the more 
modern idea of a constitution as the organization 
of a government, its offices, and the relation of the 
offices, A variant is the conception which considers 
the actual power relations the “living” constitution 
(McBain 1927; Sternberger 1956). This, too, is a 
term that might be (and has been) applied to an 
absolute monarchy or a totalitarian dictatorship as 
readily as to a political order such as the United 
States or Great Britain. 

Similar to these two conceptions, although dis- 
tinct from them, is the notion that the constitution 
is the “basic law” in the sense of incorporating the 
basic legal rules and conceptions of a given com- 
munity; it, too, would apply to an absolute mon- 
archy or dictatorship as readily as to any other 
regime. In contemporary juristic works, however, a 
constitution is more commonly defined as a de- 
cision concerning the organization of government 
(Kägi 1945; Schmitt 1928), as a legal system of 
integration (Smend 1928), or as the basic norm 
(Kelsen 1945). Kelsen would trace any constitu- 
tion to the one from which it is derived. “The docu- 
ment which embodies the first constitution is a 
real constitution, a binding norm, only on the con- 
dition that the basic norm is presupposed to be 
valid,” he wrote (1945, p. 115), after having pointed 
out that “the validity of this first constitution is the 
last presupposition, the final postulate, upon which 
the validity of all the norms of our legal order de- 
pends” (1945, p. 115). These and similar defini- 
tions clearly embody a genetic theory of law; they 
all derive from a positivist notion of law, accord- 
ing to which law has its origin in the power 
(usually seen as force) of a government. Still dif- 
ferent are those definitions, embodying morpho- 
logical theories, which describe a constitution in 
such terms as a system of divided powers (Lowen- 
stein 1957), as a mixture of monarchy, aristocracy, 
and democracy, and similar indications of the pat- 
tern or design of a government. Definitions of this 
kind are usually given in more concrete terms and 
refer to a particular pattern or design, making it at 
times a paradigm or norm for all. Not only in popu- 
lar parlance but also in advanced juristic thought 
do we find such statements. A sophisticated variant 
of morphological theorizing is represented by prop- 
ositions alleging that the constitution, usually a 
particular one, is merely a symbol or a myth. 

None of these generalized concepts of a consti- 
tution—whether philosophical, political, or legal 
(juristic)—are capable of providing the under- 
pinning for the distinctive notion of constitution- 


alism as a kind of political order which contrasts 
sharply with nonconstitutional systems, such as a 
totalitarian dictatorship. In order to develop such a 
concept, a constitution must be defined in a way 
that indicates the features which make it contrast 
with other kinds of political order. These features 
come into view when we ask: What is the political 
function of a constitution? If that question is 
asked, the constitution is seen as a process by 
which certain political objectives are realized. What 
are these objectives? 

The first and foremost objective is that of pro- 
tecting the individual member of the political com- 
munity against interference in his personal sphere 
of genuine autonomy. It is his self that each man 
presumably wishes to have safeguarded. The roots 
of this concern with the self are predominantly 
Judaic and Christian, although it must be recog- 
nized that self-concern is not completely lacking in 
Islam, Hinduism, Buddhism, and Confucianism. 
Such a self is first of all defined by a convictional 
core which is seen as “inviolable,” if the self is to 
be maintained in its uniqueness and independence. 
It is seen as possessing the right (or freedom) of 
religion. Beyond this core, the individual’s sphere 
is variable; constitutionalism has stressed different 
rights at different times, and the content of such 
rights has undergone significant changes. The 
seventeenth and eighteenth centuries liked to talk 
about such rights as “natural” and by that adjec- 
tive to suggest that they inhere in human nature 
and hence are unalterable. Nowadays the term 
“human” rights is preferred, because comparative 
historical observation has shown that these rights 
are subject to differentiation in time and place and 
that their real significance must be seen in terms 
of a minimum sufficient to protect the individual's 
convictional core. 

The basic objective of protecting the individual 
member of the political community is reinforced 
and institutionally safeguarded by the division of 
political power, both functionally and spatially. 
Such division may therefore be considered the sec- 
ond objective of a constitution. Typically, the 
“separation of powers” serves as the functional di- 
vision, while federalism serves as the spatial. Both 
require a constitution for their effective operation. 
They operate as restraints on governmental power. 
In this perspective, then, a constitutional govern- 
ment is one in which effective restraints divide po- 
litical power, or, to put it negatively, prevent the 
concentration of such power. Thus, constitution- 
alism is both the practice of politics according to 
“rules of the game,” which insure effective re- 
straints upon governmental and other political 
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action, and the theory—explanatory and justifica- 
tory—of this practice. 


Historical development 


Modern Western constitutionalism, with its em- 
phasis upon the individual's rights, is not the only 
form in which constitutionalism, defined as a sys- 
tem of restraints upon governmental action, has 
historically been practiced. Not to go too far afield, 
the historical discussion will be limited here to 
Greek, Roman, and medieval constitutionalism, be- 
fore turning to the history of English, French, and 
American constitutionalism. For Greek and Roman, 
as well as medieval, constitutional ideas have been 
so important in shaping modern constitutionalism 
that they greatly help us to understand it. At the 
same time, the failure to distinguish them clearly 
from modern constitutionalism has been the source 
of many confusions and misunderstandings. 

Greek constitutionalism (as well as Roman) was 
largely practice, rather than theory, although Aris- 
totle’s doctrine of the politeia in the specific sense 
of the model regime constituted a significant first 
theory. Before we turn to it, Plato may be said to 
have pointed the way by making nomos the cri- 
terion by which to distinguish good from bad re- 
gimes. For the nomos, while not oriented toward 
the individual, embodied the prevalent communal 
notions about what is right and just, and provided 
a standard that transcended the particular system 
of rule. Plato was convinced, however, that the 
observance of nomos could be insured only by con- 
centrating power in the hands of the wise. In his 
later years, especially in the Laws, he was inclined 
to concede that much of the nomos might be 
spelled out in nomoi that were observed by all. 
However, the mode of finding these nomoi—by 
means of a nomothetes, or legislator—as well as 
their ultimate enforcement through the Nocturnal 
Council, shows him to have retained his ultimate 
confidence in the wise man rather than in the safe- 
guarding constitution—as do both Confucianism 
and Hinduism [see PLato]. Aristotle, preoccupied 
with the general happiness, advanced further to- 
ward institutional safeguards. His notion of a 
mixed constitution which would be a mean be- 
tween monarchy, aristocracy, and democracy was 
philosophically related to his preference for mesotes, 
the middle road, the mean between extremes. His- 
torically, it constituted a rationalization of political 
practice in a number of Greek poleis, if we are to 
credit the few hints that survive [see ARISTOTLE]. 

Both the Aristotelian argument in favor of a 
mixed constitution and the corresponding practices 
in a polis, such as those which Solon sought to 
establish at Athens, rested upon a value preference 


very different from that of modern constitution- 
alism. The stress was upon stability and strength. 
Such was also the core objective of Roman con- 
stitutionalism, Slowly evolved over the centuries, 
the Roman constitution was a wonder of compli- 
cated and interrelated restraints. All the different 
offices, from that of the consuls down to those of 
the minor functionaries, were subject to carefully 
elaborated rules embodied in law supported by 
powerful religious beliefs. Polybius provided a cele- 
brated analysis of this constitutional order, as it 
presumably worked around 200 8.c.; and Cicero, 
in the Republic and the Laws, added further 
touches of insight and rationale. What Polybius 
marveled at, however, was not how it protected the 
individual but how it provided the strength which 
made Rome great by giving the Roman political 
community a measure of internal stability and pro- 
viding a balance of the different classes. It was this 
strength and stability which later inspired Machia- 
velli, Harrington, and Montesquieu. The problem 
which they and many others contemplated at 
length, and which each solved in terms of his own 
political convictions, was the problem of how this 
strong and stable system came to decline and 
eventually to be replaced by monarchical abso- 
lutism. John Dickinson added his own interpreta- 
tion in Death of a Republic (1963), making an 
analogy between the process and the modern rise 
of totalitarian dictatorship. Like Montesquieu and 
others, he interpreted Roman constitutionalism in 
the perspective of contemporary problems of con- 
stitutionalism. The problem is basically simpler; 
Roman constitutionalism provided strength and sta- 
bility for a city-state. It was unsuited to the larger 
territorial power which Rome became as the result 
of this strength and stability, Rome’s decline was 
inherent in its rise—a built-in dialectic often ob- 
served in nature. As Mcllwain has insisted, there 
can be no doubt that the theory of the Roman con- 
stitution was that “the people and the people alone 
are the source of all law” (1940, p. 48). This 
means, of course, that for an understanding of 
Roman constitutionalism a grasp of the nature of 
lex is vital. The distinction between private and 
public law is essential and is “a distinction that 
lies to this day behind the whole history of our 
legal safeguards of the rights of the individual 
against encroachment of government” (Mcllwain 
1940, p. 48). The Roman notion that law is the 
common solemn promise of the public became a 
vital ingredient of Western constitutionalism. With- 
out such a concept of law, constitutionalism’s po- 
litical function as a system of restraints is greatly 
weakened. 

Medieval constitutionalism built on the basis 


CONSTITUTIONS AND CONSTITUTIONALISM 321 


thus laid. It sprang from the medieval idea that all 
legitimate government is government according to 
law. But that law was held to be largely in exist- 
ence and merely in need of being made “public,” 
although the idea of legislation was never entirely 
lost. How could it be to men who read the Old 
Testament and the corpus juris, which are filled 
with evidence of legislation as a matter of his- 
torical fact? But all this law was already at hand, 
as was the customary law by which men lived in 
their particular national communities. Medieval 
constitutionalism arose, as did Greek and Roman 
constitutionalism, from the struggle of an aristoc- 
racy seeking to restrain a monarchical ruler who 
threatened to become a tyrant. In this struggle, con- 
stitutionalism became associated with the church, 
which in some places and at certain times even 
played a leading role, The share of the bishops in 
the fighting preceding the issuance of Magna 
Charta certainly was considerable. King John’s at- 
tempts to deal with this ecclesiastical opposition by 
enlisting the support of the pope miscarried; he 
misunderstood the position of the church. Vitally 
interested in the restraining of governments, and 
anxious to retain control over certain fields of law, 
such as family law, the church developed the doc- 
trine of natural law as it had come down from the 
Stoics, more especially Cicero, and had been in- 
corporated in the imperial code, the Corpus Juris 
Civilis. To determine whether particular laws were 
in keeping with the natural law—for only then 
could they be considered fully just laws—the 
church felt it ought to participate in the making 
of such laws as well as in the interpretation of 
established law and custom. In the Roman law, a 
constitutio was a law established by the emperor; 
in the medieval world, such collective bodies as the 
“king in parliament” were seen as the successors 
to the emperor. Legem constituere meant to estab- 
lish the law by formal enactment. Ecclesiastics 
ought to participate—and fairly generally did par- 
ticipate—in this process. For example, the Golden 
Bull, which regulated the election of the Holy 
Roman emperor, was a constitutio in this classical 
sense. The archbishops of Cologne, Mainz, and 
Trier participated and were made electors under 
this “constitutional” charter. For many medieval 
thinkers, jurists, and philosophers, no distinct con- 
stitutional problem existed apart from the general 
proposition that all government should be accord- 
ing to and under the law. Had not the great 
Aquinas treated of government just incidentally 
within the context of a discussion of law and 
justice as part of the Summa theologica? [See 
AQUINAS.] 

In England, Bracton is perhaps most representa- 


tive of this medieval stress on law and the legal 
restraints on government. But a more distinctive 
sense of the contrast between English and Con- 
tinental practice is found in John Fortescue, who 
made the distinction between a regimen regale and 
a regimen regale et politicum the keynote of his 
discussion of English government. Here the word 
politicum appeared as representative of the Aris- 
totelian politeia in its differentiating sense of a 
model government of mixed and restrained powers. 
As authority Fortescue cited Aquinas, thereby inci- 
dentally suggesting what has often since been 
overlooked or even denied, namely, that the great 
Scholastic was a constitutionalist. In the Summa 
he clearly states that a mixed government is the 
best (11, 1, 94, 4 and ny, 1, 105, 1); similar state- 
ments can be found elsewhere. This view is in 
accord with the later part of De regimine princi- 
pum, in which Ptolemy of Lucca elaborated the 
views of his master, albeit with some liberty. For 
both Aquinas and Fortescue, it was crucial that a 
government be subject to legal restraints; govern- 
ment was best when instituted by law. From here 
the road leads to English seventeenth-century con- 
stitutionalism, but before this development is 
traced, it is necessary to sketch the constitution- 
alism embodied in conciliarism. 

Conciliarism is, in a sense, the application of 
medieval constitutionalism to the church itself. The 
ecclesiastical insistence upon the need for subject- 
ing all authority to legal restraints was claimed to 
apply to the church, Effective participation of the 
lower ecclesiastical orders and even of the laity 
was demanded in the councils which were called 
upon to formulate the law. In this discussion, the 
constitutional aspect became increasingly explicit. 
From William of Ockham to Nicholas of Cusa, the 
idea of consent as a vital ingredient of law gained 
ground, and the question of how to organize the 
expression of such consent was faced. Church 
councils appeared in analogy to feudal representa- 
tive assemblies, such as the English Parliament, 
and their traditional participation in establishing 
the law was claimed to be applicable to the govern- 
ment of the church. 

Even though the conciliar movement failed, 
there can be little doubt that it spread some of the 
key ideas of constitutionalism. Thus reinforced, 
constitutionalism might have triumphed through- 
out Europe in a broader secular form, had it not 
been thwarted by the countervailing arguments 
arising from religious dissension and civil war. For 
against these divisive tendencies, the ineluctable 
demand arose for a concentration of power in the 
hands of a ruler—the famous doctrine of sover- 
eignty as first enunciated by Jean Bodin [see 
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Bopry]. Although this doctrine was perfected and 
radicalized by Thomas Hobbes, England's insularity 
made the demand seem less urgent [see HOBBES]. 
The constitutionalist position had in the meantime 
been maintained in spite of Tudor “absolutism” 
and was developed in the sixteenth century by Sir 
Thomas Smith and Richard Hooker. In his De 
republica Anglorum (1583), Smith stressed the 
representative function of the “king in parliament” 
and delineated in functional terms the emergent 
notion of a mixed government through a separa- 
tion of powers. Richard Hooker, in his celebrated 
Laws of Ecclesiastical Polity (1593-1597), devel- 
oped a careful elaboration of Aquinas’ philosophy 
of law and the need for general consent, if it is to 
hold. But the consensus in terms of which both 
Smith and Hooker wrote and argued gradually de- 
clined, and the more poignant issues of modern 
constitutionalism presented themselves in the 
course of the revolution and its aftermath which 
filled much of the seventeenth century. 

Probably the most significant and certainly the 
most lasting legal contribution to the moderniza- 
tion of medieval constitutionalism was made by 
Edward Coke. With all the skill of a great lawyer 
and an extraordinary capacity for historical learn- 
ing, combined With a striking lack of historical 
sense, he brought medieval precedent to bear upon 
the issues arising between the king and Parliament 
or, more realistically, between Puritans and An- 
glicans, between old wealth and new wealth, be- 
tween landed property and trading interests. Coke, 
more than any other man, made Magna Charta the 
battle cry of those who insisted on man’s rights 
[see Coke]. The Petition of Rights of 1628, while 
the first major official declaration of such rights, 
was still preoccupied with the rights of English- 
men, as prescriptively recognized since Magna 
Charta. As the revolutionary movement gained mo- 
mentum after the calling of the Long Parliament 
in 1640, the historic and legal guarantees were 
reinforced by the idea that these rights derive from 
the very essence of man’s nature, And while the 
Petition of Rights had been concerned with prop- 
erty rights, the right to a man’s freedom of con- 
science—the right, that is, of freely confessing 
one’s religious conviction—moved into the fore- 
ground. It was at the heart of Oliver Cromwell’s 
outlook and was given eloquent expression in John 
Milton's Areopagitica (1644). The so-called Agree- 
ment of the People proposed by Cromwell's more 
radical following was the first of a series of at- 
tempts toward effectively institutionalizing these 
rights through the protection of a constitutional 
system. In a number of epoch-making statements, 


Cromwell proclaimed the idea that in any consti- 
tution there is “somewhat fundamental” which 
ought not to be subject to change by Parliament. 
Since Parliament insisted on violating such re- 
straints upon its own exercise of power, Cromwell 
eventually had to rule arbitrarily, a dictateur 
malgré lui. 

Cromwell's desperate efforts were accompanied 
by two striking theoretical efforts, each reflecting, 
in a sense, one horn of his dilemma. Thomas 
Hobbes, the philosopher, rejecting outright the idea 
of constitutionalism, pleaded in his Leviathan 
(1651) for a radical concentration of powers in 
the hands of the sovereign. Opposing him, James 
Harrington, the political theorist, in his Oceana 
(1656) recognized that the hoary doctrine of a 
mixed constitution implied a separation of the 
powers of governing and that a “government of 
laws and not of men” can be achieved only if those 
governing are “constrained to shake off this or that 
inclination.” According to him, there are two ever- 
recurring orders, the “natural aristocracy” and the 
common people. They must concur in making laws, 
and together constitute the legislative power. A 
third power, the magistracy, must execute the laws. 
The balance between these three bodies is achieved 
in a constitution, and a commonwealth consists of 
“the senate proposing, the people resolving, and the 
magistracy executing.” It is evident that Harring- 
ton’s generalization was based upon Roman and 
English experience [see HARRINGTON]. 

Soon after Cromwell’s death, English sentiment 
swung back to its traditional constitution and in 
the course of the Restoration recaptured a measure 
of that consensus upon which it had rested. When 
James 1 threatened to disrupt this consensus, it 
powerfully reasserted itself in the so-called Glori- 
ous Revolution, a smoothly efficient coup d'état that 
replaced one king with another and reaffirmed the 
basic rights in a traditional declaration, the bill of 
rights, in 1689. John Locke was, of course, the 
theorist of these events, who skillfully summed up 
and generalized English constitutional thought. 
His Two Treatises of Government (1690), although 
they antedate the Glorious Revolution by nearly a 
decade, have long and rightly been taken to be a 
justification of this proceeding; for, especially in 
the second treatise, Locke plainly asserts a people's 
right to give itself its own constitution [see LOCKE]. 
This right, although first stated by John Milton, 
was part of a congeries of rights that Locke held 
to be natural and universal, and epitomized in the 
formula of the rights of life, liberty, and property. 
Property was, of course, dear to the rising bour- 
geoisie; but in Locke’s understanding, it still was 
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very broadly construed to mean virtually the entire 
personal sphere of what is a man’s own. It was the 
firm belief of Locke and succeeding generations 
that no government which failed to recognize these 
rights could possibly be considered legitimate, be- 
cause no one could be held to have surrendered 
what “he has no power to part with.” Hence, the 
“freedom” to choose a form of government really 
excluded the right to choose a nonconstitutional 
government, Constitutional government was a gov- 
ernment in which the crucial power to make laws 
was divided between king, Lords, and Commons, 
while the other two powers, the executive and fed- 
erative, distinguished by Locke from the legislative 
one, were attributed to the king along with his 
share of the legislative power. Only in the Act of 
Settlement in 1701 was the independence of the 
judiciary recognized, thus laying the basis for 
Montesquieu’s interpretation of the separation of 
powers in more strictly functional terms. 

In a celebrated chapter of The Spirit of Laws 
(1748) Montesquieu undertook to restate the doc- 
trine of restraints in more nearly systematic and 
“logical” terms than Locke’s tradition-derived view 
had offered. His formulation of the doctrine, dis- 
tinguishing the legislative, executive, and judicial 
functions and attributing each to a separate indi- 
vidual or group, achieved universal acclaim, was 
institutionalized in the American and French revo- 
lutions, and became the basis of nineteenth- 
century constitution making. These three functions 
still revolved around the idea of law: the law- 
making function was contrasted with the law- 
administering (executive) and the law-interpreting 
(judicial) functions [see MONTESQUIEU]. Although 
he called these three functions “powers,” Montes- 
quieu pointed out that the judicial power was “in 
a sense nil” (dans une façon nul)—that is, no 
power at all, By this curious phrase, Montesquieu 
did not, of course, wish to suggest that the judi- 
ciary had no function but, rather, that this func- 
tion depended for its implementation upon sanc- 
tions which ultimately required force. (It was 
precisely this “impotence” of the judicial power 
which recommended it to the American constitu- 
tion makers as the “guardian” of the constitution.) 
The French revolutionary movement, which far 
transcended the governmental and constitutional 
sphere, did not stop to consider such niceties. Bent 
upon achieving the millennium, the successive con- 
stitutions were increasingly inspired by Rousseau’s 
radicalism, which would “force men to be free” 
[see Rousseau]. Between the Declaration of the 
Rights of Man and Citizen, issued in 1789, and 
the dictatorship of Napoleon, the French ranged 


through all the phases of revolutionary violence; 
and the truly constitutional beginnings, inspired 
by Montesquieu and Mirabeau, soon yielded to a 
concentration of powers in support of a program 
of social transformation and renewal carried out 
with religious zeal. Even so, the French more fully 
grasped the key notion of a constituent power than 
had previously been the case. 

Very different and sharply contrasting was the 
evolution of constitutional thought in America. 
Starting from English precedent and utilizing the 
experience derived from colonial charters, the 
fathers of American constitutionalismi were any- 
thing but revolutionary in outlook. Washington, 
Adams, Jefferson, Hamilton, and Madison—to men- 
tion only the most illustrious names—were all men 
who believed in order as well as progress. To them, 
the position which independence had occasioned 
required orderly resolution without delay. The two 
successive constitutions which they helped fashion 
were both inspired by the ideas of Locke and Mon- 
tesquieu and of the entire constitutionalist tra- 
dition which they represented and embodied. But 
such inspiration as the American constitutionalists 
received was tempered by their knowledge that 
concrete and unprecedented problems were facing 
them. As a result, they discovered a number of 
highly significant institutional solutions which past 
constitutionalism had failed to resolve, notably 
federalism, judicial review of legislation, and the 
process of constitutional amendment. This achieve- 
ment was theoretically reinforced by its skillful 
defense in The Federalist, in which Hamilton, with 
the help of Madison and Jay, expounded the doc- 
trine of modern constitutionalism in such elabora- 
tion that it could become the basis of nineteenth- 
century constitution making. Along with the 
ideological stimulation of the French revolutions 
—for the great revolution of 1789 was followed by 
a series of coup d’état-like revolutions in 1833, 
1848, 1851, and 1871—the American Revolution 
seemed to prove that a community's political order 
may be rationally constituted and that an act of 
political decision making can organize the govern- 
ment and make it legitimate. 

In spite of lingering doubts which the notion of 
organic growth instilled in the minds of the more 
conservative elements, European nations under- 
took the task of constitution making. Belgium, the 
Netherlands, the Scandinavian kingdoms, the sev- 
eral German kingdoms, Switzerland, Spain, Aus- 
tria-Hungary, and Italy all fashioned constitutions 
in the image of those of Britain and the United 
States. Constitutionalism became the battle cry of 
all progressive forces; and broadly based popular 
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movements, such as that of German unification, 
were conceived in terms of making a constitution. 
The unsuccessful attempt, in 1848, to achieve such 
a constitutional order on a broadly representative 
and liberal basis was, to be sure, replaced by an 
authoritarian solution in the Reich of Bismarck’s 
creation; but even then a constitution crowned the 
newly won unity. 

Indeed, many monarchical rulers sought added 
legitimacy during the post-1848 period by “giving 
a constitution” to their people. Such royal constitu- 
tion making regarded the constitution as a grant 
from the “sovereign” and hence as an alternative 
to the democratic legitimacy of a popularly elected 
constituent assembly, While imperfect as a realiza- 
tion of constitution making, it was nonetheless a 
step in the direction of establishing restraint on 
government, through autolimitation. That it con- 
stituted progress may readily be surmised, if one 
considers the possibility of a totalitarian regime 
today believing itself to be bound by the “constitu- 
tion” it has established, instead of treating it merely 
as a facade. Monarchical constitutionalism was, in 
the sense of autolimitation, government according 
to law. As the democratic forces gained ascendancy 
in the course of the nineteenth century, such mo- 
narchical constitution making became outmoded. 
It lacked the legitimacy of a constitution based 
upon popular approval. In Switzerland and other 
countries, democratically based procedures, similar 
to those used in the United States, were generally 
adopted. 

Making constitutions of this democratic kind 
generally calls for a representative constituent as- 
sembly in which the constitution is debated and 
eventually adopted. The work of such an assembly 
may be reinforced by submitting the constitution 
to popular referendum, but such plebiscites are of 
doubtful value. Rejections have been few, the most 
striking recent instance being that of the first post- 
war constitution, submitted to the French elector- 
ate in April 1946. In the case of federal systems, 
there is also likely to be some procedure for secur- 
ing the assent of a majority of the member units, 
through either legislative action or referenda. As 
constitutional experience has accumulated, the role 
of “experts” has become more and more important. 
Indeed, preparatory commissions have often been 
established to draft a constitutional proposal, as 
was done in the case of Puerto Rico in 1952 and 
the several German Länder under American occu- 
pation in 1946. Experts, whether jurists or political 
scientists, can be most effectively employed at this 
formative stage of constitution making. The prob- 
lem confronting the modern constitution maker is 


that of fitting past experience with constitutional 
government to the particular circumstances of time 
and place. In the emergent nations, this task often 
involves complicated problems of cultural adapta- 
tion. But such adaptations apart, there is the more 
general problem of determining the components of 
a model constitution. Within a particular cultural 
context, such models have been laid out for munici- 
palities and states in the United States. Whether it 
is possible to formulate a broadly conceived com- 
mon denominator of universal validity is an open 
question, 


Contemporary problems 

It remains to delineate briefly some aspects of 
contemporary constitutionalism in Europe, the 
emergent nations, and the Soviet sphere. Since the 
second world war, constitutionalism in Europe has 
served the goal of giving expression to what have 
been called the “negative revolutions” in France, 
Italy, and Germany. By these revolutions a defunct 
and generally rejected totalitarian fascist past has 
been negated and replaced by a more or less con- 
ventional constitutional order. The constitutions 
of the Fourth Republic, of the Italian Republic, and 
of the Federal Republic of Germany closely re- 
semble the orders which existed prior to the seizure 
of power by Mussolini, Hitler, and the Pétain- 
Laval group. There were and are significant dif- 
ferences, of course: the Fourth Republic attempted 
the federalization of France’s colonial empire; 
Italy abolished the monarchy; and the Federal 
Republic is still only a torso, although it is stabiliz- 
ing its executive and moving toward a two-party 
system. Moreover, the Fourth Republic has yielded 
to the Fifth, which is characterized by a vigorous 
presidential system with little more than the trap- 
pings of parliamentarism remaining. At the same 
time, its colonial empire has all but vanished. Both 
changes together constitute a more radical and 
revolutionary transformation than has occurred in 
either Italy or Germany. The constitution under 
which they have occurred did not envisage them, 
even though it has permitted them. It has proved a 
feeble restraint upon de Gaulle’s determination to 
govern the country as he sees fit. While the Ital- 
ian and German constitutions have more nearly 
achieved the functional purpose of restraint, they, 
too, have been bent and twisted in various ways- 
Thus all three constitutions serve to illustrate the 
weakening of constitutionalism in Europe. This de- 
cline is not to be wondered at when one observes 
the lack of interest in and support for constitu- 
tionalism among the citizenry. 

Beyond the national borders, constitutionalism 
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has played a certain role in the broad movement 
for the unification of Europe. Within the European 
movement, there has been considerable discussion 
about the most suitable constitution, with federal- 
ism and parliamentarism as the key issues. Beyond 
the initial Council of Europe, the Community of 
the Six emerged. A draft constitution for this po- 
litical community was fashioned by a constituent 
assembly, the Assemblée Ad Hoc, in 1952/1953; 
quite a few other drafts have been put forward by 
organizations and individuals. A radical group of 
European federalists has pleaded for a popularly 
elected constituent assembly—so far without any 
significant result. In the meantime the unification 
has gone forward slowly within the context of cul- 
tural and economic life, sanctioned by interna- 
tional treaties and enforced by international insti- 
tutions. Even a European bill of rights has been 
agreed upon (within the broader and looser frame- 
work of the Council of Europe), and its enforce- 
ment machinery has been ratified by a number of 
states. The role of constitutionalism in all these 
developments has been limited. To some extent, 
the lingering conviction of its importance has ac- 
tually been a hindrance rather than a help to 
progress, because of its tendency to formalize and 
institutionalize before the underlying political and 
social realities justify such actions. 

Although constitutionalism is apparently weak- 
ening in its heartland, it has been a factor of con- 
siderable importance in the emergent nations. To 
most of them, the fashioning of a constitution for 
their political order has been significant as a sym- 
bol of their newly won freedom. Some of the con- 
stitutions are of extraordinary complexity and 
formal sophistication, notably that of India. Here 
the task of organizing a whole culture of conti- 
nental dimensions presented problems never before 
solved by Western constitutionalism. Working with 
European and American precedents, India had to 
add totally new provisions. It is, however, widely 
felt that the Indian constitution does not really ex- 
press political reality—a criticism which could, of 
course, also be applied to most other constitutional 
systems. Only those parts of politics which can be 
expressed in legal rules can be reflected in a con- 
stitution. Behind the formal organization, an in- 
formal one will always operate. It is an essential 
part of the living constitution, which could not 
function without it. Insight into this aspect of con- 
stitutionalism has often led and continues to lead 
to a cynicism which looks upon a constitution as 
merely a facade behind which the true reality of 
the political order is hidden. Such arguments 
usually overlook some of the most obvious counter- 


arguments. Terms of office, modes of election, ter- 
ritorial divisions, and many other provisions in 
modern constitutions are descriptive of at least part 
of the political reality. Clearly they do not exhaust 
that description and may not even mention cer- 
tain important political institutions—for example, 
parties. In many of the emergent nations constitu- 
tionalism cannot fulfill even this more modest func- 
tion, and does not restrain the government because 
it is not the expression of a firm belief in the im- 
portance of doing so. More especially, bills of rights 
remain empty paper declarations because the rul- 
ing party or clique readily identifies itself and its 
power with the public interest. This tendency is 
enhanced by the practice in totalitarian communist 
states. 

Within the Soviet sphere, and more particularly 
in the Soviet Union itself, the constitutions are 
largely facades. The purely formal character of such 
documents as the successive constitutions of the 
Soviet Union is revealed by the fact that they do 
not evolve. They remain what they are, on paper, 
until one day they are completely altered by the 
effective rulers of the dominant party. They em- 
body essentially what the regime wishes the world 
outside and its own people to believe about the po- 
litical order. They therefore invariably contain ex- 
tended bills of rights devoid of all enforcement 
machinery or possibility of implementation. The 
bill of rights is seen as a declaration of princi- 
ple, and its function was summed up in 1962 by 
the Soviet scholar A. I. Lepyoshkin as follows: 
“. , every constitution . . . is a result of changes 
in the balance of class forces; it expresses the will 
and interest of the classes in power, guarantees the 
principles of such social and state order as is ad- 
vantageous for and agreeable with the interests of 
these classes. . . . The Soviet constitution embodies 
the principles of socialist democracy, it is a genu- 
inely democratic constitution.” Surprisingly enough, 
Lepyoshkin did not hesitate to claim that the 
Soviet constitution “serves as the most important 
instrument of safeguarding the rights and interests 
of the Soviet citizens from any encroachment. . . a 
No details were furnished, however, as to how such 
a constitution actually safeguards these rights; it 
might conceivably be “the most” important instru- 
ment without being an important one, since no 
other instruments exist. 

The broad tradition of constitutionalism has in 
this century been projected onto the world plane. 
The Covenant of the League of Nations and the 
Charter of the United Nations are both embodi- 
ments of this international constitutionalism. Quite 
in keeping with the constitutionalist tradition, a 
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Universal Declaration of Human Rights was adopted 
after vigorous debate by the United Nations in De- 
cember 1948; but no enforcement machinery has 
been set up, except for the weak supervisory ma- 
chinery provided for dependent territories. Indeed, 
it is very doubtful that any such enforcement could 
at present be implemented. International constitu- 
tionalism is not a mere fagade; but the very fact 
of the participation of totalitarian regimes makes 
it inevitable that this constitutionalism partakes to 
some extent of the character of totalitarian consti- 
tutionalism. That such constitutionalism is imper- 
fect, that it does not restrain the governments 
operating under it to any significant degree, is 
obvious. That it may nevertheless become the basis 
for gradual implementation, and thus the starting 
point for the achievement of genuine constitution- 
alism, is the hope of many. Such hope may find 
some confirmation in the past history of constitu- 
tionalism, 

CARL J. FRIEDRICH 


[See also ConsTITUTIONAL Law; Democracy. Other 
relevant material may be found in Crisis GOV- 
ERNMENT; DELEGATION OF POWERS; ELECTIONS; 
FEDERALISM; MODERNIZATION; PARLIAMENTARY GOV- 
ERNMENT; PRESIDENTIAL GOVERNMENT; REPRESEN- 
TATION; and in the biographies of BAGEHOT; BEARD; 
Dicey; HARRINGTON.] 
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Modern Constitutions, Ox- 


CONSTRUCTION 
See Housine. 


CONSUMER CHOICE 
See UTILITY. 


CONSUMER CREDIT 
See INSTALLMENT CREDIT. 


CONSUMER SOVEREIGNTY 


“Consumer sovereignty” is one of those concepts 
that flourish and are widely influential long before 
they are explicitly recognized and named. (Their 
belated recognition is often concomitant with their 
decline.) Much of the substance of consumer sov- 


ereignty is implied in Adam Smith. The focus of 
subsequent classical economics on cost of produc- 
tion as the basic determinant of market decisions 
temporarily sidetracked this emphasis. It returned 
more strongly with the Austrian school of Wieser 
and Menger, and in the work of Jevons, Pareto, 
Marshall, Pigou, and Wicksell. In 1936 W. H. Hutt 
in Economists and the Public coined the term to 
refer to a common fundamental presupposition in 
all these works. 

Consumer sovereignty has been used in both a 
descriptive and a normative form. In the first form, 
the term simply means that all economic processes 
are ultimately focused toward satisfying the wants 
of the final consumer. Production, exchange, and 
distribution are all means; consumption is the end. 
Moreover, in a free market system, market perform- 
ance is in fact responsive to the specific wants of 
the consumers within the system. The question of 
how responsive leads to the normative form. As a 
normative principle, consumer sovereignty asserts 
that the performance of any economy ought to be 
evaluated in terms of how well it fulfills the wants 
of its consumers. Performance will be affected by 
the structural characteristics of the economy, by 
public policy, by behavior of participants that is 
not uniquely determined by structure and public 
policy, and by certain external circumstances. Our 
discussion will center on the normative form of the 
concept, since this will automatically illuminate 
both forms. 

Association with a free market economy. Con- 
sumer sovereignty has been frequently associated 
closely—but misleadingly—with a free market 
economy. Since it can help to isolate the boundaries 
of the principle, we shall examine the alleged as- 
sociation. It comprises the following four steps: 

(1) Knowledge of consumer wants. No one 
knows what a consumer wants as well as he does 
himself. Consequently, his wants will be best re- 
flected in his market demand for commodities. 

(2) Expression of wants, Consumer demand 
is best expressed in terms of actual choices made 
by consumers in the market—in terms of market 
transactions—since, being rational, consumers will 
adequately inform themselves of how best to realize 
their wants in the presence of given opportunities. 

(3) Responsiveness of production to wants. 
Free enterprise, directed by the profit motive and 
intense competition in all markets, brings about the 
best possible adaptation of resources to meet con- 
sumer market demands, given the available re- 
sources and state of technology. “Best” is defined 
as a set of outcomes such that, whichever one of 
these obtains, no feasible alteration can bring about 
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an alternative outcome in which any individual is 
made better off without at least one individual be- 
ing made worse off. 

(4) Laissez-faire. So long as the conditions 
for a free competitive market system obtain, a 
laissez-faire public policy will lead to a maximum 
degree of consumer sovereignty. 

The emphases in this version are on individual- 
ism, on free competitive markets, and on laissez- 
faire. “Wants” are the wants of individual con- 
sumers; individual consumers know their own 
wants; they are self-motivated to become informed 
about the real alternatives available to them; and 
competition both protects them against exploitation 
and guarantees appropriate responsiveness in the 
use of resources to meet their expressed wants. 
Thus, governmental interference is at best superflu- 
ous and, more likely, destructive of consumer sov- 
ereignty. Finally, the very criterion by which the 
appropriateness of market response is evaluated is 
one that refuses to sacrifice any one consumer's 
well-being to that of others. 

This version of the principle has sometimes been 
advanced as its true, or official, version. This is 
incorrect. The position just described is not in fact 
simply an interpretation of consumer sovereignty, 
It is a complex of at least three distinct norma- 
tive principles: consumer sovereignty, freedom of 
choice, and Pareto optimality. Freedom of choice 
asserts that every economic unit should be per- 
mitted to make and implement all decisions bearing 
upon its own welfare. Pareto optimality asserts that 
when comparing any pair of social outcomes, one 
state can be declared superior to the other state if, 
and only if, at least one individual is better off and 
no individual worse off in the first state than in 
the second state. 

Relation to freedom of choice. Comparing con- 
sumer sovereignty with freedom of choice, the first 
refers to the ends of economic activity and the sec- 
ond to the means by which these or other ends can 
be attained. Concretely, freedom of choice refers 
to the administrative procedure of allowing eco- 
nomic units to use their own property to make 
whatever voluntary trades they wish in the market 
(Lange 1938; Bergson 1948). While an adminis- 
trative procedure—a means—can become valued 
as an end in itself, this is not the same end as en- 
visaged under consumer sovereignty. Either of 
these principles can be supported without the other, 
We may have a system adhering to consumer 
sovereignty without freedom of choice where a cen- 
tral authority uses nonmarket means to discover 
what consumers’ commodity preferences are (for 
example, by questionnaires, votes, or psychological 
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projective tests) and then channels resources to 
meet them. Freedom of choice can be supported 
without consumer sovereignty where a central au- 
thority itself decides what the basic goals should 
be, independently of what it thinks consumers 
want, then employs resources to produce in accord- 
ance with its own goals, but allows the output to be 
distributed by means of market choices on the part 
of consumers, setting prices so that all markets 
clear. As another example of this last, consumers 
might be permitted to trade in free, perfectly re- 
sponsive markets, yet, in a sense to be examined 
below, might not really know what they truly want 
and as a result make deluded choices. 

The relationship can be more complicated, so 
that, despite appearances, one principle may be 
present without the other. In purchasing medical 
services from a physician, the patient seems to be 
sacrificing his sovereignty, since the doctor makes 
all decisions of importance about treatment. Yet, in 
effect, the patient delegates the doctor to decide 
for him: the goal of treatment is the patient's best 
interests. On the other hand, a television viewer 
may believe that he votes for programming because 
he is free to choose what to watch; yet the sponsor’s 
programming decisions are responses to viewers’ 
purchases of his product, not necessarily to their 
viewing decisions. It is only a precarious and varia- 
ble association between viewing programs and be- 
ing responsive to the sponsor's commercials—at 
least insofar as any such association is believed by 
the sponsor to exist—that preserves the viewer's 
illusion of influencing program content. His influ- 
ence is actually far less direct and decisive than he 
may believe. 

Notwithstanding these examples showing the 
logical independence of consumer sovereignty from 
freedom of choice, there is, of course, a strong 
empirical affinity between them. The individual 
consumer’s wants can, in a wide variety of situa- 
tions, be accurately reflected in his overt choices. 
Moreover, the alternative of indirect nonmarket 
inquiry is considerably more expensive for discov- 
ering wants that can be so reflected. Thus, as an 
empirical generalization, to satisfy freedom of 
choice in the context of responsive markets is also 
to satisfy consumer sovereignty to a first approx- 
imation; further, to want to satisfy consumer sov- 
ereignty is to be willing to see freedom of choice 
satisfied to a first approximation as well. In prac- 
tical circumstances these first approximations may 
actually involve considerable and important diver- 
gences from the respective principles. We shall 
consider this below. 


Relation to Pareto optimality. The relationship 
between consumer sovereignty and Pareto opti- 
mality is significant just because it is not as close 
as is sometimes thought. It is sometimes believed 
that the former implies the latter. This is incorrect. 
What does consumer sovereignty imply about the 
relative evaluation of alternative outcomes? The 
problem is that different individuals have different 
wants (expressed as implicit preferences). Any 
pair of outcomes is therefore likely to affect differ- 
ent individuals differently, so that in the preponder- 
ant number of cases some individuals will be better 
off with one alternative, while others are better off 
with the other. The principle of consumer sov- 
ereignty would seem to imply that an improvement 
in any individual's position, all others remaining 
the same, represents a net social improvement. It 
has, for example, been used this way in Arrow’s 
influential work in welfare economics (1951) and 
in the resultant literature. (For a bibliography, see 
Rothenberg 1961.) But it has nothing specific to 
say about aggregating some individuals’ gains 
against others’ losses. It neither specifies a particu- 
lar method of aggregation, nor precludes the pos- 
sibility of such specification. Thus, while it is not 
inferable from it, it is also not inconsistent with 
consumer sovereignty to judge that a social change 
that makes 100 million people significantly better 
off (“significantly,” in terms of their own well- 
being), while making one person slightly worse off 
(“slightly,” in terms of his well-being), represents 
social improvement. To do so formally would 
simply require supplementing consumer sovereign- 
ty with a particular normative criterion from which 
this assertion is inferable. 

On the other hand, Pareto optimality, while go- 
ing at least as far as consumer sovereignty, goes 
beyond this and forecloses the meaningfulness of 
any such aggregation, where contrary changes 
occur for different persons. It is inconsistent with 
Pareto optimality to hold that a social improvement 
occurred where ten million gained while one per- 
son lost. Thus, consumer sovereignty is a less com- 
plete criterion than Pareto optimality for judging 
aggregate social changes, On the other hand, it is 
also less restrictive because, needing to be supple- 
mented by additional normative criteria, it can be 
adjoined to any of a number of different criteria. 
Pareto optimality represents only one such possible 
combination—consistent with, but not exhaustive 
of, consumer sovereignty. It is neither the exclu- 
sive, nor even the most appropriate, combination, 
remembering our example of near, but not com- 
plete, unanimity improvement. The whole family 


of normative criteria with the property that non- 
contrary improvements to any individual register 
as social improvements qualify as potential supple- 
ments, regardless of how variously they formulate 
interpersonal comparisons of welfare change. 

The distinctiveness of the three principles signi- 
fies that consumer sovereignty, while not incon- 
sistent with, is not uniquely to be associated with, 
an individualistic, laissez-faire, free market orien- 
tation. A proper appreciation of its scope therefore 
requires deeper examination of the concept itself. 

The aggregation problem. Assuming that con- 
sumer wants are to be fulfilled, there are two major 
problems to be considered in giving this operational 
significance. The first we have already touched on. 
How does one evaluate the extent to which differ- 
ent situations fulfill wants, when “wants” refers to 
the typically heterogeneous collection of a whole 
population? Assuming scarcity to be a universal 
condition, an economy can meet consumer wants 
only by compromising between the consumers’ ends 
and the particular constraints imposed by the re- 
sources and state of technology available. Each pair 
of alternative compromises will typically have dis- 
parate effects on different members of the popu- 
lation. A complete criterion for comparing degree 
of fulfillment must make provision for aggregating 
gains and losses across individuals, Consumer sov- 
ereignty does not itself do this; it must be supple- 
mented by additional normative assumptions. Such 
supplementation is by no means easy. Indeed, the 
search for a highly consensual formulation has 
monopolized much of the attention of welfare eco- 
nomics in the past twenty years (Kaldor 1939; 
Hicks 1939; Bergson 1938; Samuelson 1947, chap- 
ter 8; Arrow 1951; Boulding 1952; Rothenberg 
1961). The upshot is that a supplemented con- 
sumer sovereignty criterion can take many forms. 
Its character will depend greatly on the specific 
supplementation. No particular supplementation 
has succeeded in commanding a consensus among 
economists. 


The identification problem 


The problem of aggregating wants assumes at 
best that each of the elements to be aggregated is 
the appropriate one. Such an assumption begs a 
critical question. Does consumer sovereignty con- 
tain clear guidelines for identifying the relevant 
individual ends to be fulfilled? 

A reasonable model of consumer wants may be 
as follows. Consumers have directional strivings 
known as wants. These are operationally reflected 
in preferences toward different commodities. These 
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preferences can be elicited by a variety of means, 
the most common being simply to observe actual 
market choices (under freedom of choice), The 
translation of wants into market choices involves 
an intermediate step of becoming informed about 
alternative trading opportunities. For our present 
purposes, the information involved concerns the 
prices and qualities of the respective commodities. 

Applied to a model like this, the spirit of con- 
sumer sovereignty begs for consideration of two 
major types of qualifications on elicited prefer- 
ences. One type concerns the concordance between 
choice and underlying wants—supportive qualifica- 
tions. The other concerns qualifications designed to 
“correct” attitudes lying below the level of choice, 
on grounds of a deeper contradiction among wants 
—corrective qualifications. There exists a mid- 
region where the two types are difficult, if not 
impossible, to distinguish. 

Supportive qualifications. Suppose individual 
consumers know ultimately what kinds of com- 
modities they want. Given the constraint of pur- 
chasing power, the factor that mediates between 
this knowledge and the satisfaction of their wants 
is the availability of information about the specific 
commodity opportunities open to them. In the naive 
form of the consumer sovereignty principle, which 
combines freedom of choice and is popularly re- 
flected in the motto Caveat emptor (“Let the buyer 
beware”), this information is largely integrated 
within the principle itself. Information is itself a 
scarce commodity, in that scarce resources (€.g., 
time and effort) must be expended to obtain it. 
Thus, it is not rational for a consumer to seek ex- 
haustive information about every commodity he 
consumes. He will, if left to his own devices, choose 
just that amount of information which the par- 
ticularity of each commodity and the cost of in- 
formation together warrant. More information will 
be sought for commodities where information 
makes a big difference than where it does not, 
more where its acquisition cost is low than where 
it is high. Thus, the consumer will allocate his 
limited budget on an optimal combination not only 
of commodities but of information as well. The 
buyer must be responsible for decisions about in- 
formation—that is, he must “beware”—just as he 
must be responsible for decisions about other 
commodities. 

This version of the principle depends for its per- 
suasiveness on low acquisition costs for “adequate” 
information, a high degree of market competitive- 
ness, and only slight damage to be suffered by con- 
sumers for making an erroneous choice. If the 
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consumer needs to know only very little, if com- 
petition in the market is so great that inferior 
brands will be quickly submerged by superior alter- 
natives, and if an occasional error by him on the 
way to becoming experienced does not do great 
harm, then each consumer can be counted on to 
adjust efficiently to a long-run equilibrium without 
requiring any outside aid or interference on his 
behalf. 

Unfortunately, the real world of choice for the 
consumer violates these conditions in important 
ways. The massive differentiation of products and 
the profuse flow of extremely complicated new 
commodities resulting from innovation have im- 
measurably increased the amount and subtlety 
(and thus the cost) of information needed by the 
consumer. Moreover, they have done so in com- 
modities that matter greatly to the consumer— 
foods, drugs, and durables. There are a number of 
important markets where consumer ignorance is 
substantial and persistent. One manifestation of 
such ignorance is the magnitude and persistence 
of fraud. In markets where frequent repeat sales 
are not important, competitive forces may actually 
engender depreciation, rather than appreciation, of 
quality, Finally, there are many instances where 
single transactions have substantial, even vital, 
impact on the consumer, either in terms of his 
wealth or health or perhaps even his life itself. 
Purchase of a house, of ownership in a corporation, 
of a dangerous surgical operation, are examples. 
In these cases, errors on the way to informedness 
may be disastrous. 

Under these circumstances, the individual con- 
sumer may either rationally or inadvertently re- 
main very poorly informed; thus, the risk that his 
choices will lead to very unfortunate surprises is 
substantial. One may seriously question whether 
in such circumstances the consumer's externally 
unencumbered choices are really accurately di- 
rected toward fulfilling his wants. 

The scope of consumer sovereignty therefore 
makes the degree of informedness a relevant quali- 
fication of free choice. It is an ambiguous qualifica- 
tion, however. How much informedness is enough? 
There is ample room for divergent views within the 
context of fulfillment of wants. Consideration of 
this qualification in practical affairs has had two 
major effects. First, it has rationalized direct gov- 
ernment intervention in the market to regulate, 
control, and forbid certain production and market- 
ing practices, Second, it has led the government to 
require specific levels of private information dis- 
closure and to disseminate information on its own. 
Both effects will be discussed below. 


In sum, actual market choices may reflect vary- 
ing degrees of uninformedness. Application of the 
criterion of consumer sovereignty requires evalu- 
ating such choices in terms of their appositeness 
to underlying wants. These judgments, and any 
public intervention in the market taken to increase 
informedness, are to be interpreted as qualifica- 
tions designed to improve the congruence between 
choices and the underlying wants that engender 
them: they support, rather than compete with, the 
satisfaction of these wants. 

Corrective qualifications. Another set of quali- 
fications that is integral to consumer sovereignty 
is aimed not at correcting means to consumer- 
recognized ends but at the ends themselves. There 
are two forms of such corrective qualifications: the 
first is intrapersonal, the second interpersonal. 

Intrapersonal, The basic argument is that some 
individuals do not know what they truly want. Con- 
sumers have a hierarchy, rather than simply a col- 
lection, of wants. Just as some commodities are 
improper means for achieving certain ends, so some 
more proximate ends are less important than, and 
are inefficiently addressed to attaining, more 
ultimate ends. 

Thus, children are deemed not really “to know 
what is good for them”—what they “really” want. 
An “improved” set of ends is externally substituted 
for their own perceived ends, whether in the form 
of actual interference with their choices or simply 
in how close an outsider judges their choices to 
reflect their “best interests.” Much the same pro- 
cedure is involved with respect to the psychotic 
and even the neurotic. In these latter categories it 
is not immaturity but internal conflict of goals or 
impaired introspection that makes a “corrective” 
qualification of free choice consistent with con- 
sumer sovereignty. 

The category can be extended, Drug addicts are 
deemed not to be able to act in their own best inter- 
ests, They may know in some sense that their over- 
riding want is inconsistent with the fundamental 
pattern of their system of ends, yet be unable to 
control themselves. The same type of conflict may 
well exist in less dramatic form in many normal 
persons, in whom no physical addiction is present. 
The discipline of psychotherapy is substantially 
grounded on the assumption that conscious and 
unconscious conflicts of ends are pervasive. Thus, 
the degree to which given choices fulfill the “real” 
wants or interests of the choosers is clearly a ques- 
tion of some profundity—a question open to a 
whole spectrum of interpretations consistent with 
the emphasis of consumer sovereignty. 

One final extremity of this category can be men- 


tioned. An influential position in behavioristic 
ethics holds that all humans are subject to im- 
portant uniformities in basic needs. Codes of social 
norms come into being as formulas by which indi- 
viduals can efficiently realize these needs in a 
social setting. Thus, behavior that violates these 
codes—“immoral” behavior—really represents “un- 
wise” behavior, behavior that is poorly aimed to 
meet the actor’s own abiding ends. While the source 
of the adverse judgment seems to come from out- 
side the individual and be at odds with him, the 
logic of the approach stresses that the ends, in 
terms of which the judgment is made, are the 
individual’s own. 

Thus, immaturity, internal conflict, and even 
“immorality” provide criteria that can be used in 
evaluating the accuracy of free choice. Each of 
these dimensions provides grounds for asserting 
that a given actor may not know what he truly 
wants, 

The set of intrapersonal corrective qualifications 
has an important similarity to the previous set of 
supportive qualifications. Both involve individual 
responses to new information. It is not only de- 
cisions about means that are affected by new in- 
formation; ends are affected as well. Accumulating 
experience with commodities influences specific 
and even general preferences: it influences wants. 
The three grounds for corrective criteria—imma- 
turity, internal conflict, and immorality—are all 
subject to the accumulating experience of the indi- 
vidual, the first almost by definition. Thus, the 
problem of “incorrect ends” is partly a problem of 
uninformedness, A distinction remains, however, 
in that uninformedness about means enables us to 
specify far more accurately both the relevant miss- 
ing information and the effect on choice of supply- 
ing it. We know far less about dependably inducing 
desired changes in values. 

The practical difficulty of distinguishing between 
the learning components of means and ends is sug- 
gested by a brief look at advertising. Existing os- 
tensibly to inform, advertising in fact attempts 
to persuade as well. The dividing line between the 
two is almost nonexistent. How, for example, is one 
to interpret the sheer repetition of advertising con- 
tent, or simply of brand name, except as persua- 
sion? The implications of advertising for consumer 
sovereignty are serious and perplexing. Do buyer 
responses to advertising represent a correction of 
means or of ends? In other words, when there is a 
divergence between the arrow and the target, does 
advertising have the pure information-feedback 
effect of getting the arrow to move or the per- 
suasive effect of getting the target to move? Insofar 
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as it is the latter, and the latter is a significant and 
pervasive impact, the whole force of the doctrine 
of consumer sovereignty (especially when joined 
to freedom of choice) diminishes appreciably. If 
wants are mercurial, trivial, easily manipulable 
from without, there ceases to be much justification 
in orienting the whole engine of production and 
distribution to their precise satisfaction. Resources 
might then more efficiently be used to produce 
what can be produced cheaply and then to per- 
suade consumers that this is exactly what they 
want. 

Interpersonal. An individual’s choices may be 
“wrong” from the point of view of consumer sov- 
ereignty, not because they reflect ignorance about 
means or confusion about ends, but because they 
have effects that diminish the possibility that other 
individuals will be able to satisfy their wants. The 
actions of one individual sometimes spill over to 
affect other individuals directly, not simply through 
influencing relative prices on the market. These are 
so-called external effects. External effects can be 
adverse or favorable, and both directions raise the 
possibility of pertinent qualifications on free choice. 
Actions with adverse reverberations need discour- 
aging; those with favorable reverberations deserve 
encouraging. Thus, from the point of view of con- 
sumer sovereignty—involving consideration of the 
wants of the entire population—if a certain indi- 
vidual can serve himself equally well by actions A, 
B, or C, but A has adverse effects on others, B is 
neutral toward others, and C enhances goal-fulfill- 
ment by others, then both A and B are wrong, and 
only C is the right choice. 

Most decisions have spillover effects, but they 
are generally minor. The issue of corrective quali- 
fication is involved only when the external com- 
ponent of the decision is major, as for example, 
the planting of ragweed, the failure to dispose of 
garbage, the failure to treat or isolate a contagious 
disease, the reckless use of an automobile, the use 
of a ladder to commit burglary, or the use of a gun 
to hunt the neighbor's children. Nontrivial favor- 
able spillovers are involved in the development of 
a beautiful garden, the immunization against con- 
tagious disease, or the shoveling of snow from the 
sidewalk in front of one’s house. Somewhat more 
complex cases concern drug addiction, education, 
and medical care. We have already suggested an 
intrapersonal qualification for drug addiction. In 
addition, it often leads to “antisocial” consequences: 
decreasing the productivity of the afflicted person 
and therefore of the total resource stock available 
to the population, increasing crime and “pushing,” 
adding to the burden of governmental welfare serv- 
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ices by requiring care for the person’s family. Edu- 
cation has opposite effects of the same sort: it 
enhances the individual's productivity and there- 
fore the community’s total effective resource stock, 
and it decreases the probability of antisocial be- 
havior and the burdening of the government's wel- 
fare load. Medical care decreases direct contagion 
externalities. It also has a positive effect on human 
productivity and on independence from government. 

In principle, interpersonal corrective qualifica- 
tions are very difficult to handle, because they force 
one to confront the problem of interpersonal com- 
parisons’ of worthwhileness—comparing the gain 
to some and the loss to others of particular actions 
having spillover effects. Thus, they are embedded 
in the very same morass of controversy that we 
discussed in connection with a criterion of aggre- 
gate want satisfaction. In practice, fortunately, the 
problem has been kept within manageable bounds. 
For the most part, only actions with substantial 
aversive spillover effects have been considered. The 
interpersonal comparison problem is thereby more 
straightforwardly resolvable on ethical grounds, 
rather than in terms of any nice balancing of 
effects under the principle of consumer sovereignty. 
The ethical principle involved is simply that an 
individual ought to be prevented from harming 
others. The exercise of freedom does not consist in 
allowing anyone to abridge the freedom of others. 
Thus, we shall see in the next section that this type 
of qualification has been socially interpreted for the 
most part as calling for a set of injunctions against 
harmful acts. 

To conclude this section, consumer sovereignty 
must be spelled out in order to have practical rele- 
vance. Externally unencumbered consumer choices 
do not invariably constitute accurate evidence about 
the wants they ostensibly serve. Such choices may 
be critically uninformed about alternative oppor- 
tunities; they may stem from internal goal con- 
flicts; or, while accurately designed to satisfy the 
wants of their agent, they may adversely affect the 
want satisfaction of other consumers. These factors 
must be considered in judging the degree of want 
satisfaction that inheres in different social situa- 
tions and, thus, in judging the consistency of dif- 
ferent public policies with consumer sovereignty. 

It is one thing to indicate that these factors must 
be considered; it is quite another to delineate ex- 
actly how they should be considered. How much 
information is necessary for specific choices; how 
much is inadequate? How does one discover a 
consumer’s—every consumer’s—true underlying 
wants? How does one trace the spillover effects of 


different actions? It is fortunately beyond the scope 
of this article to try to answer. But it is instructive 
of the content of consumer sovereignty to indicate 
briefly how a relatively free market society like the 
United States has attempted to spell out these 
qualifications in public policy. 


Institutional qualifications 


Dedicated to free markets, to freedom of choice, 
and to consumer sovereignty simultaneously, Amer- 
ican public policy can be interpreted as showing 
a consensual belief in the respects to which the 
first two must be compromised in order to be con- 
sistent with the third, Clearly, not all public inter- 
ventions in the market have been motivated by an 
attempt to improve the degree of consumer sov- 
ereignty, but a surprising number fit into the classi- 
fication of qualifications given above. We shall 
indicate some of the ways that the American public 
has been willing to compromise the working of 
free markets. 

Supportive policies. In general, Caveat emptor 
has been the dominant policy, but with a series of 
exceptions that have become more important only 
in recent years. Traditionally the consumer was 
protected at law, only after the fact, by the com- 
mon law of fraud, warranty, and negligence. 
(Much of the material in this section is from Wil- 
cox [1955] 1960, chapters 8, 12.) This essentially 
meant that consumers could sue for damages actu- 
ally suffered if due to misrepresentation or negli- 
gence. But even early practice, recognized an ex- 
ception. Some transactions could cause massive 
irreversible damage to life and health. If, in addi- 
tion, the consumer was technically incapable of 
judging in advance the quality of the commodity 
to be obtained, then he needed advance protection. 
Medical services were clearly a case in point. The 
solution hit upon was legal licensing, by which 
acceptable quality standards are guaranteed to the 
consumer in advance by the conditions that prac- 
titioners must meet in order to be granted a license 
to practice. 

Licensing spread to many services that involve 
personal health and safety, such as nursing, phar- 
macy, and dentistry. But it also spread considerably 
beyond, to occupations only remotely, if at all, 
resembling these. Examples are dry cleaning, bar- 
bering, and photography. Since licensing is a meth- 
od of legally limiting supply, it represents a profit- 
able collusive agreement for insiders and, thus, 
attractive public policy for them. It is interesting 
that in some cases of licensing that are inappro- 
priate from the present point of view the same 


language of public health and safety is nonetheless 
used to justify its legal status as under the appro- 
priate cases. 

The same advance protection was not given con- 
sumers for tangible commodities until the federal 
Pure Food and Drug Act of 1906 and its subsequent 
amendments, Food and drugs qualify as potential 
factors in one-shot massive or irreversible damage 
to life and health. Adulteration and use of toxic 
ingredients are prohibited. In more recent legisla- 
tion, the burden of proof of nontoxicity is largely 
shifted from government to the producer, espe- 
cially with respect to drugs, and must be satisfied 
before the commodity can be marketed. Informa- 
tional requirements are also imposed on sellers of 
these commodities, in the form of honest and com- 
plete labeling. Subsequent legislation, involving the 
Federal Trade Commission, extends informational 
constraints to advertising. Thus, for a particular 
subset of commodities uniquely “affected with the 
public interest,” the seller is legally required to fur- 
nish certain information, and this, as well as in- 
formation voluntarily advanced, must not be 
misleading. 

Informational constraints have been imposed on 
other commodities quite remote from this concept 
of “public interest.” Compulsory labeling exists for 
commodities made of wool, for example. As with 
licensing, many of these represent regulations in 
the interest of collusive competitive advantage for 
producers, rather than consumer-oriented support. 
There is a general constraint against misrepresen- 
tation in advertising, but this too seems to stem 
from, and be largely administered in the context 
of, protection of producer rather than of direct 
consumer interests. This has focused largely on 
the protection of sellers from unfair methods of 
competition. 

There is a further class of informational con- 
straints, which is in the spirit of consumer sov- 
ereignty. It is in the area of financial securities 
transactions. Here too the justification for interven- 
tion seems to be the possibility of massive single 
transaction damage; but here the damage is to the 
wealth position of transactors, rather than to life 
and health. Elaborate requirements for full and 
honest disclosure by sellers, as well as behavioral 
regulations on brokers and organized exchanges 
themselves, are imposed to prevent fraud. 

Supportive qualifications have thus tended to be 
concentrated on commodities where single trans- 
actions could produce important harm to life, 
health, or wealth. No serious attempt is made to 
expand information available to consumers on con- 
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sumption generally. (I neglect specific free infor- 
mation programs by the federal government for 
child care, agricultural and homemaking tech- 
niques, and other miscellany.) A minor attempt is 
made to control the substance of advertising excess 
(misrepresentation), but not the deeper problems 
involved in pervasive persuasion. 

This pattern of intervention is not hard to under- 
stand. In fact, most public intervention arose not 
out of theoretical analyses of subtle consumer sub- 
optimization but as a response to concrete national 
traumas—dramatic scandals concerning damage 
from specific foods and drugs, scandals concerning 
massive fraud and irregularity in financial markets, 
etc. Thus, laissez-faire was cast aside not on the- 
oretical grounds but on the evidence of acute mal- 
functioning of markets. 

Corrective policies. The most widespread form 
of government intervention against free choice is 
in the regulation of antisocial externalities. The 
almost uniform definition of aggression against per- 
son or property as crime, and hence illegal, is obvi- 
ous. But the consensus extends to less obvious 
forms as well. Thus, government enforces quaran- 
tine regulations and sanitary codes for dwellings; 
it imposes compulsory smallpox immunization; it 
outlaws transactions in narcotics. At a third level, 
taxation is sometimes employed with sumptuary 
goals, for example, the high excises on liquor. 
(This should not be thought to apply to special 
excises generally. Gasoline taxes, for example, are 
typically justified on a modified benefit argument: 
they are designed to pay for roads and other motor- 
ist benefits. ) 

Desirable externalities receive government at- 
tention as well. The public school system, free 
medical care for the poor, public housing, social 
welfare services, and many others can be under- 
stood as subsidization of forms of consumption that 
are believed to have substantial desirable spillover 
effects. Since most of these represent rectification 
of dismal alternatives, they can also be seen as 
measures to prevent certain adverse spillover ef- 
fects, The difference between the treatment of 
positive and negative spillovers is that for the latter, 
injunction generally suffices (plus the deterrent 
effect of possible punishment), whereas for the 
former, actual subsidization of consumption is 
necessary. 

Two types of regulation that are on the border- 
line are censorship and the regulation of “immor- 
ality” (in a broad sense, including divorce law and 
“blue laws”). The illegality of homosexuality and 
fornication, for example, is sometimes justified in 
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terms of preventing the spread of practices that 
could undermine fundamental institutions of the 
society, such as the family. Sometimes, however, 
it is justified as protecting possible participants 
from corruption—an example of intrapersonal cor- 
rection, Similarly, censorship is sometimes justified 
on the grounds that exposure to censorable mate- 
rials may provoke crimes of violence. Sometimes it 
too receives its major justification as protection of 
the individual from himself. 

These borderline cases are significant, Govern- 
ment in the United States undertakes almost no 
intervention based on intrapersonal correction. This 
sphere has been left to private responsibility, and 
governmental concern has typically been treated as 
unwarranted invasion. One would therefore pre- 
dict that cases partaking of intrapersonal correc- 
tion would be controversial. This is exactly so of 
censorship and immorality regulations. There exists 
no real consensus supporting them: divorce laws 
are widely assailed in reputable quarters; blue laws 
are kept on the books only if they are not seriously 
enforced; the illegality of sexual “immoralities” is 
highly controversial; every act of censorship is pro- 
tested by important segments of the population; 
and even the illegality of narcotics transactions 
(insofar as it is designed to protect an addict from 
himself) is responsibly controversial. One might 
imagine that heavy taxation of tobacco represented 
an exception, This is not so. Ironically, the sumptu- 
ary background of tobacco taxation stems from the 
notion that it is a quasi-luxury item. It is taxed as 
an example of pleasurable, not deleterious, con- 
sumption, 

A major public action that seems on the surface 
to fit this category is the government's massive 
intervention in the market during national emer- 
gencies, resulting in price control, rationing, and 
direct control over the allocation of resources. This 
would appear to constitute a radical abrogation of 
consumer sovereignty. In a deeper sense, however, 
it is not so, It is fruitful to consider the situation 
as one in which everyone's basic interests in the 
maintenance of the social pattern require a coordi- 
nated mobilization of resources. No one individual 
or small group can achieve such coordination. Gov- 
ernment, as the instrumentality of consensual 
wants, is required to bring it about. It acts as the 
agent of each consumer, just as we noted the phy- 
sician does in the individual case. If the political 
system is truly representative, the government's 
coordinative acts can be considered consistent with 
consumer sovereignty. Thus, this case does not in- 
volve any new qualifications on individual choice. 


It simply represents an unusual instrumentality 
through which individual choices are realized. A 
number of other general governmental activities 
can be similarly interpreted. 

In sum, abrogation of free choice for corrective 
reasons essentially leaves the individual's percep- 
tion of his own ends sacrosanct; it is only where 
individual choices have important spillover effects 
that government intervenes, and here preponder- 
antly to prevent aversive spillovers. The few cases 
where government seems to be legislating private 
norms for individuals are cases in which some spill- 
over elements are present as well, and these cases 
are highly controversial. 


Consumer sovereignty, insisting on the want- 
fulfillment of the ultimate consumer population as 
the proper end of economic activity, is an influen- 
tial but ambiguous and incomplete normative prin- 
ciple. It is often incorrectly limited to a version 
that strongly supports laissez-faire in a free enter- 
prise economy, That version represents a particular 
composite of one interpretation of consumer sov- 
ereignty with other value judgments. Consumer 
sovereignty can be interpreted in many ways and 
be joined to many distinctive normative packages 
(for example, to the goals of a socialist economy), 
no one of which has earned the consensual sup- 
port of economists as the authoritative package. 

Analysis of the principle itself indicates that 
neither the wants that are the presumed goals of 
economic activity nor their reflection in the con- 
crete choices of individuals are unproblematic. The 
degree of informedness about alternatives, the co- 
herence of the whole structure of individual wants, 
and the significance of spillover effects on others 
are all variables relevant to the interpretation of 
the principle. Moreover, its persuasiveness depends 
strongly on the stability of wants and on the depth 
at which they are integrated within the personality 
of each individual. Extreme suggestibility and vola- 
tility of wants seriously undermine the principle’s 
force. 

In the American version of a free enterprise 
society there appears to be a consensus that con- 
sumers should be given public protection in the 
form of regulations about quality and information 
when making what are potentially momentous mar- 
ket decisions. Public intervention should also be 
invoked to prevent private actions that have dele- 
terious effects on others. There is no consensus that 
public power should be used to protect individuals 
from their own confusions about means and ends. 
In this interpretation of consumer sovereignty the 
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individual is still free to be his own worst enemy— 
or his own best friend. The range between them 
still allows life to be a great adventure. 


JEROME ROTHENBERG 


[See also ADVERTISING, article on ECONOMIC ASPECTS; 
CONSUMERS, article on CONSUMER BEHAVIOR; LI- 
CENSING, OCCUPATIONAL; WELFARE ECONOMICS.] 
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I 
CONSUMPTION LEVELS AND STANDARDS 


Consumption is defined as the use of goods and 
services. Differences in consumption among indi- 
viduals at a given time and between points in time 
are called differences in consumption levels. Con- 
sumption standards are goals to be striven for, 
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maintained, or regained. These standards influence 
the ways in which families spend their income, 
that is, consumption patterns. Standards also can 
cause feelings of discontent that may result in ac- 
tion when consumption is below the standard— 
when, for example, people consider that the dwell- 
ing they occupy is overcrowded, the food they eat 
is inadequate, or the quality of schooling available 
for their children is poor. Consumption standards 
are part of a larger set of standards pertaining to 
work, leisure, social relationships, and living in 
general. Consumption is influenced by standards 
and by income, and income in turn is influenced by 
standards of living insofar as standards affect in- 
centives (Davis 1945; Pipping 1953). 

The goals of life are many, so that choices must 
be made among them. These choices are influenced 
by the hierarchy of preferences expressed in the 
standard of living and by the costs of the things 
desired. The availability of resources of several 
types acts as a constraint on total achievement. 
Earning additional income, obtaining more unpaid 
services from family members, and making wiser 
choices among products are all important paths to 
higher consumption. A consumption standard that 
is low relative to current money income tends to 
weaken the motives to earn and to economize in 
the use of income. 

Consumption is overt behavior, and standards 
are among the forces shaping it. Consumption data, 
therefore, indirectly provide some information 
about standards; the strength of preferences is im- 
plied by what is chosen. Attempts have also been 
made to gauge more directly the intensity of desire 
for things not currently being consumed. The no- 
tions of “poverty,” “comfort,” and “affluence” are 
found to be related to earlier experiences, espe- 
cially to recent changes. However, our knowledge 
of standards comes largely from observing behavior 
under varying conditions. 

Differences within and among countries. About 
a century ago Engel formulated a very important 
law of consumption, namely, that as total con- 
sumption increases, the percentage of expenditures 
going to food tends to decrease (Stigler 1954). 
This percentage is referred to here as the food rate. 
Historically, Engel’s law seems to apply even 
though many changes have accompanied the in- 
crease in total consumption. For example, in the 
United States the food rate of urban families of 
two or more persons whose head was a wage earner 
or clerical worker was 41 per cent in 1888-1891 
(Williams 1958); 38 per cent in 1918-1919 (U.S. 
Bureau of Labor Statistics 1924); and 35 per cent 
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in 1934-1936 (U.S. Bureau of Labor Statistics 
1941). For all urban consumer units of two or 
more persons the food rate was 24 per cent in 
1960-1961 (U.S, Bureau of Labor Statistics 1964). 
Changes in prices as well as in incomes may have 
had an effect on this rate. The Consumer Price 
Index shows that between 1913 and 1957 rents in 
large cities increased 77 per cent and food prices 
increased 187 per cent (U.S, Bureau of the Census 
1960). 

Relationships between the food rate and income 
similar to those indicated by the secular change in 
the food rate of urban families in the United States 
are apparent in comparisons among countries that 
obviously differ in real income as well as in other 
ways (Bennett 1951; Williams 1958). For exam- 
ple, in 1962 the food rate of countries ranged from 
51 and 50 per cent for Ghana and Ceylon to 22 and 
21 per cent for Canada and the United States, re- 
spectively (United Nations 1964). It seems prob- 
able that among countries the food rate is a suit- 
able proxy for total consumption: the lower this 
rate, the higher is total consumption. 

The food rate is correlated with a number of 
conditions closely related to total consumption. 
Among 24 countries investigated around 1960, the 
coefficient of correlation between the food rate and 
the infant mortality rate was approximately 0.74. 
At the same time among 23 countries the coef- 
ficient of correlation between the food rate and the 
percentage of households with 1.5 or more persons 
per room was 0.86. Similar relationships have been 
observed among states and cities of the United 
States (U.S. Bureau of Labor Statistics 1964; U.S. 
Bureau of the Census 1963; 1964). 

Expenditures as measures of consumption. Ex- 
penditures may be viewed as numbers of units of 
consumer goods and services multiplied by their 
prices. The prices of products of specified quality 
differ from time to time and place to place, and 
differences in expenditures may therefore not rep- 
resent differences in consumption. Many countries 
have developed price indexes that permit at least 
crude intertemporal comparison of real consump- 
tion, that is, of the quantity of consumer products 
measured in dollars of constant purchasing power. 
Differences in expenditures among places within 
countries may also reflect differences in prices as 
well as in the quantity of products purchased. Food 
prices probably vary with the distance between con- 
sumers and primary producers; in other words, 
they are a function of the regional specialization 
of agriculture. The price of residential housing 
probably increases with urban congestion. The costs 
of services are much influenced by the availability 


of job opportunities outside the service industries 
and hence tend to be positively correlated with 
local incomes. Little progress has so far been made 
in providing interplace indexes of consumer prices 
(Brady & Hurwitz 1957). If consumer prices are 
correlated with regional and local income, as seems 
probable, then the variation in expenditures among 
the consumers of a country overstates the variation 
in consumption. 

Expenditure data seldom distinguish precisely 
between consumption expenditure and outlays 
made to provide income. With the growth of cities 
and suburbs and the increased mobility of the labor 
force, job expenses have increased. In the United 
States from 1940 to 1963 the share of transporta- 
tion in total expenditure increased from 9.7 to 12.6 
per cent (U.S. Bureau of the Census 1964). Some 
of this increase was caused by the greater distances 
traveled to work and thus represents an increase 
in the cost of production rather than an increase 
in the consumption of transportation service for 
personal or recreational purposes. There are still 
few estimates of job expenses, and even these are 
limited in their coverage (Holmes 1962). It seems 
reasonable to expect that such expenses tend to 
increase with earnings. 

The boundaries between consumption and sav- 
ings are also blurred. Expenditures for term life 
insurance are often classified as savings, although 
outlays for other forms of current insurance against 
risk are not. Capital investments in owner-occupied 
housing are classified as savings, whereas the pur- 
chase of an automobile is not. The purchase of 
land to establish a son in farming is counted as 
savings, but expenditures for the professional edu- 
cation of a son are not, although both are invest- 
ments designed to augment future income. 

Expenditure data understate consumption if 
some products or services consumed are excluded. 
Among the items usually not represented in survey 
data are (a) products withheld from sale by pro- 
ducers for their own use, such as food and fuel 
produced and used by the same farm families; 
(b) the services of owner-occupied housing in ex- 
cess of current expenditures for them; (c) the 
services of the stock of consumer durables, such 
as automobiles and household equipment, in excess 
of expenditure for its maintenance; (d) nonmat- 
ket services used, such as those of family members; 
and (e) products or services provided from public 
funds, 

Income in kind. Consumer products used but 
not purchased are commonly referred to as income 
in kind. Some income in kind, notably items (a) 
and (b) above, are included in national estimates 
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of personal consumption expenditures and oc- 
casionally in survey data. The other types of income 
in kind are seldom if ever included in data on per- 
sonal consumption. Many types of income in kind 
are negatively correlated with the money income of 
consumers, and their omission therefore tends to 
increase the inequality of consumption as it is 
inferred from expenditures. 

Investment in owner-occupied housing is an im- 
portant source of income in kind. At about 1950 
in the United States such income was approximate- 
ly $6 for every $100 of expenditures of nonfarm 
consumers. Income in kind from this source tends 
to increase with the age of the head of the con- 
sumer unit, since older heads are more likely to 
have full equity in their homes (U.S, Bureau of 
Labor Statistics 1964; Reid 1962; U.S. Bureau of 
the Census 1963), In addition, it tends to increase 
with average lifetime income, and in this respect 
differs from many other types of income in kind. 

The omission of income in kind results in con- 
siderable understatement of the consumption of 
farm families, For example, the income in kind of 
farm families, largely in the form of food and hous- 
ing, in a low-income area of the United States dur- 
ing 1957 was $52 for every $100 of expenditures 
(Pennock 1964). In 1955, despite a general de- 
cline in subsistence farming, food produced on the 
farm and valued at the prices that would have been 
paid had it been purchased accounted for about 40 
per cent of the food consumption of all farm house- 
holds (U.S. Agricultural . . . 1956-1961). 

Current expenditure overstates consumption when 
a stock of consumer durables is being increased. 
A stock of consumer capital, such as furniture, 
equipment, an automobile, and clothing, is a source 
of income in kind. The time of purchase of these 
products tends to differ with the age of the family 
head. In the United States in 1960-1961, expendi- 
tures for furnishings, equipment, and transporta- 
tion were 25 per cent of consumption expenditures 
for consumer units whose head was under 25 years 
of age and 11 per cent for those whose head was 
75 years of age or more (U.S. Bureau of Labor 
Statistics 1964), This difference reflects different 
stages in the cycle of building up consumer capital 
and allowing it to depreciate, as well as some dif- 
ferences in income and consumption. 

Expenditure data do not include the value of 
services to the household by its own members. 
There has been a downward trend in the percent- 
age of households with a full-time homemaker. 
Consequently, time series of expenditure data are 
likely to overstate the increase in consumption. In 
addition, consumption seems likely to exceed ex- 
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penditure by larger amounts where there is a full- 
time rather than a part-time homemaker (Clark 
1958; Holmes 1962; Rowntree & Lavers 1951). 
Since earning by wives tends to increase with 
family income, the higher the family income, the 
less important, in general, are the unpaid services 
supplementing expenditure. 

Consumption wholly or partly financed by pub- 
lic funds has become an increasing proportion of 
total consumption. In the United States public ex- 
penditure for health, education, recreation, food, 
and housing was $7 per $100 of personal expendi- 
ture in 1950 and $9 in 1962. Public expenditure 
differs by type of product. During 1962 private 
expenditure for education and related research in 
the United States was $1.46 per $100 of total pri- 
vate expenditure. When public expenditure for edu- 
cation is included, this relation increases to $7.50. 
However, private expenditure for medical care was 
$6.20 per $100 of total private expenditure, and 
when public expenditure for “health and medical 
services” is included, this relation increases only 
to $7.63 (Social Security Bulletin . . . 1961; US. 
Bureau of the Census 1964). 

Publicly provided products or services available 
only to low-income families, such as public housing, 
subsidized food, medical care, and legal services, 
tend to decrease the inequality of consumer real in- 
comes. This also occurs if the product publicly pro- 
vided is one much used by low-income families or 
if its quality is too poor to be acceptable to high- 
income consumers. In some communities of the 
United States expenditures for tuition in private 
high schools increase markedly with income, al- 
though publicly financed high school education is 
available without charge to all persons. Public 
financing of colleges probably benefits families of 
moderate to high incomes more than those of low 
incomes; this is probably also true of subsidies to 
encourage owner occupancy of housing. The extent 
to which the inequality of real income is affected 
by public financing of consumption is almost a 
virgin field of inquiry (Peacock 1954), 

Expenditure patterns. Three sets of conditions 
cause consumption to differ among families: needs 
and preferences for various consumer products, the 
resources available, and the cost of things desired. 
Each of these sets includes a vast array of factors. 
Intercorrelations among them and incomplete in- 
formation about them have been formidable ob- 
stacles to obtaining definitive estimates of the 
effect of any one factor. A large stock of survey 
data describes expenditure in relation to income, 
the size and composition of consumer units, the 
occupation of the household head, tenure of dwell- 
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ing, place of residence, and other conditions. The 
effect of any one factor, such as prices, income, or 
the age of the head, appears to differ among sets 
of data (Friedman 1957; Houthakker 1957; Reid 
1962; Reid & Dunsing 1956). Understanding the 
cause of such differences among survey findings 
is essential to knowledge of the structure of con- 
sumption, and inquiry along many lines is under 
way. 

Engel curves and permanent income theory. 
The allocation of increased income between ex- 
penditure and saving and among the various types 
of products has important implications for eco- 
nomic development. Engel curves describe the aver- 
age expenditures of consumers by income levels. 
The ratio of the rate of increase in expenditure to 
the rate of increase in income is called the coef- 
ficient of income elasticity of expenditures, It is 
common to describe as necessities products with an 
income elasticity of less than 1.0 and as luxuries 
those with an income elasticity greater than 1.0. In 
comparing the Engel curves of broad consumption 
categories such as food, clothing, housing, and 
transportation this distinction has little merit, since 
income elasticities as great as 1.0 are seldom ob- 
served. Each of these categories includes some 
necessities and some luxuries. 

As the stock of survey data has grown, it has 
become obvious that the slopes of Engel curves 
differ from group to group and time to time. Such 
differences have been observed for total expendi- 
tures (Friedman 1957; Houthakker 1957) and for 
food and housing (Reid 1962; Reid & Dunsing 
1956). The permanent income theory of consump- 
tion has done much to rationalize these differences. 
It views annual incomes as having two compo- 
nents, one positive component representing expect- 
ed income and a second, which may be negative 
or positive, representing deviations from this ex- 
pected income. These deviations are referred to as 
transitory income [see CONSUMPTION FUNCTION]. 
The theory postulates that consumption is highly 
correlated with expected income but not with tran- 
sitory income, Although the theory is by no means 
fully tested, there seems to be no doubt that it 
explains much of the variation in Engel curves for 
a single type of product and that transitory income 
is more highly correlated with expenditures for 
durable goods than with expenditures for housing 
and food, 

The mixture of transitory and expected income 
represented by income distributions is not con- 
stant. With the decrease in the proportion of con- 
sumer units whose head is self-employed (a group 
for which transitory income is common) and with 


the increase in income made available through 
transfer payments during periods of unemploy- 
ment, sickness, and disability, the relative size of 
the transitory component of consumer income 
seems likely to decrease, Such a decrease will tend 
to increase the slope of Engel curves. On the other 
hand, an increase in the progressive taxation of 
incomes, which makes the normal income of con- 
sumer units less unequal, is likely to increase the 
proportion of the income variation that represents 
transitory income. This will tend to reduce the 
slope of Engel curves. 

Engel curves are useful chiefly for indicating the 
short-run response to changes in income. They do 
not describe the differences in consumption be- 
tween the rich and the poor or the effect on con- 
sumption of secular changes in real income. Ob- 
servations describing the average consumption and 
income of subsets of consumers by the occupation 
of the head have considerable merit for this pur- 
pose. The designing of consumer surveys suitable 
for testing the effect on consumption of differences 
in permanent, or normal, income is still in a pre- 
liminary stage. 

Role of assets. The growth of data on assets 
has lagged behind that of data on expenditures and 
income. A comprehensive survey of the assets of 
consumer units in the United States was made for 
1962 (Survey of the Financial . . . 1964). Marked 
increases in the value of assets were reported with 
increases in the age of the head of consumer units 
and with increases in income. Even so, units with 
incomes under $3,000 had a mean net worth of 
about $9,000. This figure reflects in part a concen- 
tration at incomes under $3,000 of consumer units 
with an elderly head. Short-run declines in income 
also result in some large holdings of assets being 
reported among units with low current incomes. 
Better data on assets will aid greatly in explain- 
ing the deficit spending observed at low annual 
incomes. 

Role of time. The role of time in decisions con- 
cerning earning and consumption is receiving in- 
creasing attention (Clark 1958; Kriazhev & Marko- 
vich 1959; Scientific Conference . . . 1962; Moscow 
1959-1962). Time must be allocated among earn- 
ings to provide current income, formal training 
to provide future income, unpaid productive serv- 
ices to supplement expenditure, and leisure or 
consumption activities as such. Mincer (1963), in 
his consideration of the gainful employment of 
women, notes that the time costs incurred by indi- 
viduals in earning their money income must be 
considered if the effects of money income and 
prices of products on consumption are not to be 
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misunderstood. Income foregone while attending 
college is widely recognized as a cost of college 
education. Becker (1965) extends the considera- 
tion of time cost to the purchase of consumer prod- 
ucts in general and to use of leisure time. For ex- 
ample, the cost of attending the theater includes 
the time entailed as well as the price of tickets and 
of transportation. The cost of commuting to work 
involves time as well as money. In this frame of 
reference leisure is assessed in terms of consump- 
tion foregone because of reduced income. 

Household composition. The economic aspects 
of household membership are also receiving con- 
siderable attention (Brady 1958). The rise in real 
income and the growth of programs providing re- 
tirement income have been accompanied by a 
reduction in the number of three-generation house- 
holds, At what income does preference for separate 
households offset the economies of scale of house- 
holds of three generations? How does this un- 
doubling change the correlations between income 
and household size and composition, and hence the 
observed expenditure—-income patterns? It seems 
likely, for example, that undoubling increases the 
proportion of low-income consumer units that have 
an elderly head. 

Expenditure data tell us a great deal about con- 
sumption. Their usefulness for this purpose would 
be increased if allowance were more fully made 
for differences in the cost of living due to inter- 
spatial differences in prices and in climate, for job 
expenses, and for income in kind. Without such 
allowance, the measurement of consumption is still 
in a very elementary stage. 

Consumer budgets. Budgeting by consumers 
for the most part appears to be very informal. What 
are commonly referred to as consumer budgets are 
plans prepared by specialists used for consumer 
education and for the administration of public pro- 
grams, Among the best known consumer budgets 
are the low-cost and moderate-cost food plans of 
the U.S. Department of Agriculture (Cofer et al. 
1962) and the City Workers Budget of the U.S. 
Department of Labor (Lamale & Stotz 1960). 
These budgets combine data on the purchases of 
consumers as reported in occasional surveys with 
the recommendations of specialists as to physical 
and social needs. For example, the recommended 
dietary allowance of the Food and Nutrition Board 
(Cofer et al, 1962) are intended to represent phys- 
ical needs, and the standard that there be at least 
one room per person in the family is intended to 
represent social needs. Budgets are developed for 
consumer units of specified size and composition, 
and scales have also been developed that permit an 
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adjustment in the cost of the budget to the size of 
the consumer unit. 

The recommendations of specialists involve arbi- 
trary judgments, as does the use of data on pur- 
chases. The low-cost food plan of the U.S. Depart- 
ment of Agriculture tends to represent the dollar 
value of the food consumption of families at the 
first quartile of the income distribution of the 
group referred to in preparing the plan; the moder- 
ate-cost plan tends to represent that of families at 
the median income, A new national survey is fol- 
lowed by a revision of the food plans with purchase 
patterns reflecting a new income distribution. 
Hence, secular change in the food plans tends to 
approximate secular change in the expenditure for 
food at the first quartile and the median of the in- 
come distributions, when account is taken of the 
size and composition of consumer units. A lag 
exists between the consumption represented by the 
food plans and that of families surveyed because 
the collection, tabulation, and analysis of the data 
are time consuming. 

The City Worker’s Budget purports to represent 
a modest but adequate level of consumption. It is 
less explicitly related to a particular position in the 
income distribution than are the food plans dis- 
cussed above. The recommendations of many sets 
of specialists are utilized. When purchase data 
from surveys are used, they tend to represent the 
expenditure of median-income families with a hus- 
band, a wife who is not in the labor force, and two 
children living at home. The mean City Worker's 
Budget for 20 large cities for the autumn of 1959 
was $6,083. The median income in 1959 for urban 
families with a husband who worked and a wife 
who did not was $6,283 (U.S. Bureau of the 
Census 1961—). Thus it seems reasonable to con- 
clude that the City Worker's Budget in 1959 tended 
to approximate the consumption of median-income 
families. Successive revisions of this budget have 
brought it to higher levels. For 20 large cities the 
mean budget for 1959 exceeded the mean budget 
for 1947 by 88 per cent. Over the same period na- 
tional per capita personal income increased only 
65 per cent. The “modest but adequate” City Work- 
er’s Budget has at times been interpreted as a mini- 
mum budget. In such terms, about half of the 
population is below the minimum, and the secular 
rise in the budget makes any change in this pro- 
portion unlikely. 

Inequality and poverty. In the antipoverty pro- 
gram launched by the federal government of the 
United States in 1964, an annual money income of 
$3,000 was used as a dividing point between 
poverty and not-poverty (U.S. President 1964). 
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According to this definition, a startling amount of 
poverty was found to exist in the midst of general 
affluence. The use of an annual income of $3,000 
as the test of poverty has been much debated. What 
the criterion should be is not an issue that scien- 
tific procedure can resolve. It is, however, obvious 
that a yardstick of this type, with certain modifica- 
tions, can serve useful purposes. For example, it 
can be used to determine what progress has been 
made in reducing the frequency of income that is 
low in relation to need. 

In 1961 urban consumer units with a money in- 
come of $3,000 had a per capita income of $1,224 
(U.S. Bureau of Labor Statistics 1964). Thirteen 
per cent of the urban population was in consumer 
units with incomes of less than $3,000. A per 
capita income of $1,224 in 1961 had the same pur- 
chasing power as $984 in 1950, and 24 per cent of 
the urban population of 1950 was in consumer 
units with incomes below this level (Study of Con- 
sumer Expenditures . . . 1957), An income of 
$1,224 in 1961 had the same purchasing power as 
$602 in 1941, and 40 per cent of the urban popu- 
lation of 1941 was in consumer units with incomes 
below this level (Hanson 1945). These changes 
imply a secular decline of more than one per cent 
a year in the percentage of the urban population 
receiving purchasing power of less than 1,224 
1960 dollars per capita. 

After a study in Great Britain of the income of 
wage earners and families receiving assistance in 
1936 and 1953, Schulz concluded that “poverty, in 
the sense in which the term was understood before 
the war, has been abolished for the families of the 
type considered” (1955, p. 232). Rowntree identi- 
fied five comparable consumption standards for 
1936 and 1950, and for the city of York, England, 
found 18 per cent of the population in the two 
lowest classes in 1936 and 2 per cent in 1950 
(Rowntree & Lavers 1951). 

A yardstick of poverty can also be used to dis- 
cover the people who are most likely to be poor and 
the conditions that contribute to their poverty. To 
be suitable for this purpose a yardstick must repre- 
sent the same level of consumption wherever it is 
applied. To consider only money income is to ig- 
nore income in kind and important interspatial 
differences in prices and other conditions that 
affect the cost of living, such as climate. How the 
size and composition of consumer units affect 
needs is also ignored when a single yardstick is 
used for all units. However, many descriptions of 
“who are the poor” have taken into account varia- 
tion in need that depends on the number of per- 
sons per consumer unit. The other shortcomings of 
the single money measure of poverty have tended 


to be neglected. As a result, descriptions of poverty 
in the United States have tended to overstate the 
frequency of low consumption in farm and other 
rural populations and in warmer regions where the 
cost of shelter is low; they have tended to under- 
state the frequency of poverty in large cities where 
the costs of housing and of transportation to work 
are high. 

Relative as well as absolute poverty is a matter 
of considerable importance. [See INCOME DISTRIBU- 
TION, article on s1zE.] Data on the size distribution 
of money income have been the most important 
basis for estimates of the degree of inequality. For 
industrialized countries, a secular decrease in in- 
equality is indicated (Kuznets 1965). However, the 
biases of money income as an index of consump- 
tion and the failure to consider the changing mem- 
bership of consumer units greatly limit the relia- 
bility of such estimates. These biases have not been 
constant through time and differ greatly among 
populations. The limiting conditions include in- 
come taxes, job expenses, correlations between cur- 
rent income and prices, income in kind, the char- 
acteristics of consumer units, and the extent to 
which income distributions represent transitory 
conditions. Systematic exploration of many. of 
these biases has yet to be undertaken. 

Data available leave no doubt that the inequality 
of expenditures is appreciably less than the in- 
equality of money incomes (Snyder 1960). When 
urban consumer units of two or more persons are 
ranked by 1961 income, those in the lowest fifth 
of the income distribution are found to have re- 
ceived 6.9 per cent of the money income and 7.6 
per cent of the disposable income and to have 
made 9.5 per cent of all consumer expenditures. 
Measurement in terms of the number of persons 
in the consumer units further reduces the apparent 
inequality of consumption. The lowest fifth of the 
urban population (ranked by the income of con- 
sumer units) received 8.8 per cent of the money 
income and 9.6 per cent of the disposable income 
and made 11.9 per cent of all consumer expendi- 
tures, So far expenditure data have been little used 
to provide insights into the inequality of consump- 
tion and of normal income. 


Viewed historically, scientific discoveries and 
new technology stand out as the main factors 
bringing about changes in consumption. They have 
resulted in new products, greater efficiency of pro- 
duction, higher real income, and changes in the 
relative cost of various products. During much of 
the twentieth century the tempo of change has 
been very rapid—within a few decades transform- 
ing rural, rather static, societies into urban, highly 
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dynamic societies. In general, current changes sus- 
tain an expectation of further increases in con- 
sumption, year-by-year improvement in old prod- 
ucts, and a continuing flow of new products. 
Changes in the structure of market economies, 
in the relation of private to public economies, and 
in the welfare objective of societies create a need 
for increased knowledge of consumption and of the 
forces shaping it and for a greater understanding 
of its implications for social goals. Considerable 
progress has been made in measuring the total out- 
put of consumer products of the market economy. 
Only moderate advances have been made in under- 
standing the response to changes in costs as re- 
flected in money outlays, in consumer or home 
production, and in the use of the products of the 
public economy. Nor has much progress been made 
in measuring changes in the inequality of con- 
sumption. 
MARGARET G. REID 


[See also the biography of ENGEL.] 
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Il 
CONSUMER ASSETS 


In a general sense, consumer assets encompass 
all tangible goods or intangibles which consumers 
own or on which they have claims. This includes 
not only household goods but also most of the 
assets of business corporations and of government, 
as evidenced by consumer holdings of stocks and 
bonds. 

It is customary to classify consumer assets in 
various ways, roughly analogous to the definition 
of national wealth [see NATIONAL WEALTH]. Per- 
haps the most frequently encountered of such clas- 
sifications are the following. 

Tangibles vs. financial. Tangibles include all 
material possessions owned directly by the con- 
sumer, e.g., household goods, cars, homes, jewelry, 
and ownership of a farm. Intangibles, or financial 
assets, include cash and claims on money or goods, 
such as checking and savings accounts, common 
and preferred stock, bonds, cash value of life insur- 


ance, mortgages held on property of others, and 
loans to others. 

Financial assets may be subdivided into fixed- 
dollar and variable-dollar holdings. The former rep- 
resent assets for which the consumer has a claim 
on a fixed number of dollars, such as cash, a check- 
ing account, or a mortgage. In contrast, variable- 
dollar holdings are those for which the return is 
dependent on the price level or on other economic 
conditions. Thus, the current value of 100 shares 
of common stock originally purchased at $25 per 
share will depend on the performance of the com- 
pany since that time, and may be substantially 
more or less than $2,500, Other examples of vari- 
able-dollar holdings include debentures and the 
variable annuity. 

Liquid vs. nonliquid. Liquid assets are those 
which are immediately convertible into cash, into 
a fixed number of dollars. Hence, a liquid asset is 
a fixed-dollar asset, but not all fixed-dollar assets 
are liquid assets. For example, U.S. government 
Series E bonds are fixed-dollar assets, immediately 
convertible, and therefore liquid assets. Series H 
bonds, however, which are also fixed-dollar assets, 
are not immediately convertible into cash and are 
therefore nonliquid assets. The principal forms of 
liquid assets in the United States, other than money 
itself, are savings accounts and U.S. government 
Series E bonds. 

Gross vs. net. The total wealth of the con- 
sumer, without deduction for debts or liabilities, is 
known as his gross, or total, assets. Deducting his 
debts, or liabilities, from this total yields net assets. 
The gross assets of a consumer are equal to his net 
assets only if he possesses no debts. 

Holdings of consumer assets. Accurate estimates 
of consumer assets, or wealth, are as yet obtainable 
only for national totals and only for certain aggre- 
gates and certain countries. The reason is that, 
with a few exceptions, such data can be obtained 
only from consumers, and consumers are notori- 
ously unreliable reporters on such subjects. Finan- 
cial holdings tend to be understated and distorted 
in various ways. Nonfinancial holdings, such as the 
value of one’s appliances or of clothing, cannot be 
estimated with much reliability because of the diffi- 
culty of ascertaining such holdings and because of 
the very poor basis for estimating the resale value 
of such goods. The estimation of consumer finan- 
cial holdings from institutional records is usually 
hardly feasible either, partly because of the diffi- 
culty of collating holdings in different institutions 
for the same consumer unit and partly because few 
institutions distinguish between consumer hold- 
ings and holdings of businesses or of institutions. 


The latter problem makes it difficult to estimate 
even aggregates of consumer assets from institu- 
tional data. 

For these reasons, the consumer asset, or wealth, 
estimates that have been made should be treated as 
rough approximations and not as exact figures. 
For the United States, estimates of total household 
wealth have been made as part of the national 
wealth estimates of Raymond Goldsmith. These 
estimates are discussed and presented for the end 
of 1958 in the article NATIONAL WEALTH. As 
shown in Table 1 of that article, household net 
wealth at that time in the United States amounted 
to just under $1.5 billion, or over 60 per cent of the 
national total. Roughly three-fifths of this house- 
hold wealth was in financial form, with the rest in 
physical assets. 

Detailed estimates of the aggregate financial 
wealth of individuals are published by the U.S. 
Securities and Exchange Commission, based on 
data obtained from financial institutions, and are 
shown in Table 1 for selected years between 1950 
and 1964. These data include not only consumers 
but also nonprofit institutions, and encompass such 


CONSUMERS: Consumer Assets 343 


peripheral items as private insurance and pension 
reserves. Therefore, not all of these holdings can 
be attributed to consumers or to households. Never- 
theless, these data suggest that the great bulk of 
consumer financial wealth is in the form of pre- 
ferred and common stock, insurance, and savings 
deposits of one form or another. The table also 
brings out the dramatic increase in the post-World 
War u period in mortgage and consumer debt, on 
the liability side, and in equities and savings shares 
(mostly in savings and loan associations), on the 
assets side. 

Another source of estimates of the financial 
wealth of consumers is sample surveys of saving 
or of components of wealth made periodically by 
governmental and other agencies. The annual Sur- 
vey of Consumer Finances of the University of 
Michigan Survey Research Center provides infor- 
mation on the liquid asset holdings of U.S. con- 
sumers as well as on ownership of homes, cars, and 
selected durable guods and on different types of 
consumer debt. The most comprehensive study of 
this nature so far is the “Survey of Financial Char- 
acteristics of Consumers” conducted for the Federal 


Table 1 — Financial assets and liabilities of individuals, United States, selected years 
(in billions of dollars as of end of year)* 


Financial assets: 
Currency and deposits 
Currency and demand deposits 
Time and savings deposits 
Savings shares 
Securities 
U.S. savings bonds 
Other U.S. government 
State and local government 
Corporate and other* 
Bonds and notes 
Investment company shares 
Other preferred and common shares 
Private insurance and pension reserves 
Insurance reserves 
Insured pension reserves 
Noninsured pension reserves 
Government insurance and pension reserves 


Total assets 


Liabilities: 
Mortgage debt 
Consumer debt 
Securities loans 


Total liabilities 
Net equity (assets — liabilities) 


1. Includes Alaska and Hawaii. 
. Not available. 


anga 


|. Rough estimates of market value. 


1950 1955 1960 1964 
130 154 181 243 
(73) (80) (80) (97) 
(57) (75) (101) (146) 
15 34 66 109 
b 397 505 730 
50 (50) (46) (49) 
18 (20) (27) (29) 
13 (23) (31) (36) 
b (304) (401) (615) 
b (21) (22) (22) 
b (12) (23) (39) 
b (271) (357) (554) 
68 106 152 206 
(57) (77) (96) (118) 
(6) an ay) (25) 
(6) (18) (37) (63) 
40 57 71 84 
b 749 975 1,372 
38 78 130 187 
18 34 51 70 
3 5 5 8 
58 117 185 265 
b 632 790 1,107 


Beginning 1955, includes nonguaranteed federal agency issues. In 1950, such issues were included with corporate and other bonds. 


Source: U.S. Securities and Exchange Commission, Annual Report. 
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Reserve Board by the U.S. Bureau of the Census 
in the spring and summer of 1963 (Projector 
1964). This study sought to obtain from a national 
sample of roughly 3,600 families information on 
ownership of liquid assets, common and preferred 
stock, marketable bonds, life insurance, annuities 
and investment in retirement plans, the value of 
ownership of a business (both farm and nonfarm), 
ownership of homes and automobiles, and owner- 
ship of different forms of debt. Reinterviews with 
this sample were conducted in 1964. 

The survey-based estimates of consumer finan- 
cial wealth tend to complement the aggregative 
estimates, because only the survey data provide 
extensive information on the extent to which asset 
holdings are distributed among different types of 
consumer units, These and other data indicate that 
consumer wealth is highly concentrated [see Na- 
TIONAL WEALTH, article on DISTRIBUTION]. Some 
idea of the extent to which wealth is concentrated 
among consumer units is given by Table 2. 


Table 2 — Estimated distribution of U.S. families in 1963 
by net worth and the average net worth of 
the members of each group 


Net worth Per cent Average net worth 

(in dollars) of families" (in dollars) 
Negative 8 —538 
0-999 7 302 
1,000-4,999 17 2,809 
5,000-9,999 14 7,305 
10,000-24,999 24 16,281 
25,000—49,999 LAJ 35,309 
50,000-99,999 5 67,042 
100,000—199,999 1 129,958 
200,000-499,999 1 293,655 
500,000 and over b 1,176,281 


a. Per cents do not add to 100 because of rounding. 
b, Less than half of one per cent. 
Source: Projector 1964, pp. 289-293. 


As a rule, net worth, as well as total assets, tend 
to be higher for older families, for families with 
above-average incomes and savings rates, and for 
the self-employed. 

Perhaps equally striking is the extent to which 
average holdings of different types of assets vary 
by family characteristics, as is shown in Table 3. 
Lower-income families have most of their assets in 
tangible goods or liquid form (mostly checking 
and savings accounts); this is much more so than 
the table indicates because most widely owned 
durable goods are not included. At higher income 
levels, and also among older people, holdings tend 
to be concentrated increasingly in a business and 
in investment assets (primarily common and pre- 
ferred stock). Particularly noteworthy is the fact 


that among the wealthiest families in the United 
States in 1962—the fraction of 1 per cent having 
a net worth of $500,000 or more—over 90 per cent 
of these assets were in variable-dollar form (mostly 
a business, stocks, and marketable bonds), and only 
a very small fraction was in liquid assets or in 
other fixed-dollar investments. 

These data have to be treated as approximations 
only, because consumer financial survey data are 
subject to substantial biases. These biases may not 
only lead to substantial underestimates of the 
aggregative value of consumer financial holdings 
but may also distort the pattern of these holdings. 
In particular, tendencies exist for small holdings to 
be overstated, for large holdings to be understated, 
and for the degree of understatement to increase 
with the sensitivity of the respondent about the 
asset. 

Consumer durables. Much less is known about 
consumer holdings of durable, or tangible, goods 
than of intangibles because of the many problems 
involved in locating and valuing such goods. For 
the United States, the Goldsmith estimates indicate 
that roughly a fourth of consumer wealth is in land 
and housing, and about 10 per cent is in other 
consumer durables. 

Only for the two most expensive such goods— 
houses and automobiles—are many data available. 
The Federal Reserve survey data cited earlier indi- 
cate that at the beginning of 1963 roughly three- 
fifths of all U.S. families owned the house in which 
they lived. Summary data on the characteristics of 
these homes are shown in Table 4, relating to 
one-family homes with Federal Housing Authority 
insured mortgages. The table shows how the value 
of these homes rose between 1950 and 1964 and 
suggests that further increases are in store because 
of the higher costs and values in 1964 of new 
homes than of existing homes. The table also illus- 
trates the relative growth during this period of 
mortgages in relation to home values and of land 
in relation to building costs. 

Automobiles were owned by nearly four-fifths of 
all U.S. families in 1965, as shown in Table 5. 
Nearly a fourth of the families owned two or more 
automobiles. The table also brings out the fact that 
just under a third of all cars in 1965 were eight 
years old or more and slightly more than another 
third were less than four years old, and that most 
cars, especially new ones, have been purchased 
through the use of credit. 

Extensive data on the value and characteristics 
of other durable goods owned by U.S. consumers, 
as well as of houses and cars, were obtained as 
part of the 1950 and 1960-1961 Consumer Ex- 
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Table 4 — Median cost and value of new and existing one-family homes in the United States with 
Federal Housing Authority insured mortgages, selected years (in dollars) 


NEW HOMES 

1950 1960 1962 
Estimated value 8,286 14,607 15,151 
Mortgage amount 7,101 13,569 14,195 
Ratio: Mortgage to value 88.0 93.5 94.4 
Price of site * 2,404 2,649 
Ratio: Site to value * 16.1 17.0 


* Not available. 


penditures Studies of the Bureau of Labor Statistics. 
More recent estimates of the frequency of owner- 
ship of different appliances are shown for the U.S. 
in Table 6 for 1963. It is interesting to note that 
by that year only one appliance (electric refriger- 
ators) was owned more widely than television sets. 

Assets and consumer behavior. A number of 
studies have been made of the factors that influ- 
ence consumers to hold particular combinations of 
assets, some dealing with financial assets and 
others with durable goods. Still other studies have 
sought to ascertain the effect of asset holdings on 
consumer spending and saving. 

Influencing factors. Reasons for consumer hold- 
ing of financial assets have been discussed in the 


EXISTING HOMES 


1964 1950 1960 1962 1964 
16,063 8,865 13,043 14,082 14,614 
15,118 6801 11,978 13,100 13,725 

94.5 77.8 92.6 94.4 94,8 

2,990 ` 2,285 2,653 2,824 

17.9 . 16.8 18.1 18.7 


Source: U.S. Housing ond Home Finance Agoncy 1964. 


theoretical literature for some time (for example, 
Keynes 1936, chapter 16; Katona 1951, pp. 98- 
107). However, relatively few empirical studies 
have been made on this subject, to a large extent, 
no doubt, because of lack of data. The findings of 
the principal such studies may be summarized, 
briefly, as follows: 

(1) The composition of consumer portfolios 
is influenced by numerous social, economic, and 
environmental variables (Watts & Tobin 1960). 
In particular, holding other variables constant in 
each case: 

(a) Financial assets rise in importance and 
the value of durables declines as a percent of the 
total as household income rises. 


Table 5 — U.S. automobile ownership, age, and financing: selected years® 


Unit 1950 
Total number of families in U.S. millions 45.2 
Total owning automobiles per cent 59 
Owning 1 automobile per cent 52 
Owning 2 or more automobiles per cent 7 
Automobiles owned, by age: 
Less than 2 years old per cent 17 
2 and 3 years old per cent 19 
4, 5, 6, and 7 years old per cent 6 
8 years old and over per cent 58 
Total automobiles purchased millions b 
New car millions b 
Average price paid dollars b 
Used car millions b 
Average price paid dollars b 
Method of financing purchases:* 
All passenger cars: 
Full cash (including trade-in allowance) per cent 51 
Installment credit and other borrowing per cent 49 
New passenger cars: 
Full cash (including trade-in allowance) per cent 56 
Installment credit and other borrowing per cent 44 
Used passenger cars: 
Full cash (including trade-in allowance) per cent 47 
Installment credit and other borrowing per cent 53 


a. Excludes Alaska and Hawaii. 
b. Not available. 
c. Percentages may add to less than 100 per cent because of not ascertained cases. 


1955 1960 1961 1962 1963 1964 1965 
49.3 53.4 54.2 54.9 56.5 56.8 58.4 
70 77 76 74 80 78 79 
60 62 58 57 58 56 55 
10 15 18 17 22 22 24 
12 14 13 12 13 13 16 
22 20 17 21 18 20 21 
43 41 42 38 36 33 33 
23 25 28 29 33 34 30 
13.6 14.3 16.4 15.6 18.9 17.3 18.3 
4.4 5.2 5.4 46 59 6.0 7.2 
2730 3,140 3,010 2,830 2,990 3130 3140 
9.2 91 11.0 11.0 13.0 11.3 11.1 
780 980 800 800 840 920 920 
37 38 47 43 45 47 48 
61 61 52 56 55 53 52 
38 33 39 37 38 39 40 
6 66 61 62 6 60 60 
37 4) 52 44 48 51 53 
62 57 47 53 50 44 “M 


Source: Survey of Consumer Finances. 


Table 6 — Number and per cent of U.S. households with 
specified household appliances, 1963° 


HOUSEHOLDS® 
Number, in thousands Percent 
Major appliances: 


Air conditioner: 


Room unit 7,240 13.2 

Central 962 18 
Clothes dryer: 

Electric 8,210 14.9 

Gas 3,523 6.4 
Combination washer—dryer 1,119 2.0 
Dishwasher 3,920 7.1 
Freezer 11,197 20.3 
Garbage disposer 4,789 8.7 
Range, electric 19,047 34.6 
Range, gas 33,914 61.6 
Refrigerator, electric 53,063 96.4 
Refrigerator, gas 1,313 2.4 
Sewing machine, electric 25,408 46.2 
Television: 

Black and white 50,001 90.8 

Color 915 17 
Washing machine; 

Automatic 26,194 47.6 

Wringer 14,535 26.4 
Vacuum cleaner 39,803 72.3 

Small electrical appliances: 

Blanket 13,758 25.0 
Blender 4,484 8.2 
Broom, upright 1,045 1.9 
Can opener 5,973 10.9 
Coffeemaker, automatic 27,165 49.4 
Clock (excluding radio) 34,968 63.5 
Floor polisher 5,076 9.2 
Frying pan 21,673 39.4 
Hair dryer (hat box) 8,719 15.8 
Mixer 35,781 65.0 
Phonograph: 

Portable 11,576 21.0 

Other 15,524 28.2 
Radio (including clock) 26,665 48.4 
Radio, pocket/purse 15,089 27.4 
Radio, other portable 21,247 38.6 
Shaver 24,705 44.9 
Steam iron 35,754 65.0 
Toaster 42,356 76.9 


a, Excludes Alaska and Hawaii. 
b. Total number of households = 55,050,000. 
Source: Look Magazine 1963. 


(b) Financial assets rise in importance and 
debt declines with increased education of the 
household head. 

(c) Older households have fewer holdings in 
durables and less debt. 

(d) Larger households tend to have more of 
their holdings in durables and fewer in liquid 
assets. 

(e) Pronounced, though erratic, differences 
in asset holdings exist by occupation, by region, 
and by city size. 

(2) Holdings of variable-dollar assets are also 
influenced by numerous factors, principally age, 
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home ownership, total assets, and occupation 
(Claycamp 1963). 

(3) Holdings of liquid assets are influenced pri- 
marily by total wealth rather than by income 
(Lydall 1958; Claycamp 1963; Butters et al. 1953). 
Liquid assets as a proportion of total assets tend 
to decline as the amount of wealth rises. Liquid 
assets in relation to income tend to rise with age, 
to fall as the size of the consumer unit rises, and 
to be higher for those with permanently depressed 
incomes (Guthrie 1960). 

(4) Holdings of specific assets such as life in- 
surance and common stock are also influenced by 
a large number of different variables, though in- 
come and occupation emerge frequently as dom- 
inant factors. Of particular interest is the finding 
that price expectations and attitudinal variables 
are related to ownership of common stock (Kreinin 
1957; 1959; Kreinin et al. 1961). 

Factors influencing the ownership of durable 
goods have been studied largely in terms of the 
purchase of such goods, but also in terms of the 
acquisition patterns of such goods. Thus, attempts 
have been made to derive so-called saturation 
curves, showing how ownership of particular goods 
spreads through a population. Such information 
has been used as a basis for forecasting durable 
goods purchases as well as for general analytical 
purposes (Brown, Buck, & Pyatt 1965; Pyatt 1964). 
Various sociological studies have explored the role 
of the innovator in durable goods purchases and 
the manner in which ownership of a good spreads 
through a population. These studies bring out the 
importance of the group, and of group leaders, in 
influencing such purchases (Bourne 1957; Katz & 
Lazarsfeld 1955; Whyte 1954). These innovators 
appear to be scattered through the population but, 
judging by one such study, are generally found 
among younger, better-educated people as well as 
among those who already own similar products 
and are financially optimistic (Mueller 1958). 

Other studies have shown that many factors 
enter into the durable goods purchase decision, 
some economic, some sociological, and some psy- 
chological. The socioeconomic factors include in- 
come, income change, assets, age, family size, and 
occupation (Ferber 1955; Crockett & Friend 1960; 
Morgan 1958). The life cycle of the family has 
been found to be particularly relevant (Consumer 
Behavior 1955), with purchases increasing and 
accumulating as a family begins to raise children, 
and then declining as the children grow up and 
leave home. These studies also suggest that al- 
though many durable goods purchases are planned 
and carefully thought out in advance, many others 
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“loyalty” (Tate 1961), the opportunity generally to 
obtain better and fairer prices (Caplovitz 1963), 
and a lesser susceptibility to advertising (Smith 
1965). The same selection factors apply to style of 
life. Here, too, early exploration may produce 
bizarre or “silly” effects and combinations, but 
ultimately, with experience, comes more mature 
behavior. 

Other social scientists who would seriously study 
the consumer have been held back by a distaste for 
what seemed to be tendencies toward local con- 
formity in style, not realizing, for example, that 
the ability to flock together into homogeneous and 
“conforming” communities may itself be an expres- 
sion of economic freedom. Furthermore, as changes 
in style are required by further changes in the cycle 
of family development, these same consumers flock 
easily and freely into other but different types of 
again homogeneous communities. That certain 
types of life style are now made more visible by 
the greater freedom for similar people to congre- 
gate may suggest an appearance of strong group 
conformity that persists even in the face of the 
individual consumer’s easy, though less noticed, 
ability to shift from one style to another. 

It is not enough to say that people have more 
time and more choice. The striking thing is that 
they are tending not to buy more and more goods 
as a form of conspicuous consumption, but to buy 
time to experiment more with their choices. The 
most conspicuous form of time consumption is the 
vacation trip. At the same time employees seem 
eager to receive leisure hours as a substitute for 
wage dollars. They bid against their employers for 
their own time, and their time budgets are becom- 
ing more important and more worthy of study than 
their money budgets [see WacEs]. 

While these tendencies have appeared primarily 
in those areas of Western affluence which initially 
made the new consumer innovation possible, their 
study is also of importance to the less affluent lands 
of the East. That consumer freedom happened first 
in the “rich” West does not make it irrelevant to 
the East. Hopefully, the spirit of that freedom may 
be built into economic advances in the East and 
still newer and equally stimulating styles of life 
may emerge to compete creatively with those of the 
West. If anticipated, it may not be necessary to 
attain Western levels of affluence before the equiv- 
alent ideas take hold. For example, it may be pos- 
sible during the stage of first or early affluence to 
leapfrog, or at least to inhibit, the consumer tend- 
encies to spend more in terms of class and status 
considerations and to arrive more quickly at a point 


where spending reflects the greater freedom of 
individual style. In the West, it has happened with- 
out forethought or warning and even while some 
poverty remains. The release from serious eco- 
nomic anxiety has permitted many consumers, al- 
though not swamped with wealth, to look around 
for the first time with a sense of real choice about 
how they might spend their time, what kind of life 
they would make for themselves, and what kind of 
people they might be. 

Conceptions of the consumer. Professional or 
official definitions of the study of consumer behav- 
ior are somewhat less focused. For example, the 
division of consumer psychology of the American 
Psychological Association defines its purpose as the 
study of human behavior as it relates to the con- 
sumption of goods and the uses and acceptance of 
services. The definition is broad enough to serve as 
well for a sociological or anthropological associa- 
tion. This breadth may reflect the newness of the 
concern with professional study of the consumer. 
That is, behavioral scientists, especially psychol- 
ogists, have entered into the study of matters re- 
lated to commerce and industry only in this cen- 
tury, a century which has seen the economy of the 
United States move from an orientation based pri- 
marily on production to one that is based as well 
on sales and distribution, and finally, and still in 
the process, to one that is also concerned with 
consumer needs. 

In between the two world wars there developed 
what might be called the older orientation to con- 
sumer psychology, one that was based primarily 
on an identification with the businessman, the seller 
of goods and services. With this identification came 
early emphasis on sales research and studies of 
advertising effectiveness. To be sure, the latter co- 
incided well with the psychologist’s special ability 
to conduct experimental research on the learning 
of information, and, by so doing, an enormous 
research literature developed in advertising psychol- 
ogy (Lucas & Britt 1950). Nevertheless, the new 
definition of consumer psychology is based not so 
much on an identification with the manufacturer 
(as is common in industrial psychology), not so 
much on an identification with the salesman (as is 
common in advertising psychology), but more on 
an identification with the changing consumer 
himself. 

As the consumer begins to innovate in life style, 
a change is also taking place in how he is viewed 
by social scientists. The old way was to view him, 
even define him, predominantly through the eyes 
of economics. The term “consumer” was not then 


a social science term, but a focus for concern over 
efficiency and for a harmonious balance of forces in 
the free market place. In response to the new afflu- 
ence the old way of thinking was reflected in a 
swing from concern with alleged consumer waste- 
fulness to concern with advertising’s alleged over- 
persuasiveness. However, once Packard (1957) and 
similarly inclined critics had clearly and widely 
raised the alarm that consumers were being made 
to buy more than they actually needed, the older 
type of thinking had to be abandoned as a serious 
approach, that is, it had no further comments to 
make or questions to ask about what was happen- 
ing to consumers as people. 

While proponents of the older view may acknowl- 
edge that the purchase of household appliances 
saves the time and attention of family members for 
performance of higher level activities and that con- 
sumers spend more on such living aids as they 
become more affluent, it comes as something of a 
surprise for them to discover that the proportion 
of consumer incomes which is spent after taxes, 
at least in the United States and except for a post- 
World War buying spree, has remained relatively 
constant at 92 to 94 per cent (White 1961). Great 
changes in consumer behavior have occurred, there- 
fore, during a period when, strictly speaking, 
changes in economic behavior did not. The more 
serious concern, then, of the social sciences is to 
chart the social consequences of consumer access 
to these new living aids and resources. 

Theories of consumer behavior. Ordinarily, 
when an area of human behavior is marked off for 
study by social scientists, one may expect much 
employment of comparative or experimental anal- 
yses to uncover and identify regularities in behav- 
ior within the area. These then provide a basis for 
understanding, as well as for testing the under- 
standing. The tests usually are set up in the form 
of prediction studies. This testing is efficiently 
accelerated by a clear outline and definition of the 
area to be studied. One of the functions of theory 
is to pose a persistent set of questions and thereby 
to define what is considered important. However, 
there is not now a general theory of consumer be- 
havior, although Katona (1951) has developed a 
theory of the role of consumer expectations or con- 
fidence upon the levels of spending and saving in 
the economy, A general theory of consumer behav- 
ior has probably not existed since Veblen examined 
the psychological pressures of social class for their 
impact on the efficiency of human consumption of 
goods and resources. From Veblen to Packard, a 
half century saw no marked change in this eco- 
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nomic-psychological approach, except that Packard 
identifies the cause of consumer inefficiencies not 
so much in social class pressures per se but in their 
exploitation and manipulation by advertising and 
business interests. Both Veblen and Packard are 
essentially social critics using psychological insights 
to advance a theory which is basically economic, 
wherein utility and efficiency are defined as the 
important qualities. 

Any theory based on economics might have de- 
served to be considered quite seriously in a day 
when economic indicators were the best predictors 
of human behavior. Today, this is less so. Income 
is no longer the best predictor of consumer purchas- 
ing; for example, a time series correlation between 
the data of 17 University of Michigan consumer 
surveys from 1952 to 1960 and the level of national 
expenditures for durable goods showed that esti- 
mates based on both recent disposable income and 
attitudes approximate future durable goods expend- 
itures much more closely than the estimates based 
on disposable income alone, An analysis of vari- 
ance indicated that consumer demand was more 
a function of willingness to buy than of ability 
to buy. In effect, Katona (1960) has turned the 
tables on economics by showing how consumer 
attitudes of optimism or pessimism initiate changes 
in economic indicators. The causal flow has been 
somewhat reversed, therefore, first in observation 
and then in theoretical formulations. If this is so, 
if economic predictors are no longer all powerful, 
what new indicators should we be looking at more 
closely? 

Predictors of consumer behavior. Perhaps stages 
in the family life cycle are the best predictors of 
consumer behavior (Clark & Foote 1955-1961). 
Is the family a childless couple? Is there a young 
baby in the family? Have the children grown and 
left? [See LIFE CYCLE.] 

In addition there are the many household buying 
decisions made within one particular stage of the 
cycle. What predictors seem to work here for the 
shorter run? One set seems to involve the career 
roles brought into the marital relationship by the 
two partners. This is partially a matter of relative 
buying influence within the pair, as affected by 
such simple considerations as whether or not the 
wife earns income outside the home. More impor- 
tant however is the matter of what particular occu- 
pations and specific career commitments may be 
brought together within the pair. Every occupation, 
especially if it has the lifelong involvement of a 
career, sets forth certain style of life requirements 
based on where the work is performed, the stand- 
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ards for success, who are the co-workers (clients, 
colleagues, supervisors), etc. Next to family cycle 
it is probably industry itself which has the most 
pervasive, though less direct, influence on style of 
life. It is important to note, however, that in this 
sense occupation and profession are not economic 
but social variables, involving different types of 
networks of interpersonal relationships much more 
than different levels of income. 

Family and jobs make good starting places 
for understanding consumers as whole people. 
Strangely, industry does not yet seek this whole 
view but tends to spend vast sums on the study of 
particular consumer functions (for example, food 
preparation) and closely related products. Yet some 
sociologists have looked at the new consumer life 
styles, have asked what products consumers are 
buying, and have come up with a whole view of 
the consumer which has relevance to all industry. 
Riesman and Roseborough (1955), for example, 
have said that today’s young people learn to want 
a recognizable complex of goods and services which 
they call “the standard package.” This may include 
a college education, car, gadget-equipped home, and 
annual travel vacations as base elements. The pack- 
age may come in a series of standard through 
de luxe grades, and with the aid of credit progress 
through those grades can be made easy. Such a 
view does not, as some have inferred, suggest in- 
creased conformity, for obtaining the elements of 
the package is intimately related to the acquisition 
of mobility and freedom, that is, greater job choice, 
reduction of household chores, and increased facil- 
ities for and use of transportation. It would seem, 
however, that as something like a standard package 
becomes universally wanted, especially for the free- 
dom that is seen in it, then deprivation becomes 
less tolerable for those without jobs and/or educa- 
tion. On the level of individual protest it may lead 
to delinquency and crime, but on the level of mass 
protest it may lead to an insistence on educational 
and job rights for underprivileged or minority 
groups. Thus do matters of consumer style shade 
over into social and political issues. 

The matter of social issues raises questions of 
intensity and amount. For example, to what extent 
are consumers motivated by the standard package? 
And how de luxe or how large may this package 
become? At present we can only cite Chinoy’s 
(1955) study of the American automobile worker, 
which concludes that satisfactions are reported in 
terms of a small package of independence repre- 
sented in home and car ownership, with aspirations 
for further advancement focused on the children 


and their chances for social mobility through edu- 
cation. While this modest version of the dream may 
apply only to industrial workers, the emphasis is 
more on having some of the good things of life and 
the time to use them and less on the ideologically 
distant goals. While it is true that in the United 
States a blue-collar majority is giving way rapidly 
to a “middle-class” white-collar majority, a similar 
evaluation has not yet been made for the latter 
group. This latter group, of course, is exposed to a 
different side of American industry and is more 
likely to work in office buildings than in factories, 
The standards for home ownership and decor of 
this group may therefore be different and possibly 
higher than that of the blue-collar group. 

The question of what kind of houses people want 
and what they want in them is a good over-all way 
of assessing the consumer market. For example, 
the number of new homes for which construction 
has started each year is the best single indication 
of whether or not it will be a good year for business. 
The question of what kinds and numbers of homes 
to build, and where to build them, becomes a crit- 
ical determiner of both national economic well- 
being and of the consumer's physical environment, 
complete with built-in limitations and opportunities 
for style development. Thus, the home architects, 
the appliance designers, the federal housing laws, 
and the bank credit policies all have significant 
effects on the potentialities of this environment. 

It should also be noted that new houses reflect 
consumer demand and initiative, or put another 
way, consumer investment. As Katona (1960) em- 
phasizes, the growth and expansion of the economy 
are dependent on consumer investment as well as 
business investment, and these are equally impor- 
tant forces in the extent to which they can stim- 
ulate or retard the economy. 

Research approaches. With the consumer now 
seen as so “powerful,” it is to be expected that 
attempts to study him have become intensified. One 
reviewer (Halbert 1965) has identified six different 
general approaches: 

(1) General studies of consumer behavior: the 
role of consumers and consumption, consumer 
spending as investments, consumption in economic 
development. 

(2) Income studies: changes in income distribu- 
tion, credit, and expenditures by various classes. 
This category accounts for a large part of past 
studies. 

(3) Population studies: population trends and 
shifts, life-cycle influences, various demographic 
classifications, 


(4) Life-style factors: studies of factors influ- 
encing styles of living, such as purchases, status 
symbols, social class, images, needs, leisure, conven- 
ience. This area is receiving increasing emphasis. 

(5) Consumers as decision makers: studies of 
who makes what decisions, what decisions are 
planned, who is influential in various situations, 
the household as a decision unit. 

(6) Specific purchase and consumption studies: 
studies of purchases and determinants of purchases 
for specific products such as coffee, cleansers, 
gasoline. 

The first three of these approaches are depend- 
ent upon availability and analysis of mass data. 
While the federal and state governments have long 
made such data available, it is only recently that 
correspondingly massive tools of analysis have be- 
come available (for example, the computer). This 
means that research in these areas was until re- 
cently primarily reportorial. With so much detailed 
data to comprehend it was all that scholars could 
do to keep up with descriptive identifications of 
major historical trends. The computer brings a new 
type of inquiry to such data—an inquiry that seeks 
to build dynamic theory and is not daunted by com- 
plexities in the manipulation of data. One may ex- 
pect therefore to hear more in future years, from 
investigators with a primarily methodological orien- 
tation, about the science of retailing, of marketing, 
and of other fields. 

The fourth and fifth approaches seem to repre- 
sent more clearly the opening up of consumer study 
to sociologists and psychologists. Traditionally, the 
former have studied the ills of society, such as 
divorce and unemployment. Psychologists, too, have 
been involved, if not with society’s ills, then with 
the emotionally ill. However, as the current gen- 
eration of social scientists has put behind it de- 
pression and World War 11, and has also partici- 
pated in the new affluence, there has gradually 
developed a new curiosity about the nature of the 
normal personality, a shift away from concern with 
irrational processes to rational thought, and an 
interest in theories about how various everyday de- 
cisions are made. 

All of these areas of research involve a move 
away from descriptive or reportorial research to 
inquiry guided by the use of theory and undertaken 
for the purpose of further theory development. This 
means that the study of the consumer has now 
finally linked up with the main body of the be- 
havioral sciences. 

As for the consumer himself, we may expect 
further inroads on his ancient image of himself as 
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a person in want and as one who is preoccupied 
with the allocation of scarce means to satisfy basic 
needs. Foote (1963) proposes that as the propor- 
tion of household expenditures devoted to food con- 
tinues to fall, today’s somewhat obese consumer 
may well develop wants that do not require pro- 
duction or consumption, such as stimulating con- 
versation or the creation of music. The constraint 
on wants such as these will not be income but 
learning ability. In short, consumers may yet out- 
grow consumption. 

HERBERT E. KRUGMAN 


[See also ADVERTISING, article on ADVERTISING RE- 
SEARCH; MARKET RESEARCH, article on CONSUMER 
RESEARCH. Other relevant material may be found in 
CONFORMITY; IDENTITY, PSYCHOSOCIAL; LEISURE.] 
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CONSUMER'S SURPLUS 


Consumer's surplus denotes the difference be- 
tween the maximum amount of money a consumer 
would be willing to pay for a product or service 
and the amount he actually pays. The term was 
first introduced into economics by Alfred Marshall 
in his Principles of Economics, but the first person 
to enunciate the idea in a precise way appears to 
have been Jules Dupuit, a French engineer, in a 
paper published in 1844. Rudolf Auspitz and 
Richard Lieben (who knew of Dupuit’s work but 
not Marshall's) gave an account of consumer's 
surplus in 1889 in their book Untersuchungen über 
die Theorie des Preises (“Investigations on the 
Theory of Price”); but their work was neglected, 
and it was Dupuit, and above all Marshall, who 
influenced modern economists. 

The importance of consumer's surplus is that 
it provides a monetary measure of the benefit that 
a consumer derives from the supply of a product 
given the terms on which it is made available. It 
seems therefore to offer the possibility of assessing 
the net effect on welfare of policies that alter the 
terms on which different products are supplied. 
Economists have used the concept to argue that 
some systems of taxation are worse than others 
because they lead to a greater loss of consumer's 
surplus. It has also been proposed that in decreas- 
ing cost industries in which consumers’ expendi- 
tures for a product would not cover total costs— 
if the product were sold on the market at a uniform 
price or, in most modern formulations, at a price 
equal to marginal cost—the state should make 
possible production of the product by means of a 
subsidy when the gain in consumer’s surplus would 
justify this. In effect, what consumers would be 
willing to pay but do not (or, more exactly, the 
state’s estimate of this) should be treated as an 
auxiliary factor in addition to what they do pay, in 
determining production, because it indicates as 
much as what they do pay the worth of the product 
to them. 

Recent work, particularly that of J. R. Hicks, to 
be discussed below, not only has thrown much 
light on the meaning of the concept of consumer’s 
surplus but has also revealed more clearly than 


before the ambiguities and complexities involved 
in its definition and measurement. It cannot be 
said that a consensus has yet emerged in the eco- 
nomics profession concerning the validity and use- 
fulness of the concept of consumer’s surplus. 

Jules Dupuit. Dupuit was concerned with a 
double question: how to measure the utility derived 
from a public works (such as a bridge) in order to 
decide whether it ought to be constructed and how 
the charges should be set for the services it pro- 
vides. Dupuit explained his general approach in his 
1844 paper “On the Measurement of the Utility of 
Public Works.” He elaborated his views on pricing 
in a paper published in 1849, “On Tolls and Trans- 
port Charges.” 

Dupuit argued that the amount of money paid 
by consumers for a product was not an accurate 
measure of the utility derived from its consump- 
tion. It was what a person would be willing to 
sacrifice to obtain a product that indicated what 
the product was really worth to him. More con- 
cretely, it was the maximum amount of money 
that a consumer would be willing to pay that meas- 
ured its absolute utility or, as it has been commonly 
termed by later economists, its “total utility.” The 
difference between the maximum a consumer would 
pay and what he actually paid (what is now called 
consumer’s surplus), Dupuit termed “relative 
utility.” 

Dupuit pointed out that the absolute utility de- 
rived from any given quantity of a product would 
differ among consumers; and, consequently, if it 
were made available to all consumers at a uniform 
price the relative utility (consumer's surplus) for 
that given quantity would also differ, Furthermore, 
for any one consumer, the absolute utility derived 
from an additional quantity of the product, and 
consequently the maximum he would pay for it, 
would vary with the amount of the product that 
he was already consuming. Dupuit took as an ex- 
ample the demand for water, A fall in price would 
open up more uses in which the monetary value 
of the absolute utility of a unit of water exceeded 
the price, and the amount demanded would there- 
fore expand. The difference between the sum of 
the maximum amounts of money a consumer 
would pay for the various units used for these 
different purposes and the amount actually spent 
(the price multiplied by the total number of units 
consumed) is his consumer's surplus. 

On the basis of his analysis, Dupuit argued that 
in deciding whether to construct a public works 
it is the monetary value of the absolute utility that 
ought to be compared with cost. He advocated 
recouping this cost by a system of discriminating 


prices in which the price charged to different 
groups of consumers varied with the maximum 
that each group would pay (the greater the value 
of the absolute utility the greater the charge). It 
should be noted that Dupuit did not advocate a 
free service when marginal cost was zero. He left 
this question open. He pointed out that if he were 
to consider whether a toll should be established or 
not he “would have had to examine by what new 
tax or what increase in taxation tolls could be 
replaced and what would be the effects of these 
taxes” [see DUPUIT]. 

Alfred Marshall. Alfred Marshall's treatment 
of consumer's surplus is, in essentials, similar to 
that of Dupuit although it shows a greater aware- 
ness of the underlying difficulties. But Marshall's 
practice of indicating, but at the same time glossing 
over, these difficulties made his exposition less 
clear than that of Dupuit, and there has been much 
discussion about what Marshall meant by his vari- 
ous statements, questions that probably can never 
be resolved beyond all doubt. Marshall's analysis 
seems to have been developed without knowledge 
of Dupuit’s work. His views on consumer's surplus 
were first published in the privately printed Pure 
Theory of Domestic Values (1879) as well as in a 
more extended form in the first edition of his 
Principles of Economics (1890). 

Marshall pointed out that when a consumer pur- 
chases something he is normally better off, that is, 
he derives a “surplus of satisfaction.” The eco- 
nomic measure of this surplus satisfaction is con- 
sumer’s surplus, which Marshall defined as the 
“excess of the price which he would be willing to 
pay rather than go without the thing, over that 
which he actually does pay” ({1890] 1961, vol. 1, 
p. 124), Marshall explained his meaning by exam- 
ining the demand for tea, Suppose a consumer 
would just be induced to buy one pound per annum 
if the price were 20 shillings per pound but that 
if the price fell to 14 shillings per pound that he 
would just be induced to buy a second pound. Then, 
according to Marshall, the consumer would be will- 
ing to pay 34 shillings for two pounds per annum 
(20 shillings for the first pound plus 14 shillings 
for the second), But if the price were 14 shillings 
per pound, he would pay 28 shillings for the two 
pounds and thus would obtain a consumer's sur- 
plus of 6 shillings. Marshall put this argument in 
diagrammatic form, as had Dupuit before him (see 
Figure 1). If the price were OC, the quantity de- 
manded would be OH. The maximum amount of 
money that the consumer would be willing to pay 
for the quantity OH is represented by the area 
OHAD. At price OC, the amount of money that the 
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QUANTITY 
DEMANDED 


Figure 1 


consumer would actually pay is represented by the 
area OHAC. Consumer's surplus is represented by 
the triangular area CAD. Marshall confined his 
analysis to cases in which expenditure on the prod- 
uct represented a small part of the consumer's 
total expenditure. The purpose of this restriction 
appears to have been to ensure that the amount 
that a consumer would pay for additional units was 
not affected by the amount that he had paid for 
the units already demanded. Marshall's failure to 
deal explicitly with how he would handle the ques- 
tion if this restriction were removed undoubtedly 
contributed to a feeling that consumer's surplus 
was a concept of limited usefulness. 

In applying his concept to actual problems, Mar- 
shall himself was suitably, and characteristically, 
cautious. He warned against using the loss or gain 
of consumer's surplus in situations in which dif- 
ferent income groups were affected differently. He 
emphasized that consumer's surplus was a “rough 
economic measure” and suggested that the calcu- 
lation should be confined to relatively small changes 
around the customary price, particularly when 
dealing with necessaries. Marshall seems originally 
to have had high hopes of being able to make 
statistical estimates of consumer’s surplus, and in 
one case he actually made such an estimate: the 
loss of consumer's surplus due to the British Post 
Office’s not allowing cheap local postal rates. But 
gradually Marshall became aware of the problems 
posed by complementarity and substitution and he 
became disillusioned about the possibility of mak- 
ing such statistical estimates, although he never 
revised the rather optimistic remarks included in 
the Principles. 

Marshall illustrated the use of consumer's surplus 
by means of examples taken from the field of taxa- 
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tion. If, to refer back to the diagram, a tax of CT 
per unit is placed on the product and this results in 
a rise in price of the same amount (from OC to 
OT) and a reduction in the amount demanded 
(from OH to OM) the amount that the consumer 
would pay in taxes is the sum represented by the 
area CRBT. But the consumer is not only worse off 
because he has suffered a reduction in the amount 
available for spending represented by the area 
CRBT. The consumer would cease to demand the 
quantity MH for which he would have been willing 
to pay the sum of money represented by the area 
MHAB. He would gain the sum of money repre- 
sented by the area MHAR (the money he previous- 
ly spent to secure MH). The money gained would 
be less than the value to him of the product lost. 
Consequently, in addition to the loss suffered as a 
result of paying over a certain amount in taxes, 
there would be a loss of consumer's surplus repre- 
sented by the area RAB, Marshall suggested that 
a government, having to obtain a given sum by 
taxation, should choose objects of taxation with a 
view to minimizing the loss of consumer's surplus. 

Marshall’s most celebrated application of the 
concept of consumer's surplus was his suggestion 
that it might be advantageous to tax industries 
subject to diminishing returns (in which, owing to 
a fall in the supply price due to the decrease in 
output, the loss of consumer's surplus could well 
be less than the proceeds of the tax) and to sub- 
sidize industries subject to increasing returns (in 
which, owing to the reduction in supply price 
brought about by the increase in output, the gain 
in consumer's surplus could well exceed the 
amount of the bounty). Thus the combination of 
tax and bounty might, on balance, lead to a net 
gain for consumers. The proposal to make price 
equal to marginal cost in industries in which aver- 
age cost is greater than marginal cost—the amount 
by which consumer expenditures fall short of total 
costs being made up by a government subsidy 
(which has had the support of many modern 
economists )—may be regarded as a variant of the 
Marshallian suggestion. 

Although Marshall originated the term con- 
sumer’s surplus and although it did appear in the 
first edition of the Principles, the term that Mar- 
shall normally used until the fourth edition, in 
1898, was “consumer's rent.” In view of this, it is 
hardly surprising to find that Marshall also intro- 
duced into economics the analogous concept of 
“producers surplus,” the difference between the 
amount of money a producer receives for a prod- 
uct and the minimum for which he would be will- 


ing to supply it. It corresponds to one of the mean- 
ings of the term “rent” [see RENT]. Perhaps because 
the problems associated with producer's surplus 
are more commonly considered under the heading 
of rent, the separate discussion of producer’s sur- 
plus has been less extensive than that of con- 
sumer’s surplus. It has, however, been argued that 
in computing the gain from the supply of a prod- 
uct it is necessary to include not only the con- 
sumer's but also the producer's surplus since it is 
the minimum for which a producer would be will- 
ing to supply a product that represents the value 
of what is lost through the diversion of factors to 
produce it. However, a diversion of factors on any 
considerable scale is likely to lead to a loss of 
consumer's surplus elsewhere and this also needs 
to be taken into account. The proper although in 
practice extremely difficult procedure, if the addi- 
tional output is to be valued including consumer's 
surplus, would seem to be to compare this with 
the value of the output lost through the diversion 
of the factors, including in this case the loss of 
consumer's surplus. 

Recent developments. Marshall’s analysis gained 
many adherents; but some economists remained 
unconvinced. J. S. Nicholson expressed skepticism 
(1894) and was answered by F. Y. Edgeworth 
(1894); Edwin Cannan (1924) was also skeptical 
and was answered by D. H. Macgregor (1924) and 
A. L. Bowley (1924). Doubts persisted. In recent 
times J. R. Hicks has attempted to rehabilitate the 
concept of consumer's surplus. In Value and Capi- 
tal (1939), he suggested that the way to measure 
consumer's surplus was to discover the reduction 
in income that would just offset the gain brought 
about by a fall in price, thus leaving the consumer 
no better off than before. Such a definition was not 
subject to the Marshallian restriction that expendi- 
ture on this item be a small proportion of the total. 
The marginal utility of money need not be con- 
stant and it could therefore be applied to products 
that took an appreciable part of consumer expendi- 
tures. A. M. Henderson (1941) argued, however, 
that what was described by Hicks’s new definition 
was not the Marshallian consumer’s surplus, which 
could be regarded as the amount a consumer would 
pay for the opportunity of buying at the existing 
price the amount he was in fact buying (the alter- 
native being to give up consuming the product 
entirely). Hicks’s consumers surplus was the 
amount a consumer would pay for the opportunity 
of buying at the existing price whatever quantity 
he wished. Since Hicks allowed the consumer to do 
something that according to Henderson was ruled 


out by the Marshallian definition (it allowed him 
to adjust his consumption as the amount he had 
available for spending was reduced), it was to be 
expected that the monetary value of Hicks’s con- 
sumer’s surplus would, in general, be greater. 
Henderson also showed that the monetary value of 
the consumer's surplus derived from a given 
change in price would differ according to whether 
the higher or the lower price was taken as the start- 
ing point. Hicks accepted Henderson's analysis and 
elaborated on it in subsequent articles and in A 
Revision of Demand Theory (1956). Four meas- 
ures of consumer's surplus were derived, depend- 
ing on whether the consumer is assumed to pay to 
obtain or be paid to forgo the opportunity under 
consideration and on whether the opportunity is 
one in which the consumer is or is not able to 
adapt his consumption to take account of the pay- 
ment made or received. Hicks pointed out, how- 
ever, that in the cases in which Marshall made use 
of consumer’s surplus—those in which the mar- 
ginal utility of money could be assumed to remain 
constant—all four measures had the same value. 
While Hicks’s analysis clarified the problems 
that arise when an attempt is made to measure 
consumer's surplus, the formidable nature of these 
problems and particularly the existence of differ- 
ent (but equally valid) measures of consumer's 
surplus understandably have strengthened the 
tendency to regard consumer's surplus as a con- 
cept of limited usefulness. A, P. Lerner (1963) has 
argued that such an attitude is unwarranted, main- 
taining that there is no need for a consumer's sur- 
plus that assumes that the consumer is not allowed 
to adjust his demand to the price, a category that, 
according to Lerner, arose as a result of too literal 
a reading of Marshall. Only one consumer's sur- 
plus is needed: “the tax or the subsidy that would 
have the same effect on real income if it took the 
place of the price change considered.” He argued 
that consumer's surplus could help in the analysis 
of such problems as taxation, trade restrictions, 
and monopoly—the kinds of problems, in fact, for 
which Dupuit and Marshall devised the concept. 
There is no question that many economists have 
found the concept of consumer's surplus useful in 
their thinking on such questions. But until satisfac- 
tory methods have been devised to estimate in prac- 
tice the gains and losses of consumer's surplus 
resulting from any given change in policy, other 
economists are likely to continue to regard the con- 
cept of consumer's surplus as, in Little’s words 
(1950), “a theoretical toy.” 
R. H. CoasE 
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[See also the biography of AUSPITZ AND LIEBEN.] 
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CONSUMPTION FUNCTION 


Economists have long been interested in the 
factors determining how a society divides its in- 
come proportionally between consumption and 
saving. In the past thirty years theoretical and 
empirical investigation of these factors has been 
focused by the concept of the consumption func- 
tion—a list of the variables that influence consump- 
tion, together with the direction and magnitude of 
their effects. Income itself is, of course, high on 
any such list; and much of recent investigation has 
concerned the nature, reliability, and measurement 
of the dependence of consumption on income. 

Since saving is by definition the difference, posi- 
tive or negative, between income and consumption, 
whatever relationships are summarized in a con- 
sumption function can be summarized equally well 
in a saving function. It is a matter of indifference 
whether attention is directed to the determinants 
of consumption or to the determinants of saving; 
the results of one approach can always be translated 
into the other. For example, the proposition that 
U.S. households in the aggregate spend for con- 
sumption 95 per cent of their income after taxes 
can also be stated by saying that they save 5 per 
cent of the income at their disposition. In either 
form, incidentally, the statement is no better than 
a good first approximation. 

Why have economists been interested in the di- 
vision of social income between consumption and 
saving? There are two main reasons. 

The first principal reason is the importance of 
saving for accumulation of the wealth of nations 
and for growth in their capacity to produce goods 
and services. Broadly speaking, consumption uses 
productive resources in the. present, while saving 
enlarges the resources available for production and 
consumption in the future—by increasing stocks 
of finished goods or materials, productive plant and 
equipment, and net claims on foreign countries. 

In practice it is difficult to measure these con- 
cepts of social accounting—net national product 
or income, consumption, saving, and wealth—in 
ways that correspond to the fundamental distinc- 


tion between provision for the present and provision 
for the future. Very likely, saving, as measured in 
our national accounts, understates our society's 
provision for the future. One problem is to measure 
the value of the services rendered by existing stocks 
of producers’ and consumers’ durable goods, and 
to allow for their depreciation and obsolescence. 
This is difficult enough for business plant and 
equipment. Housing is the only consumers’ durable 
asset for which such accounts are estimated; for 
other consumers’ durable goods, consumption is 
simply equated to new purchases, And no attempt 
is made in many national accounts (including 
those of the United States) to estimate the ac- 
cumulation of real wealth by governments, even 
before allowance for depreciation. 

An even greater practical and conceptual prob- 
lem, which has attracted considerable research in- 
terest recently, is to identify and to measure the 
saving embodied in human beings in the form of 
increased education, new skills, and greater ca- 
pacities. The national accounts do not now count 
any educational outlays as saving or associate with 
them any increase in national wealth [see CAPITAL, 
HUMAN]. 

In spite of these and other difficulties, there can 
be no doubt of the importance for economic growth 
of the processes of individual and social decision 
that determine the share of saving, as convention- 
ally measured, in net national product. This has 
been underscored in recent years by the shortages 
of capital confronting the less developed countries. 

The second principal reason for concern with 
the consumption function is an outgrowth of the 
depression of the 1930s and of the revolutions in 
economic thought and policy to which it led. An 
economy will not produce at the rate that its man- 
power and capital resources permit unless total 
effective demand for goods and services suffices to 
purchase its capacity, or “full employment,” out- 
put. If private consumption demand falls short of 
capacity output, the difference must be made up 
by nonconsumption spending, that is, private in- 
vestment at home or abroad and government ex- 
penditure. If these sources of demand do not absorb 
the saving that the economy would perform at full 
employment, then output, employment, and the 
use of industrial capacity will all fall short of their 
full employment levels. The national propensity to 
save will not be realized in additions to national 
wealth but wasted in unemployment and idle ca- 
pacity. 

The observation that saving is not always an 
unmixed blessing has a long history, associated 
with “underconsumption” explanations of the busi- 
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ness cycle. But it received its most sophisticated, 
convincing, and influential expression in J. M. 
Keynes’s General Theory of Employment, Interest 
and Money (1936). Keynes emphasized that con- 
sumption is on the whole a predictable and reliable 
component of aggregate demand. He saw invest- 
ment spending, in contrast, as inherently volatile 
and unstable. And in the periodic failure to use 
the potential saving of the economy he found the 
main reason for the repeated lapses of capitalist 
economies into recession and depression, He feared, 
further, a chronic tendency in wealthy countries 
for full employment saving to outrun investment, 
leading in the absence of corrective policies to 
chronic unemployment. 

The same theoretical apparatus can be applied 
to the opposite problem, excess demand and in- 
flation, although this was not the economic disease 
originally at the center of Keynes’s attention. In- 
flation occurs when nonconsumption spending, by 
governments or private enterprises, more than fills 
the gap between full employment output and the 
consumption expenditure that it normally induces. 

In recent decades Keynes’s theory has been the 
major impetus to research on the consumption 
function. This theoretical impetus was reinforced 
by the simultaneous development of national ac- 
counts providing the statistical raw material for 
empirical research on the subject. Note that the 
Keynesian motivation for understanding the social 
choice between consumption and saving does not 
require, to the same degree as the growth motiva- 
tion first mentioned, close identification of con- 
sumption with the present and saving with the 
future. What is more important for Keynes’s pur- 
pose is identification of consumption with the pre- 
dictable, and investment with the volatile, elements 
of nation expenditure. 


Keynes's “propensity to consume” 


Keynes’s consumption function is a simple rela- 
tionship between national consumption—and ac- 
cordingly national saving—on the one hand, and 
national income on the other. He called this rela- 
tionship “the propensity to consume” and derived 
certain conclusions as to its form from what he 
asserted to be a “psychological law”—that the com- 
munity will divide an increase in income in some 
regular proportion between an increase in con- 
sumption and an increase in saving. That is, both 
the “marginal propensity to consume” (mpc) and 
the “marginal propensity to save” (mps) are be- 
tween zero and one. (By definition the two marginal 
propensities sum to one.) This is all his theory 
required, but Keynes went further and speculated 
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that the “average propensity to consume” (apc), 
the share of national income consumed, would be 
found to decline with increases in total income. 
This decline could reflect either or both of the fol- 
lowing: (a) the mpc falls with income; (b) a 
certain component of consumption expenditure is 
independent of income. This means that the apc 
will be lower for higher incomes even if the mpc is 
constant, as is illustrated in Figure 1. Evidently 
Keynes believed in both. 

Figure 1 displays a Keynesian consumption func- 
tion and in the lower panel the corresponding sav- 
ing function. Net national income, Y, is measured 
along the horizontal axes. The vertical axis repre- 
sents consumption, C, in the upper panel and saving, 
S, in the lower. The 45° line from the conventional 
origin in the upper panel is a reference line, show- 
ing how much consumption would be if it were 
always exactly equal to income. The other line, 
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Figure 1 — The Keynesian consumption function 


360 CONSUMPTION FUNCTION 


more gently sloped, is the consumption function. 
It shows that below a break-even level of national 
income, Y,,, the nation will wish to consume more 
than its income, drawing on past savings. Above 
Y,,, however, consumption falls short of income, by 
an increasing margin. The mpc is the slope of this 
line, m, positive but less than one. Given a one- 
dollar increase in income, the community will in- 
crease its consumption by $m and increase its sav- 
ing, or diminish its dissaving, by $(1 — m). In the 
diagram the mpc is constant; a diminishing mpc 
would be pictured by a consumption function 
curving away from the 45° reference line, and by 
an upward-curving saving function. But even along 
the linear consumption function in Figure 1, the 
ape declines with income, Below Y, the ratio of 
consumption to income, C/Y, exceeds one; at Y, it 
is exactly one; above Y, it is below one. 

The multiplier. The use that Keynes made of 
the consumption function in explaining unemploy- 
ment can also be indicated in Figure 1. Net na- 
tional product or income, Y, is three things at once: 
(a) The sum of the incomes earned for produc- 
tive services (wages and salaries, interest, rents, 
profits), plus taxes paid by businesses prior to the 
payment or calculation of these factor incomes. 
(National accounting practice distinguishes net 
national product and national income by deducting 
these “indirect business taxes” from net national 
product in order to arrive at national income. This 
means that net national product measures the 
market value of net output, at prices that include 
these taxes, whereas national income values pro- 
duction at prices corresponding to the income pay- 
ments to productive factors. This distinction is not 
made in this article; the two terms are used inter- 
changeably to refer to the market value of net 
output.) (b) The total market value of goods and 
services produced. (c) Total purchases of goods 
and services, the sum of consumption expenditure, 
private investment, and government purchases. To 
sustain any given level of production and income, 
say Y;, total spending must add up to Y;. Since 
consumption spending will amount only to Cy, 
nonconsumption spending must make up the differ- 
ence. If Y, represents the full employment potential 
of the economy, failure of nonconsumption spend- 
ing to reach I; would mean failure of production 
and income to attain Y;. The result would be un- 
employment and excess capacity. (Likewise, should 
nonconsumption spending exceed I, , total demand 
would tend to exceed Y,, causing inflation. ) 

Further, the consumption function enabled 
Keynes to say by how much national income would 
fall short for a given shortfall in nonconsumption 


expenditure. A difference of a dollar in noncon- 
sumption expenditure means a difference of more 
than a dollar in national income. The multiplier, 
which expresses this relationship, depends on the 
mpc. In Figure 1, for example, if nonconsumption 
spending is equal to zero instead of to I; , income 
will be at the break-even level, Y, , instead of at Y,. 
The difference in income, Y;—Y,, is 1/(1 — m) 
times the difference in nonconsumption spending 
I,. The multiplier is 1/(1 — m). Thus, if the na- 
tional mpc is 3, the multiplier is 4; if the mpc is 3, 
the multiplier is 3; and so on. 

The multiplier is a measure of the response of 
total production, income, and employment to 
changes in nonconsumption spending, whether 
these changes reflect natural volatility of private 
investment or conscious policy in public expendi- 
ture. It was in the latter context that the concept of 
the multiplier was first advanced, by Kahn (1931), 
as a means of estimating the response of employ- 
ment to public works expenditure during the de- 
pression. 

The dynamics of the multiplier can be exhibited 
by assuming, along with Kahn and many other 
writers, a lag in the adjustment of consumption to 
income. Suppose nonconsumption spending was 
initially zero and income and consumption were in 
equilibrium at C,=Y,. Now nonconsumption 
spending rises to I; and stays there. Initially, in- 
come and production rise by a like amount, that is, 
to Y, =C, + I;. But then consumption adjusts to 
Y, along the consumption function and rises to C,. 
This raises income to Y,= C, + Ip. The process 
converges to income Y;, where the consumption 
function indicates consumption of C; , leaving just 
enough room for the new level of nonconsumption 
spending I. 

For an example with numerical concreteness, 
take m= and suppose the sustained increase 
AI in nonconsumption spending to be 10. Then 
the process is as presented in Table 1, where AY, 
AI, and AC are the increases in these variables 
over their initial levels. 


Table 1 
“Round” Ar Ac=4y, Ay=AI+AC 

0 o (0 0 

1 10 0 10 

2 10 6 16 

3 10 9.6 19.6 
4 10 11.76 21.76 
5 10 13.06 23.06 
6 10 13.83 23.83 
7 10 14.30 24.30 
o 10 15 25 


The dynamic formulation serves to make two 
points. First, stability requires that the national 
mpc be smaller than one. Otherwise the multiplier 
process would never converge. An initial stimulus 
would keep spending and income rising indefinitely. 
(It is a mistake, therefore, to conclude from the 
formula, multiplier = 1/(1 — m), that the multi- 
plier is negative if m exceeds one. Rather the whole 
multiplier concept is inapplicable in that case.) 
Second, assuming the mpc to lie in the normal 
stable range, as Keynes assumed, a one-shot in- 
jection of investment or government expenditure 
cannot lift income more than temporarily. A sus- 
tained higher level of income requires a sustained 
higher level of nonconsumption spending. Multi- 
plier theory offers no comfort for “pump primers.” 
A temporary stimulus can work only if there are 
some unstable elements in national spending that 
can be set into sustained motion by an initial surge 
in economic activity. According to Keynes, con- 
sumption is not an unstable element of this kind. 

Price and population adjustments. The con- 
sumption-income relation is intended to link real 
per capita magnitudes, measured in constant 
prices. An increase in national income in Figure 1, 
for example, represents an increase in production 
and purchasing power rather than an increase in 
the prices at which production is valued. The com- 
munity cannot be expected to react to a doubling 
of money income in the same way when it reflects 
a doubling of prices and wages (with real output 
constant) as when it reflects a doubling of output 
(with prices constant). Creating a new franc by 
deleting two zeros from the old one should not 
change anyone’s basic behavior. Of course, actual 
price movements are not so uniform, universal, 
and neutral as de Gaulle’s decimal reform. Con- 
sequently both the level of prices and their rate of 
change may affect the relation between real con- 
sumption and real income; some of these possible 
effects will be mentioned below. 

For somewhat similar reasons, the variables in 
the consumption function must also be corrected 
for changes in population. The logic of the argu- 
ment that the average propensity to consume is 
lower when society is better off implies that an 
increase in real national income accompanied by 
an equal proportionate rise in population would 
leave the average propensity to consume un- 
changed. Consequently the variables in Figure 1 
should be regarded as per capita measures. (The 
theoretically appropriate correction for population 
change is somewhat more complicated. Presumably 
persons of different age, sex, and family status 
have different consumption requirements and 
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should receive different weights, as in the “equiva- 
lent adult” procedures used in studies of food con- 
sumption. ) 

However, this gross correction for population 
change is not the end of the story, any more than 
simple “deflation” for price changes disposes en- 
tirely of price effects. As noted below, both price 
and population changes play important roles in 
recent theories of the consumption function. 

The reliability of the multiplier. Although the 
propensity to consume and the multiplier have be- 
come standard items in the tool kits of economists, 
these concepts have not gone unchallenged. When 
quarterly income and consumption data became 
available in the United States after World War 1, 
they showed erratic fluctuations in the response of 
consumption to quarter-to-quarter changes in in- 
come. And at times, notably during the Korean 
War, there have been wide swings in the pro- 
pensity to consume. Some critics have questioned 
the view that consumption is a particularly stable 
or predictable element in national expenditure, 
that consumers respond passively to shocks origi- 
nating in business investment and government ex- 
penditure. In their view the multiplier is not a 
reliable tool of analysis and prediction, certainly 
not one that can be applied mechanically. It should 
be noted that the multiplier multiplies error as well 
as truth in translating fluctuations of nonconsump- 
tion expenditure into predicted changes in national 
product. 

In a series of statistical studies, Friedman and 
Meiselman (1963) have argued that multiplier 
analysis performs less well in explaining fluctua- 
tions of national product than an alternative model 
of equal simplicity relating national product to the 
quantity of money. In a different vein, Katona 
(1960) has contended that consumers in a modern 
rich society possess sufficient discretion and au- 
tonomy to originate, not just respond to, fluctua- 
tions in over-all economic activity, Consequently he 
has pioneered in conducting surveys of consumer 
intentions and attitudes, which can be used to 
improve the short-run forecasts of consumer be- 
havior made from conventional economic variables 
(see Katona & Mueller 1953; 1956). The record 
suggests that these surveys contain useful fore- 
casting information, at least for the short-run 
timing of the durable goods component of con- 
sumer expenditures. [See SURVEY ANALYSIS, article 
on APPLICATIONS IN ECONOMICS.] 

A doubt of somewhat different import concerns 
a possible ambiguity in the direction of causal in- 
fluence between income and consumption. Keynes 
and virtually all subsequent writers on the con- 
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sumption function take income to be the determin- 
ing variable and consumption to be the determined 
variable. The income of an individual or house- 
hold or corporation is assumed to be outside its 
own control, at least in the short run. It is a datum, 
determined by market forces, constraining con- 
sumption and saving decisions. But there are many 
opportunities in an advanced economy for house- 
holds to adjust their incomes, by working more or 
less, or by movement of wives and other secondary 
earners in or out of the labor force, A study by 
Rosett (1958), for example, shows the influence of 
the financial position of the household on the labor 
force participation of wives. It is possible that 
many households adjust their incomes to their 
consumption standards, rather than vice versa. An 
analogous opportunity is presumably available to 
some corporations; that is, if they are in need of 
profits to pay dividends or to make investments, 
they can take steps to increase them. 

National versus household propensities. Keynes 
was concerned with the propensity to consume of 
a whole society, the relation of its consumption to 
its net national product. The difference between 
these magnitudes takes various forms: household 
saving, the retained earnings of enterprises, the 
receipts of governments. Accordingly, the national 
consumption function reflects a variety of social 
institutions and patterns of behavior—a mixture 
of family, business, and political habits and de- 
cisions. If it is based on any psychological law, the 
law must be founded in social psychology rather 
than in the psychology of individual consumers 
and savers. 

Personal saving proper is defined as the differ- 
ence between private consumption and personal 
disposable income—the income that individuals 
and households have wholly free disposition of. 
This differs from net national product by the sum 
of (a) taxes and other government receipts, net of 
government “transfer payments,” which are in the 
nature of negative taxes (pensions, benefits, sub- 
sidies, etc., for which no current productive serv- 
ices are rendered), and (b) the net earnings, 
after taxes, retained by corporations. The national 
mpc depends not only on the response of con- 
sumers to an increase in disposable income, but 
also on the response of personal disposable income 
to an increase in net national product. The latter 
response, in turn, reflects (a) the sensitivity of 
government tax receipts (net of transfer pay- 
ments )—state and local as well as federal—to in- 
creases in national product and (b) the share of 
corporate profits after taxes in such increases in 
national product, together with the “propensity” of 


corporate directors to retain these earnings rather 
than to disburse them as dividends. In the analyt- 
ical framework of Keynes’s General Theory, a 
change in any of this complex of relationships 
would shift the consumption function, changing 
its level and perhaps its slope and shape. For ex- 
ample, tax rate reductions, increases in public 
welfare payments, increases in corporate divi- 
dends, reductions in the share of profits in national 
product would all raise the national propensity to 
consume. This could happen even though the 
thriftiness of households, as measured by their 
propensity to consume from disposable income, 
remained unchanged. 

The practical quantitative importance of this 
observation can be illustrated from recent experi- 
ence in the United States, In the short run at least, 
both the ape and mpc from disposable income 
appear to be between .92 and .96. But the ape with 
respect to net national product is about .70 and the 
mpc about .55, Thus the gap between disposable 
income and net national product accounts for the 
major part of the gap between consumption and 
net national product. 

Of course, not all the difference between net na- 
tional product and private consumption is saving 
in the sense of augmenting national wealth and 
future consumption possibilities. On the expendi- 
ture side, the counterpart of this gap includes gov- 
ernment purchases of goods and services as well 
as net private investment. Some government ex- 
penditure is investment in the future, and some is 
collective consumption. Unfortunately, our present 
government accounting techniques do not permit 
a useful quantitative classification. But, as ob- 
served above, this failure is not crucial for the pur- 
poses for which Keynes designed the consumption 
function. 

Subsequent theoretical and empirical investiga- 
tion has not followed Keynes in postulating a single 
stable relationship linking consumption to national 
product. Instead “the consumption function” has 
come to have a narrower meaning: the relationship 
of consumption to disposable income and other 
variables, The linkages between disposable income 
and national product have been set to one side for 
independent investigation. Despite the fact that 
they are of greater quantitative importance than 
household saving proper, less theoretical and sta- 
tistical effort has been expended on them (see, 
however, Ruggles & Ruggles 1949). 

Although this division of labor is probably an 
advance over Keynes’s global approach, the pro- 
pensity to save from disposable income is surely 
not entirely independent of corporate saving or of 


certain tax payments. Retained earnings do not 
enter disposable income, but they are reflected in 
the value of corporate stocks and thus in the 
wealth of households and individuals. Accumula- 
tion of wealth by this route may substitute for per- 
sonal saving; evidence that wealth affects the 
household propensity to save is noted below. Tax 
payments connected with social security programs 
are excluded from both disposable income and per- 
sonal saving, even though private withholdings 
from wages and salaries for similar purposes are 
counted as both income and saving. One may sus- 
pect that socialization of saving for retirement, 
medical emergencies, and unemployment would 
lower the apparent propensity to save from per- 
sonal disposable income. But it is hard to support 
this suspicion empirically. 

What follows will concern the consumption 
function in the narrower sense, the division of dis- 
posable income between consumption and personal 
saving. 


The postwar reappraisal 


The Keynesian consumption function, typified 
in Figure 1, fits admirably two kinds of empirical 
data. First, economy-wide time series of consump- 
tion and income for the period between the two 
world wars lie along a function of the Keynesian 
type; there are even observations of negative pri- 
vate saving in the depths of the great depression. 
Second, any cross-section survey of household 
budgets appears to confirm the “psychological law” 
at a microeconomic level. When households are 
classified into income brackets, and the average 
consumption for a bracket is plotted against in- 
come, the scatter of points traces out a path (Engel 
curve) like the consumption function of Figure 1. 
Dissaving in low brackets gives way to positive 
saving at higher levels. Indeed some survey evi- 
dence suggests that the mpc falls in high brackets. 

After World War i, however, several pieces of 
evidence combined to cast doubt on the Keynesian 
consumption function. The reappraisal that fol- 
lowed led to new theories and deeper empirical 
investigation of the determinants of household 
consumption. 

First, extrapolations of statistical consumption 
functions based on prewar United States data to 
potential postwar income levels greatly under- 
estimated the postwar propensity to consume. 
These extrapolations led some analysts to pessi- 
mistic views of postwar economic prospects in the 
United States, which were in the event quite un- 
justified. These extrapolations were based either 
on interwar time series or on Engel curves relating 
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household consumption and income in prewar 
budget studies of 1935-1936. 

The Keynesian propensity to consume is pri- 
marily a tool for the short run, for analysis of the 
determination of income and employment during 
a business cycle or a period of underemployment. 
The failure of the postwar forecasts did not impair 
its usefulness for this primary purpose. Moreover, 
Keynes certainly did not exclude shifts in the con- 
sumption function, and no doubt he himself would 
have understood the significance of the artificial 
shortages of consumers’ goods and the abnormal 
accumulations of liquid savings that a great war 
leaves in its aftermath. At the same time, many of 
his obiter dicta suggest that he expected rich capi- 
talist societies to face in the long run a chronic and 
increasing excess of potential saving at full em- 
ployment. Experience since the war, persisting be- 
yond its immediate legacy of backlogs and liquidity, 
compels a considerable modification of this ex- 
pectation. 

Second, just as the interwar Keynesian con- 
sumption function forecasts too much saving after 
World War n, it “backcasts” too little saving before 
World War 1. Indeed it would not indicate any 
positive saving until about 1908, when per capita 
income reached the break-even point observed dur- 
ing the depression. One does not require data to 
know that U.S. economic growth in the nineteenth 
century and the early twentieth century was not 
generated from dissaving. But a statistical study by 
Kuznets (1946) provided the data, showing that 
the share of capital formation in U.S. output, aver- 
aged for overlapping decades, has been roughly 
constant since the Civil War. Moreover, Brady and 
Friedman (1947) took the trouble to look at earlier 
household budget studies, going as far back as 
1901. They found that while any one survey indi- 
cates the same kind of Keynesian consumption— 
income relationship observed in the 1935-1936 
study, the relationship shifts upward in successive 
surveys, A family with, say, an income of $3,000, 
corrected for price changes, will in general be ob- 
served to save much less in 1918 than in 1901, 
still less in 1935, and, as it has later turned out, 
less in 1950 than in 1935 or 1941. This finding, of 
course, undermines the simple use of the Engel 
curve of a cross-section survey for prediction or 
aggregation. The hypothesis that Brady and Fried- 
man offered to explain the finding anticipates that 
of Duesenberry, which will be discussed below. 

Recognition of these facts led a number of in- 
vestigators to formulate and test hypotheses that 
would explain them—broadly speaking, hypotheses 
that would reconcile the short-run, or cyclical, suc- 
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cess of the Keynesian consumption function with 
its long-run, or secular, failure. Indeed in the early 
1940s Samuelson (1943) proposed a “ratchet” 
model: Consumption grows in the long run roughly 
in proportion to income; but during cyclical inter- 
ruptions of long-run growth, consumers defend 
living standards already attained, and consequently 
consumption follows a flatter (lower mpc) Keynes- 
ian path. In independent but similar contributions, 
Duesenberry (1948) and Modigliani (1949) for- 
malized the ratchet idea and tested it statistically, 
making the ape depend inversely on the ratio be- 
tween current income and previous peak income. 

These formulations do the trick in a statistical 
sense. They do it by eliminating absolute real in- 
come from the determination of the long-run aver- 
age propensity to consume, which becomes a con- 
stant. Why should this be so? 

Relative income. Duesenberry (1949) offered 
one explanation, his “relative income” hypothesis. 
Consumer utility depends, he reasoned, not on ab- 
solute amounts of consumption but on the relation 
of these amounts to the consumption of others with 
whom the consumer feels in social competition or 
under pressure to conform. This hypothesis has 
obvious support in many findings of modern soci- 
ology and psychology, and in some older ideas of 
Veblen. Duesenberry’s hypothesis rationalizes the 
Brady-Friedman findings, in much the same way 
they themselves suggest. The pattern of any one 
cross section reflects the fact that the consumption 
leaders, in the upper ranks of the income distribu- 
tion, can “afford” high propensities to save, while 
the followers in the lower ranks respond to social 
pressures by high propensities to consume. A gen- 
eral increase in absolute incomes, leaving the rela- 
tive income distribution unchanged, will leave 
unchanged both these social pressures and the re- 
sponses to them in terms of shares of income con- 
sumed. Most of the differences between successive 
budget surveys disappear when consumption ratics 
are plotted not against absolute incomes but 
against relative income positions, The broad con- 
clusion is that while substantial changes over time 
in the inequality of incomes might alter the aggre- 
gate consumption ratio—contrary to the usual 
view, equalization would tend to increase the sav- 
ing ratio—sheer growth in per capita income will 
not. 

Permanent and lifetime income. In independ- 
ently developed formulations, Friedman (1957) 
and Modigliani and Brumberg (1954) offered 
alternative explanations of the same phenomena. 
Students of household consumption behavior have 
long observed that incomes other than current in- 


comes affect current consumption, Consumers suf- 
fering income declines resist departures from their 
previous consumption standards; and those enjoy- 
ing income gains generally consume less than 
other households who long ago achieved the same 
income level. Similarly, the investigations of Ka- 
tona and his colleagues (Katona & Mueller 1953; 
1956) at the Survey Research Center at the Uni- 
versity of Michigan indicate that optimistic expec- 
tations of future incomes encourage current con- 
sumption, while pessimistic expectations have the 
reverse effect. 

According to Friedman’s “permanent income 
hypothesis,” the consumption of a household is pro- 
portional to its permanent income, that is, the 
average income it expects to earn over its planning 
horizon. Friedman is not definite about either the 
factor of proportionality—which might vary with 
the household's stage in the life cycle, its wealth, 
the interest rate, and other variables—or about the 
length of the planning horizon. On these matters 
the lifetime income hypothesis of Modigliani is 
much more explicit, as will be seen below. In any 
case, Friedman employs his hypothesis to explain 
both the evidence of cross-section budget surveys 
and the ratchet effect observed in aggregate time 
series. 

In budget surveys the low-income brackets are 
bound to be abnormally loaded with families tem- 
porarily below their permanent incomes, while the 
high brackets naturally have more than their share 
of families temporarily above their permanent 
status. This is the reason that the low brackets in 
any cross section exhibit a higher propensity to 
consume, relative to measured current incomes, 
than the high brackets—even though the propen- 
sity to consume from permanent incomes, unob- 
served, may be the same for all groups. When the 
households of a cross section are classified, not by 
current incomes but by other observed attributes— 
occupation, educational attainment, residence, age 
—better correlated with permanent income status, 
the scatter of points relating average consumption 
to average income gives a better approximation of 
the long-run consumption function. It starts from 
the origin—after all, groups with low permanent 
income status cannot do much dissaving—and 
shows a higher marginal propensity to consume 
than the usual Engel curve (see Watts 1958). 
Whether this mpc is a constant or diminishing 
function of income is, however, an open empirical 
question. 

As for the ratchet effect, many households are 
below their permanent incomes in recessions and 
depressions; therefore their consumption is high 


relative to their current incomes. Like the “previ- 
ous peak” hypotheses discussed above, this ex- 
planation is more satisfactory for recessions clearly 
believed to be temporary than for prolonged de- 
pression periods like the 1930s when the memory 
of the preceding peak grows faint. 

The Modigliani-Brumberg model is in the same 
spirit, but by being bolder in its assumptions it is 
more specific in its conclusions. In its most stark 
formulation, the planning horizon of the individ- 
ual consumer is his whole lifetime. And the factor 
of proportionality between consumption and per- 
manent income is simply one. Individuals are 
assumed to plan no net lifetime saving; they trans- 
fer to their heirs no more and no less than they 
inherited. Subject to this constraint, they try to 
spread their lifetime consumable resources evenly 
over their lives. In particular, they seek to accumu- 
late enough savings during their earning years to 
maintain the same consumption standard during 
their years of retirement. The division of life be- 
tween work and retirement is taken, somewhat im- 
plausibly, as an institutional fact, a constant in- 
dependent of national and individual incomes. The 
model has several interesting implications: 

(a) In a society with stationary population and 
income, aggregate net personal saving would be 
zero. The dissaving of the retired would exactly 
offset the saving of the workers, whose only pur- 
pose in saving is to provide for their own future 
retirement, 

(b) In a society with growing population or 
growing per capita income or both, aggregate net 
personal saving will be positive. Indeed, the higher 
these rates of growth, the higher will be the ratio 
of saving to aggregate income. For in a growing 
economy, the retired of the future always exceed— 
in number or in lifetime income, or in both—the 
retired of the present. Consequently the saving 
necessary to provide for future retirements always 
exceeds the dissaving of the currently retired. (See 
Modigliani & Ando 1963 for precise calculations. ) 

(c) Changes in this year’s income affect this 
year’s consumption only to the extent that current 
consumption benefits, equally with consumption 
in all future years, from a general recalculation of 
lifetime consumable resources. This effect will not 
be large unless current income changes are re- 
garded as permanent. This is the counterpart of 
Friedman’s more extreme contention that all tem- 
porary income gains will be saved and none con- 
sumed, and symmetrically that temporary income 
losses will be wholly offset by dissaving. 

This implication of the two models has attracted 
widespread attention and controversy. One reason 
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is that it casts doubt on the efficacy of temporary 
changes in tax rates or income transfers as meas- 
ures of economic stabilization. Some caution is 
required in drawing this inference from the perma- 
nent income or lifetime income hypotheses. Strictly 
speaking, these theories concern consumption 
rather than expenditures on consumers’ goods, for 
example, the use of an automobile rather than its 
purchase. It may be that temporary windfalls are 
used to purchase durables but do not appreciably 
increase their use; the purchaser uses the newer 
product with the same intensity. But from the 
standpoint of economic stabilization, the stimulus 
of such expenditure is equally good whether it is 
properly classified as saving or as consumption. 
The pure theories assume, moreover, that there are 
no obstacles in financial markets to borrowing 
against expected future incomes. In actual fact 
many households are at the limits of their lines of 
credit and are therefore prevented from adjusting 
current consumption fully to their permanent in- 
comes. Temporary windfalls may increase their 
consumption expenditures simply by adding to 
their liquid resources. 

Empirical measurement of the mpc from tem- 
porary income has not been conclusive. But Bodkin 
(1959) observed that the households of veterans 
who received unexpected national service life in- 
surance dividends during the 1950 budget survey 
spent significantly more than households similar 
to them in income and in other respects. Using 
different methods and data, Watts (1958) esti- 
mated the mpc from temporary income to be sig- 
nificantly positive, although perhaps only half of 
the mpc from permanent income. Watts’s results 
also indicated that household reaction to temporary 
income changes may not be symmetrical for posi- 
tive and negative changes. 

(d) An individual whose life proceeds accord- 
ing to plan will gradually build up his wealth, so 
that at any age his wealth plus his remaining ex- 
pected earnings during working years just suffice 
to maintain his consumption throughout his life- 
time, including his years of retirement. To every 
age there corresponds a normal ratio of wealth to 
income. But unexpected income changes, con- 
sumption emergencies, or capital gains and losses 
on past savings may cause an individual’s wealth 
to deviate from its normal relation to income. A 
high wealth—income ratio permits the individual to 
consume more, both now and in retirement, while 
a low wealth-income ratio requires him to con- 
sume a lower fraction of his current income in 
order to provide for undiminished consumption in 
retirement. Extended to the whole population, this 
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argument suggests that the aggregate saving ratio 
depends inversely on the wealth-income ratio. 
Along with the effects of growth rates mentioned 
in b, this is the major aggregate implication of the 
lifetime income hypothesis. 

Wealth and the propensity to consume. The 
importance of wealth in the consumption function 
had been urged by several earlier writers. They 
based their view on more general considerations 
than those involved in the Modigliani-Brumberg 
model. In particular, saving may be motivated by 
the desire to make bequests or other transfers of 
wealth to the next generation, as well as by the 
need to even out consumption over the savers’ life- 
times. The adequacy of wealth to meet a bequest 
target—which may itself be a function of lifetime 
income, of course—will then be one of the de- 
terminants of current saving and consumption. 

Pigou (1947) emphasized the wealth effect in 
the course of an abstract theoretical attack on the 
whole structure of Keynes’s General Theory. Keynes 
had contended that in certain circumstances no 
amount of wage and price deflation could restore 
aggregate real demand to full employment levels. 
Pigou pointed out that the real value of private 
wealth can be indefinitely increased by price de- 
flation. The reason is that deflation would increase 
the purchasing power of gold, government-issued 
currency, and interest-bearing government debt, all 
of which are fixed in money value. As the owners 
of these assets become saturated with real wealth, 
their propensity to consume is bound to increase. 
Via what has come to be known as the “Pigou 
effect,” the absolute price level, because of its 
effect on the real value of private wealth, has a 
bearing on the propensity to consume. 

Quite apart from the theoretical context of 
Pigou’s argument, a general wealth effect is an 
alternative explanation of some of the same phe- 
nomena explained by the other hypotheses under 
review. Over the long run disposable income and 
private wealth tend to grow in step, with wealth 
five to six times income. Recent empirical calcula- 
tions (Modigliani & Ando 1963) suggest a mar- 
ginal propensity to consume from income of .5 to 
-7 and from wealth of .05 to .07. When income and 
wealth grow in step, estimates in these ranges ex- 
plain how consumption can consistently take about 
90 per cent of increases in disposable income. But 
the apparent mpc from income can be quite differ- 

„ent when cyclical fluctuations break the normal 
long-run linkage of wealth and income. Assuming 
that the real value of wealth fluctuates less than 
real income, consumption will appear less sensitive 
to income in the short run than in the long run. 


At the level of individual households in budget 
surveys, the dissaving recorded for the lower 
brackets is of course supported by past savings. As 
pointed out by Tobin (1951), differences in the 
wealth available to households at a given real in- 
come can help to explain the differences observed 
in different surveys in their propensities to save or 
to dissave. This explanation is not inconsistent 
with those by which Duesenberry, Friedman, and 
Modigliani explain the same phenomenon. 

Sometimes the wealth effect on saving and con- 
sumption has been attributed to a particular kind 
of wealth, liquid asset holdings, rather than to total 
net worth. In principle net worth is the relevant 
variable. This is the constraint that the past and 
the market impose on consumers, a constraint that 
they can change only slowly and to which they 
must meanwhile adjust. In contrast, many house- 
holds can, within wide limits, decide for them- 
selves at each moment of time how much of their 
wealth they wish to hold in liquid form; and they 
can implement such decisions very quickly by 
purchases or sales of other assets and debts, But 
for many households below the top brackets, liquid 
assets (bank deposits, savings accounts, savings 
bonds) are virtually the only assets held; their total 
serves as a good proxy for total wealth. In other 
cases, the remaining constituents of net worth are 
not easily convertible into current purchasing pow- 
er—home real estate, consumers’ durable goods, 
pension rights. Consumption functions fitted to 
survey data indicate a positive effect of liquid asset 
holdings on consumption, especially for house- 
holds suffering income reverses (see Michigan, 
University of . . . 1954). 

Income and wealth distribution. Aggregate con- 
sumption can be affected not only by changes in 
aggregate income and wealth but also by changes 
in the distribution of a given aggregate income or 
wealth. However, these distributional effects are 
commonly exaggerated by observers who are struck 
by the wide discrepancies in average propensities 
to consume among different economic and social 
groups. Redistribution will affect total consumption 
only to the extent that the marginal propensities of 
the affected groups differ. And these appear to dif- 
fer much less than average propensities. Neverthe- 
less, the prevailing evidence is that certain redis- 
tributions of income and wealth would increase 
saving; for example, from low or middle to top in- 
come brackets, from employees to self-employed, 
from urban residents to farm operators. 

It is necessary to beware of another frequently 
heard argument, namely that reduction in taxes 
levied on individuals with high saving propensities, 


in particular those in the upper brackets of income 
and wealth, will by itself increase saving. This in- 
crease in disposable income will doubtless increase 
personal saving and probably by more than would 
equal tax reductions benefiting others. But it will 
increase consumption too, By raising disposable 
income relative to net national product, it raises 
the propensity to consume with respect to national 
product and leaves less room in the economy for 
nonconsumption expenditures. 

The hypothesis that “capitalists” save while 
“workers” spend has a long history, and it has re- 
cently been revived in connection with certain 
analyses of economic growth and development. 
Kaldor (1955-1956) has gone so far as to build a 
theory of the functional distribution of income on 
the differences between capitalists and workers in 
their marginal propensities to save. Houthakker, 
who has conducted some pioneering studies of 
intercountry differences in saving propensities, 
finds (1957) that the saving ratio appears to be 
higher in countries where the property share of 
income is higher. This statistical result can, how- 
ever, be alternatively rationalized. 

This hypothesis may be more relevant to under- 
developed economies with a well-defined division 
of the population between high-income property 
owners and low-income workers than to advanced 
economies like the United States, where ownership 
of property is widely diffused and some rentiers are 
poor while some “workers” are rich. But if “capital- 
ists” are identified with corporations and their 
saving with retained corporate earnings, the prop- 
osition has some applicability to the United States. 

The importance of the distribution of income has 
also been stressed in the analysis of “forced saving” 
during inflation. If wages lag behind prices and the 
marginal propensity to consume from profits is 
less than from wages, the process of inflation sup- 
presses some demand for consumption goods, 
which would occur if prices were stable. The same 
effect can occur without a change in income dis- 
tribution if consumers are slow to adjust their 
money expenditures to rising prices and money 
incomes. 

The rate of interest. Prior to Keynes the eco- 
nomic variable that economists usually stressed in 
analyzing the choice between consumption and 
saving was the interest rate. The classic exposition 
of the theoretical relation of saving decisions to the 
rate of interest is Fisher’s (1930). Conard (1959) 
provides a good review of doctrine on this subject. 
An increase in the rate of return on saving was 
expected to tip the balance of choice in favor of the 
future. But it was recognized that for many posi- 
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tive savers, an increase in interest rates means an 
over-all increase in consumable resources, from 
which current as well as future consumption might 
benefit. 

Whether because of this real ambiguity or be- 
cause of the narrow range of variation of observed 
interest rates, econometricians have not been able 
to detect significant interest rate effects on saving. 
Perhaps these effects would be more evident if it 
were possible to relate net saving not merely to 
rates paid on savings accounts and bonds, but to 
the effective rates at which consumers can borrow 
and the rates of return on business capital, corpo- 
rate equities, houses, other real estate, and con- 
sumer durables. These assets, after all, absorb the 
bulk of national saving; but their returns are dif- 
ficult to measure. The positive correlation inter- 
nationally between saving and the property share 
of income cited above could be interpreted to mean 
that saving is higher in those countries where its 
yield is greater. 

Other variables. Observed differences among 
households in consumption and saving behavior 
are, of course, attributable to a long list of differ- 
ences in their circumstances, habits, and prefer- 
ences. Some of these are, like income and wealth, 
variables whose influence is the major interest of 
economists. Others are, like demographic char- 
acteristics, variables that can differ widely among 
households even though their distribution over the 
population changes only very slowly. It is nonethe- 
less important to measure their effects, if only to 
disentangle them from the measurement of the 
influence of variables more important in economic 
fluctuations and economic policy. Considerable sta- 
tistical effort has been devoted to the “life cycle” 
variables—age, marital status, family size and 
composition—and other demographic characteris- 
tics, such as educational attainment, occupation, 
race, and geographical location (see, for example, 
Michigan, University of . . . 1954; Lydall 1955). 

A set of variables of a different nature are the 
“psychological” ones—attitudes, intentions, expec- 
tations, personality attributes—which Katona and 
his associates seek to measure. Unlike demographic 
variables, the distribution of some of these psycho- 
logical variables in the population may change 
radically in the short run, in ways that can be 
ascertained in our present state of knowledge only 
by new surveys. If household surveys are to con- 
tribute further to our understanding of the pro- 
pensity to consume and make possible more power- 
ful tests of competing theories, they will have to 
take a longer perspective. To measure the effects 
of past and expected levels of income and wealth 
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and of retirement and bequest objectives, it is 
necessary to observe not only the current status of 
households but their lifetime histories, plans, and 
aspirations. 

JAMES TOBIN 


[See also INCOME AND EMPLOYMENT THEORY.] 
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CONTAINMENT 


Containment has been the primary strategy of 
the United States in its conduct of foreign policy 
in the postwar era. 

The term defines the objective: to “contain” 
communism within the spheres of influence which 
it controlled effectively in 1947. It was then that 
American leaders first fully perceived the extent to 
which communist expansionism threatened the se- 
curity and freedom of the noncommunist world. 

In the opinion of some observers, the concept of 
containment has guided the foreign policies of the 
Soviet Union and the Western powers, although 
the ground rules have differed for each side. For 
the West, containment of communism signified a 
status quo policy which did not envisage any at- 
tempt to regain any territory absorbed by the Sino— 
Soviet bloc. The communist powers, however, while 
insisting on the inviolability of their domain behind 
the iron and bamboo curtains, considered the lands 
within the West’s defense perimeter fair game for 
aggression—if not by direct military means, then 
by indirect methods hardly less effective (Strausz- 
Hupé et al. 1959; Stillman & Pfaff 1961). 

Evolution. The United States policy of contain- 
ment evolved during that unhappy period, from 
about 1945 to 1947, when the United States was 
forced by ineluctable realities to reassess Soviet 
intentions. Repeated shocks were necessary in order 
to impress the hard fact of Soviet hostility upon the 
American consciousness, so long conditioned dur- 
ing the war years to regard the U.S.S.R. as a gal- 
lant ally, As the implications of Soviet dominion 
over eastern Europe began to dawn on Western 
leaders, their resistance to further Soviet encroach- 
ment began to stiffen. When Soviet troops failed to 
evacuate Iran by the established deadline in March 
1946, American and British pressure was brought 
to bear, forcing their withdrawal. That same month, 
Winston Churchill helped to arouse Americans to 
the growing communist threat with his famous 
“iron curtain” speech at Fulton, Missouri. 
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The communist-inspired civil war which erupted 
in Greece that fall hastened the formation in the 
United States of a wide consensus on policies de- 
signed to halt communist aggression. In March 
1947, Britain asked the United States government 
to take over the support she had been providing to 
Greece and Turkey. President Truman and his ad- 
visors responded with a proposal to Congress to 
provide substantial aid to these two countries. The 
policy outlined by the president, which became 
known as the Truman Doctrine, promised Ameri- 
can aid to any country whose freedom and inde- 
pendence were threatened by external aggression. 
The Truman Doctrine proved to be the inspiration 
for the American concept of containment. From it 
issued the Marshall Plan, NATO, Point Four, and 
other programs and alliances designed to build up 
the economic, political, and military strength of the 
“free world.” 

The containment policy, with its rationale and its 
objectives, was articulated most clearly by George 
F. Kennan (1947), a career diplomat, who organ- 
ized and served as the first head of the Policy Plan- 
ning Staff of the Department of State, created in 
1947. He specified that “the main element of any 
U.S. policy toward the Soviet Union must be that 
of a long-term, patient but firm and vigilant con- 
tainment of Russian expansive tendencies” (1951, 
p. 119). The communist threat, Kennan main- 
tained, “can be contained by the adroit and vigilant 
application of counter-force at a series of con- 
stantly shifting geographical and political points, 
corresponding to the shifts and maneuvers of 
Soviet policy . . .” (1951, p. 120). 

Kennan developed his thesis, in somewhat gen- 
eral terms, from an analysis of Soviet ideology and 
society. It was his contention that if the West could 
successfully block the communists’ “unceasing 
constant pressure” toward world dominion for a 
period of time—perhaps 10 to 15 years—the result- 
ing frustration would so heighten the internal 
strains of Soviet society that the Kremlin regime 
would collapse or, at the least, seek a realistic 
accommodation with the West. 

The policy of containment has been executed by 
a variety of means. One of the chief architects of 
the policy, Dean G. Acheson, who served as under- 
secretary of state during the crucial spring of 1947 
and later succeeded George C. Marshall as secre- 
tary of state, saw the problem as primarily one of 
converting situations of weakness in the free world 
jnto “situations of strength.” This meant, first of all, 
the building of military strength but, in addition, 
“the buttressing of all the other forms of power— 
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economic, political, social, and moral—and the 
utmost resolution and unity among the free nations 
of the world” (Acheson 1950, p. 965). Embracing 
a policy of containment, the United States clearly 
repudiated such alternative policies as isolationism, 
appeasement, and preventive war. 

The principal tools which the United States has 
employed to carry out its policy have been economic 
assistance and a system of military alliances. Per- 
haps the most important objective of the United 
States’ alliance diplomacy has been to secure over- 
seas bases along the perimeter of Sino-Soviet bloc 
territory in return for military assistance and 
mutual security pacts. 

Critique. The critics of the containment policy 
have found it wanting on several accounts. Per- 
haps the least serious of the charges leveled against 
it has been the argument that American resources 
are not equal to the task of policing the entire 
frontier of the communist empire. Defenders of 
containment counter with the assertion that in time 
our European allies, whom we assisted in regaining 
economic strength, will help shoulder a larger 
portion of the burden. 

More telling has been the contention that the 
supporters of containment as the most rational and 
only possible policy of the United States toward the 
Soviet Union have failed to take into account the 
transformation in Soviet society which has occurred 
since Kennan published his celebrated analysis in 
1947. Soviet economy and technology have out- 
paced rates of growth predicted 15 years ago. Fur- 
thermore, the Soviet Union has thus far overcome 
the difficulty of the transfer of supreme power, 
a hurdle which many Western Kremlinologists 
thought Soviet leadership could not surmount. 

A corollary to the criticism that containment has 
been too inflexible is the charge that the policy has 
been, from the beginning, wholly unsuitable for 
application in Asia, Those who hold this view are 
willing to concede that the strategy of containment 
has been reasonably successful in Europe, where 
the crisis was of long standing, where the remedies 
were fairly obvious, and where a shared cultural 
heritage facilitated political and military coopera- 
tion. As these critics see it, the mistake was in 
attempting to extend the European formula to 
other areas of the world where similar conditions 
did not exist. 

The sharpest critics of containment have been 
those who felt that the policy was too passive, too 
Static, too apt to concede victory to the enemy by 
default. This school of thought contends that the 
free world, simply by maintaining the status quo, 


leaves the communist camp free to consolidate its 
gains. Thus, the hope for a break-up or a mellow- 
ing of the Soviet regime in some far-distant future 
is considered both erroneous and dangerous. Ac- 
cording to this view, by accepting the communist 
ground rules, which forbid waging the cold war on 
any territory already under Sino-Soviet control, the 
West commits a folly that is all the more unpar- 
donable because it is, besides being dangerous, 
exceedingly costly. 

Despite such adverse criticisms, nearly all Ameri- 
cans conversant with international affairs are will- 
ing to concede that containment has achieved some 
success in imposing on the Soviet Union, in Ken- 
nan’s words, “a far greater degree of moderation 
and circumspection.” In fact, even the severest 
critics of the policy of containment agree tnat it 
provides the basis upon which must rest any future 
foreign policy adopted by the United States, To be 
sure, the United States must “contain” before it 
can assume the initiative, Generally, those who be- 
lieve that containment can be no more than a stop- 
gap argue that it should be superseded by a more 
positive policy. Such a policy should place greater 
emphasis on political realities, especially the need 
for the political reformation of the western alliance; 
persist in the maintenance of military defense; and, 
in addition, concentrate greater efforts on the non- 
military aspects of the conflict; i.e., means must 
be found to carry the contest beyond the bounds of 
the free world and to exploit the divergencies and 
vulnerabilities of the communist camp itself. 

History has yet to render a final verdict on the 
success or failure of containment. Given the exist- 
ing political and intellectual climate of the United 
States in 1946-1947, it seems reasonable to assert 
that containment was probably the one and only 
effective foreign policy that did not exceed the 
physical and psychological capabilities of the 
United States at that time. How the American 
people can put to best use the experience derived 
from containment in order to develop a foreign 
policy adequate for the world of today—this is a 
matter which still demands their and their leaders’ 
most thoughtful attention. 

ROBERT STRAUSZ-HUPÉ AND 
Davip EDWIN LEBBY 


[See also DisENGAGEMENT; FOREIGN POLICY; STRAT- 
EGY; and the guide under INTERNATIONAL RELA- 
TIONS.] 
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CONTENT ANALYSIS 


Content analysis is used in the social sciences as 
one means of studying communication—its nature, 
its underlying meanings, its dynamic processes, 
and the people who are engaged in talking, writing, 
or conveying meaning to one another. Although not 
a research method sui generis, content analysis is 
roughly distinguishable from other methods by two 
characteristics. First, its data—in contrast to ethno- 
graphic reports, for example, or census enumera- 
tions—are the verbal or other symbols which make 
up the content of communications (letters, books, 
sermons, conversations, television programs, thera- 
peutic sessions, paintings, and the like). Second, 
its procedures differ in emphasis from those of the 
historian or literary critic: they aim to be exact 
and repeatable, to minimize any vagueness or bias 
resulting from the judgments of a single investi- 
gator. Thus, each content analysis employs an ex- 
plicit, organized plan for assembling the data, 
classifying or quantifying them to measure the con- 
cepts under study, examining their patterns and 
interrelationships, and interpreting the findings. 

Within these broad limits the techniques of con- 
tent analysis are diverse, and the objectives range 
from mapping propaganda campaigns, for exam- 
ple, to explaining international conflict and inte- 
gration; from abstracting the ideas and beliefs 
expressed in folklore or movies of a given period to 
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tracing the epochal alternations in societal values 
over many centuries; from charting the interaction 
between patient and therapist to assessing the psy- 
chological states of great men in the past. 

No general theory of communication is yet in 
common use among the several social sciences to 
guide these varied analyses. Implicit in each inves- 
tigation is a special conceptual model, or set of 
ideas and assumptions, about the nature of the 
particular communication process under study. To 
test this conceptual model or to add new ideas to it, 
the researcher uses the concrete data of communi- 
cation. In the empirical phase of the research he 
is led by his model to select particular communica- 
tions and to search for order among them by adapt- 
ing certain conventional procedures of sampling, 
measurement, and analysis. In the interpretative 
phase, in comparing his findings with his initial 
conceptions, so as to understand their broader sig- 
nificance, he encounters certain special problems 
and possibilities. 


Historical background 


The use of content analysis in the social sciences 
today—its methods and its problems of interpreta- 
tion—has been affected both by related develop- 
ments in other fields and by historical demands for 
certain practical applications. Early in the twen- 
tieth century students of journalism began to count 
the newspaper linage devoted to foreign affairs or 
to sports, comparing one newspaper with another, 
for example, and later comparing newspaper con- 
tent with the content of other media. In literary 
criticism such devices as type of rhyme or ratio of 
adjectives to verbs were tabulated, as a means of 
differentiating the styles of writers or of literary 
periods or to settle disputes about authorship or the 
chronology of an author’s work. Meanwhile, educa- 
tors were constructing formulas for the readability 
of printed materials, utilizing proportions of easy 
and hard words, length of sentences, and the like. 

During the 1930s certain applications of such 
techniques began to be made in the social sciences. 
Sorokin’s monumental study of social and cultural 
changes in western Europe over the entire course 
of history rests in part upon an analysis of works 
of art, music, literature, and philosophy according 
to their central meanings (1937-1941). Lasswell 
developed a scheme for categorizing the content of 
patients’ responses in psychiatric interviews as 
pro-self, anti-self, pro-other, or anti-other, and for 
counting the frequency with which such categories 
occurred (1938). Lasswell also, with a number of 
associates, pioneered the application of content 


872 CONTENT ANALYSIS 


analysis to the study of public opinion and propa- 
ganda, an effort immensely stimulated by the de- 
mands of the United States government during 
World War u. Mass communication was concep- 
tualized, within a political framework, as “who says 
what to whom, how, with what effect,” and large- 
scale analyses were made, for example, of the fre- 
quency with which key symbols (democracy, com- 
munism, England, Hitler) were given indulgent, 
deprecatory, or neutral presentation (e.g., Lasswell & 
Leites 1949). These wartime efforts encouraged 
content analyses in other areas—focused on the 
intentions of particular communicators, the kinds 
of material brought to the attention of particular 
audiences, or the cultural values underlying the 
communicator’s assessment of what the audience 
wants. 

When Berelson (1952) made his critical survey 
of the applications of content analysis methods, he 
found several books and articles reporting the use 
of various techniques, e.g., techniques of sampling 
the content of newspapers by successively selecting 
specific newspapers, issues, and relevant content 
within each issue; techniques of categorizing and 
counting key words, themes, or whole documents; 
techniques of increasing the reliability of classify- 
ing and counting. To guide the application of such 
techniques, however, Berelson found only one con- 
ceptual model in widespread use: the Lasswellian 
model of the purposive one-way communication 
intended to influence a mass audience on contro- 
versial public issues (Berelson 1952, p. 57 and 
passim). Thus, broader scientific utilization of the 
available techniques waited upon a growing inter- 
disciplinary understanding of the many-faceted 
communication process and upon the closer fitting 
of techniques to theory. 


Examples of uses of content analysis 


Important developments in the application of 
content analysis to social science models may be 
illustrated by a few examples from the profuse 
literature of the 1950s and 1960s (see also Work 
Conference on Content Analysis 1959). 

Interaction process. Bales and his associates 
have developed one of several procedures for ana- 
lyzing the content of communication observed in 
small groups (e.g., Bales 1952), Observers sitting 
behind a one-way screen categorize each of the 
remarks and gestures (acts) directed by each group 
member to other members as the group attempts 
to solve an assigned problem. Bales’s standard set 
of 12 categories (shows solidarity, shows tension 
release, agrees, gives suggestions, etc. ) indexes cer- 


tain sociological properties of the interaction of a 
group: positive or negative direction, instrumental 
or expressive character, and the focus on such sys- 
tem problems as control, tension management, or 
integration. Thus categorized, data from many 
groups are used (with the aid of statistical devices 
and mathematical models) to describe the group 
process—the patterning of content, phasing over 
time, group structure. From these descriptions in- 
ferences are drawn about the underlying nature of 
this process. For example, the findings may show 
that typically a group leader emerges who both 
initiates and receives more communications than 
any other member; or that the process of problem 
solving goes through phases emphasizing, first, 
orientation; then, evaluation; and, finally, control 
[see INTERACTION, article on INTERACTION PROCESS 
ANALYSIS], 

Studies of therapy. Such analyses of the con- 
tent of interaction, made at the time of observation 
or, later, through recordings or transcripts, are also 
applied to the therapeutic process in social work, 
counseling, and psychiatry (e.g., the review by 
Auld & Murray 1955). Category systems based on 
psychological theories of behavior combined with 
principles of client-centered therapy are used to 
trace the interview process, the client—therapist 
relationship, changes in predominant content over 
time, or differences between types of treatments. 
Standard measures employed in content analysis of 
psychotherapy include Bales’s categories (1952) 
and the Discomfort-Relief Quotient (D.R.Q.), de- 
veloped by John Dollard and O. H. Mowrer (see, 
e.g., Auld & Murray 1955, pp. 379-380) to show 
the ratio between the client’s discomfort responses 
(reflecting tension, unhappiness, pain) and his 
relief responses (reflecting satisfaction, comfort, 
enjoyment). In Japan (Shiso... 1959) content 
analysis has been applied to the exchanges of letters 
published in life-counseling columns of newspapers 
and magazines. [See MENTAL DISORDERS, TREAT- 
MENT OF, article on CLIENT-CENTERED COUNSEL- 
ING.] 

Psychological state of the communicator. Anal- 
ysis of an individual’s communications as an index 
of his underlying motives is exemplified in a study 
of suicide letters by Osgood and Walker (1959). 
Within the framework of stimulus—response theory 
these researchers formulated a number of predic- 
tions about the structure and content of suicide 
letters in contrast to ordinary letters and to simu- 
lated suicide notes: the letters of bona fide suicides 
are characterized, for example, by greater stereo- 
typy; more evidences of conflict; more construc- 


tions of the demand, command, and request type 
that express needs of the speaker and require some 
reaction from another person to satisfy these needs. 

To test these predictions, the researchers use 16 
different measures—some already standard and 
some specially designed—for the analysis and com- 
parison of the letters. As measures of the stereo- 
typy of each letter, for instance, they divide the 
number of different words by the total words, count 
repetitions of phrases, or take the ratio of nouns 
and verbs to the number of adjectives and adverbs. 
To measure conflict, they determine the degree to 
which assertions are qualified, the number of syn- 
tactical constructions expressing ambivalence (such 
as “but,” “if,” “however”), or the extent to which 
both positive and negative assertions are combined 
in the same letter. They also employ Osgood’s eval- 
uative assertion analysis, a standard procedure that 
isolates from their context all terms by which the 
communicator evaluates an object, rates these 
terms as favorable or unfavorable, and then com- 
bines these ratings to index the communicator’s 
over-all attitude toward the object (see, e.g., Work 
Conference on Content Analysis 1959, pp. 41-54). 
Comparisons of such measures for suicide letters 
and ordinary letters support the researchers’ hy- 
potheses in most instances, although the results for 
suicide versus simulated suicide notes are less clear. 

Historical personalities. In similar fashion, con- 
tent analysis seems potentially useful to the his- 
torian or biographer who is seeking to understand 
the personalities of great men through their writ- 
ings and speeches. For solving questions about 
authorship of documents, rigorous procedures have 
been developed (as in the study of the disputed 
Federalist papers: Mosteller & Wallace 1964). For 
examining motives, attitudes, and psychological 
states of historical persons, however, content anal- 
ysis of their communications has been less widely 
used, despite suggestive studies of Goebbel’s diary 
or the autobiography of Richard Wright (see Gar- 
raty in Work Conference on Content Analysis 
1959, pp. 171-187). 

Culture and society. In their study of popular 
religion, Schneider and Dornbusch (1958) illus- 
trate the use of content analysis to reflect, not the 
psychological states of single persons, but the val- 
ues of an entire society. These researchers selected 
46 representative works of American inspirational 
literature, published over an 80-year period, choos- 
ing best sellers to assure that the books were read. 
They classified each, paragraph by paragraph, 
according to its main themes (“religion brings 
physical health,” “happiness can be expected by 
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most men”) and then ascertained what proportion 
of the total paragraphs is devoted to each theme. In 
their report they described the changes in these 
themes within the wider context of historical trends 
in American religion and culture and used a socio- 
logical model to interpret the functions of popular 
religion in society. 

A considerable tradition of such analyses rests 
on the assumption that cultural values which have 
been institutionalized in certain segments of the 
society are represented in the communications of 
individuals from these segments. Some analyses 
stress the social determinants of the ideas or values 
expressed in folklore and sermons, for example; 
some emphasize the cultural determinants of such 
expressions. Thus, public communications of Amer- 
ican business leaders are taken to reflect their 
business creed; Brazil’s riddles and myths, to reflect 
the didactic aspects of its religion; Japan’s popular 
songs, to reflect the loneliness and helplessness of 
its postwar era (Shiso . . . 1959, p. 122). 


Empirical methods 


Content analyses are conducted by selecting and 
adapting certain empirical procedures used in social 
science generally: typically, the methods of using 
available data (although new materials for special 
objectives may occasionally be acquired by ques- 
tioning or observing) and the methods of meas- 
urement combined with sampling and statistical 
analysis. 

Use of available data. Most commonly, the con- 
tent analyst chooses from the vast store of com- 
munications already available in libraries, clinics, 
archives, records, and family attics. Thus, he must 
know how to utilize the benefits of available data, 
while avoiding their pitfalls. 

Advantages, Several advantages accrue to the 
student of communication who decides to use ma- 
terials that already exist rather than to elicit new 
ones. (1) Time, labor, and expense can often be 
saved when the researcher can go directly to the 
heart of his analysis, bypassing preliminary field 
work, experimentation, or commissioning of docu- 
ments. (2) When massive data are required, be- 
yond the scope of a single new study, existing 
content materials frequently afford wide ranges of 
potentially relevant variables and of refinement in 
the measurement of each variable. (3) Most impor- 
tant, the available data afford the only means of 
studying certain kinds of communication problems. 
Past events cannot be observed directly by the re- 
searcher, nor can events beyond the recollection of 
respondents living today be reached through ques- 


374 CONTENT ANALYSIS 


tioning. Thus, the analysis of historical situations 
or of long-term trends—the important study of 
social change—depends upon the prior existence of 
relevant materials, Similarly, study of cross-cultural 
communications from remote places (e.g.,of world- 
wide tastes in movies or folklore or of similarities 
and differences in attention to major political sym- 
bols in different countries) may require materials 
that cannot be elicited by the researcher directly. 
Communication contents in technical fields that are 
beyond the competence of the researcher may have 
been originally assembled in usable form by an 
expert such as a psychiatrist, a socia] worker, or an 
ethnographer, Sometimes, as in letters or diaries, 
existing materials may provide deep insights into 
intimate feelings or personal relationships; and 
sometimes, as in Sorokin’s analysis, they may widen 
the investigator's focus to include macroscopic 
social or cultural systems. 

Pitfalls. Against such impressive assets must 
be set certain basic problems to be overcome in the 
utilization of data not originally assembled for the 
present purposes. (1) The materials are often in- 
complete. The content analyst must attempt to 
discover any absences of letters from a file of cor- 
respondence or of speeches from a set, which may 
mean that the data lack representativeness. (2) The 
data may lack reliability or validity. An isolated 
record of a historical event, for example, cannot 
be checked through comparison of different ac- 
counts or through direct observation or question- 
ing by the researcher. Clues to validity can often 
be obtained, however, by comparing two sets of 
data believed to reflect the same concept, as does 
Sorokin (1937-1941) when he shows the parallel- 
ism between trends in scientific discoveries and the 
trends in empirical thought derived from content 
analysis. (3) Data from differing sociotemporal 
contexts may not be directly comparable, as sources 
of information may themselves change over time 
or from one country to another, or the same cate- 
gories may take on different meanings. This diffi- 
culty requires careful documentation and the 
search for linguistic equivalences. (4) Finally, the 
data that come to the researcher in a form he does 
not fully understand may not fit his definitions of 
the concepts under scrutiny. Unlike the researcher 
who handles data he himself has collected, he is 
often unfamiliar with the circumstances under 
which the communications originally took place. 
Yet the content of a diary may depend upon 
whether it was written for public or private con- 
sumption, and the answer to an open question may 
be affected by interviewer bias. Here the important 
caveat is to attempt to reconstruct the process by 


which the data were produced, spelling out and, 
insofar as possible, offsetting any limitations and 
biases and recasting the data in a form suitable for 
the new problem. 

Although the researcher may on occasion have 
to reject given data because he cannot adequately 
assess their limitations or find suitable means of 
compensating for them, the great variety of avail- 
able data which may in some sense be classified as 
communications constitutes a highly valuable re- 
source for the further application of content 
analysis. 

Use of measurement, The content analyst 
makes use of his data to measure his concepts, 
rather than to describe them in discursive language. 
His data consist of certain concrete communica- 
tions of certain concrete individuals (the cases). 
His conceptual model contains corresponding defi- 
nitions of particular types of orientations, actions, 
or characteristics (the properties) of particular types 
of persons or collectivities. What he does, in effect, 
is to treat the sense data (the written or spoken 
words, the gestures or pictures which he observes 
as manifestations or indicants of these properties 
(the ideas which he holds in his mind). Measure- 
ment is defined here, then, as the classification of 
cases (persons, groups) in terms of a given prop- 
erty, according to some rules for selecting and 
combining appropriate communications data as 
indicants. 

Composite measures. The measurement rules 
followed in content analysis vary in detail with the 
study; but in general they are characterized by a 
two-stage procedure that results in a composite, 
rather than a simple, measure. The researcher does 
not simply classify (code) each case as a whole. 
Rather, he breaks down the total communication 
into a set of constituent units (e.g., words, asser- 
tions, articles, books); he first codes each of these 
content units separately, and then he recombines 
the coded units to provide the composite measure. 
Bales, for example, in classifying his cases (groups) 
according to various dimensions of interaction, 
might well have observed an entire small-group ses- 
sion and then assigned over-all ratings (simple 
measures) to indicate the extent to which, for ex- 
ample, solidarity was expressed or tension-manage- 
ment activities had occurred. Instead, he broke 
down the property (interaction) into small content 
units (acts), categorized the behavior act by act, 
and then counted the number of acts in each cate- 
gory. This composite measure gives a group profile 
—a distribution of the total number of acts among 
code categories—by which groups are classified ac- 
cording to the extent to which members show 


agreement, engage in tension-management activi- 
ties, and so on. 

Coding. ‘At the first stage the coding process 
involves a measuring instrument for assigning to 
each content unit certain code designations that 
indicate how much of (or which attributes of) the 
property it possesses. This instrument consists of 
(1) a code, or set of code designations. Made up 
of numerals, symbols, or names of categories, the 
code lists all the categories marked off on each 
dimension of each property. (Properties are con- 
ceived of as having one or more main dimensions, 
or aspects; for multidimensional measures, the 
measures of single dimensions—whether simple or 
composite—must ultimately be combined to reflect 
the property as a whole.) The instrument further 
contains (2) coding instructions, which, on the one 
hand, define each dimension and its categories in 
terms of the conceptual model and, on the other 
hand, specify the kinds of data to be taken as indi- 
cants under each category. The coding instrument 
for a particular study is sometimes taken from an 
existing body of theory (such as Riesman’s “inner- 
direction” versus “other-direction”); it may be a 
standard code developed by other researchers (such 
as the D.R.Q. or the verb—adjective ratio); or it 
may be developed from the empirical data of the 
study. 

Combining. At the second stage—combining 
the content units to refer to the communication as 
a whole—the content analyst may simply count the 
number of units in each category (e.g., to show the 
number of favorable and unfavorable assertions or 
to arrive at the mean percentage of paragraphs 
devoted to dogma in a sample of religious books). 
Such frequency counts of similar units have the 
effect of weighting the category to show how pre- 
dominant or pervasive that category is within the 
communication as a whole. Sometimes the units 
are given equal weight (e.g., Bales 1952); or dif- 
ferent weights may be assigned, e.g., for different 
degrees of attitude intensity, as assessed by judges, 
or for differing degrees of impact upon an audi- 
ence; Sorokin (1937-1941) weighted the influence 
of great thinkers according to the number of spe- 
cial monographs devoted to each. 

Alternatively, the content analyst may adapt 
various available procedures to uncover the em- 
pirical patterning among different types of units, 
to show how specific acts, attitudes, or character- 
istics may fit together within a single communica- 
tion process. For instance, factor analysis may be 
used to isolate broad dimensions (as of mental 
health attitudes expressed in mass media or of sen- 
sationalism in the handling of news), or Guttman 
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scaling may be used to uncover cumulative patterns 
(as of the various duties and functions assigned 
in state laws to boards of education). Such proce- 
dures have the virtue of containing built-in tests of 
the correspondence between the researcher’s con- 
ception of the property and the rules that he follows 
in making his measurements. These tests obviate 
the necessity of relying entirely upon the investi- 
gator’s arbitrary judgment for the combining and 
weighting of indicants. 

Such patterning is further disclosed, of course, 
when the investigator, having completed his meas- 
urement of single properties, proceeds to examine 
the positive or negative correlations between prop- 
erties. References to the devil and to writing may 
be found to go together in certain folk tales; or a 
psychotherapy patient may tend to dissociate his 
thoughts of mother from his thoughts of homosex- 
uality (e.g., Osgood’s contingency analysis in Work 
Conference in Content Analysis 1959, pp. 54-78). 
Some content analysts apply statistical tests to 
estimate the likelihood that such correlations are 
due entirely to chance, although there are often 
problems of appropriateness of the particular tests 
(as when the several communications of selected 
individuals may not meet the assumptions of statis- 
tical independence). 

Utility of composite measurement. The charac- 
teristic two-stage measurement procedure of first 
coding and then combining content units often 
enhances the precision possible in simple over-all 
measurement, while intercoder reliability is re- 
portedly high. Coding rules can be defined more 
specifically and coding decisions made more easily 
for a small content unit than for an entire com- 
munication (see Schneider & Dornbusch 1958, pp. 
165-169, for a comparison of global ratings of 
entire books with paragraph-by-paragraph ratings 
of the same books). Although the detailed proce- 
dure is typically more time consuming and labori- 
ous, computer programs will without doubt be 
increasingly used to expedite a number of these 
operations (e.g., the General Inquirer system for 
content analysis, Stone et al. 1962). Mathemati- 
cally, the composite measure is quantitative, a 
characteristic which often facilitates its use in rela- 
tion to other measures. Even though each act or 
unit of meaning may be coded on a nominal scale 
(described in words as favorable or unfavorable, 
showing or not showing solidarity, etc.) at the 
first stage of the procedure, at the second stage, 
when all these codes are combined, the resultant 
measure consists of numbers or proportions (of 
remarks that are favorable or of group activities 
that show solidarity). Such numerical data are 


376 CONTENT ANALYSIS 


used to classify the individuals or groups along 
scales that are at least at the ordinal level. 

Nevertheless, the quantitative, precise appear- 
ance of many composite measures used in content 
analysis may be deceptive. Without a clear corre- 
spondence between measurement operations and 
the communication process, serious problems of 
interpretation arise. 

Sampling. Just as content analysis requires 
measurement, it also requires rigorous procedures 
for sampling. The procedures generally employed 
by social scientists refer to the selection of both 
the concrete cases to be studied (the communi- 
cating persons or groups) and the communications 
to be used as indicants. Some content analyses deal 
with only a single case (e.g., Wilson as a single 
historical figure or western Europe as a single 
society). When many cases are studied, so as to 
separate common properties from those peculiar to 
exceptional cases, samples are often chosen by 
standard probability procedures that aim to repre- 
sent the conceptual universe through the sample 
selected. A second important aim is to select a 
sample of cases that will facilitate the analysis— 
as Osgood chooses samples for comparative analy- 
sis of ordinary persons and suicides. 

Similar sampling procedures are applied to the 
determination of which communications will be 
examined, since it is by no means always necessary 
to analyze all the writings of a given man, all the 
meetings of a given group, or all the propaganda 
of a given country. Selections are often made by 
stratifying or classifying the major items, such as 
books, prayers, pictographs, records of single meet- 
ings, paragraphs, and then taking a probability 
sample from each stratum. 


Interpretations 


Just as each piece of content analysis uses cer- 
tain empirical methods to arrive at its findings, it 
also employs procedures for interpreting the scien- 
tific and theoretical significance of the findings by 
comparing them with the conceptual model. The 
methods for arriving at such interpretations have 
been less clearly codified than the empirical methods 
—many of which have been taken over wholesale 
from other applications—and there is much dis- 
cussion and some confusion about the kinds of in- 
ferences which are appropriate or valid. Never- 
theless, the accumulating body of interpretations 
derived is now beginning to explain the relation- 
ships between communicators, recipients, and the 
patterned content of the communications them- 
selves. These interpretations shed light on historical 
changes and dynamic processes of communicative 


interaction. They often go behind the meanings of 
the language to the underlying social structure of 
the group or the psychological state of the indi- 
vidual. Content analysis may show, for example, 
that—quite apart from the content of communica- 
tion—in a task group a leader tends to emerge who 
initiates and receives about half the communica- 
tion. Or such nonlexical aspects of speech as stut- 
tering or hesitation may reveal the anxiety of a 
patient in an interview. 

Exploration. The content analyst whose main 
objective is exploratory makes his interpretations 
by working primarily from data to model, adding 
new ideas to his theories after he has completed 
the empirical phase of the research, Here the spe- 
cial character of the composite measurement pro- 
cedure can be a notable asset. The careful handling 
of details and the search for patterning among 
them often serves to clarify the concepts with which 
the inquiry began, and to uncover latent relation- 
ships and processes not immediately apparent to 
an investigator. Thus, the sociologist can reveal the 
balance between instrumental communications, 
through which a group may pursue its goals but 
which place strains upon its members, and expres- 
sive communications, which ease such strains and 
tend to re-establish the equilibrium (Bales 1952). 
Or the historian, through composite measures based 
on subject matter and key words in the clauses of 
the Grand Remonstrance, enacted by the British 
Parliament in the seventeenth century, can expose 
its character as a propaganda vehicle rather than 
a constitutionally important document (Knight 
1960). In such exploration the methods of interpre- 
tation, though rarely explicit, require creative effort 
—a jump from evidence to ideas, a sensitivity to 
potential linkages between empirical clues and ex- 
isting theory and knowledge. 

Inadequate use of theory. The fruitless char- 
acter of content analysis without careful reference 
to adequate theory is, unfortunately, all too often 
overlooked. Complex techniques of measurement 
and analysis may be applied blindly, without ques- 
tioning their theoretical relevance. Content may be 
arbitrarily broken down into units that distort mes- 
sages by wrenching them from their setting. One- 
to-one inferences may be drawn, from content 
descriptions to states of the communicator or his 
social system, without recognition or assessment 
of the isomorphism implied (as discussed in George 
1959). Little consideration may be paid to the 
meaning of a particular frequency count, as this 
might refer to the intensity of an individual's atti- 
tude or to the consensus with which several indi- 
viduals hold the same attitude or to the calculated 


impact of repetition upon an audience. Yet such 
oversights in connecting techniques with theory can 
yield meaningless—even misleading—results. 

Hypothesis testing. Errors of sheer empiricism 
are less likely when the researcher, instead of ex- 
ploring, can use content analysis to test hypotheses. 
This is often feasible when the conceptual model 
is highly enough developed to suggest an interpre- 
tation in advance of the content analysis (e.g., 
Osgood & Walker 1959). In hypothesis testing the 
researcher cannot avoid an explication of the pre- 
sumed relationship between theory and operations. 
Here he uses logical or mathematical reasoning to 
specify what the expected findings would be if the 
assumptions of the model were in accord with the 
facts. Again, of course, any evidence derived from 
testing the model can only be as good as the model 
itself; the importance of the evidence is bounded 
by the imagination and the theoretical grasp with 
which the research begins. 

Supporting analyses. Just as the model of com- 
munication includes not only the message unit but 
also underlying attitudes, behavior patterns, values, 
and social structures, so content analysis alone can- 
not provide a full understanding of communica- 
tion. However ideally executed and interpreted and 
however widely replicated, the approach must often 
be supplemented by other approaches, which focus 
more widely upon the several aspects of the com- 
munication process. Thus, precise estimates of the 
intended meaning of a communication depend on 
knowledge of the situational and behavioral con- 
text (George 1959); the content of therapy cannot 
be assessed without an outside measure of the re- 
covery of the patient; the interpretation of a suicide 
letter requires the identification of the writer as 
actually suicidal or not; the presumed appeals of 
propaganda or advertising must be checked against 
the responses of the audience. The very ability of 
the language to communicate must often be tested 
—for instance, by Taylor’s “cloze” procedure, in 
which sample recipients are given a message in 
mutilated form and asked how far they can recon- 
struct it (see Work Conference on Content Analysis 
1959, pp. 78-88). 

A full understanding of communication will rest 
ultimately, of course, upon accumulation of ideas 
and facts from many related studies. Among these, 
the findings of content analysis can make a special 
contribution because of their objectivity. The con- 
tent analysis of letters by Osgood and Walker, for 
example, is more open to evaluation and replication 
by other scholars than the less systematic handling 
of Polish peasant correspondence by Thomas and 
Znaniecki; the content analysis by Schneider and 
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Dornbusch is more open than Max Weber's insight- 
ful construction of ideal types from the writings of 
a Benjamin Franklin or a Jonathan Edwards, 
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[Other relevant material may be found in Factor 
ANALYSIS and SCALING.] 
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COOLEY, CHARLES H. 


Charles Horton Cooley (1864-1929), American 
sociologist, was born in Ann Arbor, Michigan, and 
spent almost his entire life there, His father, 
Thomas McIntyre Cooley, was the first dean of the 
University of Michigan Law School, a justice of 
the Michigan Supreme Court, first chairman of the 
Interstate Commerce Commission, and the author 
of several famous legal treatises. The younger 
Cooley, who was delicate in health and painfully 
shy, early discovered the joys of reading. From 
the age of 18 until the end of his life he kept a 
journal in which he set down his ambitions and 
cares, his reactions to the books he read, his com- 
ments on the world around him, and the first in- 
timations of ideas that he later developed in his 
sociological works. This journal was the source of 
the pensées that make up his last book, Life and 
the Student (1927). 

Emerson, Goethe, and Thoreau were among 
Cooley's early mentors, From Emerson he drew 
moral inspiration and a love for democracy, from 
Goethe the notion that one’s life should be a work 
of art, and from Thoreau a taste for the simple 
life and the willingness to be a nonconformist. 

Cooley’s interest in sociology was first aroused by 
reading Spencer, but he soon concluded that 
Spencer's thought was too full of mechanistic anal- 
ogies to be of much value. Albert Schiffle’s 
Bau und Leben des socialen Kérpers he found 
more satisfactory, although still too analogical. He 
read other early sociologists—Comte, Gumplowicz, 
Maine, Lewis Henry Morgan, Tarde—without en- 
thusiasm. Tocqueville he found stimulating, but 
he did not share the view that democracy tends, 
by reducing everyone to the same level, to hinder 
distinctive achievement. Cooley met both Ward 
and Giddings early in his career and was grateful 
for their encouragement. 

It was, however, to three men outside the field 
of sociology that he felt his greatest scientific debt. 
Foremost among these was Darwin, whose com- 
bination of patient empirical study and imagina- 
tive theoretical speculation Cooley greatly admired. 
Later in his life he was to regret that although he 
might claim to have accomplished something on 


the theoretical side, he had never matched it with 
the painstaking empirical work exemplified by Dar- 
win’s. To Darwin, perhaps more than to anyone 
else, he owed his appreciation of the interrelated- 
ness of the elements of life and his conviction that 
parts are never separable from wholes. 

James Bryce was another scholar Cooley re- 
vered. His careful analysis of American democracy 
and his belief in its strength appealed both to the 
sociologist and the optimist in Cooley. Finally, 
deeply impressed by William James's Principles of 
Psychology, he sought to emulate James's willing- 
ness to look at every new event without precon- 
ceptions, and he accepted James's belief that 
nature progresses through a trial-and-error process. 
He also greatly admired the beauty of James's 
literary style. t 

Cooley did not decide to devote himself to soci- 
ology until he was 28. He had prepared himself 
at the University of Michigan to be an engineer, 
and he worked briefly as a draftsman. But his 
heart was not in this work, and he accepted an 
opportunity to go to Washington, D.C., in prepara- 
tion for graduate work in political economy. There 
he collected and analyzed statistics on street rail- 
ways for the Bureau of the Census—his first re- 
search undertaking. Returning to the University of 
Michigan in 1890, he became an instructor while 
still a graduate student in economics. He had al- 
ready begun to read Spencer and to become inter- 
ested in sociology. It was arranged for him to take 
a minor in the subject for his doctorate (the ex- 
amination was to be set by Giddings). He was 
awarded his PH.D. in 1894 and taught his first 
course in sociology that fall. His thesis, “The 
Theory of Transportation” (1894), is a pioneering 
study in human ecology, still highly regarded. But 
he had already begun to be dissatisfied with analyz- 
ing the external, ecological features of society and 
decided to attempt instead to search out the under- 
lying sociopsychological facts. One of the works 
that stimulated him to turn his energies in this 
direction was James Mark Baldwin’s Social and 
Ethical Interpretations in Mental Development. 

There are two principal themes in Cooley's great 
trilogy, Human Nature and the Social Order 
(1902), Social Organization (1909), and Social 
Process (1918). One is his organic view of society; 
the other is the central role he gave to mind. 

Society as an organism. Cooley was fully aware 
of the pitfalls in the analogy between society and 
the biological organism. By the organic view he 
meant merely that the student of society must 
avoid all particularistic theories. He must realize 
that neither the individual nor the group is basic, 


that both are aspects of the same whole. Different 
aspects of life are systematically related, so that 
an economic interpretation of history is no more 
plausible than a religious one. All elements of life 
interact with one another. For a particular purpose 
one can concentrate on the study of a single insti- 
tution, but one must view it in the context of the 
whole and never forget that the special approach is 
just a convenience. Other starting points are 
equally valid; the ideal would be to take a number 
of standpoints until a rounded view is obtained. 

Cooley called the dynamic aspect of the organic 
view the “tentative process.” By this he meant that 
all forms of social life—whether persons, small 
groups, role-constituted institutions like schools, or 
abstract institutions like languages—are constantly 
growing and changing, attempting to push out into 
new territory. This thrusting forward, however, is 
a selective process. Some attempts succeed, some 
fail; the test is a pragmatic one, Those that are 
adapted to the surrounding situation survive be- 
cause they “work.” This is as true of conscious try- 
outs—for example, applications for jobs—as it 
is of unconscious processes by which a language 
evolves. Cooley saw this tentative process as one 
of continual organization and reorganization, since 
the various elements are always developing new 
relations to one another as they move forward. 

Consistent with this position, Cooley emphasized 
the dialectic or dramatic aspect of social life. He 
saw the thrust and counterthrust of forces in 
any social whole as being like the “conversa- 
tions” among the various sections of a symphony 
orchestra. 

Although Cooley did not use the term “func- 
tionalism,” his view is suggestive of it. If “to work” 
is to survive, then all existing culture traits and 
institutions must be functional. ` 

Primacy of mind. The second principal theme 
in Cooley’s work is that the essence of social life 
is its mental character, Mind is all-pervasive; it is 
the medium in which thought develops and through 
which it is communicated. Human action, so far 
as it is superior to the action of animals, flows 
from mind. Thus, there is a sort of universal stuff, 
mind, that becomes nucleated and organized at 
particular points—persons, groups, institutions, 
publics. The history of a language and the history 
of a human being are equally instances of mind in 
process. 

Cooley was not merely saying that the mind of 
the child is developed socially, which is universally 
acknowledged, but something much more radical— 
that the theater of all interaction is the mind. 
Persons and groups exist for each other only as 
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they are conceived in the mind, and thus their 
interaction takes place only within the mind. This 
extremely mentalistic view undoubtedly sprang 
from Cooley’s own inclination toward introspective 
analysis. He sought to encompass society in his 
own mind, believed that others did the same, and 
made this activity a scientific principle. 

It would be a mistake to suppose that Cooley's 
mentalistic emphasis is an emphasis on rationality. 
He believed that the child learns such sentiments 
as justice and kindness by experiencing them in 
his relations with others, by the same kind of inter- 
course through which thought is learned. The 
basic process in learning sentiments is what he 
called sympathy and what we would now call 
empathy. Since sentiments are symbolized in the 
process of learning, they can, like thought, be 
communicated. 

In Human Nature and the Social Order, Cooley 
concentrated on the person and the subtleties of 
his relations with others. A central concept is the 
self, which to Cooley is composed of whatever the 
person appropriates from the general flow of life, 
in contrast to that which belongs to others, The 
self can be discovered by examining the referents 
of the word “I.” Although the physical body is in- 
cluded among them (except in the case of ascetic 
extremists), it is only a minor part of the self be- 
cause men cherish their values, ideas, achieve- 
ments, and possessions more than they do their 
bodies. Cooley worked out his conception of the self 
by studying the social development of his own 
children, Perhaps the best piece of empirical re- 
search he did was a verification (1908) of the 
hypotheses set forth earlier in the 1902 chapter, 
“The Meaning of T?” 

To Cooley the sense of self arises partly from 
experience, as in the child’s competitive appro- 
priation of toys, but also from imagining others’ 
conceptions of oneself. A person tends to accept 
the view of himself held by those whom he ad- 
mires. In a brilliant analysis of various phases of 
the self he drew upon the works of Shakespeare, 
Goethe, Montaigne, Meredith, Stevenson, Thoreau, 
Thomas Aquinas, Disraeli, Pascal, Lecky, and 
George Eliot to discuss such matters as vanity, 
pride, humility, self-respect, withdrawal, and self- 
transformation. Although he did not use the term 
“reference group,” he clearly used the idea of ref- 
erence persons in his treatment of emulation and 
fame and the idea of reference groups in his dis- 
cussion of the need for setting of standards in 
society. 

Conscience, for Cooley, was the expression of a 
mature participation in social life. “To violate con- 
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science is to act under the control of an incom- 
plete and fragmentary state of mind; and so to 
become less of a person, to begin to disintegrate 
and go to pieces” ({1902] 1956, p. 329). What is 
right is not necessarily what is altruistic, since one 
individual may be able to make a greater contribu- 
tion with a scarce resource than another. Con- 
science is strongly affected by group norms and by 
ideal persons, but in the end the person has to 
synthesize all these influences and thus develop 
his conscience. In discussing morally deviant con- 
duct, Cooley pointed out that the organic view 
diffuses responsibility but by no means exonerates 
the actor. 

Conception of social structure. In Social Or- 
ganization Cooley shifted the focus from the per- 
son to the social structure, but he did not abandon 
the mental emphasis. Society is a mental complex; 
it is tied together by communication. For Cooley 
the social mind is molded in the intimacy of the 
primary group—the family, the children’s play 
group, the neighborhood or local community, Ex- 
perience in the primary group underlies both the 
development of the person and the development of 
the larger organization of society. From this ex- 
perience the person receives his fundamental 
orientation to life, and the society receives its 
model for integrated living. 

Cooley did not define his concept of human na- 
ture until he discussed primary groups, although 
he had used the term in the title of his previous 
book. By human nature he did not mean original 
biological nature but that nature which every- 
where and always develops in the intimate contact 
with others that is characteristic of primary groups. 
Beneath cultural differences there is a common 
core of experience in primary groups which is so 
similar around the world that it produces a com- 
mon human nature. 

The connection between primary groups and the 
wider society is made through what Cooley called 
primary ideals. He believed that life in the primary 
group tends to be satisfying and that it generates 
an ideal of moral community. For analytical pur- 
poses this ideal can be broken up into loyalty, 
truth, service, and kindness, but actually it is the 
broad notion of living in a moral whole that is 
idealized. Indeed, Social Organization can perhaps 
best be seen as an attempt to analyze the difficul- 
ties in and opportunities for achieving some kind 
of moral wholeness in a complex industrialized 
society. 

For Cooley, a truly democratic society—one in 
which the self-expression of all leads to the de- 
cisions in accordance with which controls are exer- 


cised—is an embodiment of primary ideals. He 
conceived of public opinion not as the aggregated 
individual judgments of a majority of the people 
but as the crystallized judgment of all persons, 
whether they are in the majority or in a minority. 
“It is not at all necessary that there should be 
agreement; the essential thing is a certain ripeness 
and stability of thought resulting from attention 
and discussion” ([1909] 1956, p. 122). Thus, pub- 
lic opinion is a stage in a process, The masses con- 
tribute to it sentiment and general tendency; the 
leaders, the formulation of alternatives for policy. 
Public opinion can judge only broad questions of 
change and readjustment; the details of public 
business must generally be handled by specialists, 
But Cooley realized that democracy will not work 
unless there is a latent moral unity—a tradition 
of justice, freedom, and humanity—beneath the 
differing class views expressed in public opinion. 

Since Cooley saw wealth as generalized power, 
he tended to see social classes as economically 
based. He discriminated, however, between a caste 
system and a system of open classes. The caste sys- 
tem tends to be fostered by cultural and racial 
differences in the population, by the absence of 
social change, and by a low level of education and 
enlightenment. Because a democracy enlists the 
energies of all and educates its citizens, an open- 
class system is suitable to it. To the degree that 
public opinion operates successfully, hostile feeling 
between classes is kept in check. 

Consistent with his general view of social organ- 
izations, Cooley defined an institution as “a definite 
phase of the public mind”; it is the organization of 
human thought around a certain task, The state, 
for example, represents the practical working out 
of men’s ideas, in the field of government. A social 
structure is developed within which individuals 
participate, but they participate with only a spe- 
cialized part of themselves. An institution repre- 
sents the accumulation of human experience, but 
it does not have the wholeness and humanness of 
the person. Institutions all too readily become for- 
malized, since formalism represents the line of 
least resistance for the participants. If formalism 
goes so far that participants lose all interest, it may 
result in decay and disorganization; or a reform 
movement may successfully spur reorganization. 

Cooley's ideas about institutions may be illus- 
trated by his discussion of pecuniary valuation in 
Social Process. He demonstrated that economic 
value is not merely the result of free market 
choices of autonomous individuals; it also results 
from a special structure of thought and action that 
has to be studied in its own right if current values 


are to be understood. Thus, each rising generation 
is conditioned to want certain things and not 
others—breakfast cereals, perhaps, but not bird’s- 
nest soup. Further, those of great wealth have a 
disproportionate amount of influence in setting 
standards of taste because they consume more 
goods and services. In addition, in the great cor- 
porate structures of our day the salaries of high 
officials are not set by a market mechanism; hence, 
these officials determine their own value. In short, 
economic institutions, like all others, are not mere- 
ly the result of great impersonal forces but are in 
part the result of a special stream of thought and 
practice. When an institution becomes too diver- 
gent from the general norms of the society, critics, 
inspired by human values, lodge protests and 
attempt to initiate reform. 

Social philosophy. Cooley mixed his aspira- 
tions for democracy with his sociological analysis. 
He defined success as self-development in social 
service; he had great confidence in the moral in- 
tegrity of the common man; and he thought that 
there is a natural affinity between democracy and 
art, that communication fosters humanitarianism, 
and that there is an onward and upward tendency 
to life. Behind his sociology, and combined with it, 
was a social philosophy. He believed that as com- 
munication improved life was organized humanely 
in wider and wider circles. As a result, national 
democracies were already in being, and some sort 
of democratic world order was bound to come. 

Although Cooley did not regard himself as more 
than incidentally a social critic, his views on the 
world around him are clear in his work. He ap- 
plauded differentiation as the main guarantee of 
democracy, and at the same time he was an enemy 
of particularism either in thought or in the guise 
of simplistic reforms. Provided that the system is 
an open one, he believed in altering it through 
gradual, not revolutionary, change. Although he 
found much that was crude and selfish in the capi- 
talism of his day and much that was admirable in 
socialism (especially its humane spirit), he was in 
no sense a radical. He believed too strongly in a 
ramified tentative process to favor monolithic or- 
ganization and too strongly in the need of guidance 
and control to embrace anarchism. 

Assessment of Cooley. It is not accurate to label 
Cooley an “armchair” sociologist. It is true that he 
drew much from the great observers of life, such 
as Goethe and Bryce, that he did little systematic 
research, and that his students were not trained to 
become conventional researchers. But he did have 
an empirical bent. In his essay “Genius, Fame and 
the Comparison of Races” (1897), he marshaled 
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both historical and contemporary evidence to refute 
those portions of Galton’s Hereditary Genius that 
belittle the influence of social factors in producing 
great men. He also used empirical materials closer 
at hand. Among the important sources of his 
knowledge were self-analysis, the study of his 
children, novels about contemporary life, and the 
autobiographies that he encouraged students to 
write. 

Again, Cooley was not a systematic theorist. His 
organic view postulates so seamless a web of social 
life that no emphasis on structure is made. Rather 
than constructing a general scheme and then suc- 
cessively dealing with each of its features, he pre- 
ferred to dip into life at the points where he had a 
flash of insight, let his thought mature, express it 
beautifully and illustrate it convincingly, and then 
pass on to something else. He left it to the reader 
to fashion a general scheme. 

Cooley was distrustful of reliance upon mensura- 
tion in sociology. He did not believe that units for 
measuring the subtleties of relation and process in 
social wholes had been devised, “The Roots of 
Social Knowledge” (1926) is a plea for using one’s 
own experience as a means of interpreting the ex- 
perience of those under study. Statistics are only 
as good as their basic units of measurement and 
are therefore of secondary importance in socio- 
logical research. Only mind can predict what mind 
will do. Since Cooley’s day sociologists have tried 
to overcome the obstacles he saw by asking open- 
ended questions and coding the answers afterward 
and by social experiments in the laboratory, which 
include the simulation of lifelike situations. They 
have also greatly increased the use of his favored 
method, participant observation. 

Throughout his career, Cooley lived a quiet, un- 
complicated life. Reading, writing, and teaching 
were his daily round. Personal conflict was dis- 
tasteful to him, and he had no fondness for ad- 
ministration. During his lifetime sociology at the 
University of Michigan remained under the wing 
of the economics department. 

Soon after Cooley’s death, George Herbert Mead 
(1930) wrote an appreciative critique of his work. 
In it, however, he disagreed with Cooley's view that 
the differentiation of self and other takes place 
wholly within mind. Whereas Cooley saw com- 
munication arising within mind, Mead saw mind 
arising in communication; he held that communi- 
cation begins in gestures and preverbal sounds, to 
which words are gradually added to create mind. 
He also pointed out that it is very difficult to give 
adequate recognition to the nonmental side of the 
person and the technological side of the society if 
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one views society as wholly mental. He admitted 
that the immediate process of most adult inter- 
action is mental but not that mind is the ultimate 
social reality. 

A second but more limited criticism was made 
by another admirer of Cooley, L. L. Bernard 
(1936). He doubted the validity of Cooley's belief 
that the larger relations of society could and should 
be controlled by the “extension of primary ideals.” 
He took the position that the problems of large- 
scale organization are so different from the prob- 
lems of primary groups that special ideals (which 
he called derivative) would have to develop from 
the broader context to control social processes at 
this level. 

A third criticism was leveled by both Jandy 
(1942) and Rieff (see Cooley 1902, “Introduc- 
tion” to the 1964 edition). Although laudatory on 
many points, they took Cooley to task for his exces- 
sive optimism—for believing that American society 
was what he wanted it to be. They charged him 
with ignoring the deep schisms that are glossed 
over by the organic view, for believing that the 
majority of men are unselfish and capable of as- 
cetic motivation, and for seeing modern communi- 
cation as a means of humanizing urban life. 

Perhaps these and other criticisms might be 
summed up by saying that Cooley's society, which 
he offered as the real world, is in fact a selective 
model, an ideal type. In it the mental aspect of life 
is salient, and within this aspect connection and 
balance are emphasized. The significance of crude 
power is not appreciated, since it is neither a men- 
tal phenomenon nor a reciprocal one. And Cooley 
did not deal adequately with social structure: there 
are no sharp edges in the mind, only the flow of 
messages through more and less permeable filters. 

Despite these deficiencies, Cooley’s work stands 
as a monumental achievement. Indeed, his ideas 
seem so simple and so true that the reader may not 
appreciate their originality. His empirical work on 
the sense of self, for instance, was ground break- 
ing. With his conception of primary groups he 
crystallized prior thought and added to it the sig- 
nificant idea that human nature is developed in 
such groups. His organic view of public opinion 
was a fresh insight, and his treatment of public 
will foreshadowed information theory and cyber- 
netics. He anticipated Durkheim in his under- 
standing of the role of festivals in revitalizing the 
spirit of large groups. His analysis of pecuniary 
valuation was a significant contribution to institu- 
tional economics. His treatment of social classes in 
Social Organization was vastly superior to any- 
thing in the American sociological literature that 


preceded it. Cooley did not create a system of soci- 
ology, but he greatly enriched the discipline in 
many areas. 

ROBERT COOLEY ANGELL 


[For the historical context of Cooley's work, see INTER- 
ACTION, article on SOCIAL INTERACTION; and the 
biographies of BALDWIN; BRYCE; DARWIN; GIDDINGS; 
JAMES; SPENCER. For discussion of the subsequent 
development of his ideas, see Creativity, article 
on SOCIAL ASPECTS; GROUPS, article on THE STUDY 
OF GROUPS; REFERENCE GROUPS; TRANSPORTATION, 
article on SOCIAL ASPECTS; and the biographies of 
BERNARD; Mean; Ross; THOMAS; WALLER.] 
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COOPER, JOHN M. 


John Montgomery Cooper (1881-1949) was 
an American ethnologist, ethnographer, and priest 
who also made important contributions to the fields 
of education and social welfare. 

Although his father’s family was Quaker, his 
mother was of French Catholic stock, and at an 
early age Cooper felt the desire to enter the priest- 
hood. His academic work’ at St. Charles College 
near Baltimore, Maryland, was followed by six 
years of study in Rome and travel on the Conti- 
nent, At the American College in Rome he received 
a PH.D. in 1902 and a degree of doctor of sacred 
theology in 1905, the year in which he was or- 
dained. On returning to the United States, he was 
appointed to St. Matthew’s church in Washing- 
ton, D.C. 

In 1909 Cooper became associated with the 
Catholic University of America as part-time in- 
structor in religious education and was soon re- 
lieved of parish duties. In 1923 he became an 
associate professor in the department of sociology 
and a full professor in 1928. When the depart- 
ment of anthropology was established in 1934, he 
became its chairman, a position he filled until his 
death. 

From 1917 to 1925 Cooper's attention was turned 
to subjects broadly sociological in nature. Practi- 
cally all his writings concerned group work, social 
hygiene, and related topics. His association with 
the National Catholic Welfare Council, 1918-1920, 
brought him to international prominence as a 
leader in social group work. His last contribution 
to applied sociology was his book Children’s Insti- 
tutions (1931a). 

Although subsequently he made original contri- 
butions to several fields of anthropology, such as 
linguistics (1945), and to the psychiatric approach 
to certain ethnographic phenomena (1934), Cooper 
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was primarily an ethnologist. His vacation trips to 
northern Canada led to an interest in American 
Indians, which was encouraged by John Reed 
Swanton and Frederick W. Hodge, ethnologists at 
the Smithsonian Institution. His first notable con- 
tribution to ethnology, which preceded his sociolog- 
ical work, was his Analytical and Critical Bibliog- 
raphy of the Tribes of Tierra del Fuego (1917). 
This study revealed in high measure the qualities 
that marked all of his work—craftsmanship, lucid- 
ity of expression, and critical judgment. His recur- 
rent visits to the Algonquian-speaking Indians of 
northeastern Canada and the Great Plains resulted 
in many papers on various aspects of their culture 
—material, social, and magico-religious. His last 
full-length monograph, the second part of The Gros 
Ventres of Montana (1957), on religion and ritual, 
was published posthumously. 

As a theorist Cooper was particularly intrigued 
with questions of distribution and historical recon- 
struction, His paper on the areal and temporal 
distribution of culture in South America (1942) 
inspired the over-all arrangement of the Handbook 
of South American Indians (Steward 1946-1959). 
To this large compendium he contributed ten arti- 
cles, all of which exhibit his complete mastery of 
the often widely scattered sources. In his Temporal 
Sequence and the Marginal Cultures (1941) he 
presented evidence for the hypothesis that the non- 
literate peoples of the ethnographic present repre- 
sent “tarriers” with relatively unchanged cultures 
from prehistoric times, and he worked out several 
canons to be applied in historical reconstruction. 
One of these canons is as follows: 


(a) Where a given cultural phenomenon is consistently 
found among present-day marginal peoples it was prob- 
ably present in early prehistoric human culture as 
such. (b) The probability of such early prehistoric 
presence is augmented where the factors underlying 
the given phenomenon can be assumed on probable 
grounds to have been continuously operative among 
the cultural ancestors of the present-day marginals and 
to have been operative in early prehistoric times. 
(Cooper 1941, p. 64) 


Cooper was aware that the applicability of his can- 
ons was distinctly limited. “As is obvious they can- 
not yield a total all-embracing reconstruction of 
prehistoric culture . . .” (1941, p. 66). Exhibiting 
proper scientific caution, he drew conclusions when 
the evidence warranted but never subscribed to 
what he considered the overbroad generalizations 
of the Viennese Kulturkreis theory, which claimed 
by means of “the culture-history method” to be able 
to reconstruct original primeval human culture. 
Lowie (1949, p. 291) stated that Cooper’s 
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essay “Andamanese—Semang—Eta Cultural Rela- 
tions” (1940), “which presents a powerful argument 
for the pristine unity of the Asiatic Pygmies,” was 
possibly “the most superb example of what sound 
judgment coupled with control of the material can 
accomplish.” His interpretative paper “The Rela- 
tions Between Religion and Morality” (1931b) was 
selected for inclusion in The Golden Age of Ameri- 
can Anthropology. In this paper Cooper called at- 
tention to a certain fundamental order and uni- 
formity underlying the extremely varied moral 
standards that prevail among the nonliterate peo- 
ples of the world and showed that while among 
some of the marginal peoples religious sanctions 
are clearly operative, among a good number of 
them social sanctions alone back up the moral 
code. He concluded therefore that neither the clas- 
sical theory, which maintained an early dissocia- 
tion of religion and morality, nor the theory holding 
that in earlier times both duties to God and to 
neighbor were looked upon as the will of God can 
be maintained in the light of the anthropological 
evidence. 

In addition to his purely academic activities 
Cooper evidenced extraordinary organizing skill. 
He was largely responsible for founding the Catho- 
lic Anthropological Conference in 1926. He initi- 
ated and edited several valuable publications, in- 
cluding the periodical Primitive Man (title changed 
to Anthropological Quarterly in 1953). He never 
shirked professional responsibility and participated 
actively in numerous organizations. His religious 
beliefs and his priestly vocation completely pene- 
trated his life. He never felt that they interfered 
in any way with his scientific attitude as an anthro- 
pologist. 

REGINA FLANNERY 
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COOPERATION 


Cooperation is joint or collaborative behavior 
that is directed toward some goal and in which 
there is common interest or hope of reward. Co- 
operation may be voluntary or involuntary, direct 
or indirect, formal or informal, but always there 
is a combination of efforts toward a specific end 
in which all the participants have a stake, real or 
imagined. At its higher intellectual levels coopera- 
tion involves reciprocity of intent as well as joint- 
ness of behavior, and it may even become an end 
in itself. There is no limit to the potential range 
for cooperation; it is to be found in groups as small 
as the dyad and as large as leagues of sovereign 
states. 

It is possible to regard cooperation as an ethical 
norm, as a social process, or as an institutional 
structure. In ethics and religion, cooperation has 
been among the most honored of values throughout 
human history. Indeed, some philosophers and 
religious teachers have made cooperation synon- 
ymous with the whole fabric of morality. Coopera- 
tion is stressed in all of the major religions and 
moral systems of the world. It is at the very heart 
of Hinduism and Confucianism and has a hallowed 
place even in such relatively individualistic reli- 
gions as Christianity. 

Considered as a process, cooperation is central 


f 


to the formation of types and to changes in types. 
Studies by modern naturalists have shown this to 
be as true in the plant and animal world as in the 
human-—cultural. Closely related to competition, as 
will be emphasized below, cooperative behavior is 
one of the central mechanisms of the evolutionary 
process; it is to be observed in conditions leading 
to change as well as to stability. 

As a social structure, cooperation is manifest in 
countless organizations created expressly by man 
for the purpose of joint behavior toward a given 
goal. Such structures range in size from primitive 
hunting groups to modern insurance companies 
and in kind from criminal conspiracies, at one 
extreme, to the World Health Organization, at the 
other. They are as often religious, political, and 
cultural in character as they are economic. Only 
the fact that the modern renewal of interest in 
cooperation as a process and structure occurred 
in the nineteenth century, a century overwhelm- 
ingly preoccupied by the impact of laissez-faire 
capitalism on the social order, can account for the 
distressing tendency among social scientists even 
today to conceive of cooperation and competition 
as processes primarily economic in significance. 


Types of cooperation 

Five types of cooperation may usefully be dis- 
tinguished: automatic, traditional, contractual, di- 
rected, and spontaneous. Each can be found in all 
spheres of human society—political, religious, eco- 
nomic, cultural—and while it is important to dis- 
tinguish the types from one another, it is equally 
important to emphasize that rarely, if ever, does 
any one of them exist in isolation. 

Automatic, Automatic cooperation refers to the 
varied types of impersonal coordination, jointness 
of behavior, and mutuality of interest that arise 
directly from ecological position. Within this type 
fall most of the instances of cooperation stressed 
by naturalists (see, for example, Allee 1938). Such 
cooperation in the plant and animal world is al- 
most largely sexual or commensal in nature, al- 
though among the higher orders it is related to 
security. It is commonly instinctual in basis. De- 
spite traditional emphasis on the struggle for exist- 
ence, observable patterns of plant and animal order 
would be incomprehensible apart from the premise 
of vital cooperative processes. 

Automatic cooperation is also observable in the 
human world. To the eye of the social ecologist, 
human populations, both urban and rural, appeat 
as directed in considerable part by processes that 
are unplanned and usually unnoticed by partic- 
ipants. Among these processes is cooperation: 
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jointness of behavior toward a common end that 
arises solely from the fact of strategic location in 
the larger ecological pattern. Unplanned coopera- 
tion between two groups (whether national, eco- 
nomic, religious, or racial) may exist simply by 
virtue of an independently perceived threat to the 
security of each; such cooperation may be stim- 
ulated by action or threat of action by an outside 
group, but it is predicated upon a pre-existing com- 
patibility of norms and aspirations. Defense by 
each group of its own position (normative or spa- 
tial) can, without any conscious effort, produce a 
considerable degree of cooperation that can only 
be called automatic. Automatic cooperation is a 
cardinal feature of the complex relations among 
the primary; secondary, and tertiary orders of the 
modern economy, and it is no less a feature of 
relations among ethnic and religious groups and 
of sovereign states. 

Traditional. Cooperation of the traditional type 
is regulated neither by instinct, volition, nor simple 
location, but rather by traditional social norms. 
Prime examples are the joint family of India, the 
Chinese clan, the village community of Asia and 
medieval Europe, and craft and merchant guilds 
the world over in ancient and medieval times. The 
Hindu family, “joint in food, worship, and estate,” 
is based upon the inviolable morality of mutual 
aid among the generations and branches, however 
remote in space or time; the family in perpetuity 
is conceived as a cooperative relationship among 
the dead, the living, and the unborn. Similarly, in 
the agricultural village community, which is one 
of the most universal of institutions, such matters as 
the planting, cultivation, and harvesting of crops 
are dealt with cooperatively by the villagers. In the 
medieval guild, prices, techniques of craftsman- 
ship, and standards of work were all established 
cooperatively by the guild masters, and direct com- 
petition was forbidden. In all such instances, coop- 
eration—however it may have been started in the 
first place—is one of the mores, as binding upon 
the participants as any other part of morality. 

Contractual. In modern industrial society co- 
operation is more likely to be contractual than 
traditional in character. Terms of cooperation are 
specific and conditional upon the will of the par- 
ticipants or governed by legal sanctions, and they 
are precise both in terms of length of cooperation 
and of what is specifically required by the relation- 
ship. Contractual cooperation commonly increases 
sharply in historical periods during which the close 
ties of the traditional community are supplanted 
by the more individualistic and utilitarian ties of 
an open society. Contractual cooperation is one 


386 COOPERATION 


of the most basic patterns of contemporary West- 
ern society. The whole vast array of consumer and 
producer cooperatives, credit unions, cooperative 
apartment houses, profit-sharing plans, and the like 
make plain how dependent contemporary capital- 
ism is on this type of cooperation. It is sometimes 
thought that informal contractual cooperation 
tends to wane in mass society, but the profusion 
of baby-sitting and car pools suggests that con- 
tractual cooperation at the grass roots is far from 
moribund. 

Directed. Military organization is probably the 
oldest and most universal form of directed cooper- 
ation, but, in the modern world, the large-scale 
business enterprise, labor union, school system, 
and even religious and recreational organizations 
could hardly survive without the form of coopera- 
tion that arises from command or direction. The 
source of cooperation here is only incidentally or 
derivatively a common recognition of goal or a 
clear norm. It would be difficult to find a more 
dramatic instance of directed cooperation than in 
the great atomic bomb project of World War 11; 
tens of thousands of persons, at all levels of skill 
and responsibility, cooperated under remote direc- 
tion in the making of a product known to but the 
tiniest handful of top leaders in the project. Such 
an instance, although dramatic, is far from unique 
in the contemporary age. What Max Weber called 
“rationalization” is a process in modern society that 
has had the effect of converting a great deal of 
human cooperation into the directed type. It is 
inconceivable that complex, large-scale organiza- 
tions could operate today without directed coop- 
eration, and we are witnessing an ever increasing 
amount of this type of cooperation as a result of 
the intricate and far-reaching operations of data- 
processing and computer systems. 

Spontaneous. Spontaneous cooperation is the 
oldest, most natural, and widespread form of co- 
operation. Unprescribed by tradition, contract, or 
command, it is situational in character and prac- 
tically constitutes the essence of relationships 
within the family, neighborhood, play group, and 
other close, personal forms of association. Every- 
thing we know about this type of cooperation sug- 
gests that it is most common when there is a prior 
basis of amity. Thus, irrespective of the type of tasks 
assigned, spontaneous cooperation is least likely 
when friendly acquaintance has not had an oppor- 
tunity to arise. This type of cooperation is by no 
means absent from even the most regimented of or- 
ganizations. In army, corporation, or government, 
types of spontaneous cooperation arise, given pro- 
pitious conditions that may support (or negate) the 
planned, directed type; as every military observer 


has emphasized, spontaneous cooperation is the 
ultimate requisite of victory in battle. No matter 
what the sphere of activity, not even the most 
rationally and meticulously planned operation will 
altogether obviate spontaneous cooperation among 
participants. 


History of the concept 


In one form or another, speculative or reflective 
interest in cooperation is as old as human thought. 
Some of the most ancient proverbs and legends 
embody man’s realization not only of the impor- 
tance but also of the elements of cooperation. From 
Confucius, Lao-tzu, and Gautama in the Far East, 
as from the prophets of the Old Testament, the 
centrality of the ethic and psychology of coopera- 
tion may easily be inferred. For both Plato and 
Aristotle, cooperation was the keystone of the good 
state, and what Aristotle called stasis, or political 
factionalism, was the infallible sign of civic degen- 
eration. In the writings of the Christian fathers, 
the imperative of cooperation was based in part on 
their organismic image of the world and society, 
but also on the remembered reality of early Chris- 
tian cooperative communities—often, indeed, com- 
munistic—and on the kinds of brotherhood and 
interest associations that flourished throughout the 
Middle Ages. The theme of cooperation was a pow- 
erful one in medieval religious thought, and a great 
deal of modern thought on cooperation is but a 
secularization of this theme. 

From a sociological point of view early modern 
theories of society may be seen as falling within 
either a cooperation or conflict orientation. On the 
one hand, there were those who, like Thomas 
Hobbes (1588-1679), saw the natural state of man 
as characterized by conflict and war of all against 
all, with the life of man left “solitary, poor, nasty, 
brutish, and short”; the absolute state was seen as 
man’s only refuge from this conflict. There were 
others, however, like the great Johannes Althusius 
(1557-1638), who believed that amity and co- 
operation were basic in the human species and 
that the social ties of family, community, and asso- 
ciation had long preceded the rise of the state. 
Down through the nineteenth century, conflict 
theories of society vied with those based upon 
cooperation; Ludwig Gumplowicz (1838-1909) 
was perhaps the most notable exponent of the 
former. 

Nineteenth-century thought. The nineteenth 
century witnessed a major renewal of interest in the 
ethic and nature of cooperation. This renewal must 
be seen as a profoundly important reaction against 
three major tendencies of the age. The first, he- 
donistic utilitarianism, declared all human behavior 


to be the product of self-interest and to be governed 
solely by drives leading to the maximization of 
pleasure and minimization of pain. Similarly, clas- 
sical economic theory made the units of society 
discrete, self-interested individuals, united chiefly 
by ties of impersonal competition. Finally, the 
doctrine of natural selection affirmed the primacy, 
and even the exclusiveness, of competition and 
conflict in the whole evolutionary process. The 
term “social Darwinism” was used to cover views 
that insisted that only through competition and 
struggle could society progress, although Darwin 
himself had reservations about this, especially as 
applied to human society. 

The rise of the theory of cooperation is to be 
seen in an attack on all three of these propositions, 
which by mid-century had become significant in 
European thought. In the writings of the historical 
jurists Henry S. Maine, Otto von Gierke, and Léon 
Duguit, the ethnologists Lewis Henry Morgan and 
George Laurence Gomme, and the political philos- 
ophers Francis H. Bradley and Thomas H. Green, 
a very different view of man and society emerged: 
one in which the social group, not the individual, 
was the element of society and man was under- 
stood in terms of his interactive ties with others 
rather than through instinct forces presumed to be 
resident within the solitary individual. 

The rise of systematic sociology played a major 
role in disabusing men’s minds of the priority of 
economic values and of the universality of com- 
petition. What began in a quasi-utopian and even 
apocalyptic context in the work of August Comte 
and Charles Fourier became, by the latter part of 
the century, an ever more rigorous scientific dem- 
onstration of the irreducibility of the social bond, 
including cooperation, into presocial individual 
“atoms.” Durkheim's great work emphasized not 
only the long history of cooperation that was em- 
bodied in what he called “mechanical solidarity” 
but also the more ecological type of cooperation 
reflected by the restitutive sanctions of modern 
“organic solidarity.” He was also keenly concerned 
with the needs of modern society with respect to 
the creation of large-scale cooperative occupational 
associations, which, he thought, would rescue 
worker and citizen from the anomie and isolation 
of modern urban and industrial life. 

Undoubtedly the most influential single work 
specifically directed to cooperation was Mutual Aid: 
A Factor of Evolution (1890-1896), by P’etr Kro- 
potkin, the founder of anarchism. Written with 
extraordinary insight and based upon a vast knowl- 
edge, this book quickly achieved the status of a 
classic. Kropotkin addressed himself critically to 
the doctrines of natural selection and competitive 
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individualism, and his book is filled with numerous 
instances of behavior in both the animal and 
human worlds that is based upon cooperation 
rather than conflict and struggle. It gives examples 
of mutual aid at all stages of evolution: in the 
ecological relationships of plants, the behavior of 
subhuman species, and in such universal human 
institutions as the commune, the village com- 
munity, the guild, the cooperative, and the trade 
union. 

In terms of direct human action, as well as 
theory, cooperation was a significant force in the 
nineteenth century. There were the great consumer 
cooperatives, of which the Rochdale Equitable 
Pioneers’ Society in England was perhaps the best 
known, although it was far from exceptional in 
either size or influence. Consumer cooperatives, 
both rural and urban, were founded in Europe and 
the United States; producer cooperatives also arose, 
although these were fewer and had their chief 
impact on the utopian movement. Literally hun- 
dreds of colonies were founded on the principle of 
producer, as well as consumer, cooperation. Al- 
though most of these were defunct by the end of 
the century, they were an important part of the life 
of that period. Furthermore, there was a profusion 
of mutual-aid societies and self-help and friendly 
associations, which dealt cooperatively with such 
matters as education, life insurance, and burial 
expenses. In sum, despite the common conception 
of the nineteenth century as wholly individualistic 
and competitive, we have to conclude that coopera: 
tion, both in thought and practical affairs, was an 
important aspect of this period. 


Contemporary research 

Since the nineteenth century the behavioral sci- 
ences have steadily expanded the investigation of 
the processes of cooperation and their varied con- 
texts. In reference to the schema of types of co- 
operation described above, it can be said with only 
a few reservations that the earliest scientific in- 
vestigations were studies of the first three types. 
Interest in automatic cooperation was manifest 
early in the century in the works of the human 
ecologists who followed Robert E. Park and the 
Chicago school. Traditional cooperation among pre- 
literate and folk peoples was studied by ethnol- 
ogists and in agricultural areas by rural sociologists. 
Types of contractual cooperation and the social 
norms underlying them constituted the subject mat- 
ter of some of the works of Roscoe Pound and other 
legal sociologists. 

More recent have been the studies of directed 
and spontaneous cooperation. The field of indus- 
trial relations, which burgeoned as an academic 
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discipline in the United States during the 1940s, 
contains most of the research of the former type; 
specifically, studies of patterns of cooperation in 
industry and bureaucracy followed the style of the 
earlier and justly heralded work of Elton Mayo, 
Fritz Roethlisberger, and their colleagues (see 
Roethlisberger & Dickson 1939). Until the 1950s 
virtually all study of spontaneous cooperation was 
in the field of child psychology or psychology of 
learning, with small schoolchildren the inevitable 
subjects. 

Two areas of research have cut across those 
listed above and appear to be the most relevant to 
present and emerging tendencies: research in 
social interaction and comparative-historical re- 
search. Studies of interaction are undoubtedly the 
most valuable and suggestive investigations of co- 
operation to be found anywhere at the present 
time in the behavioral sciences: investigations that 
seek the molecular or microcosmic elements of the 
process, its sources of motivation, and the effects 
upon it of changes in the quality and intensity of 
interaction. It is unfortunate that progress in micro- 
social analyses of cooperation is not matched by 
that in studies of the macrosocial: that is, the 
actual structures through which the process of 
cooperation has revealed itself in such diversity 
throughout the ages and in all cultures. This im- 
balance, it is worth noting, has its parallel in the 
biological sciences, where interest in microevolu- 
tion has virtually displaced older but still needed 
inquiries into macroevolution. 

Studies of social interaction. Two recent studies 
best portray the kind of analysis of cooperation 
that occurs in the field of social-interaction re- 
search. The first is Morton Deutsch’s notable study 
(1953) of the effects of cooperation and competi- 
tion upon group process. Fifty college men from 
an introductory psychology course were given per- 
sonality tests, and on the basis of the results they 
were divided into ten groups of five men each; the 
groups were matched according to their members’ 
range of ability and personality. On the basis of a 
further test involving a problem in human rela- 
tions, the ten groups were divided into two sets of 
five groups, with each set containing the same 
number of high-scoring and low-scoring individuals. 
All of the groups within the two sets, meeting each 
week, received a logical puzzle and a problem in 
human relations to work on. One of the sets was 
given “competitive treatment,” and the other set 
received “cooperative treatment.” In the former set, 
each student was made to understand that his 
grade would depend strictly upon how well he did 
as an individual in solving the given problems; in 


the latter, each student’s personal grade was made 
contingent on how well the group as a whole did 
in meeting the problems. Differences in both group 
and individual results between the two sets of 
groups were carefully noted by trained observers, 
and questionnaires about results, spaced a week 
apart, were given to the students themselves. 

Given the difference in rewards promised to the 
participants in each set, there is nothing surprising 
in the finding that there was a manifestly greater 
cooperativeness and sense of group cohesion in the 
one set and a greater degree of competitive indi- 
vidualism in the other; the same is true of the 
differences in the degree of friendliness within 
groups of the two sets. However, there were more 
striking conclusions. The cooperative groups met 
the puzzle problem more efficiently and contributed 
in more detail to the analysis of the human-relations 
problem, although not necessarily with more in- 
sight. Within the cooperative groups there was 
more differentiation of individual function, that is, 
more division of labor; in the competitive groups, 
on the other hand, duplication of effort was con- 
siderable, for all were equally on their own to do 
all that was required. Communication was smoother 
in the cooperative group; there was, however, as 
much interaction in the competitive group, a con- 
dition resulting from the fact that merely in order 
to demonstrate individual ability on the human- 
relations problem, spirited contribution to a dis- 
cussion was mandatory. Despite the contrasting 
environments of the two sets of groups, no signifi- 
cant differences were discernible ih the amount of 
psychology learned by individuals in the two sets. 
Such are a few of the conclusions; obviously they 
say nothing, nor are they intended to, about the 
ultimate values of competition and cooperation in 
a society, for such values are related to the entire 
normative order [see Groups, article on GROUP 
BEHAVIOR]. 

The Robbers Cave experiment. The second 
social-interaction study is that of Muzafer Sherif 
and his associates (see Oklahoma, University of 
. . . 1961). The setting was a summer camp, and 
the subjects were twenty boys of homogeneous 
background. The boys were divided into two groups, 
and the first stage of the experiment consisted of 
welding each of the groups into a unity. This was 
done by separating the groups and ensuring that 
each group had experiences of a kind that would 
bind its members into prideful unity. The second 
stage of the experiment was designed to intensify 
group cohesiveness by putting the two groups into 
situations where sharp competition would develop: 
athletic contests, raids on each other’s camp areas, 


slogans of hostile character. The result was a rising 
tension between the two groups, combined with 
strong “we group” loyalties in each. In the third 
stage of the experiment, an effort was made to heal 
the breach between the two groups. First, they 
were brought together in a series of purely passive 
encounters, such as eating or watching a movie 
together; however, competitive friction only in- 
creased. Then, in the final and crucial stage of the 
experiment, members of the two groups were 
brought together for the performance of tasks that 
clearly required cooperation between the two groups 
for successful completion: repairing a damaged 
water tank, fund raising, and so on. The changes 
were quite striking. Whereas near-hostility had 
been the relation between the groups by the end of 
the third stage, in the final stage a majority of 
cross-group judgments by members were favorable, 
and the number of friendships went up correspond- 
ingly. Such experiments as Sherif’s and Deutsch’s 
give microcosmic sanction to what (in military 
combat or professional sports, for example) has 
long been clear: that the most effective way of re- 
ducing intergroup tensions lies in mobilizing in- 
dividuals into activities where cooperation is 
absolutely vital to success—where, in short, it is 
functional. 

The comparative-historical approach. The value 
of social-interaction studies would be greatly en- 
hanced if their results could be placed against 
studies that deal, not with contrived or experimen- 
tal situations, but rather with actual, institutional- 
ized groups in all ages and cultures. Unfortunately, 
the comparative-historical approach to cooperation 
is not well developed in terms of methodology, nor 
is it represented by a significant number of studies. 
Kropotkin’s great work at the beginning of the cen- 
tury pointed to some of the social structures 
(guilds, cooperatives, and similar associations ) 
through which human cooperation has been car- 
ried on historically, and he offered many valuable 
insights into their functioning and context. His 
objective, however, was more moral than scientific. 
A few followers of the French sociologist Frédéric 
Le Play, most notably Patrick Geddes and his stu- 
dents, in England, also did some work in the field. 
One should also note the too little known field 
studies of Demetrius Gusti (1941) in Rumania, 
whose investigations of rural villages included in- 
formal systems of cooperation among villages as 
well as internal patterns. In India the early work of 
Radhakamal Mukerjee (1923) and his students 
gave much attention to processes of cooperation 
through which interfamily and intervillage projects 
were carried on in such areas as irrigation. 


COOPERATION 389 


In the United States it is chiefly in the works of 
the earlier rural sociologists that any attention to 
cooperation is to be found in significant degree. 
Scattered throughout historiography and ethnology, 
there are, of course, specific references to the vari- 
ous cooperative enterprises that have marked man’s 
struggle for existence. But nowhere have these been 
brought together, and nowhere has a systematic, 
scientific effort been made to deal with cooperation 
as a cultural and historical force. Margaret Mead, 
in her notable work Cooperation and Competition 
Among Primitive Peoples (1937), showed what 
could be done among preliterate peoples, but few 
have chosen to follow. In a remarkable monograph 
Mark A. May and Leonard W. Doob (1937) made 
a distinguished effort to summarize existing knowl- 
edge on the subject and to arrive at central 
principles. This work provided an analysis of ex- 
perimental approaches to cooperation and com- 
petition as well as an evaluation of existing 
anthropological and sociological studies; and it 
covered a wide range of types of cooperation in 
military organizations, utopian communities, and 
varied clubs, associations, and economic enter- 
prises. However, what was chiefly lacking in this 
pioneering work was a sense of history. 

Nothing of comparable stature has been done 
since May and Doob’s contribution. Given the 
seemingly assured place of cooperation as the sub- 
ject of social-interaction studies, it may therefore 
be said without reservation that the most urgent 
sphere of research in the future must be the com- 
parative-historical. Types of cooperation drawn 
from all ages and social orders must be distin- 
guished, their structures and internal processes 
analyzed, their relations to the surrounding scene 
investigated, and their impact upon members 
assessed. Moreover, such studies should be coordi- 
nated with inquiries into the dynamics of social 
development. One of the major aspects of the rise 
of new nations is the impact—often dislocative— 
of the emerging administrative structures of na- 
tional authority upon cooperative systems that have 
been in existence for centuries. 


Cooperation and competition 

Although cooperation is commonly contrasted 
with competition (a process in which efforts toward 
a common objective are separate and in rivalry 
with one another), it must be emphasized that the 
two rarely, if ever, occur separately. Indeed, each 
may have a contributory relation to the other. Com- 
petition requires at least the degree of prior co- 
operation that is necessary for the setting of rules 
and imposing of sanctions without which competi- 
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tion would dissolve into open war. Conversely, it is 
doubtful that cooperation would be the major force 
it is were it not for pressures of competition that 
spur some to cooperate with others as a means of 
enhancing their effectiveness in the struggle for 
existence. In any event, a purely cooperative or 
purely competitive relationship would be hard to 
imagine. 

In each of the types of cooperation outlined 
above—ranging from the automatic to the spon- 
taneous—elements of competition are to be found 
side by side with, indeed often embedded in, those 
of cooperation. However dependent two orders or 
types may be upon one another in natural or hu- 
man ecology, competition is always a latent pos- 
sibility. Within traditional cooperation, role rivalries 
are legion; contractual cooperation draws its very 
efficacy from the fact that it represents legally 
specifiable surcease from normal competition; 
within directed (and especially bureaucratic) co- 
operation, competition of roles and strata goes on 
constantly, Even in the most elemental, spontane- 
ous types of cooperation—that of the small, con- 
jugal family, for instance—there cannot help but 
be competition between members: between parents 
for affection of children, between children for re- 
wards from parents, The same is true in the largest 
of political and economic systems. Capitalism, 
commonly thought of as competitive, would founder 
if it were not for vast systems of economic and 
legal cooperation within it that help give it direc- 
tion and stability; and, at the opposite extreme, all 
that we have learned of socialism in practice indi- 
cates that competition can be a powerful force in 
bringing the various sectors of society to desired 
levels of performance. 

ROBERT A. NISBET 


[See also COOPERATIVES; ECOLOGY; ĪNDUSTRIAL RELA- 
TIONS, article on HUMAN RELATIONS; INTEGRATION; 
SOCIOMETRY; and biographies of DURKHEIM; GED- 
DES; KROPOTKIN; LE PLAY; PARK; TOCQUEVILLE.] 
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COOPERATIVES 


The beginnings of the cooperative movement are 
usually associated with the name of Robert Owen. 
The real association lies, however, more in Owen's 
ultimate ideals than in his immediate plan of or- 
ganization. Owen’s central plan was for a self- 
supporting community, such as Fourier’s phalan- 
stére, where producers and consumers were one 
and the same people, not for an organization either 
of producers or of consumers. In contrast to some 
utopians, however, Owen knew by practical experi- 
ence the consequences of the industrial revolution 
for working-class producers and consumers, and he 
urged production, by voluntary associations, for the 
use of consumers and not for profit. Although he 
regarded them as mere steppingstones to his com- 
munities, Owen approved of the few short-lived 
cooperatives that existed prior to the foundation of 
the Rochdale society, which is still alive today. 
These societies claimed his authority for their acy 
tivities and were, together with the Rochdale pio- 


neers, composed mostly of his adherents. Owen 
appears to have been the first (Digby 1948, p. 15) 
to apply the term “cooperative” to these activities. 
So Owen may be accepted as the father of the co- 
operative movement, if not of its precise type of 
organization. 

The characteristic trait of the cooperative as a 
form of organization of economic activity is the 
absence of any special capital-providing class. In 
producer cooperatives, the workers put up the capi- 
tal; in consumer cooperatives it is the consumers. 
Apart from farm marketing cooperatives, producer 
cooperatives have not proved successful unless 
some outside individuals or some outside organiza- 
tion, such as a consumer cooperative, helps sub- 
stantially in providing capital and in supervising 
production. In practice, workers in industry have 
been found to be too unwilling, or else unable, to 
put up and risk the amount of capital required for 
modern equipment. They have not always been the 
best judges of an efficient manager, and in spite 
of enthusiasm and high morale, they often have 
not exercised sufficient discipline over their own 
operations. 

Consumer cooperatives have, on the other hand, 
proved very successful in many countries, notably 
Great Britain and Sweden. British cooperative or- 
ganization will, in the following paragraphs, be 
taken as the prototype. Like other cooperatives, the 
consumer societies rely on no special capitalist 
class. In addition they have certain particular eco- 
nomic and organizational characteristics. 


British cooperatives 

The main economic characteristic of the British 
consumer cooperative is the payment (after pro- 
vision for depreciation and a fixed interest on a 
limited amount of the shares held by consumer 
members) of a dividend on member purchases. Its 
main organizational characteristic is control by the 
consumer in place of the nonexistent capitalist. 
All consumers, even members who have paid only 
a small deposit on their shares, have the right to 
vote at the quarterly or half-yearly meeting. One 
man, one vote is the rule, The members vote for a 
Management committee, or board of directors, 
which is usually unpaid. This committee appoints 
the paid manager and, usually meeting every week, 
exercises general supervision over the conduct of 
business. In form, the organization is thus ex- 
tremely democratic. The existence of a number of 
autonomous retail societies of widely varying size 
is a further manifestation of democracy. The com- 
plete independence of the locally organized soci- 
eties distinguishes the cooperatives in free enter- 
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prise and mixed economies from the communist 
cooperatives, which are not covered in this article. 

Organization and growth. In Great Britain the 
retail societies formally control the two wholesale 
societies, English and Scottish, each of which con- 
ducts large-scale production. In appointing the 
wholesale societies’ full-time directors, one society, 
one vote is not the principle, however; voting is 
weighted according to the value of the goods that 
the retail society has bought from the wholesaler. 

For purposes other than trading, such as settle- 
ment of disputes or legal and financial advice, the 
British retail societies have formed the Cooperative 
Union. The final authority of the union is the 
annual congress, to which societies send a number 
of delegates roughly proportionate to their member- 
ship and which, in its discussion of policy, has 
been described as the focus of cooperative democ- 
racy. Most societies are affiliated with the Coop- 
erative party, which, in 1964, had 19 members in 
Parliament—normally voting with the Labour party. 

The opportunities for democratic control offered 
by cooperative constitutions are, however, not fully 
exercised, particularly as societies grow larger. 
Only a minute fraction of the members attend their 
retail society meetings. An inquiry in 1955 (Oster- 
gaard 1963) revealed that only 0.5 per cent attend 
business meetings and only 1.65 per cent vote for 
the management committee. If the actual informal 
practice of consumer cooperatives is less demo- 
cratic than its formal constitution presupposes, it 
is, however, more democratic in worker participa- 
tion. In theory, the position of the labor employed 
is no different from that in capitalist enterprises. 
But when few consumers attend meetings, a small 
group of members, who are also employees, can 
elect workers or committeemen favorable to the 
workers’ interests. About one-third of the places on 
the committees of the larger societies are now 0C- 
cupied by cooperative employees or former em- 
ployees (Ostergaard 1963, p. 60). Democracy is 
also manifest in the emphasis on education in co- 
operative principles, particularly at the Cooperative 
College, in the many publications discussing all 
aspects of cooperatives, and in the organization of 
local discussion groups such as the “women’s 
guilds.” 

The growth of the British consumer cooperatives 
is usually traced from 1844, when the Rochdale 
pioneers adopted the principle of dividend on pur- 
chase. During the next half century, the develop- 
ment was rapid, aided by the Industrial and Provi- 
dent Societies Act of 1852. This act allowed the 
privileges of incorporation to cooperative societies, 
provided no individual member held more than 
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£200 in shares. The limit is now £1,000. When the 
act was amended in 1862 to permit one society to 
hold unlimited shares in another society, the retail 
societies founded the two wholesale societies, By 
1900, these wholesale societies had over 12,000 
workers engaged in production. Roughly the same 
number were employed in production by the larger 
retail societies together with the local federations 
of societies. 

British consumer cooperatives continued to de- 
velop well into the twentieth century. Between 
1901 and 1931, membership in the retail societies 
rose from 1.75 million to 6.5 million. During the 
same period, the total value of sales of both the 
retail and the wholesale societies quadrupled, while 
the retail price of food rose by only one and a half. 
By 1931, the English wholesale society owned 171 
factories, employing 40,000, and the Scottish so- 
ciety had 56, employing 9,000. Both societies 
singly, or jointly, produced a wide range of goods, 
conducted integrated warehousing, provided ship- 
ping and business services (such as building the 
retail societies’ premises), were involved in bank- 
ing and insurance, and even owned tea and other 
plantations abroad. 

Throughout the history of the consumer coop- 
eratives in Great Britain, success has been greatest 
in Scotland and in the North. In 1939, for instance, 
annual cooperative retail trade per head of popula- 
tion was £8.8 in Scotland, £7.2 in the North, and 
£6.4 in the Midlands, as against £4.5 and £4.0 in 
the southwest and south, and this in spite of the 
much higher per capita income in the London area 
of the south. 

The causes of this cooperative success must be 
sought in the emergence, which occurred first in 
Great Britain (particularly in Scotland and north- 
ern England), of an industrial and mining wage- 
earning proletariat. This group was cut off, in 
contrast to the village community, from the middle 
classes and developed a strong sense of class soli- 
darity. The more intensive development of coop- 
eration in the North, away from the “squirearchy,” 
is some indication of this. Until the Education Act 
of 1902, education in England beyond the age of 
13 was a rarity for working-class children, and, in 
the absence of opportunities for moving up any 
education ladder, potential leaders were lost to their 
class. 

The British cooperatives were in fact firmly 
based on the same class solidarity as the British 
trade unions and the political labor movement. And 
the membership of the cooperatives had the same 
cautious, solid financial approach. To this funda- 
mental condition of success of the cooperatives— 


their solidarity in two senses of the word—must 
be added two specific economic “techniques,” the 
attraction of the dividend on purchase and the 
branch store. 

The first British Census of Distribution, taken 
in 1950, showed that retailing as a whole was op- 
erated by 403,839 organizations with 528,450 es- 
tablishments, an average of 1.3 stores per organiza- 
tion. Among retail cooperative societies, however, 
some 1,000 organizations owned 27,263 establish- 
ments, an average of 27.3 stores per society. 

The efficiency of the multiple stores is attested 
to by the present rapid increase in their sales, to 
be indicated shortly, The British cooperatives were 
first in this field, and for a considerable period 
they were without large rivals. 

Recent stagnation. At first sight the continued 
progress of the British cooperative societies seems 
to be indicated by the further rise in their member- 
ship from 6.5 million in 1931 to almost 11 million 
in 1952, and almost 13 million in 1962, But sales 
per member have fallen, and total sales have, since 
1950, remained fairly constant in volume with a 
definite fall relative to the sales of the multiple 
(chain) stores. 

The British Board of Trade now publishes an 
index number of the value of retail sales of coop- 
erative and other forms of retail organization. Tak- 
ing sales in 1950 as 100, the cooperative index 
was 176 in 1962. But although higher than the 
corresponding general department store index of 
160 and the index of 165 for independent retailers, 
it was much lower than the index of 239 for mul- 
tiple stores and indeed lower than the general re- 
tailing index of 181 for all organizations. 

A part of this relative decline is due to the rela- 
tive decline in population and income of the re- 
gions where cooperation was most successful— 
Scotland and the North. But several more general 
causes are at work, and to predict the future of 
consumer cooperatives in Great Britain and, as we 
shall see, in other countries too, an analysis of 
these causes is essential. 

Fortunately analysis can be firmly based on two 
very detailed inquiries into the British consumer 
cooperatives, which, while sponsored by the coop- 
eratives themselves and so allowed full access to 
their records, were conducted by independent au- 
thorities. The first inquiry, an “examination” by 
Sir Alexander Carr-Saunders, Professor Robert 
Peers, and the author (1938), dealt with all the 
activities of the movement. The second inquiry 
was by the Co-operative Independent Commission 
(1958), which had Hugh Gaitskell as chairman, 
and which had terms of reference limited to “a 


survey of cooperative production and marketing.” 
These two inquiries will be referred to as the 1938 
examination and the Gaitskell report. 

Up to 1938, consumer cooperatives were almost 
unreservedly praised by English economists. Pro- 
fessor Alfred Marshall served for a year as presi- 
dent of the Cooperative Congress, and Sidney and 
Beatrice Webb were enthusiastic (1921). The 1938 
examination, however, was critical in several re- 
spects and foretold possible stagnation, while the 
Gaitskell report had to take account of some actual 
stagnation and foretold possible decline. Both re- 
ports tried to analyze stagnation and possible de- 
cline in terms of the cooperatives’ own (1) financ- 
ing, (2) management, and (3) local organization. 
A further internal factor (4) may be the weak 
incentive for growth. Both reports also stressed the 
external factor (5) of changes in the society that 
the cooperatives are trying to serve. 

(1) Compared to large-scale capitalist trading, 
the cooperatives’ investment in both fixed capital 
and research is low. This is partly due to fear of 
financial insecurity expressed, as the 1938 exami- 
nation says, in such slogans as “no speculation 
with depositors funds” and “owe no man any- 
thing” (1938, p. 456). In consequence, the coop- 
eratives have invested heavily in supposedly safe 
liquid funds, such as long-term government stock. 
Low fixed investment is also due to the high rates 
of dividend on purchases. In 1935 societies with 
80 per cent of the retail trade paid dividends of 
between one shilling and two shillings on the 
pound, that is, 5 per cent to 12.5 per cent on turn- 
over. This range is higher than the average capital- 
ist dividends on sales turnover and meant less of 
the surplus was available for development. 

(2) The cooperatives do not secure managers of 
the necessary ability and enterprise for growth 
under competition. This is due partly to the low 
scale of salaries they offer compared to their capi- 
talist competitors and partly to the bias toward 
appointing persons from within the movement. The 
independent small retail entrepreneur was not par- 
ticularly efficient, but the large British capitalist 
retail firms now displacing him are increasingly 
recruiting university graduates as future managers, 
while the British cooperatives still consider it un- 
democratic to recruit even grammar school gradu- 
ates over the heads of those working their way up 
after leaving school at the minimum age. This view 
of democracy is becoming more and more damag- 
ing to efficiency because so many more university 
places are now available to the more intelligent 
children of the working class in England. Those 
now without degrees are likely to be unintelligent. 
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This was not necesarily so previous to the Educa- 
tion Act of 1902 but has become particularly so 
since the act of 1944, which increased so greatly 
the number of “free places” in secondary schools. 

The quality of management depends not only on 
the principles of appointment but also on those of 
discharge. The British working-class tradition, re- 
acting against ruthless capitalist practice, seldom 
allows the removal of cooperative managers or in- 
deed the unseating of elected directors, however 
inefficient they may prove to be. “In some societies,” 
to quote from the 1938 examination (Carr-Saun- 
ders et al., p. 87), “it seems to be assumed that the 
only cause which can lead to a member leaving the 
committee is death.” Once the managers are ap- 
pointed, their actions are likely to be cramped by 
the elected directors, who tend, because of the 
principle of no unseating, to be old. This cramping 
varies from society to society, according to the 
personalities of the directors, but will probably be 
greatest wherever the directors are paid and are on 
on the job full-time, as are the wholesale societies’ 
directors. 

(3) A criticism in the Gaitskell report con- 
cerned the uneconomically small scale of the non- 
food sales, which is a result of the strictly local 
organization and control of retail societies not spe- 
cializing in particular trades. The Gaitskell com- 
mission advocated (1958, p. 252) a national retail 
development society which would, among other 
activities, set up chains of specialized shops, no- 
tably in clothing and footwear, “in which the ad- 
vantages of multiple shop organization are well- 
proved and in which local societies restricted by 
boundary agreements are in no position ever to 
attain to these advantages.” 

(4) Looking into deeper economic causes of the 
relative cooperative stagnation, it is possible that 
the incentives to growth are weak. When income 
and prestige are in the form of a profit on capital, 
a strong motive exists to use that capital to capac- 
ity and to spread the overhead over the maximum 
possible output. But if income and prestige are in 
the form of a margin or dividend on sales, the 
motive is to cut costs per unit of output (even, 
possibly, by doing less business) so as to increase 
that margin. Empirical evidence to sustain this 
hypothesis can be found in the comparatively low 
distributive costs of cooperative retailing shown in 
the British Census of Distribution. 

Cooperative shops achieve this low cost by trying 
to spread the load of shoppers. Some societies even 
avoid heavy peak loads by closing shops on Satur- 
day afternoons, when the load would be greatest! 
By this means, the cooperative shops need not keep 
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so large a sales staff; and sales staff, not capital 
overhead, is the heaviest item of cost in retail- 
ing. Since market prices are charged, this policy 
achieves a high margin of profit on the goods ac- 
tually sold, but it does not stimulate growth as 
directly as does the increasing of profit on capital 
by the sale of more and more goods. 

(5) With the distinct rise in the standard of 
living since World War u and the wider variety of 
goods demanded by the working class, the coop- 
eratives have met much enterprising competition. 

In the food trades, capitalist multiple stores have 
paid increasing attention to the tastes of con- 
sumers. When the cooperatives were first founded, 
the cheap shops where the working class traded 
offered foodstuffs of inferior quality. The coopera- 
tives, accordingly, stressed the “purity” of the foods 
they sold and produced—often to the neglect of 
their taste. Indeed, the 1938 inquiry organized tast- 
ing panels for identically priced foodstuffs, the 
makers of which could not be identified, and found 
a preference even among cooperative society mem- 
bers for a majority of capitalist food products. 

Engel’s law, moreover, has operated to make ex- 
penditure on foods a lower proportion of the work- 
ing-class budget. The cooperatives, with their con- 
centration on foods, do not seem to have risen to 
the occasion. While the British Census of Distribu- 
tion for 1961 gives 14.8 per cent as the coopera- 
tives’ share of the total food trade (including as 
high or higher a percentage for the simpler wants, 
for example, around 35 per cent for milk), the 
cooperative share of the total dry goods trade is 
only 5.8 per cent (including 4.3 per cent of ap- 
parel and 4.5 per cent of furniture). 


The pattern outside Great Britain 


In countries other than Great Britain, consumer 
cooperatives started later and, judging by the per- 
centage of membership to total population, still lag 
behind. In 1961, these percentages were Great 
Britain 24.7, Finland 23.0, Sweden 16.0, Switzer- 
land 14.2, and Denmark 12.8. Lower down the 
percentage scale were France 7.1, Australia and 
Austria 5.8, Canada 4.7, West Germany 4.6, Italy 
3.9, Argentina 1.8, and the United States 0.8. Un- 
derdeveloped countries—except Israel 6.9, Burma 
3.2, Ceylon 3.0, Mauritius 2.2, and Pakistan 1.6 
—all have a cooperative membership to total popu- 
lation below 0.7 per cent, in spite of government 
encouragement in many of them (Statistical State- 
ment No, 1 in International Co-operative Alliance 
1963). 

In total value of cooperative retail trade, Great 
Britain again comes first with about four times that 


of Finland or of Sweden, which have the next 
highest trade. But if the trade per head of popula- 
tion is used as an index, Finland ranks first, Switz- 
erland second, and Great Britain is bracketed third 
with Sweden. The most striking impression in re- 
viewing the activities and structure of consumer 
cooperatives throughout the world is the similarity 
of the pattern of organization and achievement 
(although deviations appear here and there): 

(1) The local retail society is the original and 
basic unit; federations of retail societies, and 
wholesale societies controlled by retail societies, 
follow later and undertake production. The Swedish 
wholesale cooperative, the Kooperativa Förbandet, 
which was founded in 1899, has gone further in 
production activity than the two British wholesale 
societies. Its products form a higher proportion of 
the sales of the retail societies, and by price reduc- 
tions it has successfully fought monopolies or near- 
monopolies in such products as electric lamps, 
detergents, and rubber overshoes. 

(2) The wide variation in the size of the retail 
societies is pervasive. The largest own a great num- 
ber of branch stores and undertake production for 
local consumption, such as baking. The smaller 
societies are being amalgamated and, with the en- 
largement in size, difficulties are encountered in 
maintaining a real democracy. In Sweden, mem- 
bers of large societies meet by districts to elect a 
local committee for the supervision of shops in that 
district. Delegates from the local committees form 
the general meeting of the society. 

(3) Some income is spent on education and 
propaganda. Journals are published and discussion 
groups organized. Economic activities are not con- 
fined to distribution and production but have spread 
into insurance and banking, on which the French 
societies concentrate. But the main activity remains 
the sale of foodstuffs and household consumables, 
particularly the more standard goods such as milk, 
margarine, flour, and bread. 

(4) The proportion of cooperative society mem- 
bership to the total population is increasing, but 
the sales per member are diminishing. The rates of 
dividend on purchase are becoming lower, and al- 
though a rapid growth in business occurred at first, 
an approach to a ceiling of 12 per cent to 15 per 
cent of national sales is indicated in most coun- 
tries. Finland, however, has broken through this 
ceiling. 

Comparing the relative strength of consumer 
cooperation in different countries, the greatest con- 
trast is seen to occur between the old (European) 
developed countries where cooperatives are strong 
and the underdeveloped and the new developed 


countries, such as the United States, Canada, and 
Australia, where cooperatives are weak. Weakness 
in the underdeveloped countries is due to the con- 
sumer's lack of spendable income. A high propor- 
tion of the population is engaged in agriculture, 
and most of that proportion are subsistence farm- 
ers with no cash income to spend. 

The weakness of consumer cooperation in the 
new developed countries has no such simple ex- 
planation. It is partly due to the higher standard 
of living, which has, as we have seen, checked 
cooperative progress in Great Britain, checking food 
consumption relative to other consumption. It is 
also partly due to the relative absence of the class 
solidarity so characteristic of nineteenth-century 
Great Britain. Indeed, the few consumer coopera- 
tives that exist in the United States are largely 
found on university campuses, not in industrial 
sections. Cooperative weakness may also, perhaps, 
be due to the greater efficiency of capitalist retail 
management. Certainly the specialized chain store 
that has proved so successful was introduced into 
America under capitalist, not cooperative, auspices. 

As among the old developed countries, in all of 
which class solidarity counts, the more classless 
ones, such as the Scandinavian countries and 
Switzerland, have an advantage because the class 
outlook does not preclude educated management 
and risk taking. In proportion to total population, 
Germany and Italy have a smaller number of co- 
operative members because of Nazi and fascist 
suppression in the 1930s. 


Farmers’ purchasing cooperatives 


The cooperatives that have been discussed so far 
were formed by final, “household” consumers to 
provide for their own consumption. An important 
branch of the consumer cooperative movement, 
however, is engaged in the purchase of materials 
and supplies consumed in further production. In 
contrast to the cooperatives of producers formed 
to market their products, these purchasing associa- 
tions are, logically, consumer cooperatives. They 
answer the same need for good quality commodi- 
ties at reasonable prices. This need arises particu- 
larly among producers who operate on a small 
scale and are unable independently to buy in suf- 
ficient bulk to realize large-scale economies. Co- 
operatives have thus been organized among the 
small traders and craft industries in Germany and 
among fishermen, particularly in Spain (Digby 
1948, pp. 76-77, 126-127). But farmers are the 
outstanding example of such small-scale producers, 
and cooperation has become firmly established in 
supplying requirements for agricultural production. 
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These requirements include not only materials such 
as feedstuffs for livestock and seed and fertilizer 
but also petroleum products for operating tractors, 
electrical power equipment, and various farm im- 
plements. To this list, facilities for insurance and 
credit might also be added. 

The use of cooperatives for obtaining these sup- 
plies varies among countries, largely because of 
different economic and technological circumstances. 
The cooperative provision of the mainly imported 
feedstuffs is important in Great Britain, and the 
provision of electrical power is important in the 
rural areas of the United States. The methods of 
organization also vary from country to country. In 
Great Britain, large purchases are made from the 
powerful consumers’ wholesale societies already 
mentioned. In Sweden and Norway, the supply as- 
sociations have their own wholesale organizations. 
Sometimes, as in Denmark, the cooperatives’ sup- 
ply organizations specialize in one commodity, 
such as feeds or fertilizers; sometimes, as in Fin- 
land, the organization does a general trade, both 
in farmers’ supplies and household goods. 

Beyond the main function of these various 
institutions—the function of supplying “reliable 
goods at reasonable prices”—Margaret Digby points 
out (1948, p. 95) that the farmers’ societies “raise 
the standard of farming by keeping in touch, in a 
way which is sometimes impossible for the individ- 
ual farmer, with developments in agricultural sci- 
ence and technology, and transmitting them to him 
by means of the improved seeds, chemicals and 
implements which it is able to put at his disposal.” 

In the underdeveloped countries where primitive 
small-scale agriculture is the rule, supply societies 
should thus play a large role and are, in fact, sub- 
stantially helping development. However, because 
of the almost universal shortage of enterprise and 
saving and the rate of population increase in these 
countries, considerable government subsidies have 
been required in order for the cooperatives to pro- 
vide land, capital, organizers, and expert techni- 
cians adequate for economic growth. 


P. SARGANT FLORENCE 


[See also AGRICULTURE, article on MARKETING; CREDIT. 
Other relevant material may be found in the biog- 
raphy of OWEN.] 
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CORPORATION 


The corporation, the central economic institution 
of modern society, is, in its most general definition, 
an association of individuals united for a common 
purpose and acting in a common name. With this 
definition, it is an institution of unknown antiquity. 
Certainly corporations organized for business and 
other purposes were well known to the Romans, 
but there is evidence that such institutions, inter- 
mediate between the individual and the state, are 
much older than this. Other characteristics that 
particularize the corporation as we know it have 
been present or absent throughout history, depend- 
ing on the purposes of the association, the charac- 
ter of institutional innovation, and the permissive- 
ness of the state. 


Early history 


The law is prone to emphasize that the corpora- 
tion is a body chartered or recognized by the state; 
that it is a formal agreement, in the nature of a 
contract, among people joined in a common pur- 
pose; that it can hold property, contract, and sue 
and be sued in a common name; and that it has a 
length of life not subject to the lives of its members. 
These formal attributes, however, tend to compress 
the corporation as a historical and developing in- 
stitution into too narrow a mold. Roman corpora- 
tions were not chartered by the state before the 
third century A.D., and in seventeenth-century Eng- 
land there were many associations doing business 
on a joint-stock basis that were not chartered by 
either Crown or Parliament. Indeed, alarm at the 


activities of these unchartered associations was an 
important reason for the Bubble Act of 1720. 

The significance of the agreement of the several 
hundred thousand stockholders of a large American 
corporation and the commonness of their purpose 
is a matter of some doubt. The holding of property, 
contracting, and suing in a common name are 
highly significant attributes of the business corpo- 
ration; but they may have little importance to an 
association incorporated for the improvement of 
the breed of Airedales, Even a life beyond the lives 
of its members is not a necessary attribute of the 
corporation as a historical entity. 

Likewise, most of the elements that are empha- 
sized as essential attributes of the modern business 
corporation are the product of social invention and 
have not been characteristic of business corpora- 
tions from their beginning. Roman business corpo- 
rations traded on a joint stock; but prior to the 
development of double-entry bookkeeping in the 
Italian city-states, the distinction between common 
and individual stocks tended to be somewhat fuzzy. 
The early English overseas trading companies also 
traded on a joint stock, but normally the capital 
was redistributed to the stockholders at the expira- 
tion of a voyage and reconstituted only for the 
next. The Dutch East India Company, founded in 
1602, is supposed to be the first corporation estab- 
lished with a permanent capital stock. The practice 
of issuing transferable shares developed on the 
Continent and was brought to England in the six- 
teenth century. It was, and still is, the Continental 
practice to issue bearer shares, a device that has 
not been used in Anglo-Saxon jurisdictions. Limi- 
tation of the liability of stockholders for the debts 
of the corporation, characteristic of most modern 
business corporations, is a privilege that, on occa- 
sion, has been granted or withheld. In England it 
was not generally made available until 1855. 

The delegation of authority to manage the cor- 
poration’s daily affairs to a board of directors has 
developed slowly, This delegation is of no—or little 
—importance to the small, privately owned corpo- 
ration and, in the large corporation, it has been 
subject to limitations that have varied considerably 
among countries and over time. The proliferation 
of participation rights in capital and income repre- 
sented by different classes of securities is largely 
the product of American railway finance in the 
nineteenth century. And the various devices for as- 
suring and perpetuating minority ownership oF 
management control, such as voting trusts, non- 
voting and multiple-voting stock, pyramiding via 
the holding company, and use of the proxy, are 
largely the invention of our own epoch. 


In other words, the corporation is an evolving 
entity, and the end of its evolution is by no means 
in sight. There is every reason to believe that the 
business corporation a century hence will be a 
rather different institution from the one we now 
behold. The relations between the corporation and 
the state, between management and the various 
participants in capital and income, and between 
the corporation and its suppliers, customers, and 
labor force are in process of change. 

If we consider the long sweep of corporate his- 
tory, it is useful to distinguish the typical forms 
and uses of the corporation before and after the 
industrial revolution, Although the corporation was 
early put to business use, in the early period it was 
established predominantly for other purposes. Uni- 
versities, monasteries, guilds, bishoprics, boroughs, 
were familiar types of corporations. Typically, the 
emphasis was on association for a common pur- 
pose rather than for utilization of a common stock 
of capital. The incorporators were a group of men 
related to each other by the place where they lived 
and the things they did. In the period since the 
industrial revolution, the business corporation has 
become the overwhelmingly important form. The 
emphasis has come to be placed on the manage- 
ment of a stock of capital rather than on the co- 
operation of a group of individuals. In the words 
of one observer, the “associative elements have 
been refined out” of the corporation, and in law it 
has become the expression “of a complicated rela- 
tionship between men and things.” 

In England the corporation was known as early 
as the Norman period; and from the fourteenth 
century on, incorporation by the Crown of bor- 
oughs, guilds, and ecclesiastical bodies was used 
as both an administrative device and a method of 
extending royal power against the baronage. Inso- 
far as business activities were involved, the empha- 
sis was not on a common stock, but rather on the 
regulation of the affairs of a group of craftsmen 
or tradesmen, each operating with his own capital. 
To this end the incorporated guilds developed ef- 
fective internal legislative, judicial, and executive 
instrumentalities. More often than not, a functional 
or territorial monopoly was claimed for its mem- 
bership. 

The use of the corporation as both a business 
device and an arm of the state is clearly evidenced 
in the operations of the large overseas trading and 
colonizing companies of the sixteenth and seven- 
teeth centuries. The British East India Company, 
the Levant Company, and others were as much 
governments as they were business enterprises. 
The American colonies were established by char- 
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tered corporations that governed as well as traded 
in their respective areas. Here again a grant of 
monopoly privileges was a common feature of in- 
corporation, The colonists had every reason, from 
their experience, to associate monopoly with in- 
corporation; and this view continued to influence 
American attitudes toward the corporation until 
well into the nineteenth century. 

This customary relation between monopoly privi- 
lege and incorporation contributed in a major way 
to making the right of incorporation one of the 
issues in the struggle between Crown and Parlia- 
ment in the seventeenth century. Parliament first 
sought the right to confirm such grants and later 
the independent power to grant charters. During 
the eighteenth century, corporations were chartered 
by both the Crown and Parliament; this issue was 
not finally settled until the enactment of general 
incorporation laws in the nineteenth century. 

If we consider the transition from the older busi- 
ness or nonbusiness corporation to the modern, 
predominantly business corporation to be princi- 
pally a shift from the emphasis on association, 
with all the problems of governance of men in as- 
sociation for a common purpose, to an emphasis 
on the administration of a joint stock of capital for 
profit, it is probable that the British joint-stock 
company is the closest forerunner of the modern 
business corporation. Before the industrial revolu- 
tion, the sole proprietorship and partnership were 
overwhelmingly the most important types of busi- 
ness organization, both in numbers and in assets 
owned and controlled. But the seventeenth century 
witnessed an increasing number of business asso- 
ciations, both incorporated and unincorporated, 
trading on a common stock and acting in a com- 
mon name. Participation in a joint stock recom- 
mended itself particularly as a device for spreading 
the risk in hazardous overseas trading operations. 
Before the end of the seventeenth century, many 
joint-stock companies had been formed in England 
for domestic operations as well. 

Early in the eighteenth century, in both England 
and France, the formation of such companies as- 
sumed boom proportions. The two companies that 
achieved particular prominence in the ensuing de- 
bacle, the South Sea Company in England and 
John Law’s Compagnie de la Louisiane ou d'Occi- 
dent in France, were both chartered—the South 
Sea Company in 1711 and John Law’s company 
in 1717. But in England, at least, many of the 
joint-stock companies that fell into disrepute were 
not chartered. The Bubble Act of 1720 struck di- 
rectly at these unchartered companies, but its ef- 
fects seriously inhibited the formation of chartered 
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companies as well. This piece of legislation de- 
scribed itself as “An Act to Restrain the Extrava- 
gant and Unwarrantable Practice of Raising Money 
by Voluntary Subscriptions for Carrying on Projects 
Dangerous to the Trade and Subjects of this King- 
dom.” It prohibited: 


the acting or presuming to act as a corporate Body or 
Bodies, the raising or pretending to raise transferable 
Stock or Stocks, the transferring or pretending to 
transfer or assign any Share or Shares in Such Stock 
or Stocks without Legal Authority either by Act of 
Parliament or by any charter from the Crown to war- 
rant such acting as a Body Corporate. ... (quoted 
in Davis [1905] 1961, pp. iii-iv) 


Although the Bubble Act left the door open for 
duly chartered corporations, very few business cor- 
porations were in fact chartered. During the eight- 
eenth century the corporation was in eclipse. It 
was not that businessmen ceased to understand 
the merits of the corporation as a business form. 
Applications for the privilege of incorporation dur- 
ing the eighteenth century continued to stress the 
advantages of the holding of property in a common 
name; the legal right to contract, sue, and be sued; 
the effective delegation of power to a directorate; 
life extending beyond the lives of members; and 
limited liability. But the sentiments aroused by the 
crash of 1720 were deep and long-lasting, Adam 
Smith, writing on the eve of the industrial revolu- 
tion, expressed these sentiments firmly. In his view, 
the joint-stock company, without monopoly privi- 
leges, was likely to be a successful enterprise only 
in conducting such routine operations as banking 
and insurance, or the management of canals and 
urban water systems. 

Such enterprises usually required more capital 
than could be provided by a sole proprietor or 
a partnership. With the advent of the industrial 
revolution, the number of business opportunities 
that could be effectively exploited only by sizable 
amounts of capital increased rapidly. The joint- 
stock company was frequently the most effective 
way of assembling the required quantities of capi- 
tal, and the management of such companies rather 
quickly demonstrated that the effectiveness of this 
type of business enterprise was not, in fact, limited 
to routine operations. With the industrial revolu- 
tion the business corporation came into its own. 
Its subsequent development falls roughly into two 
periods. The first saw the spread of free incorpora- 
tion under permissive general laws. The second 
has witnessed the growth of the large, multiplant 
corporation, frequently through the amalgamation 
of small local and regional concerns. 


Nineteenth-century growth 

France took the leadership in liberalizing corpo- 
ration laws. The Code de Commerce of 1807 rec- 
ognized three classes of business associations, of 
which one—the société anonyme—was the busi- 
ness corporation with transferable shares. The 
formation of such companies required special gov- 
ernment authorization until the law of 1867, which 
permitted corporations to be formed on the basis 
of public registration. In England the Bubble Act 
was repealed in 1825, but incorporation continued 
to be permitted only by special grant until 1844. 
Limited liability was not granted as a matter of 
course until 1855. In Germany freedom of incor- 
poration was not established until 1870. 

The terms used to characterize the business cor- 
poration in different countries emphasize different 
aspects of the institution. The German Gesellschaft 
mit beschränkter Haftung stresses limited liability. 
The French société anonyme suggests the juristic 
as against the real person—a “person” whose life 
is independent of the lives of the incorporators. 
The British “limited joint-stock company” empha- 
sizes the common capital, with participating rights 
and limited obligations. The American “corpora- 
tion” refers back to the older meaning of an asso- 
ciation of individuals united for a common purpose. 

In the American colonies the only business cor- 
porations important enough to justify the invest- 
ment in royal charters were the colonies themselves. 
And the extension to the colonies of the principles 
of the Bubble Act in 1741 effectively precluded 
the formation of unincorporated joint-stock com- 
panies. Thus, the United States came into being 
with a population highly suspicious of the business 
corporation as an instrument of royal prerogative 
and royal favor. It was not long, however, before 
the business corporation was domesticated to the 
American environment. The state legislatures, con- 
ceived as heirs to the sovereign power, took it upon 
themselves to grant charters, and a sizable number 
of businesses were incorporated before the con- 
stitution was ratified. Despite the fact that a power 
of the federal government to incorporate was de- 
liberately omitted from the constitution, Congress 
chartered a corporation as early as 1791; the Su- 
preme Court, in McCulloch v. Maryland, ratified 
this use of power. Federal incorporation, however, 
has been relatively rare; and corporate policy very 
early in the nineteenth century became a preroga- 
tive of the several states, Ñ 

Some 335 businesses had been incorporated dn 
the United States by 1800, mainly banks, in- 


surance companies, and canal, turnpike, and bridge 
companies. Although special charters were re- 
quired, these were readily granted for almost any 
business purpose; in 1811 New York enacted the 
first law permitting extensive self-incorporation. 
This act covered the principal areas of domestic 
manufacture and permitted the self-incorporation 
of companies with an authorized capital of less 
than $100,000. Larger companies required special 
charters, and such charters continued to be sought 
even by a number of small companies. The Con- 
necticut Act of 1837 was still more general, and 
before the beginning of the Civil War a dozen 
states had enacted general incorporation laws. Most 
of these states, however, still left the way open 
for special charters. In fact, the general laws 
tended to be relatively restrictive and limited to 
smaller companies. Larger companies frequently 
attained a substantially greater freedom of action 
by special legislative enactment. 

Competition among the states for the business 
of incorporating led, after the Civil War, to a rather 
rapid relaxation of legislatively imposed limita- 
tions. This movement was favored by a business 
environment in which incorporation came to be 
regarded as a “right” rather than a “privilege”; a 
“right” that was conceived to be an essential ele- 
ment of the doctrine of individual liberty. The New 
Jersey Act of 1875, which was highly permissive, 
was perhaps the first of the modern incorporation 
laws; it was followed over the next two decades 
by similar laws in other states. This process led 
to a substantially more lax regulation of business 
corporations than characterized the company law 
of most European countries. 


The modern corporation 


The large majority of business corporations 
formed in any economically advanced country are 
hardly to be distinguished from other types of busi- 
ness organization. They normally enjoy limited lia- 
bility and a different tax status, but in essential 
respects function like other small firms. Economic 
theory has tended to see entrepreneurial activity 
as unaffected by the particular organizational form 
with which it happens to be clothed. But while the 
small, private corporation may be economically— 
and in other respects—undistinguishable from 
other types of firms, this is clearly not true of the 
large, quasi-public corporation. The large firm, 
because of its capital requirements, is almost in- 
evitably a corporation; most of the problems 
commonly associated with corporate activity are 
problems of size and dispersion of ownership. The 
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small, private corporation and the large, quasi- 
public corporation as legal persons may be equal 
in the eyes of the law, but they are so in no other 
respect. 

The business area in which capital requirements 
first demanded the large corporate organization 
was railway transportation. Since individual for- 
tunes were quite inadequate to meet these financial 
requirements, means were sought to enlist the 
savings of both public bodies and small private 
contributors. American railways were typically un- 
dercapitalized, and, in search of funds to meet 
underestimated financial requirements, a long se- 
ries of financial instruments was devised. Preferred 
stock was issued first in 1830, when the Baltimore 
and Ohio ran out of funds and appealed to the 
Maryland legislature. The state sought to protect 
itself by asserting a first claim on earnings. Early 
preferred stock was cumulative, participating, vot- 
ing, and temporary. All these characteristics were 
later varied and combined in different relation- 
ships. Debt financing was typical from the begin- 
ning of the railway era; and a wide variety of 
forms, sound and unsound, was evolved during the 
century. This variety of financing techniques de- 
veloped by the railways became available to the 
growing industrial corporations after the Civil War. 
Still, as late as 1900, three-quarters of the corpora- 
tions whose securities were listed on the New York 
Stock Exchange were railways. 

The wide variety of financial instruments in com- 
mon use created a similar variety in the participa- 
tion rights of various types of security holders to 
capital and earnings. In theory, participants, at 
least in equity, were all bound together in a com- 
mon interest and all working toward a common 
end. In fact, their interests and purposes tended to 
differ substantially. With this proliferation of 
securities, management acquired powers to appor- 
tion earnings and assets among various partici- 
pants that were in addition to and distinctly dif- 
ferent from older powers to manage the enterprise. 
Corporate law has slowly taken cognizance of this 
difference and, to prevent or limit the possibilities 
of exploitation inherent in the ability to influence 
participation rights in earnings and assets, has 
been developing the doctrine that all such powers 
have been granted only in trust, to be used equi- 
tably among the participants. 

The great merger movement of 1897-1903 
created in the industrial area corporations of a size 
that had formerly been by and large limited to 
the field of railway transportation. It was during 
this period that the big “twos,” “threes,” and “fours” 
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characteristic of so many American industries were 
largely established. These large manufacturing 
and mining corporations, together with the older 
railway enterprises and the rapidly growing cor- 
porations in the newer utilities, were organizations 
substantially different from private businesses in- 
corporated for tax reasons or to secure the advan- 
tages of limited liability. Their market position, 
absolute size, and internal organization created a 
set of problems essentially unknown to the world 
of the early nineteenth-century, family-type cor- 
poration. [See MERGERS.] 

Typically, these large corporations possess assets 
of $250 million or more, employ 5,000 or more 
workers, count their stockholders in thousands or 
hundreds of thousands, account for a relatively 
large share of output in their respective industries, 
and operate establishments in a number of differ- 
ent locations. A relatively small number of such 
corporations account for a large percentage of the 
economic activity in manufacturing, transportation 
and utilities, mining, banking and insurance, and 
in certain branches of commerce. A recent compi- 
lation of corporate income tax returns revealed 
525,000 active nonfinancial corporations in the 
United States with assets of $413,000 million. The 
202 largest of these, all with assets of $250 million 
or more, owned 40 per cent of total corporate 
assets. 

This is not a recent development in the United 
States, nor is it limited to the United States. Al- 
though the evidence is far from complete, it sug- 
gests no significant increase during the last four 
or five decades in the share of business activity 
(by various measures) accounted for by the largest 
corporations in the areas of manufacturing, mining, 
and public utilities. Such increase in concentration 
as has occurred in the economy as a whole during 
this period is mainly to be attributed to the decline 
in the relative importance of agriculture, a typi- 
cally small-enterprise sector. And with the relative 
growth of service industries, it is possible that 
concentration in the economy as a whole may even 
decrease. Recent studies in Canada, Britain, and 
western Europe suggest a degree of concentration 
that, by certain measures, is higher than in the 
United States. The large corporations in the United 
States are much larger than elsewhere, but so also 
is the economy. 

Nevertheless, the facts indicate a situation in 
which a few very large corporations account for a 
high percentage of business activity in a number 
of important sectors. And there is, moreover, a 
high correlation between absolute size and relative 


size within the different sectors and the industries 
constituting the manufacturing sector. The “twos,” 
“threes,” and “fours” in most industries tend also 
to be very large corporations. It is the relation 
between relative and absolute size that occasions 
most of the concern for what is commonly called 
“the problem of the large corporation.” [See Econ- 
OMIES OF SCALE; INDUSTRIAL CONCENTRATION.] 


The controversy 


A high degree of concentration in a particular 
industry, i.e., a measure of relative size, is com- 
monly supposed to indicate a substantial degree of 
monopoly power. The difficulties, however, of as- 
sessing the “degree of monopoly” are so formidable 
that it is doubtful whether this is a useful concept. 
An industry, however defined, inevitably differs so 
substantially from a market, within which it is 
appropriate to discuss the relations of monopoly 
and competition, that the one substitutes only very 
imperfectly for the other. Products of one industry 
compete with products from another, and firms in 
one industry are sheltered by transport costs from 
the competition of other firms in the same industry. 
It may be possible through the study of the struc- 
ture of particular industries and of the behavior 
of firms within this structure to come to a more 
or less objective judgment that the area of choice 
or scope of decision making of large firms is sig- 
nificantly greater than that of small firms or of 
firms whose behavior is compatible with the com- 
petitive model, but this is about as far as agree- 
ment among different students of the problem can 
be pushed. [See the articles MARKETS AND INDUS- 
TRIES; MONOPOLY. ] 

The area of choice or scope of action open to 
the large corporation has a good deal to do with 
the significance of the “corporate problem.” If large 
corporations were as circumscribed in their behav- 
ior as firms operating within the context of the 
competitive model, most of the aspects of this 
problem, as elaborated in the literature of the 
subject, would tend to disappear. In an atomisti- 
cally competitive environment, firms—whether 
incorporated or not—have no alternative but to 
buy and sell at established market prices; and a 
failure to maximize profits is tantamount to a 
failure of the firm. Under these circumstances, the 
legal characteristics or the internal organization 
of the firm is irrelevant to its behavior. Economic 
analysis has, in fact, tended to neglect questions 
of internal organization and to assume that entre- 
preneurial behavior is unaffected by the absolute 
size of firms. There is reason to believe that the 


95 per cent of American corporations employing 
fewer than 20 workers do indeed behave much as 
do small proprietorships and partnerships. Insofar, 
however, as large corporations are able to escape 
from competitive market restraints, they command 
an area of decision making within which behavior 
may be significantly affected by absolute size and 
the character of their internal organization. [See 
CoMPETITION.] 

The aspects of size and internal organization 
most often stressed in the literature on the large 
corporation are the separation of ownership from 
control, the growth of a managerial class with 
motivations allegedly different from those of the 
classical nineteenth-century entrepreneur, and a 
changing relationship of the corporation to govern- 
ment and to the local community. The separation 
of ownership from control is alleged to be pro- 
ducing, on the one hand, a change in the meaning 
and significance to be attached to private property 
and, on the other, a group of manager-controllers 
no longer owing exclusive responsibility to owners 
but also to other groups, such as suppliers, workers, 
and customers, who are attached in one way or an- 
other to the corporation. The management groups 
in large corporations are said to be self-selected and 
their activities to have the customary bureaucratic 
characteristics: hierarchical organization, profes- 
sionalization of function, formality of recruitment 
and promotion procedures, and a proliferation of 
written rules, orders, and record keeping. The 
changing relation of the large corporation to gov- 
ernment is described by some as an increasing 
vulnerability of size to public opinion and govern- 
mental action or threats of action, and by others 
as increasing opportunity for manipulation of 
public policies by large business as a “system of 
power.” The changing relation to the local com- 
munity is supposedly the product of multiestab- 
lishment corporations whose local installations are 
run by hired managers with only tenuous roots in 
the community. Obviously, some of these assertions 
are more speculative than others. z 

Separation of ownership and control. An in- 
creasing separation of ownership from control of 
the large corporation was clearly portrayed in the 
classic study of Berle and Means, The Modern 
Corporation and Private Property (1932). Nothing 
has appeared since then to deny this thesis, and 
much to confirm it. Very few among the largest 
corporations in the United States could be correctly 
described as majority-controlled. A larger number 
are controlled by discernible minority interests but, 
under the influence of estate duties, this group 1$ 
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declining relatively. A still larger number can only 
be described as management-controlled, and this 
number is increasing. The dominant influence at 
work, of course, is the increasing percentage of 
equity holdings in the hands of those concerned 
with returns rather than control. Management con- 
trol has been assisted by such devices as voting 
trusts, nonvoting and multiple-voting stock, and 
pyramiding; but the trend to management control 
would exist without these devices. Nor has the 
securities legislation of the 1930s in the United 
States, which sought to support “shareholders” 
democracy” by regulating the system of proxy vot- 
ing, had significant effect on this trend. 

Much less clear, however, are the effects, if any, 
on the controllers and on the owners of the sepa- 
ration of ownership from control. Wide dispersion 
of shareholdings has accentuated certain charac- 
teristics of corporate ownership that have been 
there from the beginning. The shareholder of today 
has typically become a rentier concerned only with 
dividends and capital gains, but this is implicit in 
an organization that puts many capitals together 
to form a joint stock. In Berle’s words, “The capital 
is there, and so is capitalism. The waning factor is 
the capitalist” (1954, p. 39). The capitalist as 
owner—manager of a local grist mill or corner garage 
is indeed hard to discover in the owner of 100 
shares of Standard Oil. Eighteenth-century justifi- 
cations of private property become somewhat obso- 
lete in this context. But whether the stock-and- 
bond owner of the twentieth century is any less 
devoted to the institution of private property than 
the “full-blooded capitalist” of the nineteenth cen- 
tury is not obvious. 

Nor is it clear to what extent the separation of 
ownership from control has affected the behavior 
of the firm. Managements of large firms typically 
and increasingly recruit their own replacements, 
taking account frequently of the interests of large 
lenders, investors, and possibly suppliers and cus- 
tomers; but without consulting the amorphous mass 
that represents majority stockholdings. This pro- 
cedure of self-recruitment and self-replacement of 
management, together with a certain vagueness as 
to whom management owes responsibility, does 
raise questions concerning the “legitimacy” of 
managerial power. But it does not follow that this 
power, whether legitimate or not, is used very dif- 
ferently than it would be if it were in the hands of 
owners. There is considerable evidence that, with 
respect to policies affecting dividends, retention of 
earnings, and relative reliance on outside financing, 
separation of ownership from control has not had 
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much influence. In other areas the supposed influ- 
ence is rather speculative. Furthermore, there are 
influences other than the separation of ownership 
from control that impinge on the behavior of the 
large corporation. 

Role of size. Absolute size, particularly if meas- 
ured in terms of numbers of employees, regardless 
of the location of ultimate control, creates a man- 
agerial problem for which bureaucracy is the only 
answer. Major decisions tend to be group decisions 
in which the divergent interests of separate func- 
tional or area divisions of the corporation are rep- 
resented by established suborganizations with pro- 
cedures and expectations of their own. Authority 
that would be centralized in a small firm is dele- 
gated, but within the limits necessary to the main- 
tenance of cohesion in the organization as a unit. 
The major as well as the minor decision makers 
are hired employees whose principal remuneration 
is by salary and whose expectation of economic 
improvement is through advancement in the hier- 
archy. 

Insofar as absolute size is correlated with size in 
relation to the markets in which the corporation 
purchases inputs and disposes of output, the possi- 
bility exists of escape from the profit-maximizing 
behavior necessary to the continued existence of 
the competitive form. This is true whether the cor- 
poration is owner-controlled or management-con- 
trolled. But whether a management-controlled firm 
is more prone to take one or another course of 
action alternative to profit maximization than an 
owner-controlled firm is not clear. 

The large corporation is likely to be confronted, 
on both the buying side and the selling side of the 
market, with units that are also large and well 
organized. In the capital market it confronts large 
banks, insurance companies, and other institutional 
investors well able to look after their own interests 
as capital suppliers. In the labor market it typically 
deals with large trade unions. The distribution of 
its product is frequently through dealers who are 
well organized and often protected by antidiscrim- 
inatory legislation. Finally, the large corporation 
acts within a political and social environment in 
which its size and conspicuousness make it a legit- 
imate object of political inquiry and of public 
interest. 

The large corporation is, then, typically manage- 
ment controlled; it is bureaucratically organized; 
it is usually able to escape to some degree from 
the competitive restraints that would compel action 
within a narrow range of alternatives; it is fre- 
quently confronted on the buying and selling sides 


of the market with large and well-organized units 
or groups; and it operates in a political and social 
environment in which its size makes it conspicu- 
ous. How these various characteristics act and react 
on each other and what influence, in concert, they 
have on the behavior of the large corporation is a 
subject on which there exists a large body of litera- 
ture but little hard evidence. There does seem, how- 
ever, to be substantial agreement that the “man- 
agerial capitalism” of the twentieth century is a 
different entity from the “classical capitalism” of 
the nineteenth and that the emergence of the large, 
multiproduct corporation, with widely dispersed 
shareholding, lies at the heart of the difference. 

Profit maximization. There is strong emphasis 
in the literature of managerial capitalism on the 
proposition that profit maximization, as it was 
understood in the nineteenth-century context, does 
not now govern the behavior of the large corpora- 
tion. By some this is interpreted to mean that long- 
run considerations rather than considerations of 
immediate gain are determining. Some hold that 
there are groups in addition to the owners whose 
interests are considered in the distribution of the 
gross proceeds of the corporation. Still others main- 
tain that in a management-controlled corporation 
there are managerial preferences, running counter 
to profit maximization and concerned with the 
security and continued growth of the firm, that 
significantly influence corporate behavior. It must 
be said that whether corporations do or do not 
maximize long-run profits is not a proposition sub- 
ject to statistical demonstration. Profit maximiza- 
tion can be precisely defined only in a static context 
with known cost and revenue functions. Any dem- 
onstration that corporations do not follow profit- 
maximizing procedures must produce weighty evi- 
dence of behavior clearly incompatible with any 
plausible concept of profit maximization. 

That large corporations shape their price, output, 
and other policies with a relatively long time hori- 
zon in view does not require demonstration. But 
such behavior is not limited to large firms, nor is it 
dependent on particular forms of business organi- 
zation. It is, rather, technologically determined and 
is dependent on the length of life of various inputs 
and on the possibility of cultivating persistent cus- 
tomer attachment. 

Assertions that management is no longer exclu- 
sively responsible to ownership but owes obligations 
to labor, suppliers, customers, and others abound 
in writing on corporate management. But a distinc- 
tion must be drawn between changes in the envi- 
ronment external to the corporation, to which 


management may react in ways that are compatible 
with sensible conceptions of profit maximization, 
and internal changes in corporate organization, 
creating obligations to interests other than owner- 
ship and incompatible with any principle of profit 
maximization, The twentieth-century corporation 
confronts an external politico-economic environ- 
ment substantially different from that of its nine- 
teenth-century counterpart. It would be strange if 
profit-maximizing behavior did not adapt itself to 
this environment. Increased emphasis on institu- 
tional advertising, expenditures devoted to improv- 
ing the “corporate image,” grants for university 
scholarships, contributions to local community 
chests, the lending of executive personnel to vari- 
ous types of government service, are all evidence 
of an adaptive reaction to the changed environ- 
ment within which the corporation operates; but 
they are not necessarily evidence of departure from 
profit maximizing. 

Somewhat more persuasive is the argument that, 
in managerially controlled corporations in which 
the principal decision makers are not profit re- 
ceivers, managerial preferences are likely to run 
counter to sensible interpretations of profit maxi- 
mization. The obligations that management recog- 
nizes are not so much to various interest groups 
as to itself, Management compensation tends to 
be highly correlated with the size of the firm. So 
likewise are power, prestige, and status. It is sug- 
gested, therefore, that management's concern is 
centrally with the growth of the enterprise at rates 
and in ways that do not endanger the continued 
viability of the firm or the position of the existing 
officeholders. This may or may not be compatible 
with a sensible interpretation of profit maximiza- 
tion. There is a certain plausibiilty to, and some 
evidence for, this proposition. It would be rather 
strange if the relaxation from immediate competi- 
tive pressures attendant on relative size, the admin- 
istrative changes attendant on absolute size, and 
the expansion of managerial control had not had 
some significant influence on managerial prefer- 
ences, Nevertheless, it must be said that these 
suggestions have not as yet been shaped into a 
clear and persuasive account of managerial moti- 
vation and behavior. 

Epwarp S. MASON 
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CORPORATISM 
See Fascism. 
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See PENoLocy and the biographies of BECCARIA 
and BENTHAM, 


CORRELATION 
See under MULTIVARIATE ANALYSIS; see also 
STATISTICS, DESCRIPTIVE, article on ASSOCIATION, 


COST 


In economics, the cost of an event is the highest- 
valued opportunity necessarily forsaken. The use- 
fulness of the concept of cost is a logical implica- 
tion of choice among available options. Only if no 
alternatives were possible or if amounts of all re- 
sources were available beyond everyone's desires, 
so that all goods were free, would the concepts of 
cost and of choice be irrelevant. If choices are 
made on anything other than a random, purpose- 
less basis, a criterion of choice is implied. What- 
ever the criterion, the chosen option will involve 
a loss of the highest-valued forsaken option. This 
implies that only if one chooses actions so as to 
maximize the value realized will cost be covered. 

Failure to appreciate the purpose of the concept 
of cost can lead to confusing the concept of cost 
with the undesirable attributes of some event. For 
example, when one builds a swimming pool, the 
toil and trouble of digging it and the nuisance of 
noisy, disobedient neighborhood children and un- 
invited guests who use it are undesirable attributes 
of the pool. They are not the costs of creating and 
having a pool. This distinction between (a) unde- 
sirable attributes inherent in some event and (b) 
the highest-valued forsaken option necessary to 
realize that event is fundamental, for only the 
latter is cost as the term is used in economics. 

We can illustrate. The construction and posses- 
sion of the pool involve an amalgam of undesirable 
and desirable attributes. But if, in some sense, the 
desirable exceed the undesirable, it does not follow 
that one would choose to have the pool. One might 
choose something else instead, say having an extra 
car, and that too would involve desirable and unde- 
sirable attributes. The decision maker must choose 
among events that are amalgams of “goods and 
bads.” He cannot choose all events whose desirable 
features more than offset their undesirable ones, 
given the limited resources at his disposal. A com- 
parison among all the available options (each con- 
sisting of an amalgam of good and bad) yields for 


each option a rank-indicating measure of value. 
The cost of one amalgam is the best of the for- 
saken amalgams. It is not necessary that for each 
event the good and bad attributes be separated and 
that there be assigned a measure of the undesirable 
attributes and also a measure of the desirable. 
Such a procedure would indicate only that many 
events are desirable on net, and a criterion of 
choice among these would still be needed. 

We can illustrate with the person deciding 
whether or not to have a swimming pool. He de- 
termines that the “good” consequences of a pool 
are worth what we shall call “100 units,” while the 
“bad” are equivalent to the loss of “70 units.” The 
best alternative to having a pool is, let us say, to 
take action “A,” with “good” attributes valued at 
50 and “bads” valued at a loss of 10. The pool has 
a net value of 30, while event A is worth 40. The 
cost of the pool is 40 (not 70), while the cost of 
A is 30 (not 10). What is lost if the pool is se- 
lected is the 40 units of value otherwise available 
by opting for A. 

The temptation to think that because events are 
valued by comparing the good attributes with the 
bad, cost must be the bad attributes is encouraged 
by business usage. Businessmen weigh revenues 
(as good consequences ) against expenses or costs. 
Considering these costs as the bad attributes over- 
looks the distinction between valuation and cost- 
ing. The value of a given event is obtained by 
weighing its good and bad consequences against 
each other—if one wants to think in terms of 
good and bad rather than less or more desirable— 
but the cost of that event is still not revealed. The 
highest-valued forsaken option must still be ascer- 
tained in order to determine the cost. Even in the 
businessman’s calculation, what his cost really 
measures, as shown below, is not the bad conse- 
quences of an action but the highest-valued for- 
saken opportunity. 

It is sometimes fallaciously thought that if 
building a pool involved even more pain or other 
undesirable consequences, its costs surely must be 
higher. But the costs of the pool are not higher 
unless the best alternative is affected. More pain 
in building a pool may or may not affect my alter- 
native opportunities. If an extra hour of work is 
involved, then my alternatives are changed be- 
cause I lose another hour of other desirable uses. 
The definition of cost does not deny that the pain 
and time and trouble of producing some event are 
influential in the measure of cost. But it does show 
that these aspects enter into costs only by affecting 
the value of the best forsaken opportunities. 

This can be seen more clearly if we consider the 


following situation in which the alternatives are 
not affected. Suppose that in building the pool, the 
pain to be suffered during a given time was to be 
more intense—but not longer-lived. In this case, 
the increase in intensity of pain (assuming that 
recovery is immediate upon the cessation of the 
work) does not affect the alternative opportunities. 
These stay the same. What this more intense pain 
does is reduce the value of the pool, not raise its 
cost. 

Another example of an increase in undesirable 
attribute that does not increase costs is one that in- 
creases that attribute uniformly for all opportuni- 
ties. In this case, the feature cannot be avoided no 
matter what one does. A uniform reduction in the 
value of all options reflects the lower level of 
“utility” now generally available. One could even 
call this effect a decrease in costs—since the best- 
valued options are now lower valued. The costs are 
lower because the values are lower, for that is what 
cost reflects. 

Clarification of the logical role of the concept 
of cost in order to explicate clearly the distinction 
between the two ideas—the value of forsaken 
alternatives and the so-called undesirable attributes 
—was begun by the Austrian school of economics 
in the nineteenth century and was further devel- 
oped by Frank Knight (1924). [See ECONOMIC 
THOUGHT. | 

Money costs in a society. The preceding is rela- 
tively unambiguous for choices or selections of 
options in a one-person world. But in a society, 
selection among options involves not only different 
options for the same person but also different 
options available to different people. Therefore an 
interpersonal value measure is necessary. A society 
in which choices are made in accord with a single 
dictator’s preferences resembles the one-person 
world. In a pluralistic (individualist) society, an 
interpersonal value measure can be based on inter- 
personal exchange rates. Voluntary market ex- 
changes among individuals reveal the highest values 
of available options and, hence, their costs in terms 
of values of forsaken options. These market prices, 
to which all people can adjust their choices, pro- 
vide a common measure of the value of increments 
of one event relative to others. 

For example, a market exchange rate of 1 Coca- 
Cola for 2 ounces of chocolate indicates the rela- 
tive value of each. The optional event—having 1 
more Coke—is compared with the option of having 
2 more ounces of chocolate. In an open market— 
one in which all people have access to all goods— 
the exchange rate, or price, of Cokes must at least 
equal the highest-valued alternatives to 1 more 
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Coke. If the price does not equal the highest-valued 
alternative, those who value a Coke at more than 
the market exchange price will prefer, and will be 
able, to enter the market and offer more for a Coke. 
And this will raise the exchange rate to at least the 
highest-valued alternative, Rather than expressing 
the values of alternatives to 1 Coke in terms of the 
amounts of chocolate, beer, or other individual 
goods that are as much desired as 1 Coke, con- 
venience dictates agreement on a common meas- 
ure of value, Since almost all formal contractual 
exchanges are conducted with the medium of 
money, all exchange rates typically are measured 
in units of money—as so many dollars or cents 
per Coke. The use of money prices does not mean 
that money is all that counts, or that people love 
money. It means simply that money is the medium 
of exchange and therefore is the convenient de- 
nominator of interpersonal exchange values of 
events or options. 

In sum, because goods are substitutable sources 
of utility, and because substitution is facilitated by 
exchange via money, it is possible to measure the 
value of a forsaken option in money terms. When 
goods can be obtained not only by interpersonal 
trade but also by production, at the “cost” of other 
things that could have been produced, the costs 
incurred in production choices will be related to 
the market prices of interpersonal exchanges if 
producers have access to markets in which to offer 
their products. 

Market prices and cost. The preceding discus- 
sion implies that cost in an exchange economy is 
based on market-revealed values. If some produc- 
tive resources are used in ways that yield less than 
their highest achievable alternative, or “oppor- 
tunity,” values, these uses will not cover cost. The 
incentive to increase one’s wealth induces shifts of 
resources to their higher-valued use until their cost 
is at least matched by the value of their currently 
yielded product. As the output of the service now 
being produced at a higher rate increases, the 
value of additional increments will fall until there 
is no further shifting of resources from other uses. 
By drawing resources from lower-valued to higher- 
valued uses, the value of a producible good or 
service influences the allocation of resources and 
so the rate of output of the good or service itself, 

This adjustment or reallocation of resources 
among various uses is often expressed less rigor- 
ously. For example: (a) “Lower cost resources are 
shifted to their higher valued uses.” (b) “If costs 
of production are less than potential values of out- 
put, low cost resources will be shifted to increasing 
the output of the higher priced goods, thus induc- 


408 COST 


Table 2 

Purchase price $3,000 

Resale value 1,652 (present value of $2,000 deferred 
two years) 

Depreciation $1,348 

Taxes and insurance 150 (first year) 

Taxes and insurance 136.35 (present value of second year’s 
payment) 

Present-value cost $1,634.35 


Events are rarely indivisible; instead, the magni- 
tude of the event can be varied. Thus, in the auto- 
mobile example, we could consider a set of alterna- 
tive output programs, e.g., running the car zero 
miles in two years, one mile in two years, two miles 
in two years, etc., up to, say, 20,000 miles in two 
years. Suppose that for each of these we can deter- 
mine the costs. The differences in costs between ad- 
jacent alternative programs is the incremental or 
marginal cost for the mileage increment. That is, 
the difference in cost between two-year programs of 
19,999 miles and 20,000 miles is called the mar- 
ginal cost of a mile of travel at 20,000 miles: it is 
the increment in cost for a 20,000-mile program 
over the cost for a 19,999-mile program. If we com- 
puted the cost for one mile of distance, for two 
miles, etc., up to 20,000 miles, we could compute 
a series of marginal costs at, or associated with, 
one mile more than no miles, one mile more than 
one mile, one more than two, etc. The sum of all 
these (including the cost of zero miles with two- 
year ownership) will total to the cost of ownership 
and 20,000 miles of travel. The concept of mar- 
ginal cost is relevant for deciding among available 
programs because it tells by how much the cost of 
one program differs from that of adjacent available 
programs. 

In the comparison of mileage programs we do 
not mean that one performs the program of, say, 
10,000 miles of travel and then, after completing 
that program, asks how much one more mile would 
cost. Instead, initially one considers the cost of a 
proposed 10,000-mile program and the cost of 
a proposed 10,001-mile program. The difference in 
cost between the proposed programs is the mar- 


Table 3 
Beginning Beginning End of 
of year 1 of year 2 yeor2 
Purchase price $3,000 $- $- 
Taxes and insurance 150 150 - 
Gas, oil, and maintenance - 500 400 
Resale value - - —1,700 
$3,150 $650 —$1,300 
10% present-value factor (1.00) (.909) (.826) 


Present-value cost: $2,667.05 = $3,150.00 + $590.85 — $1,073.80 


ginal cost. (To run one more mile as the result 
of a last-minute decision may involve a higher 
extra cost than if one had planned for that extra 
mile from the beginning. In the extreme situation 
one might have to buy another car in which to 
do it.) 

For any event there are two associated concepts 
of cost—total and marginal (the latter referring 
to a comparison between one particular event and 
another differing by one unit in some dimension 
of the event). For every alterable dimension there 
is a marginal cost of increments in that dimension, 
Two important dimensions in most output pro- 
grams are the rate or speed of output and the total 
volume to be produced. We shall confine our subse- 
quent discussion to changes in these rate and vol- 
ume dimensions. 

The present-capital-value measure of total (or 
of marginal) cost can be converted into a variety of 
other equivalent measures for expressing that cost. 
For example, the present capital value can be re- 
expressed as a future capital value with the future 
value measure (t units of time in the future) in 
the ratio (1 +i)’ to the present value, where i is 
the rate of interest. Alternatively, the present capi- 
tal value can be converted to a rate of costs over 
some interval. For example, a present-capital-value 
cost of $1,000 is, at 10 per cent per year, equiva- 
lent to a perpetual rate of cost of $100 per year, or 
to a rate of $263 per year for 5 years. 

If the event being costed consists of a group or 
collection of homogeneous units, e.g., the produc- 
tion of pianos, or the production of miles of service 
from a car, or the production of bushels of wheat, 
the cost can be prorated or expressed as an aver- 
age cost per unit of each item. In the automobile 
example, the event consisted of owning and driv- 
ing a car 20,000 miles in two years, the cost of 
which was $2,667.05. This can be expressed as 
$2,667.05/20,000 = 13.3 cents per mile of dis- 
tance. This is the prorated amount that, if re- 
ceived now for each future mile of service, will 
enable the receipts to cover the cost. 

Sometimes the rate at which revenues must be 
received in order to cover costs is measured not by 
dividing the capital-value measure of costs by the 
total volume of output, but instead by dividing an 
annual rate of costs by an annual rate of perform- 
ance or output. For example, the present-value 
measure of cost in the above illustration was 
$2,667 (for a two-year program of 20,000 miles of 
travel at the rate of 10,000 miles per year). The 
$2,667 present value can be re-expressed as an 
equivalent-valued continuous-flow annuity for two 


years, at 10 per cent per year compounded con- 
tinuously. This steady-flow or rate measure is 
$1,479 per year for two years (and since there are 
8,760 hours in a year, this is equivalent to $1,479 
per year/8,760 hours per year = 16.9 cents per 
hour), The speed of service, at the rate of 10,000 
miles per year for two years, is equivalent to 
10,000 miles per year/8,760 hours per year = 1.14 
miles per hour. If we divide one annual (or hourly) 
rate of costs by the other annual (or hourly) rate 
of service (i.e., $1,479 per year/10,000 miles per 
year, or 16.9 cents per hour/1.14 miles per hour), 
we get 14.8 cents per mile. Therefore, if costs 
are to be covered by revenues received concur- 
rently with the service performed, the receipts 
must be 14.8 cents per mile of distance. (This dif- 
fers from the earlier cost measure of 13.3 cents 
per mile paid at the beginning of the entire two- 
year program because the 14.8 cents is paid later 
and includes interest on the average delay. ).. 

Extreme care must be taken to ens rates 
are divided by rates or that present-cap! faltvalue 
measures are divided by volume measures of the 
output, Confusion will result if rate (flow) meas- 
ures of output are divided into capital-value 
(stock) measures of cost. That would yield cost 
per unit of speed of output (e.g. miles per year), 
not per unit of output (e.g. miles). Since outputs 
are usually sold or priced in units of output or vol- 
ume, rather than in units of speed of service, it is 
more useful to consider the covering of costs by 
receipts per unit of volume of output rather than 
per unit of speed or of rate of production. 

Fixed and variable costs. The preceding discus- 
sion distinguished among events being costed ac- 
cording to whether they involved (a) the ownership 
of some good, (b) the operation of that good to pro- 
duce some service, or (c) a unit expansion of the 
event, giving a marginal cost. For some purposes 
a classification of costs may be useful. It may be 
relevant to know, for a chosen output program, 
what costs have been incurred even if we were, at 
some subsequent moment, to abandon the pro- 
gram. As was seen in the automobile example, at 
the moment of purchase we have incurred some 
loss of resale value, e.g., $3,000 — $2,500 = $500. 
That “cost” is “sunk” or “historical.” Once we pur- 
chase the car, it cannot be escaped. It should play 
no role (except as a help in forecasting costs of 
similar future events) in any subsequent decision, 
for regardless of what we do, that historical “cost” 
has been incurred, and is inescapable and unaf- 
fected, For any ensuing decision only the escap- 
able, or “variable,” costs are relevant. 
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Having separated sunk or historical “costs” 
(which really are no longer costs) from future 
costs, we can proceed to classify future costs into 
invariant and variable costs. Suppose a person can 
choose among a restricted set of output programs 
but that associated with all those options there is 
a common set of activities or inputs, the cost of 
which is therefore common to each option in the 
subset. The cost of these common activities is 
sometimes called a “fixed” cost. Regardless of 
which option in the subset he chooses, he cannot 
avoid those “fixed” costs. But since the real range 
of options is greater, he really can escape that 
“fixed” cost by choosing an option outside that 
subset. Therefore that “fixed” cost is not a “sunk” 
cost. Fixed cost is a useful concept, for example, 
in situations in which there can be delegation of 
authority to choose within some subset. So long as 
the selection is to be made within that subset, only 
the costs other than"the “fixed” costs are relevant. 
But for the larger range of options, the “fixed” 
costs are not “fixed” and are relevant for compar- 
ing options. To avoid the impression that “fixed” 
costs are fixed upon a person as an inescapable 
loss, it seems appropriate to use the name “invari- 
ant” rather than “fixed,” but this is not yet a gen- 
erally accepted terminology. (Fixed or invariant 
costs would be “sunk” if and only if the subset was 
in fact the entire set of possible options, for then 
regardless of what one did, one could not avoid 
the sacrifice of those alternatives. ) 

Law of costs. So far, we have classified costs 
according to differences in the event being costed, 
and also in terms of various ways of expressing 
the costs of a specified event or output program. 
The question to which we turn now is whether 
there are any laws or general propositions that relate 
the magnitude of costs to the characteristics of 
output programs. But first it is pertinent to identify 
the relevant characteristics or dimensions of an 
output or production program. As suggested earlier, 
the total volume and the rate or speed of produc- 
tion are two important dimensions of such a pro- 
gram. A third is the timing of the output. We may 
denote these three variables as follows: V is the 
volume of output, v(t) is the rate of output at 
moment t, T, is the present moment, and Tm is the 
terminal moment. An increase in v(t) will either 
increase V or, for fixed V, will decrease Tm (move 
is closer to T,). Let C denote the capital-value 
measure of cost of the entire program. Several laws 
can now be stated in terms of these symbols. 

(1) It is a well-recognized and validated law 
that cost is larger the larger V is (whether V is 
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increased by increasing v(t) or by increasing Tm). 
Simply put, a bigger output costs more than a 
smaller one. Symbolically this means C/V is 
positive, even for fixed or unchanged v(t). The 
expression JC/dV is called the marginal cost with 
respect to volume. 

(2) Another proposition is that C/V is smaller 
(but always positive) the larger V is (again with 
the rate of production held constant and with the 
increased V being obtained by increasing T,,). In 
symbols, @°C/dV? is negative. This effect is some- 
times referred to as the lower costs effect of mass 
or large-volume production. A larger output can 
always be produced by replicating the technique 
for a smaller output. However, sometimes a larger 
output can be produced at lower cost through the 
use of different techniques (e.g., metal dies in- 
stead of sand casting for forming metal), but this 
cheaper method cannot be subdivided proportion- 
ately for smaller volume. It follows that larger 
volume will at most involve proportional increases 
in total cost (by replication of the cheapest meth- 
ods for small volumes) and may permit utilization 
of lower-cost methods. Learning and improvement 
in methods with a larger volume of output are also 
predictable. Both effects, substitution of cheaper 
methods for larger volume and learning, contribute 
to the decrease in increments of total cost for in- 
crements in volume. 

The two laws relating costs to volume of out- 
put imply that (3) the average cost per unit of 
volume of output decreases, the larger the volume 
—a widely recognized phenomenon. This lower 
unit cost with larger volume is manifested in the 
extensive standardization of products, in contrast 
with the less common individually styled, custom- 
built goods, which would be preferable if the costs 
were no higher. This lower cost with larger volume 
(along with the gains from specialization in pro- 
duction resulting from the greater heterogeneity 
of productive resources) is one reason why larger 
markets and population areas permit lower costs 
per unit. 

(4) A law relating cost to the rate (not volume) 
of output is that the cost, C, is a positive function 
of u(t) for any given V; that is, 0C/dv is positive. 
The more rapidly a volume of output is produced, 
the higher its cost. 

(5) Another, possibly less general, law is that 
the marginal cost with respect to rate, 9C/dv, while 
always positive, increases for larger v (that is, 
8°C/dv* is positive). This law is possibly less gen- 
eral because the evidence is contradictory for “very 
low” rates of output, at which it is sometimes 
claimed that increases in the rate might lead to 


decreasing increases in total cost. Nevertheless, 
a general and universally valid law is that for 
every volume of output there exists an output rate 
beyond which the marginal cost with respect to 
rate always increases. This is commonly called 
the law of increasing marginal costs and reflects 
the well-known law of diminishing marginal re- 
turns with respect to rate of output. If expressed 
in terms of average costs per unit of volume of 
output, the effect of higher rates of production of 
that volume is persistently to raise the average 
cost—after a possible initial fall in average cost 
for very low output rates. 

(6) Instead of increasing the rate at which 
some constant volume is produced, output pro- 
grams can be different in that both the rate and 
the volume are proportionally larger over a speci- 
fied interval of time. Joint proportional increases 
in both the rate and the volume (over the given 
interval of production) will of course raise total 
costs. The effect on the cost per unit of product is 
not predictable except for “high” rates of output. 
Unlike proposition (3), concerning per-unit cost, 
proposition (6) involves an increase in the rate of 
output as well as in the volume. These two work 
in opposite directions on the per-unit cost, with 
the higher rate increasing unit costs while the 
larger volume decreases them. The rate effect ul- 
timately will dominate as programs with higher 
rates are considered. For production programs ar- 
rayed according to the rate and volume of output 
(both varying strictly in proportion to each other) 
it follows that the average cost per unit of volume 
of output can be decreasing for small outputs. But 
as larger outputs are considered, the average cost 
will, beyond some output rate, begin to rise per- 
sistently and with increasing rapidity until a lim- 
iting rate of production is realized—at which all 
the resources of the world are devoted to this one 
program over that given time interval. 

Short-run and long-run costs. We are now in 
position to examine another classification—short- 
run and long-run. Although it is common to see 
references to the short-run and long-run costs of 
some production program, there is in fact only one 
cost for any program. The short-run—long-run 
cost distinction rests on two concepts that are 
sometimes confounded with each other. A short- 
run cost is sometimes used to refer to a short, as 
contrasted with a long, program of production. At 
other times it is used to refer to the cost of doing 
something more quickly rather than less quickly. 
Yet in each case the shorter output and the quicker 
output both involve higher per-unit costs than do 
the longer output and the later output. Sometimes 


the higher per-unit short-run cost (no matter in 
which of the two different senses) is attributed 
to an alleged fixity in some of the productive 
units. In fact, of course, no producer is stuck with 
literally fixed inputs (except in the sense that 
momentarily it is hardly possible to increase any- 
thing). What is true is that it is more expensive 
to vary some inputs in any given interval than to 
vary others. That differential cost of adjusting var- 
ious inputs is often oversimplified into an extreme 
bipolar classification of fixed and variable inputs. 

The purpose of the long-run-short-run distinc- 
tion is to note the differences in cost between 
different output programs, those achieved in the 
more immediate future in contrast with those un- 
dertaken later, when one can get the advantage 
of less expensive, less hasty adjustments. For ex- 
ample, if the demand for some good increases, 
producers will be able to respond immediately, but 
at a higher cost than for less hasty revisions of 
output. Although the “same” good is being pro- 
duced (except, of course, for the important differ- 
ence in the time of its availability), the cost is 
lower for the later output. To trace the impact of 
a demand change on output and prices, one will 
want to recognize the difference in the output and 
price with the passage of time. Instead of tracing 
out a continuous history or sequence of subsequent 
developments, it is convenient to divide the history 
arbitrarily into two episodes: the relatively im- 
mediate response (the short-run) and the limiting 
ultimate response (the long-run). The difference 
between these two “runs” indicates the path and 
direction of effects subsequent to the initial event. 

While the long-run-short-run distinction serves 
as a convenient two-stage analysis of a sequence 
of effects, obviously there are as many “runs” as 
one wishes to consider. However, in analyzing 
total effects, three states or runs are usually con- 
sidered: the “market period” (referring to that 
period of adjustment in prices which occurs before 
there is any change in output), and the afore- 
mentioned short-run and long-run, during both of 
which output is changed. 

Joint products and unallocable costs. Suppose 
that an output program yields several joint prod- 
ucts, e.g., wool, meat, and leather from sheep; 
or gasoline and kerosene from crude oil; or heat 
and light from electrical energy; or passenger-miles 
and freight-miles from an airline. What is the cost 
of each of the joint products? Depending upon 
which one is called the residual, or by-product, a 
different allocation of costs can be obtained. By 
calling meat the “basic” product and attaching most 
of the costs to it, the costs of wool can be made 
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small, and conversely. It is tempting to jump to 
the conclusion that something must be wrong with 
the concept of cost or with the economic system 
if such indefiniteness can result. After all, if costs 
cannot be uniquely allocated, how can one tell 
what prices are right? How can one tell on which 
of the joint products he is making a profit? If 
costs cannot be assigned, how can one tell which 
to produce or what prices to charge? In fact, how- 
ever, the presence of cost that cannot be allocated 
uniquely among the joint products does not upset 
anything or prevent unique prices. 

If we recall the purpose of the cost concept— 
that of enabling choices among alternatives ac- 
cording to some criterion of preference—we see 
that what is required is a way of assessing the 
consequences of changes in the output. If the air- 
line program is revised to transport more passengers 
and less freight, or revised so as to transport more 
passengers with the same amount of freight, what 
happens to costs? Comparing the costs of alterna- 
tive programs gives marginal costs, which with 
the marginal value of the revised output give a 
basis for a decision. There is no possibility and no 
necessity for allocating costs into uniquely identi- 
fiable parts for each product in order to determine 
what to produce and what prices to ask. The prices 
set will be those which allocate the amount pro- 
duced among the competing claimants and yield 
a maximum wealth to the producer of the joint 
products. His power to maximize his wealth will 
of course depend upon competitors’ access to the 
market. The function of inducing output does not 
require an assignment of portions of total cost to 
each of the joint outputs. What is necessary is a 
comparison of the total cost of the set of joint 
products with its value. If the market value of the 
set does not cover the cost, in an open market, the 
loss of wealth will induce reduced production (of 
some or all the joint outputs) and higher prices, 
until the value of the set of joint products covers 
the costs. (If joint products can be produced only 
in fixed combinations, then not even marginal 
costs of each output can be ascertained; neverthe- 
less, everything said in the preceding two sen- 
tences is still valid and applicable.) 

Private and social allocations of costs. Through- 
out the preceding discussion the costs of a choice 
were assumed to be borne by the chooser; none 
of the forsaken options are forsaken by anyone 
else. If Smith builds a swimming pool, the for- 
saken options—the costs—are all borne by him. 
The options open to the rest of the community 
or to any of its members are in no way reduced. 
So we assumed. If, however, Smith builds a pool 
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and in doing so creates a “nuisance” for his neigh- 
bor, Jones, Smith has taken away Jones’s peace 
and quiet. If Smith’s pool overflows and harmfully 
floods Cohen’s land, Cohen has had options re- 
moved from his range of choice. Being less careful 
and thereby letting water run over into a neigh- 
bor's land, or having a more riotous time and dis- 
turbing the peace, is less costly for Smith if he 
dees not incur the costs of being more careful in 
watching the water level or in soundproofing his 
play area. 

The situation is similar to that of the factory 
owner who “dumps” smoke, waste, smells, noises, 
and night lights on other people’s land. By doing 
so he keeps his land in better condition and avoids 
the costs of filtering his smoke, collecting and 
disposing of his own garbage, etc. He makes others 
bear some of the costs, instead of bearing them 
himself. His actions involve a sacrifice of alterna- 
tive uses of goods, which sacrifice, instead of being 
borne by the decision maker, is in part borne by 
or imposed on other people. 

“Property rights are not private” is another way 
to express this situation. The use of “one’s” re- 
sources is not subject solely to the owner's volun- 
tary control, but is in fact and de jure controllable 
in part by other people. This ability to “use” other 
people’s resources for one’s benefit, and thereby 
remove their options, enables one to make other 
people bear part of the costs of one’s decisions. The 
costs are divided between the decision maker and 
outsiders. This division or separation is called a 
divergence between private and social costs— 
where social costs are treated as the whole of costs 
as defined in the earlier portions of this discussion, 
with private costs being the portion of those costs 
borne by the decision maker or owner of the re- 
sources directly concerned. Social and private costs 
are not two different costs—they are merely classi- 
fications according to the bearer of the cost. If 
there is no divergence, so that ali social costs are 
private costs, then all the costs of use are borne 
by the person choosing or authorizing the choice 
of action. The divergence between private and 
social costs is also characterized as the presence 
of “external” costs. [See EXTERNAL ECONOMIES AND 
DISECONOMIES.]| 

Parallel reasoning is relevant on the side of 
benefits. The value of a resource in this use may 
be incompletely revealed or have incomplete in- 
fluence on decisions if that value is dispersed so 
that only a part of it accrues to the decision 
maker. This is a divergence between private and 
social value in this use. If the value measure 
assigned to any particular potential use by the 
chooser is less than the total value in that use, then 


there will be a divergence between his private 
valuation and the social valuation. In this case, 
values of some uses of resources are not as fully 
revealed and available as inducements to the com- 
peting resource users as are the values of other 
uses. As a result, the values of some uses will be 
understated, which encourages more of other kinds 
of use by leading to an underestimation of their 
cost. Thus the analysis of external versus private 
or of social versus private values or costs is an 
essential part of the analysis of the meaning and 
role of costs. 

But whatever they are called, such effects are 
commonplace and well-nigh universal. For exam- 
ple, every voluntary act of exchange involves a 
choice of use of resources that benefits the other 
party as well as oneself. However, the external 
effect is “internalized” as an inducement on the 
acting agents. If you give me “that,” I will do, or 
give you, “this”; and what you give me reflects the 
gains you will get from what I do, The external 
effects of my actions are made internal or effective 
by your ability to offer me a gain reflecting the 
value to you. The external costs of my acts are 
internalized or made effective in controlling my 
behavior by laws prohibiting my imposing any 
such costs on you unless I pay you an acceptable 
amount for the right to do so. Our laws of prop- 
erty and the right to engage in exchange help to 
make private costs also contain the social costs, 
and to make private gain reflect social gains. In 
other words, external effects are usually inter- 
nalized. 

In every society the extent of a divergence be- 
tween private and social costs (or the presence 
of external effects) for some resource use depends 
upon the technological facts and upon the legal 
structure of property rights. The costs of defining, 
policing, and enforcing various types of property 
rights vary. Private property rights, defined as 
those in which external physical effects are not 
permissible, may be too expensive to enforce with 
respect to some effects. But if there is a cheap way 
to internalize external effects or to make the pri- 
vate costs equal the social costs, then the use of 
resources will respond more fully to the cost or 
values of use. If there were some cheap means of 
excluding other people from enjoyment of some 
use I may make of my resources, then I could 
charge them for the availability of that enjoyment 
and thereby make that value of use effective in 
my decision as to how to use resources. This is 
a means of internalizing external effects or of 
making external effects “inducive” with respect to 
my choices about resource uses. 

Often the costs that must be borne in order to 


internalize external effects exceed the value of 
those external effects but may nevertheless be 
worth incurring if they involve associated revenues 
and a more profitable, larger enterprise. For ex- 
ample, a golf course provides benefits to neighbor- 
ing landowners. A golf course builder could buy 
enough land to build a course and to build homes 
on the surrounding property, thus internalizing 
the higher value from proximity to the golf course. 
Another example is that of the apartment building 
in which the rental includes the cost of main- 
tenance of common gardens and recreation areas, 
rather than having each tenant maintain his own 
area. The purchase of cemetery lots includes a 
payment for upkeep of the whole cemetery. By 
such devices, neighborhood effects are made the 
owners’ effects. 

Another important means of internalizing or 
making external effects “inducing-effects” is the 
development corporation, which enables a larger 
venture to be undertaken so that more of the bene- 
fited resource owners can be included in the unit 
of ownership that provides the benefits. If all the 
land of a suburban shopping center is owned by 
one enterprise, there can be more complete re- 
sponse to the total value of the shopping center, 
which includes values external to the component 
units resulting from their proximity. Similarly, de- 
partment stores with several departments in one 
building are a means of “internalizing” values or 
of making private and social effects converge. 
Signal decoders and wire transmission systems for 
television, fences around athletic pavilions to keep 
out nonpaying spectators, and walls around thea- 
ters are examples of devices (not costless) for 
internalizing and increasing the value of the serv- 
ice to those who provide it, and so are “inducive” 
to that resource use. 

In other cases the value of complete suppres- 
sion of external effects may be less than the cost. 
For example, automobile exhaust suppressors and 
smoke filters are not universally required. As a 
result, those who create smoke and smog thrust 
part of the costs of their actions on other people. 
An especially instructive example is provided by 
the problem of noisy airplanes. If an airport owner 
had to compensate the nearby landowners for the 
noise made by the airplanes using his airport, the 
landowners would in effect be selling rights to that 
particular use of their land, and the airport owner 
could in turn charge the airplane owners. Instead, 
one of the following solutions is usually adopted: 
(1) There is no compensation for the noise. (2) 
The planes are prohibited. (3) The neighboring 
land is bought up and people are prohibited from 
living there—even though many would prefer to 
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do so, if they could buy the land at a low enough 
price to reflect the value of the lost quiet. These 
extreme policies are sometimes explained by an 
incorrect presumption that it is impossible or un- 
desirable to buy the rights to “dump” noise on 
neighboring land; in fact, they are used because 
neighboring landowners do not have a legally rec- 
ognized right to the undisturbed use of their land. 

As the preceding remarks have indicated, often 
our legal structure of property rights is such that 
decisions are made in which only part of the costs 
are operative in affecting the choice. This may be 
a result of a deliberate attempt to attenuate the 
role of costs in decision making or, because of 
technological features, it may be the result of the 
difficulty (cost) of defining, policing, and enforc- 
ing rights to resources in such a way that private 
and social costs do not diverge much, Laws may 
be what they are because those most influential 
in affecting them may want resources to be used 
with less regard to the exchange-value measure 
of costs. It may be thought that the values the 
people of the society would express in the way they 
would use resources are inappropriate or improper 
and therefore should not be so influential in affect- 
ing resource allocations. If so, choices about uses 
of resources should be insulated from those al- 
ternative use values (ie., costs). This can be 
achieved by suppressing a market place in which 
market prices would reveal alternative use values, 
or it can be achieved by not sanctioning private 
property rights, so that no one can negotiate an 
exchange that would reveal alternative use values 
(that is, resources would not be “owned,” in the 
sense of being salable). 

Policing and enforcing of property rights is not 
performed exclusively by the government. In many 
cases other forms of control are effective. Etiquette 
and socially accepted codes act as determiners of 
rights. These institutions serve, in part, to restrict 
the extent to which a person can impose the costs 
of his choices on others, That is, they are often 
means of inducing behavior of a type that would 
occur if resources involved were “privately” owned 
and exchangeable. Custom and etiquette, along 
with property rules, affect the degree of concentra- 
tion of costs on decision makers. 

External value effects and costs. Still another 
source of confusion is the confounding of the ex- 
ternal price effects of some event with its costs. 
Cost has been defined as the highest-valued option 
necessarily sacrificed consequent to action A; 
Suppose that I open a restaurant near yours, and 
by virtue of my superior cooking talents attract 
customers away from you, with a consequent loss 
of wealth by you of, say, $50,000. So far as you 
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are concerned, my effect on you is as bad as if I 
had burned your uninsured $50,000 building for 
the joy and excitement this afforded me. From an 
analytical point of view, the former loss of $50,000 
of value is not a cost, whereas the destruction of 
the building would have been a cost. Why the 
difference? Simply that opening a restaurant does 
not necessarily involve a sacrifice to society at 
large, while the destruction of the building does. 
My superior cooking skills do not involve a sacri- 
fice of $50,000 of alternatively valuable output, 
whereas my enjoying the fire would. My superior 
cooking may impose a loss of wealth on you be- 
cause I outcornpete you in providing services to 
third parties, But the $50,000 loss to you is more 
than matched by the gain in the value of service 
to the third parties who were formerly your cus- 
tomers but who have shifted to me, and by the 
increase in my own Wealth. No formerly available 
options are forsaken by society as a whole. Every- 
thing that could be done before I opened my res- 
taurant still can be done, That $50,000 is not a 
sacrificed opportunity—instead, it is a measure 
of a transfer of wealth from you to two other par- 
ties, me and the customers. The distinction be- 
tween the transfer of rights to uses of resources 
and the costs of use of goods should be kept clear. 
For example, when I open a new restaurant service, 
and the public offers less for your goods and more 
for mine, they are telling you that the exchange 
rights formerly attached to your goods—their mar- 
ket value—is being transferred by them to my 
goods. 

The transfer of rights of choice of use and the 
revision of exchange values consequent to changes 
in offers by competitors, or consequent to changes 
in tastes by customers, do not reduce the total set 
of alternative use options, The transfer changes the 
person authorized to control the decision as to use. 
When my superior culinary talents reduce the ex- 
change value of your services (without affecting 
their physical attributes in any way) and so re- 
duce your wealth, society could in principle take 
away some of my gains and those of my customers 
(who gain by accepting my offers rather than 
yours) and fully reimburse you, while still leaving 
me and the customers better off than before I 
entered the Market. Such compensation is not 
possible for true costs. 

The person who loses wealth either via transfer 
of goods or the reduction of their exchange value 
is suffering areal loss of wealth, but not a cost, 
That loss is different in principle, in kind, and in 
fact from a cost. From the private point of view 
both sources Of loss of wealth are “bad” for him. 


Both are losses of opportunities to him, though 
only a cost is a loss to the community as a whole. 
What he loses in the pure price revaluation case, 
someone else gains. 

There are many examples of the use of public 
policy to reduce such transfers. Taxes have been 
imposed on innovations or on new products in 
order to reimburse owners of resources formerly 
used to produce the displaced products. Sometimes 
laws are passed prohibiting new, cheaper devices, 
in order to preserve the marketable wealth of users 
of older, more costly methods. Sometimes general 
taxes are imposed to aid those whose wealth is 
reduced by new methods, e.g., government financ- 
ing of retraining of displaced workers and low- 
interest loans to business firms in distressed areas. 
Taxes on innovations make the innovators count 
the taxes as part of their costs. The “costs” of 
innovation are thereby biased upward, with a re- 
sultant attenuation of the incentive to introduce 
new methods or products that would produce a 
larger total wealth. 

Although wealth transfers via market revalua- 
tions are not costs, they may influence behavior. 
For example, if such market revisions of wealth 
are (somehow) deemed undesirable, steps can be 
taken to restrain people from taking actions that 
revise the distribution of wealth, or steps can be 
taken to redistribute the wealth again so as to 
restore the status quo ante wealth for each indi- 
vidual. Social policy (laws of property) may be 
evolved to insulate decisions from these effects or, 
conversely, to make them more sensitive. But in 
neither case are these market-price side effects on 
wealth components of costs. 

We conclude by returning to the initial theme. 
The costs of some event are the highest-valued 
options necessarily forsaken. We have seen that he 
who is to forsake those options and he who makes 
the decision about the chosen option may or may 
not be the same person. Furthermore, the privately 
borne costs may be less or greater than the true 
costs, depending upon laws and upon the struc- 
ture of property rights. 

ARMEN A. ALCHIAN 
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COUNSELING PSYCHOLOGY 


Counseling psychology, a specialty within the 
area broadly designated as applied psychology, is 
not primarily an entitative science but draws 
heavily upon the basic and applied fields of psy- 
chology and upon other behavioral sciences for 
its foundations (Berdie 1959). It uses concepts, 
tools, and techniques that are also used by other 
specialty groups—notably industrial and person- 
nel, clinical, and school psychology. It is, however, 
most appropriately viewed as the application of 
psychological and behavioral science knowledge in 
the form of a unique personal service furnished 
by professional practitioners with special qualifica- 
tions. As such a specialty the most important char- 
acteristic of counseling psychology is its focus on 
the decisions and plans that individuals must make 
in order to play productive roles in their social 
environments. It is irrelevant whether the person 
receiving professional assistance is sick or well, 
normal or abnormal, handicapped or whole—that 
is to say he is a client and not a patient. In coun- 
seling psychology the emphasis is on further de- 
velopment as an individual; its concern is the 
identification and enhancement of possibilities and 
potentialities (American Psychological Association 
1961), 

Other important defining characteristics of coun- 
seling psychology include the following: (a) a pri- 
mary focus on normal people; (b) service that is 
available throughout the life span; (c) emphasis on 
the individual's strengths and assets; (d) empha- 
sis on cognitive elements, especially where choice 
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and decision are involved, with rationality and 
reason stressed; (e) the dealing with personality 
difficulties in the context of the total goals, plans, 
and roles of the individual; (f) the giving of full 
consideration to situational and environmental 
factors, with emphasis on the utilization of en- 
vironmental resources and on environmental modi- 
fication where judged necessary. 

Basically, counseling psychology is concerned 
with the choices, decisions, and plans that every 
individual must make, as contrasted, for example, 
with clinical psychology, which is largely concerned 
with the problems and difficulties that some indi- 
viduals face (Sundberg & Tyler 1962). This dis- 
tinction is demonstrated by the historic and con- 
tinuing emphasis on vocational counseling as a 
major element in establishing the unique identity 
of counseling psychology (Brayfield 1963). 

Historical background. Counseling psychology 
(it came to be so designated in the early 1950s) 
evolved historically under a variety of influences. 
The first, vocational guidance, began in this cen- 
tury under the auspices of social welfare agencies 
and took shape around 1909 with the writings of 
Frank Parsons (1909), who described vocational 
guidance as a process of vocational orientation, in- 
dividual analysis, and counseling; primary focus 
was upon the collection and dissemination of occu- 
pational information in the absence of useful 
methods of individual appraisal and theories and 
techniques of interviewing (Brewer 1942). 

At about the same time the scientific study of 
individual differences, sparked by James McKeen 
Cattell, and the development of psychometric de- 
vices got under way. Psychometric efforts yielded 
useful measures of individual differences in occu- 
pationally significant aptitudes and interests by the 
middle 1920s. In the early 1930s large-scale pro- 
grams of vocational guidance—as developed by 
Donald G. Paterson (see Paterson & Darley 1936) 
for the Minnesota Employment Stabilization Insti- 
tute and at the Adjustment Service in New York— 
demonstrated the practicality of professional coun- 
seling based on the available psychological knowl- 
edge. During this period the student counseling 
program established at the University of Minne- 
sota—under the leadership of Paterson, Edmund 
G. Williamson, and later John G. Darley—set a 
pattern which has been widely emulated in colleges 
and universities (Williamson & Darley 1937). 

In the 1940s, personality and learning theories 
and the increasing interest of psychologists in 
psychotherapy and practice, particularly under the 
influence of Carl R. Rogers (1942), helped shape 
the course of counseling psychology. Psychody- 
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namic interpretations of behavior and an accom- 
panying concern with psychopathology contributed 
to the emerging specialty. 

New elements were introduced in the early 
1950s, when the ideas and findings of develop- 
mental and social psychology were recognized as 
directly relevant to counseling. The psychology of 
careers, as formulated by Donald E. Super (1957), 
took on a new dimension when viewed in this con- 
text. Gradually this approach has been extended 
by Milton E. Hahn (1963) and others to include 
the major events of the entire life span—including 
the formation of a family, participation in com- 
munity and leisure-time activities, adaptation to 
cultural changes, and retirement. 

Psychological foundations. The most important 
foundations for counseling psychology, as it has 
evolved under these influences, are found in dif- 
ferential psychology and psychometric theory; basic 
processes of learning, perception, motivation, and 
emotion; personality theory and dynamics; devel- 
opmental psychology; and in social psychology 
along with selected aspects of economics, sociology, 
and anthropology. The selective emphasis and or- 
ganization of concepts and materials from these 
areas is the distinctive contribution of counseling 
psychology as a substantive field of knowledge 
(Pepinsky & Pepinsky 1954). 

Although counseling psychologists draw upon a 
wide range of psychological and behavioral science 
literature, one mark of the development of the field 
is the publication of the Journal of Counseling 
Psychology, which began in 1954. 

Counseling is best understood as a process in- 
cluding assessment and counseling per se. Assess- 
ment includes the systematic collection, organiza- 
tion, and interpretation of information about a 
person and his situation. Counseling is the process 
in which a professionally trained person and his 
client relate the materials brought to light by the 
assessment process to the choices and decisions 
confronting the client in his formation of goals and 
life plans, in essence providing a learning and 
problem-solving setting for the client in which the 
crucial element is the relationship established be- 
tween counselor and client. This constructive re- 
lationship is characterized by such words as friend- 
ly, accepting, sincere, and confidential as viewed 
by the client. The establishment of such a relation- 
ship constitutes a real test of counseling skill; it 
provides the psychological environment in which 
the special knowledge and understanding of the 
counselor can best be utilized by the client in mak- 
ing the choices and decisions that will give full 
play to his potentialities (Tyler 1953). 

Although face-to-face counseling with individu- 


als is the heart of counseling psychology, the coun- 
seling psychologist also plays an active role in 
exploring and coordinating community resources 
such as placement, habilitative and social services, 
and educational opportunities in order to provide 
a comprehensive and effective service for his 
clients. Increasingly the counseling psychologist is 
in demand as a consultant to local and state 
agencies and institutions concerned with human 
welfare and effectiveness. 

Training and qualifications. It is customary to 
define the counseling psychologist as a person who 
has completed training in psychology, with a con- 
centration in counseling psychology, at the level of 
the doctor of philosophy degree. Recommended 
standards for training psychologists at the doctoral 
level were published by the Division of Counseling 
and Guidance (now Division of Counseling Psy- 
chology) of the American Psychological Association 
in 1952 (1952a). Additional reports from the same 
source have made recommendations for practicum 
training and internships (American Psychological 
Association 1952b; 1960). In 1964 the division 
published the report of a conference on professional 
preparation of counseling psychologists, which 
made recommendations for further improvements 
in training (Conference on the Professional Prep- 
aration of Counseling Psychologists 1964). 

In the United States approximately 48 universi- 
ties offer graduate training at the doctoral level in 
what is essentially counseling psychology. Of these 
programs approximately half have met the stand- 
ards required for accreditation by the Education 
and Training Board of the American Psychological 
Association (1958). Although some individual 
changes have occurred, there has been little growth 
in approved programs since the first listing in 1955, 
which included 23 departments. Approved pro- 
grams vary in their administrative sponsorship and 
essential nature: about one-third are in depart- 
ments of psychology and are undifferentiated from 
the clinical psychology program; about one-third 
are in psychology departments but differentiated 
from clinical psychology; and the remainder are 
jointly administered by a department of psychology 
and either a department of education or of educa- 
tional psychology. 

The most distinctive recognition of qualifications 
and competence which can be accorded a counsel- 
ing psychologist is the diplomate status conferred 
by the American Board of Examiners in Profes- 
sional Psychology (1953). These diplomates num- 
ber slightly in excess of 250. 

In 1965, 1,500 was a reasonable estimate of the 
number of doctoral-level counseling psychologists 
employed in the United States. Although counseling 


psychologists work in a variety of settings, approx- 
imately 60 per cent of them are found in colleges 
and universities, where they are engaged in coun- 
seling, teaching, research, and administration. 
About 10 per cent, the next largest group, are 
employed by the Veterans Administration. Smaller 
proportions are found in community counseling 
agencies, public schools, business and industry, 
city, state, and federal agencies including rehabili- 
tation services, and private practice. 

Although the number of counseling psycholo- 
gists is limited—and only 40 to 50 are produced 
annually—counseling psychology has an extensive 
and important impact on the general field of coun- 
seling. The major share of direct counseling in 
public schools, rehabilitation agencies, state employ- 
ment services, and community agencies is offered 
by nonpsychologists (Brayfield 1961). A substan- 
tial share of their formal preparation, however, is 
based on materials developed or made readily avail- 
able by counseling psychologists who provide the 
research undergirdings of practice, the intellectual 
leadership, and frequently the supervision and ad- 
ministration of counseling services. The outlook for 
nondoctoral counselors is the requirement of two 
full years of graduate training that will provide 
substantial amounts and kinds of instruction in 
psychology (Wrenn 1962). 


The major locus of counseling psychology is in 
the United States, particularly with respect to the 
numbers of practitioners. However, there has been 
increasing interest in the field in other countries. 
Great Britain has produced a considerable amount 
of relevant research, especially in the field of voca- 
tional guidance, French psychologists have made 
contributions in recent years, and there are im- 
portant developments in India and Japan. 


ARTHUR H. BRAYFIELD 


[Directly related are the entries CLINICAL PSYCHOL- 
OGY; EDUCATIONAL PSYCHOLOGY; INDUSTRIAL RE- 
LATIONS, article on INDUSTRIAL AND BUSINESS 
PSYCHOLOGY; MENTAL DISORDERS, TREATMENT OF; 
VOCATIONAL REHABILITATION. Other relevant mate- 
rial may be found in INTERVIEWING; OCCUPATIONS 
AND CAREERS; PERSONALITY MEASUREMENT.] 
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COUNTED DATA 


Counted data subject to sampling variability 
arise in demographic sampling, in survey research, 
in learning experiments, and in almost every other 
branch of social science. The counted data may 
relate to a relatively simple investigation, for exam- 
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ple, estimating sex ratio at birth in some specified 
human population, or to a complex problem, inves- 
tigating the interaction among qualitative responses 
of animals to stimuli in a physiological experiment. 
Further, a counted data approach is sometimes 
useful even when the actual data are inherently 
not counted; for example, a classical approach to 
so-called goodness of fit uses the counts of numbers 
of continuous observations in cells or intervals. 
Again, some nonparametric tests are based on a 
related device. [See GOODNESS OF FIT; NONPARA- 
METRIC STATISTICS,] 

Investigations leading to counted data are often 
described by giving percentages of individuals fall- 
ing in the various categories. It is essential that the 
total numbers of individuals also be reported; other- 
wise reliability and sampling error cannot be esti- 
mated. 

The structure of this article is as follows. First, 
simple procedures relating to one or two sample 
percentages are considered. These procedures ex- 
emplify the basic chi-square approach; they may 
be regarded as methods for treating particular con- 
tingency tables in a way falling in the domain of 
the basic chi-square theorem. Second, special as- 
pects of contingency tables are considered in some 
detail: power, single degrees of freedom, ordered 
alternatives, dependent samples, measures of as- 
sociation, multidimensional contingency tables. 
Under the last topic is considered the important 
topic of three-factor interactions. Third, some alter- 
natives to chi-square are briefly mentioned. 


Binomial model 


Consider an experiment in which animals of a 
group are independently subjected to a stimulus. 
Assume two, and only two, responses are possible 
(A and A). Of 20 animals exposed independently to 
the stimulus, responses of type A are exhibited by 
16. Such a count, or the corresponding percentage, 
80 per cent, may be the basis of an estimate of 
the probability of an A response in all animals of 
this kind; or it may be the basis of a test of the 
hypothesis that responses A and A are equally likely. 
The evaluation of either this estimate or the test is 
dependent upon the assumptions underlying the 
data collection. 

One of the basic models associated with such 
experiments is the binomial. The binomial model 
is associated with a series of independent trials in 
each of which an event A may or may not occur 
and for which it is assumed that the probability of 
occurrence of A, denoted p, is constant from trial 
to trial. If the number of occurrences of A among 
n such trials is v, then v/n is the maximum likeli- 


hood and also the minimum variance unbiased 
estimator of p. [For further discussion of the bino- 
mial distribution, see DISTRIBUTIONS, STATISTICAL, 
article on SPECIAL DISCRETE DISTRIBUTIONS. | 

Additional insight as to the reliability of the 
estimator is obtained from a confidence interval 
for p. Tables and graphs have been prepared to 
provide such confidence intervals for appropriate 
levels of confidence. The best known of these is 
the graph, by Clopper and Pearson (1934). This 
graph or the tables that have been computed for 
the same purpose (for example, Owen 1962) deter- 
mine so-called central confidence limits; that is, 
the intervals that are “false,” in the sense that they 
do not include the true parameter value, are 
equally divided between those that are too low and 
those that are too high. [See ESTIMATION, article 
On CONFIDENCE INTERVALS AND REGIONS. ] 

Confidence intervals may also be used to test a 
null hypothesis that p has the value p,. If p, is not 
included in the 1 — a level confidence interval, then 
the null hypothesis p = p, is rejected at level a. 

Equivalently, a direct test may be made of this 
hypothesis by utilizing the extensive tables of the 
binomial distribution. Two of the best known are 
those of Harvard University (1955) and the U.S. 
National Bureau of Standards (1950). 

More usually both confidence intervals and test 
procedures are based upon an approximation to 
the distribution, that is, on the fact that (v — np) 
[np(1 —p)}* has a limiting standard normal dis- 
tribution. [See DISTRIBUTIONS, STATISTICAL, article 
On APPROXIMATIONS TO DISTRIBUTIONS. | 

Denote by Z,a the 100(1 — æ) percentile of the 
standard normal distribution. The null hypothesis 
P= P, tested against the alternative p = p, is re- 
jected at level a on the basis of an observation of 
v successes in n trials if 


[v= np] —4>Z,-sa [np.(1 — po) I's 


in case the alternatives of interest are limited to 
one side of p,, say p> p,, the test procedure at 
level a is to reject H, if 


U—np, — 4 > Z,-a[np (1 — po]. 


The subtracted 3 is the so-called continuity correc- 
tion—useful when a discrete distribution is being 
approximated by a continuous one. 

Thus, in the experiment described above, the 
experimenter might be testing whether the choice 
is made at random between A and A against the 
possibility that A is the preferred response. This is 
a test of the hypothesis p = 4 against the alterna- 
tive p> 4. Corresponding to the conventional 5 
per cent significance level, Zs, = 1.64; then if 


v= 16 (16 A responses are observed), the hypoth- 
esis is rejected at the 5 per cent level since 


16 — 10 — 4 > 1.64 [(20) (2) G). 


The normal approximation to the binomial is 
thought to be quite satisfactory if npo(1— po) is 
5 or more. However, for many practical situa- 
tions the normal approximation provides an ade- 
quate test (in the sense that the type I error is 
sufficiently close to the specified level) for values 
of npo(1— py) well below the bound of 5 men- 
tioned above. 

The simplest confidence limits for p based on the 
normal approximation are 


32%-u/22 (1-2): 
n nn n 


The binomial model requires independence of 
the successive trials. Much sampling, especially of 
human populations, is, however, done without re- 
placement so that successive observations are in 
fact dependent and the correct model is not the 
binomial but the hypergeometric. In sampling 
theory this is taken into account by the finite popu- 
lation correction, which modifies the variance. 
Thus, where the binomial variance is np(1— p), 
the hypergeometric variance for a sample of size n 
from a population of size N is np(1 — p)(1— n/N). 
If n is a small fraction of N, the finite population 
correction is negligible; thus, the binomial model 
is often used as an acceptable approximation. 


Chi-square tests 


For one or two proportions. The statistic 
(|v — npo] —4)[npo(1 — po}, which for suffi- 
ciently large n may be used to test the hypothesis 
p= p, against p#p, yields, when squared, an 
equivalent test procedure based on the chi-square 
distribution with one degree of freedom; this fol- 
lows from the fact that the square of a standard 
normal variable has a chi-square distribution with 
one degree of freedom. [See DISTRIBUTIONS, STA- 
TISTICAL, article on SPECIAL CONTINUOUS DISTRI- 
BUTIONS.] 

Following recent practice, “X?” is written for the 
test statistic, and the symbol “x?” is reserved for the 
distributional form. 


x: — (JU = mpo| — 3)? 


npo( 1 — po) 
—({v— np — 4)? , (Inv 201 = Po)| — 4). 
NPo n(1 — Po) 


This algebraic identity shows that the statistic X? 
may be written (neglecting the continuity correc- 
tion term, 4) as (observed — expected )*/expected, 
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summed over the two categories A and A. Such a 
measure of deviation of observations from their 
expected values under a null hypothesis is of wide 
application. 

For example, consider the counts of individuals 
with characteristic A that occur in two independent 
random samples and suppose that the null hypoth- 
esis at test is that the probability of occurrence of 
A is the same in both populations; call the common 
(but unspecified) probability p. The observations 
may be tabulated as in Table 1. 


Table 1 — Observations in two samples 
NUMBERS OF OBSERVED 


A's as Totals 

SAMPLE TE Vey via m 
Yar Yaa m 

Totals va Vea n 


If p were known, then under the null hypothesis 
the expectation of the number of A’s in sample 1 
would be n,p and in sample 2 the expectation 
would be n-p, where p is the probability of occur- 
rence of A. Since p is unknown, however, it must 
be estimated from the data [see ESTIMATION, article 
on POINT ESTIMATION]. 

If the hypothesis were true, the two samples 
could be pooled and the usual (minimum variance 
unbiased) estimator of p would be v.,/n. With this 
estimator the estimated expected number of A’s in 
sample 1 is n,(v.,/m) and in sample 2 is n:(v.:ı/n). 
Similarly the estimated expected numbers of A’s are 
n,(v.2/n) and n:(v.:/n) in the two samples. These 
estimated expectations are tabulated in Table 2. 


Table 2 — Estimated expected number of observations 
in two samples 


ESTIMATED EXPECTATIONS 
FOR NUMBER OF OBSERVED 


An expression similar to X* can be calculated for 
each sample where now, however, Po is replaced 
by the estimator v.,/n. These expressions are 


[lv — m(v../n)| — 3? 

n(v.,/n)(1 — v.:/n) 
and 

[lva — n2(V../n)| = 47 | 

n2(V.,/n)(1 — v.:/n) 
Since the estimator of p will tend to be close to 
the true value for large sample sizes, it is intuitive 
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to conjecture that each of these are squares of 
normal variables (at least approximately for large 
samples ), The sum does have a limiting chi-square 
distribution but with one degree of freedom, not 
two. The “loss” of the degree of freedom comes 
from estimating the unknown parameter, p. The 
test statistic, which more formally written is 


X= 


nyv.;/N 


2 yes — ny. ;/n| — 3)? g 

A e 

may be simplified to 
N(|VisVer — V 

NNV. V.» 


= 10/2)! 


If [VV — Vis is less than or equal to n/2, 
the correction term is inappropriate and possibly 
misleading. In practice this problem rarely arises. 

Basic chi-square theorem. The above chi-square 
test statistics for one or two proportions may, as 
was seen, be written as sums of terms whose numer- 
ators are squared deviations of the observed counts 
from those “expected” under the null hypothesis. 
(Expected is placed in quotation marks to empha- 
size that the “expectations” are often estimated 
expectations obtained via estimation of unknown 
parameters.) The denominators may be regarded 
as weights to standardize the ratios, This pattern 
may be widely extended. 

For example, consider a questionnaire with re- 
spondents placing themselves in five categories: 
strongly favor, mildly favor, neutral, mildly oppose, 
strongly oppose. The n independent responses 
might furnish data for a test of the hypothesis that 
each of the responses is equally likely. If the prob- 
abilities of the five responses are denoted p, through 
Ps this null hypothesis specifies p, = p. = p, = B= 
Ps = 4 and under the null hypothesis the expected 
number of responses in each category is n/5. The 
appropriate weights in the denominator of the chi- 
Square test statistic are suggested by the expanded 
form of X* given above; each term (observed — 
expected)? is divided by its expected value. Thus 
in this example, 


= Su n/s)? 
X= a 


Ta 


That these weights lead to the usual kind of null 
distribution can be shown by considering the multi- 
nomial distribution, the extension of the binomial 
distribution to a series of independent trials with 
several outcomes rather than just two. If the null 
hypothesis is true, this X? has approximately a chi- 
square distribution with four degrees of freedom. 

More generally, suppose that on each of n inde- 


pendent trials of an experiment exactly one of the 
events E, , ++- , E; occurs. Let p;, depending in a 
given way (under the null hypothesis under test) 
on unknown parameters @,, ---, êm, be the prob- 
ability that E; occurs and suppose there are asymp- 
totically efficient estimators of the 8's, from which 
are obtained asymptotically efficient estimators of 
the p;, denoted p,; thus np, estimates the expected 
frequency of occurrence of E; under the null hy- 
pothesis. Let the random variable v; be the num- 
ber of times E; actually occurs in the n trials. Then 


X= em 
— n 


has, under the null hypothesis for large n and 
under mathematical regularity conditions, approx- 
imately the chi-square distribution with J — m — 1 
degrees of freedom. When the null hypothesis is 
false, X? tends to be larger on the average than 
when it is true, so that a right-hand tail critical 
region is appropriate, that is, the null hypothesis 
is rejected for large values of X°. 

Note that the above “chi-square” statistic is of 
form 


s pie number — “expected” number)". 
“expected” number 


(The quotation marks around expected indicate 
that this is actually an asymptotically efficient 
estimator of the expectation under the null hypoth- 
esis. ) 

The above development can readily be extended 
to I independent sequences of trials, with n; trials 
in the ith sequence, p;; denoting the probability 
under the null hypothesis of event j for sequence i, 
and v;; denoting the number of times E; occurs in 
sequence 7. As before, 

(is = mis )* 


“os niPi; 


is, for large n;, approximately chi-square with 
I(J—1)-— m degrees of freedom, under the null 
hypothesis, and with appropriate regularity con- 
ditions. Note that when 1= 1, J —m — 1 degrees 
of freedom are obtained, as before. 

The primary problem in such tests is the deriva- 
tion of asymptotically efficient estimators. For ex- 
ample, such estimators may be maximum likelihood 
estimators or minimum chi-square estimators. The 
latter are the 6's that minimize X*, the test 
Statistic, subject to whatever functional restraints 
are imposed upon the pj,’s. Neyman (1949) has 
given a method of determining modified minimum 
chi-square estimators, a method that reduces to 


solving only linear equations, as many as there are 
unknown parameters to estimate. A review of the 
methods of generating such minimum chi-square 
estimators for this model, and for a more general 
one, is given by Ferguson (1958). 

It is easily seen that the comparison of two 

percentages is a special case of the general theorem. 

tere I[=J=2 and the null hypothesis can be 
put in the form pı: = Pris = 0; Pa = Pox = 1 — 0. Here 
Pau is the probability of A occurring on a trial in 
the first series, p,» is the probability of A occurring 
on a trial in the second series; Pa, P: are defined 
similarly with respect to A, The maximum likeli- 
hood estimator of @ is v.,/n and the degrees of 
freedom are seen to be one from insertion in the 
general formula. 

Proofs of the basic chi-square theorem and state- 
ments of the mathematical regularity conditions may 
be found in Cramér (1946) or Neyman (1949). 

Power of the chi-square test. The chi-square 
test is extensively used as an omnibus test without 
particular alternatives in view. Frequently such 
applications are almost useless in the sense that 
their sensitivity (that is, power) is very low. It is 
therefore important not only to make such tests 
but also to specify the alternatives of interest and 
to determine the power, that is, the probability that 
the null hypothesis is rejected when in fact such 
alternatives are true. A fairly complete theory of 
the power of chi-square tests has been given re- 
cently by Mitra (1958) and Diamond (1963). 

Because chi-square tests are based upon a limit- 
ing distribution theorem it is necessary to express 
the alternative in a special form, depending on 
the sample size, n, in order to obtain meaningful 
results. Consider first the case where I = 1 and the 
null hypothesis completely specifies the p? as nu- 
merical constants. (In the questionnaire experi- 
ment above, since there are five responses the null 
hypothesis that the responses are equally likely 
specifies p?= 4.) Write an alternative p; in the 
form 

rs peel, 
Vn 
If in fact p; = p4 then the test statistic X? has a 
limiting noncentral chi-square distribution with 
noncentrality parameter 


J 
where >) C; = 0. 


A=LC7/p! 


and with J — 1 degrees of freedom [see DISTRIBU- 
TIONS, STATISTICAL, article on SPECIAL CONTINU- 
OUS DISTRIBUTIONS]. 

The A required to obtain a specified probability 
of rejection of an alternative p! for tests at signifi- 
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cance levels 0.01 and 0.05 has been tabulated; such 
a table is given, for example, by Owen (1962, pp. 
61-62). These tables are useful not only in calcu- 
lating the power function but also in specifying 
sample size in advance. For the example where 
there are five responses and the null hypothesis is 
p? = p? = p? = p? = p? = 0.20, consider the alterna- 
tive pt = p4 = pt = pf = 0.15, pt = 0.40. Then C; = 
vn (0.05), j =1, ++, 4; Cs = — Vn (0.20), so that 
à= n(.25). To achieve a probability of 0.80 of re- 
jecting the null hypothesis for this alternative, it is 
found from the tables that à must be 11.94 (four 
degrees of freedom and 0.05 significance level). 
This requires a sample size of 11.94/.25 or, to the 
nearest whole number, 48. 

For the comparison of two samples, a similar 
power theory is available. Consider two sequences 
of n; trials each of which results in an outcome 
E,, E:,-:+, E,. Here p;; i = 1,2, j =1,---,J) is the 
probability of outcome j on sequence i and the null 
hypothesis of homogeneity is Py, = P21, Piz = Pes» 

++, Py = Pus. Now consider a sequence of alterna- 
tives p4 (n) which for some p; (j= 1, 2, +, J) 
satisfy the equations p{,(n) = pj + Ci;/Vn, where 
n=n,+n. and EC; = ŁC; = 0. Then, for the 
sequence of alternatives, X? has, in the limit as 
n>», a noncentral chi-square distribution with 
J — 1 degrees of freedom and noncentrality param- 
eter A, where 


In actual practice, when the statistician considers 
a specified alternative for finite n, p; is not unique- 
ly defined; it is convenient to define p; = sipi (n) + 
pi (n)] but whether other choices of p; might im- 
prove the goodness of the asymptotic approxima- 
tion to the actual power appears not to have been 
investigated. For the case J = 2, and with n, =, 
a nomogram is available showing the sample size 
required to obtain a specified level of power for 
one-sided hypotheses, that is, for comparison of an 
experimental and a standard group (Columbia 
University 1947, chapter 7). In the general case 
the formulation of A is more difficult. 

Contingency tables. In the example of compar- 
ing two percentages, the observations were con- 
veniently set out in a 2 x 2 array. Similarly, in the 
more general comparative experiment (the power 
of which was just discussed), it would be conven- 
ient to set out the observations in a 2 x J array. 
These are special cases of contingency tables, 
which, in general, have r rows and c columns; 
counted data that may be so represented arise, for 
example, in many experiments and surveys. 
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Such arrays or contingency tables may arise in 
at least three different situations, which may be 
illustrated by specific examples: 

(1) Double polytomy: A sample of n voters is 
taken from an electoral list and each voter is clas- 
sified into one of r party affiliations P, , +, P, and 
into one of c educational levels E, ++, E.. Denote 
by pj; the probability that a voter belongs to party i 
and educational level j, so that ©),);pi; = 1. The 
usual null hypothesis of interest is that the classifi- 
cations are independent, that is, 

Pu = Pi- Pj, 

where p;. = probability a voter is in party i (re- 
gardless of educational level) and p.; = probability 
a voter is in educational level j, again regardless of 
the other classification variable (that is, pi. = Epi; 
and p.; = Epu). In this case both the vertical and 
horizontal marginal totals of the r x c sample array 
are random. 

(2) Comparative trials: Consider instead of a 
single sample from the general electoral roll, r 
samples of sizes n; from the r different party rolls. 
The voters in each sample are classified as to edu- 
cational level (levels E,, : ++ , E.). Denote as before 
by pi; the probability that a voter drawn from party 
i belongs to educational level j (so that Epi = 1). 
The hypothesis of homogeneity specifies that 
Ps) = Pz; =+ + = pu for each j. In this case the row 
totals are fixed (m,,--+,m,), while the column 
totals are random. Into this category falls the two- 
sample experiment discussed earlier; in that case 
r=2, 

(3) Independence trials (fixed marginal totals) : 
Consider a group of n manufactured articles, of 
which fixed proportions are in each of the quality 
categories C,,---,C,. The articles have been di- 
vided into r groups of fixed size n, , -+ , n, for fur- 
ther processing or for shipment to customers. The 
question arises whether the partitioning into the 
T groups can reasonably be considered to have been 
done randomly, that is, independently of how the 
articles fall into the quality categories. Since the 
number of articles in each of the categories C,,---, 
C, as well as the n,, +- , n, are fixed, both marginal 
totals are fixed in this situation. 

For these three cases let v;; denote the number 
of individuals falling into row i, column j, and 
denote by v;. and v.; the row and column totals, 
whether fixed or random. While different proba- 
bility models are associated with the three cases, 
the approximate or large sample chi-square test is 
identical. The test statistic is 


2- 5 ley — (v:.v.;)/n} 


which has, if the null hypothesis is true, an ap- 
proximate chi-square distribution with (r—1) - 
(c — 1) degrees of freedom. 

For the comparative trials case, this is an ex- 
tension of the comparison of two percentages. The 
maximum likelihood estimator of the common 
value of P,;, Pz, ***, Pr; is v.;/n under the null 
hypothesis. The comparative trials model consists 
of r sequences of trials, each of which may result 
in one of c events; c — 1 parameters are estimated. 
Because J$., Pi; = 1, as soon as c — 1 of the proba- 
bilities are estimated the final one is determined. 
Hence the degrees of freedom are r(c—1)— 
(c—1)=(r-1)(c-1). 

In the double polytomy case there are (r — 1) + 
(c— 1) independent parameters to be estimated 
under the null hypothesis: p,., Pa., °°", Pras) Poo 
P-2, ***, P-o, Since again the restrictions $ ;pi. = 
Lip.) =1 provide the last two needed values. The 
maximum likelihood estimators of the p;. are the 
v;./n and of the p.; are the v.;/n, so that the 
estimated expected values are n(v;./n)(v.;/n) or 
v;.U.;/n. The degrees of freedom in this case 
are (re — 1)—(r—1)—(c—1)or(r—1)(c—1), 
since there is only one sequence of trials with rc 
outcomes. 

Like all chi-square tests, these are based upon 
asymptotic distribution theory and are satisfactory 
in practice for “large” sample sizes. A number of 
rules of thumb have been established in regard to 
the acceptable lower limit of sample size so that the 
actual type I error, or probability of rejecting the 
null hypothesis when true, does not depart too far 
from the prescribed significance level. For a care- 
ful discussion of this problem, and of procedures to 
adopt when the samples are too small, see Coch- 
ran (1952; 1954). 

2 x 2 tables. The special case of contingency 
tables with r = c = 2 has been extensively studied, 
and the so-called Fisher exact test is available. 
Given v,., Vs., v.,, v.., under any of the null hy- 
potheses, v,, has a specific hypergeometric distribu- 
tion; hence probabilities of deviations as numeri- 
cally large as, or larger than, the observed deviation 
can be calculated and a test can be made. The ap- 
plication of the test is now greatly facilitated by use 
of tables by Finney et al. (1963). For the compara- 
tive trials model and the double dichotomy model 
this exact test is a conditional test, given the mar- 
ginal counts. 

While the hypotheses associated with the three 
different models in r xc tables, in general, and 
2x2 tables, in particular, can be tested by the 
same chi-square procedure, the power of the test 
varies according to the model. For the 2 x 2 case, 
approximations and tables have been given for 


each of the three models. The most recent of these 
are by Bennett and Hsu (1960) for comparative 
and independence trials and Harkness and Katz 
(1964) for the double dichotomy model. Earlier 
approximations are discussed and compared by 
these authors. 

Single degrees of freedom. The statistic xX? 
used to test the several null hypotheses possible for 
rxc contingency tables can be partitioned into 
(r — 1)(c — 1) uncorrelated X? terms, each of which 
has a limiting chi-square distribution with one de- 
gree of freedom when the null hypothesis is true. 

Planned comparisons. Planned subcomparisons, 
however, can be treated most easily by forming 
new contingency tables and calculating the ap- 
proximate X? statistic. For example, in the com- 
parison of three experimental learning methods 
with a standard method the observations might be 
recorded for each pupil as successful or unsuccess- 
ful and tabulated in a 4 x 2 table. These are four 
comparative trials; and X°, the statistic to test 
homogeneity, has, under the null hypothesis, an 
approximate chi-square distribution with three 
degrees of freedom. 

In this situation, two subcomparisons might be 
indicated: the standard method versus the com- 
bined experimental groups and in the experimental 
groups among themselves. Tables 3a and 3b show 
the two new contingency tables. The X? statis- 
tics calculated from these two subtables may be 
used to make the indicated secondary tests. The 
two X? values (with one and two degrees of free- 
dom respectively) will not sum to the X? calculated 
for the whole 4 x 2 array. Short-cut formulas for 
a partition that is additive and references to other 
papers on this subject are given by Kimball (1954). 


Table 3a — Comparison between standard method and 
combined experimental methods 


Successful Unsuccessful 
Standard (method 1) vu via 
Experimental methods combined Vay + Ysy + Yar vaa + Yaa + Yaa 


Table 3b — Comparison among experimental methods 


Successful Unsuccessful 
Experimental (method 2) Vay Yaa 
Experimental (method 3) Vay Ysa 
Experimental (method 4) Yar Van 


Unplanned comparisons. As in the analysis of 
variance of linear models, distinction should be 
made between such planned comparisons and un- 
planned comparisons. Goodman (1964a) has given 
a procedure to find confidence intervals for a 
family of “contrasts” among multinomial proba- 
bilities for the r x c contingency table in the com- 
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parative-trials model. A “contrast” is any linear 
function of the probabilities p;;, with coefficients 
summing to zero, that is, 


DE bipi 


E b;; = 0 for each j. 
in ja iz 
[See LINEAR HYPOTHESES, article on MULTIPLE 
COMPARISONS. ] 

Thus in the comparison of teaching methods ex- 
periment referred to above, where p; is the proba- 
bility of a pupil being successful when taught by 
method i, the unplanned comparisons or contrasts 
might be Pa — Ps:, Ps: — Pur» Pa — Paun. These rep- 
resent pairwise comparisons of the three experi- 
mental methods. 

Denote a contrast by 0; an estimator of pi; is 
pi; = vi;/vi. and an estimator of @ is 


=Z X byu. 


r 
isi joa 


An estimator of the variance of 6 is 


r e e 2 
so- ġo: [$no - (Ewe) J 
The large-sample joint confidence intervals for 0 
with confidence coefficient 1—a@ have the form 
6 —S(6)L, 6 + S(6)L, where L is the square root of 
the upper 100(1—«)th percentage point of the 
chi-square distribution with (r — 1)(¢ — 1) degrees 
of freedom. An experiment in which one or more 
of the totality of all such possible intervals fail to 
include the true 0 may be called a violation. The 
probability of such a violation is a. 

If instead of all contrasts, only a few, say G, are 
of interest, then L in the last formula may be re- 
placed by Z,-a/xo (the 100 [1-a/2G] percentile of 
the standard normal distribution), which often 
will be smaller than L and hence yield shorter 
confidence intervals while the probability of a viola- 
tion is still less than or at most equal to a. 

Comparative trials; ordered alternatives. In the 
comparative trials model, with r x 2 contingency 
tables, frequently the only alternative of interest is 
an ordered set of pi,’s. For example in a 2 x 2 com- 
parative trial involving a control and a test group, 
the question may be to decide whether the groups 
are the same or whether the test group yields 
“better” results than the control group. In the 2 x 2 
case, this situation is handled simply by working 
with the signed square root of X?, which has a 
standard normal distribution if the null hypothesis 
is true. A one-sided alternative is then treated in 
the same manner as a test for a percentage referred 
to earlier. 

For the more general r x 2 table, the most com- 
plete treatment is that of Bartholomew (1959); his 
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test, however, requires special tables. If the ex- 
perimenter believes that the p;, have a functional 
relationship to a known associated variable, x; , then 
a specific test can be derived from the basic the- 
orem. Such a test would be a particular example 
of a planned comparison. Many authors have given 
short-cut formulas and worked out examples of 
this type of problem (cf. Cochran 1954; Armitage 
1955). 

Comparative trials; dependent samples. A sam- 
ple is taken of n voters who have voted in the last 
two national elections for one of the major parties. 
Denote the parties by L and C, and suppose that 
in the sample 45 per cent voted L in the first elec- 
tion and 55 per cent voted L in the second. Does 
this indicate a significant change in voter behavior 
in the subpopulation of which this is a sample? To 
make such a comparison in matched or dependent 
samples, it is necessary to obtain information on 
the actual changes in party preference [see PANEL 
STUDIES]. These can be read from a 2 x 2 table 
such as Table 4, 


Table 4 — Voter preference in two elections 


ELECTION 1 

L z 

euection2 Yn va 
C var v22 


Such a 2 x 2 table with random marginals appears 
to fall into the double-dichotomy model, but the 
hypothesis of independence is not of interest here. 
The changes are indicated by the off-diagonal ele- 
ments V: , V , and the hypothesis of no net change 
is equivalent to the hypothesis that, given V: + Va, 
Vis is binomially distributed with probability }. Thus, 
the test of comparison of two percentages in iden- 
tical or matched samples reduces to the test for a 
percentage. If the normal approximation is ade- 
quate, the square of the normal deviate, with the 
continuity correction, is 


E A E DH 

CEN 
which has a limiting chi-square distribution with 
one degree of freedom under the null hypothesis 
that the probabilities are the same in the two 
matched groups. Cochran (1950) has extended this 
test to the r x c case. The test described above does 
not at all depend on v,, and v.. , but of course these 
quantities would enter into procedures pointed 
toward issues other than testing the null hypothesis 

of no net change. 

Chi-square tests of goodness of fit. Chi-square 
tests have been used extensively to test whether 


sample observations might have arisen from a pop- 
ulation with a specified form, such as binomial, 
Poisson, or normal. Such chi-square tests are again 
special cases of the general theory outlined above, 
although there are many other types of tests for 
goodness of fit [see GOODNESS OF FIT]. 

There are some special problems in connection 
with some nonstandard chi-square tests of goodness 
of fit for the binomial and Poisson distributions. 
The standard chi-square test of goodness of fit for 
these two discrete distributions requires (in most 
cases) estimation of the mean. The sample mean 
is an efficient estimator of the population mean; it 
appears to make little difference whether the sam- 
ple mean is computed from the raw or grouped 
data. 

There is evidence that two simpler tests are 
more powerful, at least for some alternatives, for 
testing whether a set of counts does come from one 
of the distributions. These test statistics are the 
so-called indices of dispersion, studied by Lexis and 
Bortkiewicz, which in fact compare two estimators 
of the variance—the usual sample estimator and 
the estimator derivable from the fact that for these 
distributions the variance is a function of the mean, 
Alternatively they may be viewed as chi-square 
tests, conditional on the total count and placed in 
the framework of the basic theorem. Thus if the 
observations are v; , +- , v, , which according to the 
null hypothesis come from a Poisson distribution, 
the appropriate index of dispersion test statistic is 


X (vi — 0)? 
Xi Ere 5 
0 
where @ is the sample mean of the v,. For large n 
and if the null hypothesis is true, X? is approxi- 
mately distributed as chi-square with n — 1 degrees 
of freedom [see BoRTKIEWICz; LEXIS]. 

The corresponding test for the binomial can be 
expressed similarly, but it is also useful to set out 
the n observations in a 2 x c contingency table 
such as Table 5. 


Table 5 — Arrangement of data to test for binomial 


Sample! Sample? ... Sample c 
Successes xa EEEL NT. 
Failures m — xy Ma— Xa ves NX 
Total n De geen A, 


The variance test is equivalent to the chi-square 
test of homogeneity in this 2 x c array, which has, 
of course, c — 1 degrees of freedom. 

Whereas in general the chi-square test is a one- 
tailed test, that is, the null hypothesis is rejected 


for large values of the statistics, the dispersion tests 
are often two-tailed tests, not necessarily with equal 
probability in the tails. The reason for this is that a 
too small value of X? reflects a pattern that is more 
regular than that expected by chance, and such 
patterns may correspond to important alternatives 
to the null hypothesis of homogeneous randomness. 

Contingency table association measures. If in 
the double-dichotomy model the hypothesis of in- 
dependence is rejected, it is logical to seek a meas- 
ure of association between the classifications. Dis- 
tinction must be made between purely descriptive 
measures and sampling estimators of such meas- 
ures, [See STATISTICS, DESCRIPTIVE, article on ASSO- 
CIATION. ] 

A large number of such measures have been 
presented, usually related to the X* statistic used 
to test the null hypothesis of independence. Good- 
man and Kruskal (1954-1963) have emphasized 
the need to choose measures of association that 
have contextual meaning in the light of some prob- 
ability model with predictive or explanatory value. 
They distinguish between two cases—no ordering 
among the categories and directed ordering among 
them, 

Multidimensional contingency tables. The anal- 
ysis of data that have been categorized into three or 
more classifications involves not only a consider- 
able increase in the variety of possibilities but also 
introduces some new conceptual problems. The 
basic test of mutual independence is, however, a 
straightforward extension of the two-dimensional 
one and a simple application of the main theorem. 
The test will be discussed for three classifications. 

This is a test of the hypothesis that pix, the 
probability of an observation falling in row i, col- 
umn j, and layer k, can be factored into pi..P.;- 
p..x. Under this null hypothesis the estimated ex- 
pected value in cell ijk is n*(v..)(V.j-)(U--%), 
where the dots indicate summation over the corre- 
sponding subscripts of the observed counts, Vij: - 
The X? statistic has the usual form, sum of 
(observed — expected)?/expected, and has rel — r — 
c— 1+2 degrees of freedom if there are r rows, 
c columns, and l layers. 

Tests for partial independence, for example that 
Pi = (pi-.)(p.jx), or for homogeneity (between 
layers, for example) may be derived similarly. 
New concepts and new tests are introduced by the 
idea of interaction between the different classifi- 
cations. 

In linear models, interactions are measures of 
nonadditivity of the effects due to different classifi- 
cations. With contingency models several defini- 
tions of interaction have been given; the present 
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treatment follows Goodman (1964b). Consider, for 
example, samples drawn from rural and urban 
populations and classified by sex and age, with 
age treated dichotomously (see Table 6). 


Table 6 — Classification of rural and urban samples by 


sex and age 
URBAN RURAL 
Young Old Totals Young Old Totals 
Male 20 14 34 34 n 45 
Female 42 24 66 36 19 55 
Totols 62 38 100 70 30 100 


For the urban group there is a sex ratio 20/42 
among the “young” and 14/24 among the “old.” 
The ratio of these may be regarded as a measure 
of the interaction of age and sex in the urban 
population. Similarly the same ratio of sex ratios 
is a measure of the interaction in the rural popu- 
lation. These are, of course, sample values; and the 
population interactions must be defined in terms of 
the probabilities p;j,. It is useful to define 


— Pir [Pixx 
A, = Bt 
3 i 


and to write the three-factor no interaction hypoth- 
esis as A, = A. for this 2 x 2 x 2 contingency table. 
The maximum likelihood estimator of A, is d= 
ViikUVssk/ VV, and its variance can be estimated 
consistently by sj=dju, where u= CDiLvin 
A simple statistic to test the hypothesis A, = A, is 
X? = (d, — d,)*/(s? + 82), which, if the null hypoth- 
esis is true, has a large sample chi-square distribu- 
tion with one degree of freedom. 

For the data given d, = 0.816, d: = 1.631, si = 
0.125, s? = 0.534, and X? = 1.01 so that the three- 
factor no interaction hypothesis is not rejected at 
the usual significance levels. Goodman has ex- 
tended this test in an obvious way to the 2 x 2 x l 
contingency table. Here the test statistic is 


i 


LIVE: 


which has | — 1 degrees of freedom. The extension 
to rx cx tables is based on logarithms of fre- 
quencies rather than the actual frequencies. Good- 
man also provides confidence intervals for the 
interactions A, and indicates a number of equiva- 
lent tests. A bibliography of the very extensive liter- 
ature on this topic is given in this paper. 


Alternatives to chi-square 


While the chi-square tests are classical for the 
analysis of counted data, with the original simple 
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tests going back to Karl Pearson, they are not the 
likelihood ratio tests. The latter are based upon 
statistics of the form 


4 J 
BE log sp = D v; log [1 + (v; — np;)/np;] 
=1 Jai 


in the general case with one sequence of trials. It 
is easy to show, by expanding minus twice the 
logarithm of the likelihood ratio statistic in a power 
series, that its leading term is X*, and that further 
terms are of a smaller order than the leading term 
so that the tests are equivalent in the limit. How- 
ever, they are not equivalent for small samples. 

Another test for contingency tables (comparative 
trials model) is that of C. A. B. Smith (1951). 
Further work appears to be necessary before any 
of these alternatives is accepted as preferable to 
the chi-square tests. The most widely used alterna- 
tive analysis is that indicated in the next section. 

ANOVA of transformed counted data. Contin- 
gency tables have an obvious analogy to similar 
arrays of measured data that are often treated by 
analysis of variance techniques (ANOVA). The anal- 
ysis of variance models are more satisfactory if the 
data are such that (1) effects are additive, (2) 
error variability is constant, (3) the error distribu- 
tion is symmetrical and nearly normal, and (4) the 
errors are statistically independent. [See LINEAR 
HYPOTHESES, article on ANALYSIS OF VARIANCE.] 

Counted data that arise from a binomial or mul- 
tinomial model fail most obviously on the second 
property since the variances of such data vary with 
the mean. However, some function or transforma- 
tion of the observations may have approximately 
constant variance. Transformations that have been 
derived for counted data to make variances nearly 
constant have been found empirically often to im- 
prove the degree of approximation to which prop- 
erties (1) and (3) hold also. These transformations 
include: (a) Arc sine transformation for propor- 
tions 


y=are sin yo/n , 

which is applicable to dichotomous data with an 
equal number of trials in each sequence, If the 
number of trials (n;) varies from sequence to 
sequence the problem is more complicated (see 
Cochran 1943). (b) Square root transformation: 
y = Vv + 1 for Poisson data. (c) Logarithmic trans- 
formation: y = log (v+ 1) for data such that the 
standard deviation is proportional to the mean. 
Use of the are sine transformation, and subse- 
quent analysis, is facilitated by the use of binomial 
probability paper; graphic techniques are simple 
and usually adequate. The basic reference for such 
procedures is Mosteller and Tukey (1949), 


Refinements of these transformations and dis- 
cussion of the choice of transformations is given 
a thorough treatment by Tukey (1957). If a suit- 
able transformation has been made, the whole 
battery of tests that have been developed in analy- 
sis of variance (including covariance techniques) 
is applicable. Estimation problems may be more 
subtle; in some situations estimates may be given 
in the transformed variable but in others it may be 
desirable to transform back to the original variable. 
[See STATISTICAL ANALYSIS, SPECIAL PROBLEMS OF, 
article ON TRANSFORMATIONS OF DATA.] 


Precautions in the analysis of counted data 


The transformations discussed above were de- 
rived to apply to data that conform to such models 
as the binomial or Poisson and that could be an- 
alyzed by chi-square methods, However, counted 
data often arise from models that do not conform 
to the basic assumption; in particular independence 
may be lacking, so that the chi-square tests are not 
valid. Such data are often transformed and treated 
by analysis of variance procedures; the justification 
for this is largely empirical. Examples of situations 
where this is necessary are experimental responses 
of animals in a group where dependence may be 
present, eye estimates of the numbers in a group 
of people, and comparisons of proportions in het- 
erogeneous and unequal-sized groups. In such sit- 
uations care is necessary that the proper trans- 
formation is selected to achieve the properties listed 
above and in the interpretation of the results of the 
analysis. 

The lack of independence and the presence of 
extraneous sources of variation are frequent sources 
of error in the analysis of counted data because the 
chi-square tests are invalidated by such factors. 
A discussion of these errors and others is found in 
Lewis and Burke (1949). The two careful exposi- 
tory papers by Cochran (1952; 1954) represent an 
excellent source of further reading on this topic. 
See also the monograph by Maxwell (1961). 


DoucLAs G. CHAPMAN 


[See also QUANTAL RESPONSE. ] 
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COURNOT, ANTOINE AUGUSTIN 


Antoine Augustin Cournot (1801-1877), French 
mathematician, economist, and philosopher, was 
born at Gray (Haute-Saône), the son of a notary. 
He came from a family of farmers who had lived in 
Franche-Comté since at least the middle of the six- 
teenth century. Until age 15, Cournot attended the 
collége at Gray; then, for the next four years, he 
read a great deal on his own, especially works by 
scientists and philosophers, such as Laplace and 
Leibniz. After preparing at Besançon, Cournot was 
admitted in 1821 to the scientific section of the 
Ecole Normale Supérieure, in Paris. The school 
was closed by the reactionary regime the following 
year, but Cournot stayed in Paris and in 1823 re- 
ceived the licentiate in sciences. He attended ses- 
sions at the Académie des Sciences and associated 
with the principal scholars of the day; it was 
through one of them that he met Proudhon. In 
October 1823, Cournot entered the household of 
Marshal Gouvion Saint-Cyr in the double capacity 
of literary adviser to the marshal, who wanted to 
complete some unfinished manuscripts, and tutor 
to the marshal’s son. 

Cournot spent ten years with the marshal, all 
the while continuing his scientific studies. In 1829 
he received a doctorate in science, writing a main 
thesis in mechanics and a supplementary one in 
astronomy. He also studied law. A series of articles 
that he published on scientific questions attracted 
the attention of the great mathematician Poisson, 
who was then professor at the Ecole Polytechnique 
and later at the University of Paris. Poisson was 
then in charge of the instruction in mathematics 
throughout France and arranged Cournot’s ap- 
pointment to the chair of mathematical analysis in 
the faculty of sciences at Lyon. Cournot taught 
for only one year, however; subsequently, most of 
his life was spent in university administration, in 
which he was very successful. He became, suc- 
cessively, rector of the Académie de Grenoble in 
1835, inspector general of the University of Dijon, 
and rector of the Académie de Dijon from 1854 
to 1862. Cournot then accepted no further public 
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positions and returned to Paris, where he died just 
as he was about to apply for membership in the 
Institut de France. Toward the end of his life he 
was nearly blind. 

Cournot’s administrative work left him ample 
time for a great deal of scientific writing. Unfortu- 
nately, his books suffer from a lack of the stimula- 
tion that contact with an audience gives to teach- 
ers. Ten of his works appeared between 1838 and 
1877. These books have three major themes: (1) 
algebra, infinitesimal analysis, and calculus of 
probabilities; (2) the theory of wealth; and (3) 
the philosophy of science, the philosophy of history, 
and even general philosophy. These themes were 
interdependent in Cournot’s work, and although 
the economic side will be stressed here, the pro- 
found unity of his thought must be remembered. 
Cournot started with mathematics, was then at- 
tracted to economics, and ended with a general 
interpretation of the world that was infused with 
his profound understanding of probabilities. It is 
the concept of probability that integrates the three 
parts of his work. 

Cournot’s rather melancholic and solitary tem- 
perament considerably delayed the influence he 
was ultimately to have. Modest and self-effacing, 
he did nothing to make his books attractive. They 
tend to be austere, crowded with facts and proofs. 
Even the titles he chose reflect his modesty: 
Researches, Essay, and Considerations. It is not 
surprising, then, that he is less well known in the 
field of probability than Laplace and Poisson, less 
appreciated among economists than Bastiat, Say, 
or Proudhon (to mention only French economists ), 
and less quoted in philosophy than Comte or 
Spencer. And yet Cournot, having the rare knowl- 
edge required to use the language of all of these 
authors, combined many of their qualities and 
made a quasi-prophetic synthesis of their ideas. 

Mathematics and probability theory. Cournot’s 
mathematical works, econometrics aside, appeared 
between 1840 and 1850. In his Exposition de la 
théorie des chances et des probabilités (1843), he 
put forward a definition of statistics as that sci- 
ence which deals with collecting and coordinating 
numerous facts of every kind, in such a way as to 
obtain numerical relationships that are markedly 
independent of the anomalies of chance, and that 
manifest the existence of uniformly operating 
causes whose effects have been confounded, how- 
ever, with other, accidental effects. 

Cournot, living in the first half of the nineteenth 
century, stood at the crossroads of two ways of 
mathematical endeavor: the one originated with 
Pascal and Fermat and led to the work of Jacques 
Bernoulli, Gauss, Laplace, and Poisson on the doc- 


trine of chances; the other, renouncing the mathe- 
matical study of chance and uncertainty, focused 
on the mathematics of rigorous determinations by 
algorithms that admit no margin of uncertainty— 
as if science were perfectly deterministic. This 
concept that the perfect is the determined was to 
play a major role in the development of the early 
science of economics, as one can see from the still 
current term “perfect competition.” Cournot fore- 
saw that science could not be intrinsically and 
definitively tied to such determinism. He believed 
that a science of margins and chances is not only 
viable but perhaps better suited to the needs of 
economics than a science of absolute, exact equi- 
libria. Nonetheless, Cournot is often regarded as 
having introduced into economic theory the use of 
deterministic mathematics. 

Economics. Cournot wrote three works in eco- 
nomics, in 1838, 1863, and 1877; thus, his career 
began and ended with writings on economics. 
When Recherches sur les principes mathématiques 
de la théorie des richesses (1838a) first appeared, 
it was such a fiasco that Cournot remained silent 
on the subject of economics for 25 years. He then 
wrote Principes de la théorie des richesses (1863), 
putting into “literary” language what he had pre- 
viously said in the language of mathematics. De- 
spite the concessions Cournot made in the form of 
presentation, the Principes was no better received 
than the Recherches; indeed, if Cournot had pro- 
duced only his nonmathematical work, he would 
never have been recognized as anything but a 
minor figure. 

Strictly speaking, to be sure, Cournot was not 
the first to have used mathematical language to 
express economic problems. There was Nicolas 
Canard, in France, whom Cournot did quote, but 
Canard did not have Cournot’s scope and erudi- 
tion. Great erudition is needed to combine eco- 
nomics and mathematics, and if mathematical 
economics has had its setbacks, the reason is that 
it has not always been guided by such clear minds 
as Cournot’s. 

Cournot’s great merit is that, without saying so 
explicitly, he was the first to construct a true theory 
of prices and markets. Chapter 4 of Recherches, 
entitled “Of the Law of Demand,” is the first model 
of its kind. Cournot was interested exclusively in 
the demand that is followed by an actual sale and 
that is, therefore, observable and measurable. He 
is recognized for having revealed the concept of 
function and for having made available to econo- 
mists the immensely useful language of functional 
concepts. Sales are, in general, a decreasing func- 
tion of price. This function is continuous, at least 
when the number of consumers is not limited. 


Moreover, the sales function is not purely abstract; 
it may be constructed on the basis of mean an- 
nually observed data, And since it is not merely an 
a priori function but an empirical, experimental 
one, Cournot may be considered to have made the 
initial step in developing econometrics. 

In this econometric mode of analysis, a related 
idea is that of imperfection, hence of uncertainty 
and, in turn, of chance in measurement. Here 
again, Cournot’s knowledge of probabilities saved 
him from incorrectly associating mathematics with 
the idea of rigorous precision. He asserted that 
even if the object of numerical calculation “were 
unattainable, it would be nevertheless not improper 
to introduce the unknown law of demand into the 
analytical combinations, by means of an inde- 
terminate symbol; for it is well known that one of 
the most important functions of analysis consists 
precisely in assigning determinate relations be- 
tween quantities to which numerical values and 
even algebraic forms are absolutely unassignable” 
([1838a] 1960, p. 48). We must give up trying to 
grasp what cannot be grasped rigorously. It is for 
this reason, perhaps, that we must reason mathe- 
matically “. . . by showing what determinate rela- 
tions exist between unknown quantities, analysis 
reduces these unknown quantities to the smallest 
possible number, and guides the observer to the 
best observations for discovering their values. It 
reduces and coordinates statistical documents; and 
it diminishes the labour of statisticians . . .” (ibid., 
pp. 48-49). 

Cournot’s first construction is worked out in this 
spirit: it consists in drawing up, within suitable 
limits, tables of correspondence between the values 
of the demand, D = f(p), and the price, p. In a 
second stage, the function pf(p), the total value 
of the amount sold, is considered. This becomes the 
crux of the theory of markets, Long before Alfred 
Marshall, Cournot presented a theory of elasticity: 
depending on whether AD/Ap < D/p or AD/Ap > 
D/p, the price increase will make the product 
pf(p) larger or smaller. Hence commercial statis- 
tics should begin by dividing merchandise into two 
categories, depending on whether their current 
prices are lower or higher than the value making 
pf(p) a maximum. In this formulation, Cournot 
was truly a great innovator. 

Instead of attacking the general price equilib- 
rium directly, as Léon Walras and Vilfredo Pareto 
were later to do, Cournot proceeded by gradual 
Steps. He started with monopoly, then considered 
competition limited to a few participants, and in 
the end took up the case of indefinite or unlimited 
competition, using the intercommunication of mar- 
kets to complete his theory. This procedure has 
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been criticized, for economists in the classical tra- 
dition follow the inverse procedure, starting with 
unlimited competition or, as they call it, perfect 
competition and ending up with monopoly, How- 
ever, Cournot’s point of view has again been 
adopted in the modern theory of games and of the 
rational search for decisions. Although his theory 
of duopoly was criticized in 1883 by the mathema- 
tician Joseph Bertrand, the theory of bilateral 
monopoly was later to be erected on a similar base. 
The model with two parties was changed to a three- 
sided model (triopoly), and as the number of par- 
ties increased, the system was called oligopoly, 
pliopoly, and finally polypoly; the larger the num- 
ber, the more closely the model approached that of 
classical competition. Today, makers of models no 
longer believe that there is an irreducible differ- 
ence between models with many elements and 
those with few, but instead they relate the theory 
of markets to the general theory of economic inter- 
action; hence their work is in the tradition of Cour- 
not’s model, at first so poorly understood. There is 
still much to be learned from this 1838 model. 
Although Cournot has been rehabilitated, his con- 
tribution to economics has not been exhausted. 
Philosophy. In Cournot’s philosophical works 
there emerge a philosophy of order and one of 
history. Cournot firmly maintained that, appear- 
ances to the contrary, chance does not imply dis- 
order. By virtue of the theory of probabilities it is 
possible to see regularities: it is, as it were, the 
point of intersection of multiple independent causal 
series. And it permits the joining of the sciences 
with philosophy. The meaning of history, for all its 
incoherence, and the meaning of the future, for all 
its unpredictability, are profoundly related. To dis- 
cover the course of ideas and events is the function 
of knowledge and the vocation of the human mind. 


Cournot was a pioneer. He did nothing to court 
his contemporaries, and they, in turn, not only 
failed to appreciate him but ignored him. By a 
fitting reversal, his triumph came 80 years after 
his death. The most advanced of the econometric 
school recognize him as their ancestor. The theory 
of probabilities, whose full import Cournot realized, 
is a vital component of the structure of recent sci- 
ence. The problems that did not interest the men 
of his time are those that guide the building of 
tomorrow’s world; Cournot had anticipated the con- 
cern with predictions and decisions that now pre- 
occupies economists. 

HENRI GUITTON 


[For the historical context of Cournot's work, see biog- 
raphies of the BERNOULLI FAMILY; Gauss; LAPLACE; 
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Porsson. For discussion of the subsequent develop- 
ment of his ideas, see DEMAND AND SUPPLY; OLIGOP- 
OLY; PROBABILITY.] 
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COURTS 
See ADJUDICATION; JUDICIAL PROCESS; JUDICI- 


ARY. 


COURTS MARTIAL 
See MILITARY LAW. 


CRAFTS 


The term “craft” derives from the Anglo-Saxon 
cræft, meaning “strength, skill, or cunning,” in 
contrast to “art,” which usually implies an inten- 
tion of producing beauty or pleasure. Contempo- 
rary scholars often consider “arts and crafts” 
together because of the difficulties in so many 
societies of differentiating the aesthetic from the 
strictly utilitarian. Anthropologists prefer to use 
“technology” to refer to the processes of manufac- 
ture and “material culture” for the artifacts them- 
selves. Since crafts include all activities that 
produce or modify objects by manual means, with 
or without the use of mechanical aids, such as 
looms or potters’ wheels, the range of study is very 
broad. There is an equally wide range of social 
forms within which the craftsmen operate. 

In a second sense, “craft” is synonymous with 
“guild,” commonly used as a term of class ascrip- 
tion and role delineation in contemporary sociology, 
and is applied to occupational associations. Within 
this meaning the status of craftsmen differs by 
culture, epoch, and craft. 


History of the study of technology 


It was the change in the role of the craftsman 
that first drew the attention of scholars to the 
importance of studying the history of technology. 
When it became apparent in the middle of the 
nineteenth century that the impact of the indus- 
trial revolution was causing rapid degeneration 
in traditional peasant crafts, European folklorist- 
ethnographers set themselves to recording and 
collecting as much as possible before the crafts 
became extinct. As these collections grew, the 
Study of technology grew with them and raised 
some of the basic questions with which the infant 
science of anthropology was to be concerned. The 
exhaustive collections of European museums, often 
made under the impetus of chauvinistic national- 


ism, tempted some scholars to postulate a number 
of diffusion theories of differing degrees of sophis- 
tication and other scholars to devote their careers 
to the refutation of these theories. The product of 
these controversies has been of infinite value to the 
social sciences. 

European students of local arts and crafts have 
made superbly documented analyses of the diffu- 
sion of particular objects, styles, and techniques, 
such as de Rohan-Csermak’s study (1963) of the 
spread of sturgeon hooks across Eurasia, and most 
European countries now boast folk museums or 
reconstructed villages where peasant crafts are 
produced, displayed, and sold. Folk craftsmen, 
along with folk musicians, dancers, and other 
artists, are subsidized by the national governments 
to discourage crass commercialization and to en- 
sure the continuity of tradition. In many countries, 
such as Sweden, Denmark, Mexico, and Japan, folk 
craft products have been adapted to the world 
market to such an extent that they have become 
a valuable asset in economic development and 
national prestige. 

However, American scholars view much of this 
European research as both questionable and beside 
the point. On the basis of the work of Sapir (1916, 
pp. 5-25), Dixon (1928, pp. 145-146), and others, 
they tend to feel that far-reaching historical recon- 
structions on the basis of distribution studies of 
artifacts are unwarranted, and they are particu- 
larly cynical if the result happens to support a 
nationalistic claim. Only very limited and tentative 
historical inferences are allowable from the study 
of artifacts, and then only in a restricted area 
whose historical unity can be assumed, or where 
the artifacts are used to corroborate and illustrate 
historical documentation. Archeology, which might 
be defined as the study of the more durable arts 
and crafts of past societies, provides a model. In 
spite of extreme care in distribution study and 
typing of artifacts, archeologists reach only very 
tentative conclusions and are loath to attempt the 
outlining of even a small segment of protohistory. 

Thus, in their zeal to understand more fully the 
relationships between art and culture, American 
scholars eschew historical reconstruction based on 
techniques for chipping flint or the distribution of 
beads. Instead, such monographs as Bunzel’s The 
Pueblo Potter (1929), O’Neale’s Yurok—Karok Bas- 
ket Weavers (1932), and Adair’s The Navajo and 
Pueblo Silversmiths (1944) analyze the integration 
of craft activities with the structure of living socie- 
ties, the economic base of the crafts, their social 
functions, the roles of the craftsmen, and the 
value systems that may be expressed through them; 
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these monographs are, accordingly, as pertinent to 
art historians, aestheticians, and psychologists as 
to anthropologists. But even when the art is mori- 
bund or the culture extinct, intensive study of 
museum specimens can yield useful insight into 
the nature of style and the processes of creativity. 
In this way, Olbrechts (1946) was able to postu- 
late the so-called Master of Buli as the creator of 
the long-faced Luba sculptural style, and Pros- 
kouriakoff (1950), through analysis of significant 
stylistic details on dated Maya stelae, was able to 
give provisional dates for the great mass of un- 
dated Maya sculpture. The inconclusive results of 
studies in measurement, such as Reichard’s Mela- 
nesian Design (1933), or in typology, such as 
Schweeger-Hefel’s Holzplastik in Afrika (1960), 
suggest the need for more sophisticated statistical 
and computer analyses of dimension, proportion, 
and the frequency of particular stylistic character- 
istics in objects from a given area, the ultimate 
goal being the mathematical quantification of each 
style. Research projects by Yehudi Cohen and Alvin 
Wolfe, as yet unpublished, are hopeful attempts to 
correlate particular styles or forms of art with types 
of social organization. Where the arts and crafts 
are still practiced in reasonably traditional ways, 
it would be possible to study the aesthetic value 
systems of the craftsmen and their audiences and 
thus help answer the ancient questions about the 
existence of an absolute criterion of art above cul- 
ture, epoch, and individual. 


The social organization of crafts 


Craft organization by sex. In most human so- 
cieties throughout history, the crafts have been 
organized primarily on the basis of the sexual divi- 
sion of labor, with all the men of an ethnic group 
learning with varying degrees of skill all of the 
“masculine” crafts, and all the women learning a 
separate set of “feminine” crafts. Although it would 
seem logical for women to make clothing and 
utensils for food processing, and for men to pro- 
vide shelter, transport, weapons, and hunting and 
fishing equipment, actually there is no clear con- 
sensus as to which sex should practice which par- 
ticular crafts. Women are the weavers in the 
American southwest, but men are in Africa. Men 
of the Northwest Coast Indians design the Chilkat 
blankets by making drawings on boards, but women 
do the actual twining process. Men are the potters 
in Europe, India, and central Africa, but women 
are in west Africa and the Americas, Bark cloth is 
made by men in Africa, but by women in southeast 
Asia. Baskets made for the home are usually the 
domain of women, but men may make baskets for 
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sale or barter. In spite of its obviously arbitrary 
nature, the division of craft labor by sex is usually 
considered a “law of nature,” the breaking of which 
brings serious consequences. Many nonliterate so- 
cieties provide the role of the berdache, or trans- 
vestite, for men who are not necessarily homosex- 
uals but who choose to practice, and often excel 
women in, the feminine crafts. Even in Western 
culture, where industrialization has blurred the 
sexual division of labor, female welders, engineers, 
or surgeons face as much opposition as male nurses 
or secretaries, although there is no innate supra- 
cultural reason why they cannot perform the re- 
quired services as well as, or in some cases better 
than, the opposite sex, to whom the role is usually 
assigned. 

Craft organization by family. A second basic 
structuring device for crafts is the family unit. 
This device occurs in fairly complex societies and 
is seen in its most extreme form in Indian occupa- 
tional castes. Here all the men of a family practice 
the same craft, the techniques of which are handed 
down from father to son. Ideally at least, every son 
follows the same craft as his ancestors and marries 
the daughter of a member of the same caste. 
Wherever appropriate, the women also practice the 
craft or help in the preparation of raw materials. 
Such endogamous occupational castes as sonar 
(goldsmiths), lohar (blacksmiths), kumhdr (potters), 
and camdr (leatherworkers) prove quite stable 
even when technological change or emigration re- 
quires radical modification of their occupational 
activities to keep their product marketable. Many 
occupational castes have subcastes, each endog- 
amous, which specialize in particular designs or 
types of the caste product. Although castes effec- 
tively minimize competition, they are all interde- 
pendent economically because they need one an- 
other's products. 

In many other societies castelike family organi- 
zation exists among craftsmen. The Senufo and 
Bambara of west Africa, for instance, group all 
artisans together in a single endogamous kin unit, 
whether they be male blacksmiths, jewelers, wood 
or stone carvers, or female potters. They usually 
live apart in their own villages or in compounds 
at the edge of towns. Even though they obtain 
most of their livelihood from farming, as do other 
Senufo, they have a separate initiation system and 
are both feared and disdained because of the 
magical associations of their callings. In central 
Africa those who know the secret of smelting iron 
from local ore are also considered magicians but 
are often of chiefly status and are accorded great 
admiration, rather than fear. 

In many other cultures, such as traditional 


Japan, China, Dahomey, and Polynesia, the right 
to practice a particular craft was hereditary, and 
these craft families often served as courtiers who 
supported the political authority by providing suit- 
ably elegant objects for the king and his court. In 
contemporary Euro-American culture one may find 
families who have practiced a particular and often 
highly specialized craft for many generations and 
often in many countries. Examples are the Dol- 
metsch family, makers of recorders and other 
archaic instruments, the Zildjian family, who 
alone possess the formula for making the finest 
cymbals, and, in the humbler crafts, Swiss wood- 
carvers and watchmakers, Belgian lacemakers, and 
Irish and Scottish tweed weavers. 

Craft organization by village. Since nearly all 
crafts require particular materials for their manu- 
facture, craftsmen try to settle as near as possible 
to their source of supply or, alternatively, to their 
markets. In this way, an area possessing an out- 
cropping of fine pottery clay will often develop one 
or more villages of potters. In such a situation, it 
is not difficult to see how potters originally of dif- 
ferent origins could combine forces to regulate 
prices, standardize designs or specializations, mini- 
mize competition, and ultimately to intermarry and 
form a complex kin unit. Within such a craft vil- 
lage, the family organization is likely to approxi- 
mate a caste; indeed, in India many craft villages 
are inhabited by a single occupational caste. Even 
where there is no limitation on marriage, the com- 
mon economic and technological interests of the 
young people tend to draw them together socially. 

Just as with interdependent castes, craft villages 
and families also are complementary. For instance, 
craft villages in Melanesia trade extensively with 
one another, often in connection with ceremonial 
exchanges. The need for one another's exclusive 
products is so great that trade is often carried on 
through neutrals during periods of war. In the 
Nilgiri hills of south India, three non-Hindu tribes 
have a similar symbiotic relationship, the Toda 
providing clarified butter (ghi) for food and ritual 
purposes, the Kota cultivating grain, and the 
Badaga serving as craftsmen, merchants, and 
musicians. 

Mexican popular arts, probably the richest and 
most varied being produced today, come from 
family groups in Mexico City or from small craft 
villages. In the Lake Patzcuaro district of Michoa- 
can, for instance, Uruapan produces fine lacquer 
work; Paracho, musical instruments, toys, and 
lathe-turned wooden objects; Santa Clara del 
Cobre, hammered copper utensils and fringed 
black serapes; Apatzingán (until recently), leather- 
covered equipale furniture; Patamban, Capula, and 


Tzintzuntzan, three or more distinctive pottery 
wares; Erongaricuaro, woven cambaya cotton tex- 
tiles and embroideries; and Patzcuaro itself, rebozo, 
reed mats, and silver jewelry. Even in pre-Colum- 
bian times the Tarascan Indians of Michoacan had 
a high degree of craft specialization, including 
lapidaries, stonemasons, woodcarvers, paper and 
feather workers, weavers of cotton textiles, mat- 
makers, lacquer makers, drum makers, and makers 
of bows and arrows. Today the products not con- 
sumed locally are sold in nearby Morelia, and a 
few reach international markets. Similar conglom- 
erates exist around Guadalajara, Toluca, Puebla, 
Oaxaca, and other Mexican cities, and there are 
many isolated craft-producing areas, such as Santa 
Maria del Rio for silk rebozo and Saltillo for 
serapes. 

Part-time professionals. In most societies there 
is no economic surplus to support full-time crafts- 
men, so that all practitioners of crafts, even those 
recognized as consummate experts, must make 
their livings largely through agriculture. As an 
example, the Chokwe of Angola and the Congo 
(Leopoldville) teach all boys to carve mahamba 
figures for hunting magic and jinga charms for 
female fertility in the course of the mukanda 
initiation rites. Except for a few special cases, no 
full-time artists exist, but a number of skilled 
carvers have so impressed their individual styles 
on the community that their work can be recog- 
nized far and wide, and young men pay for the 
privilege of being apprenticed to them. Yet such 
a master carver (songi) works his farm and hunts 
at the end of the rainy season. The few masks, 
figures, or stools he might be commissioned to 
make in the course of a year would bring him 
only a small amount in cash or livestock to sup- 
plement his regular income from the sale of pea- 
nuts, manioc, or other crops. The paucity of output 
helps explain why each piece is stylistically unique 
and why traditional carvers employed full time as 
producers or teachers turn out only repetitive and 
stereotypic work. Such part-time experts also super- 
vise the making of masks and paraphernalia for 
the elaborate socioreligious rites, for instance, of 
the west African Dogon (Griaule 1938) and the 
Melanesian Orokaiva (Williams 1940). 


Development of craft associations 


In all their variety of style and technique, most 
crafts can be shown to have been invented at least 
twice (in the Old World and in the New World) 
and probably many more times in the hundreds of 
centuries man has had at his disposal, and they do 
not seem to follow a strongly evolutionary pattern 
of development. On the contrary, the elegantly 
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idealized animals of the Magdalenian cave painters 
were drawn virtually with the same methods, tools, 
and materials as are found in a contemporary art 
studio. By 1500 B.c. craftsmen in Egypt and the 
Near East, using almost all the techniques now 
known to us, were able to turn out work compa- 
rable to the finest of any other epoch. Five hundred 
years earlier in Sumeria, craft guilds were formed 
to protect full-time craftsmen working in the royal 
courts and developing urban centers. Craft asso- 
ciations with such features as the apprenticeship 
system, standardization of production, and a divi- 
sion between utilitarian and luxury goods existed 
in most classical cultures, But for a long period 
after the fall of the Roman Empire, craftsmen 
could safely earn their livings only as retainers of 
an isolated and virtually self-sufficient baronial 
manor. Because a weaver, armorer, or saddler 
could produce more than even a populous manor 
could consume, full-time craftsmen in late medie- 
val times began to move into towns, where they 
could offer their services to more numerous and 
richer clients. 

Town craftsmen accepted commissions to be 
carried out to the taste of the purchaser, and often 
with materials provided by him. Craftsmen also 
discovered that they could profitably produce ob- 
jects in advance for sale to all. In this situation 
the craftsmen were no longer working for wages 
as in the manor, but had become merchants buying 
and selling raw materials and finished products in 
their own right for profit. In such politically inde- 
pendent commercial centers as Milan, Florence, 
Barcelona, Bruges, and Ghent, merchant guilds 
were established primarily to create a legal mo- 
nopoly on the sale of local products within the 
town under municipal authority. The merchants 
in return could promise a “fair” fixed price for 
products of each quality and could effectively pro- 
hibit any attempts by their members to corner the 
market in these goods. Guild rules were strictly 
enforced, and expulsion meant that the merchant 
could no longer do business in that city. Member- 
ship became so desirable that it could be attained 
only by purchase or inheritance, not unlike a seat 
on a contemporary stock exchange. A further im- 
portant function of merchant guilds was the lend- 
ing of money to members at equitable rates. In 
time the wealthier merchants began to purchase 
for export some or all of the output of other crafts- 
men at less than retail prices. This wholesale trade 
proved so profitable that merchants soon set up 
central craft workshops employing craftsmen full- 
time, foreshadowing the factory. 

By the fourteenth century the merchant guilds 
had become too cumbersome for efficiency and 
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broke down into smaller units limited to the prac- 
titioners of a single specialized craft within a city. 
These craft guilds, or “companies,” besides pro- 
tecting their members from competition, ensured 
fair wages and fair prices, forbade night work, 
inspected workrooms, and protected the appren- 
tices and journeymen. The considerable revenues 
received by the craft guilds were expended on 
public works, such as bridges and roads, religious 
processions, public festivals, and charity and occa- 
sionally were used for political purposes. The 
guilds, often run by powerful family dynasties, 
resisted both technical innovation and the growth 
of trade as destructive of their monopoly. By the 
end of the seventeenth century the improvement 
of transportation made possible a greater exchange 
of goods, and the power of the guilds was gradu- 
ally broken by the spread of laissez-faire economic 
policies. Craft guilds were legally abolished in 
England in 1835, but some continue to have ritual 
functions. Among the contemporary Yoruba of 
southwestern Nigeria, new guilds are being organ- 
ized for recently introduced crafts, and their pat- 
tern of development parallels that of European 
guilds, 

Outside the secular urban setting of guilds, full- 
time professional craftsmen are often organized by 
means of socioreligious brotherhoods. In India 
painters who were also Buddhist monks spent cen- 
turies decorating their cave retreats with gigantic 
murals, while medieval Christian monks in Europe 
lavished comparable time, effort, and taste on the 
illumination of sacred manuscripts. Female artists 
have been organized into similar associations, the 
incomparable textiles of the Incas of Peru having 
been produced by the acllacuna, women trained 
from childhood as weavers of ritual and burial 
robes. 

One characteristic distinguishes craft production 
in most cultures and epochs—a single individual 
carries on all the processes of production, from the 
gathering of the raw material to the finishing of 
the article. But when craftsmen are brought to- 
gether in a central workshop, it soon becomes 
apparent that some excel in one process and some 
in another. When each step in the production of 
a craft is carried out by a specialist who does noth- 
ing else, the result need not be inartistic even 
though standardized. Chinese porcelains pass 
through at least twenty hands in the process of 
production, and Japanese blockprints are turned 
out by a series of experts, each of whom carries 
out only one step. Although many crafts are now 
produced by production-line methods for the mar- 
ket, a surprising number of craftsmen have resisted 


specialization of technique and standardization of 
product, preferring aesthetic satisfaction to greater 


efficiency and profit. 
DANIEL J. CROWLEY 


[Directly related are the entries GILDS; PRIMITIVE ART.] 
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I 
PSYCHOLOGICAL ASPECTS 


Creativity, although currently much emphasized 
in psychological research, has been one of the most 
neglected topics in the history of psychology. The 
neglect of so complex a phenomenon as creativity 
by a psychology newly born in 1879 with Wilhelm 
Wundt’s founding of the first psychological labora- 
tory at Leipzig is understandable. In its attempts 
to cut its affiliation with its parent discipline, phi- 
losophy, and to establish itself as an empirical and 
experimental science, psychology turned its atten- 
tion to the simpler aspects of consciousness and 
behavior, e.g., simple sensory, perceptual, and mo- 
tor responses, for the study of which manageable 
techniques were available. In the second decade 
of the twentieth century, when behaviorism be- 
came the dominant emphasis in American psychol- 
ogy, the climate was even less congenial to such 
global and complex topics as that of creativity. 
Before creativity could become a conceivable con- 
cern of psychologists, the discipline itself had to 
change. 

Several events brought about that change: the 
demonstration by the gestalt psychologists, most 
notably Max Wertheimer and Kurt Lewin, that 
complex processes of thought and action could be 
brought into the laboratory and submitted to ex- 
perimental manipulation and measurement; the 
introduction of the topic of personality into aca- 
demic psychology, especially by Gordon W. Allport 
and Henry A. Murray; the demonstration by Mur- 
ray and his colleagues of the possibility of gaining 
an understanding of personality through the use 
of a multiplicity of assessment techniques; the de- 
velopment of a cognitive psychology, particularly 
by the gestalt psychologists and Edward C. Tol- 
man; the reintroduction into psychology of the 
concepts of self (George H. Mead) and ego (All- 
port), and the development of an ego psychology; 
the development of interest in and techniques for 
the assessment of the effective functioning of man 
(the United States Army Air Corps selection pro- 
gram, under the guidance of John C. Flanagan and 
J. P. Guilford, and the Office of Strategic Services 
assessment program, under the leadership of Mur- 
ray); and Guilford’s presidential address to the 
American Psychological Association, entitled “Cre- 
ativity” (1950). 

What is creativity? In spite of the enormous 
amount of psychological research on creativity in 
recent years, there is today, as in the past, little 
conceptual agreement as to what creativity is: It 
is perhaps most often conceived to be the ability 
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to bring something new into existence, yet others 
think of creativity not as an ability but rather 
as the psychological process or processes by which 
novel and valuable products are created. For others 
creativity is not the process but the product. Rather 
than attempt a complete listing of all the proposed 
meanings of creativity, it will suffice to indicate 
that definitions of the term range in scope and 
complexity from simple problem solving to the 
actualization of self. 

Instead of seeking to choose from among this 
plethora of definitions, it might be argued, as it 
has been in the case of mental health, that cre- 
ativity is not a theoretical construct at all but a 
general rubric, under which fall a variety of eval- 
uative concerns. Such will be the orientation of 
the present treatment of creativity, and there are 
four major psychological aspects of creativity upon 
which attention will be focused: (1) the creative 
product; (2) the creative process; (3) the creative 
person; and (4) the creative situation. Each of 
these can be formulated as a question to which 
empirical research, if it has not already done so, 
can provide some answers: (1) what are creative 
products, and by what qualities are they identified? 
(2) what is the nature of the creative process, and 
what are the qualities and kinds of psychological 
processes that lead to the production of new crea- 
tions? (3) what are the distinguishing traits and 
characteristics of the creative person? (4) what 
are the characteristics of the creative situation, 
the life circumstance, the social, cultural, and work 
milieu, that facilitate and encourage the appear- 
ance of creative thought and action? 


The creative product 


Anything that is experienced or made by man, 
for example, an idea, a response, a performance, 
a work of art, a scientific theory, a building, etc., 
may be a creative product. But of these, only a 
subclass that meets, at least to some degree, cer- 
tain specified requirements or criteria will qualify 
as creative products. 

To state that the first requirement of a creative 
product is simply that it be something novel or 
original is not sufficient; for one must next ask, 
novel or original within what range of experience 
or frame of reference—that of an individual, of 
a group, or of mankind? Much of a young child's 
experience and many of his ideas will be new to 
him and in that sense creative for him; but if these 
experiences and ideas are ones that practically 
every child encounters in the course of growing 
up, they are not creative products for the society 
in which the child lives. Similarly, a man may 
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think a thought new to him, yet it may be one of 
the most common thoughts in the whole world. 
Thus, the creativeness of a product, when judged 
in terms of its novelty, originality, or statistical in- 
frequence, is always relative to a given population 
of products. The most creative products are those 
that are novel or original in the experience of an 
entire civilization or of all mankind. 

Mere novelty or originality of a product is, how- 
ever, not alone sufficient to justify its designation 
as creative. There is the second requirement that 
it be adaptive to reality. It must, in other words, 
serve to solve a problem, fit the requirements of 
a given situation, accomplish some recognizable 
goal. Artistic creation, no less than scientific crea- 
tion, involves the solving of a problem: e.g., in 
painting, to find a more appropriate expression of 
one’s own experience; in dancing, to convey more 
adequately a particular mood or theme, etc. 

But all solutions to problems are not equally 
good, even though they all may be equally correct 
or right. Thus, there enters a third requirement 
that a creative product must meet, although this 
requirement, like all the others, can be fulfilled 
to varying degrees. A response or product, if it is 
a truly creative one, meets the demand that the 
answer it provides be an aesthetically pleasing one. 
It is not sufficient that a solution is offered; it must 
also be, in the mathematician’s term, elegant. A 
product that meets this criterion is at one and the 
same time simple and complex, its apparent sim- 
plicity masking great complexity and its complex- 
ity hiding a simplicity that binds many complex 
elements into a single whole. 

The truly creative product meets a fourth cri- 
terion, namely, that it in turn creates new condi- 
tions of human existence. In order to do this, it 
must transcend or transform the generally ac- 
cepted experience of man by introducing new 
principles that defy tradition and radically change 
man’s view of things. Examples of creative prod- 
ucts that have in their time transcended and trans- 
formed the then existing constraints of reality are 
the heliocentric theory of Copernicus, Darwin's 
theory of evolution, and Freud’s theory of psycho- 
analysis. 

The fifth requirement of a creative product is 
that the insightful solution that underlies it be 
realized and that it be evaluated and elaborated, 
developed to the full, and communicated to others. 
In other words, the creative product must be pro- 
duced. 

In one sense much is known about creative 
products, since they are the manifest and tangible 
expressions or resultants of creative activity. A 


thorough examination of these should permit in- 
ferences to be drawn concerning the nature of the 
creative process, and it is only through the exist- 
ence of these products that creative persons can 
be distinguished from their noncreative peers and 
from those others whose creative potential has not 
yet found expression. Yet, curiously, although the 
problem of the criteria is basic to any study of 
creativity, no fully adequate and systematic study 
of the distinguishing marks of creative products 
has been made. 

All too often investigators have settled for ob- 
viously crude and fallible criteria, e.g., performance 
on so-called tests of creativity, self-reports of sub- 
jects on their creative interests and achievements, 
and over-all ratings of the creativity of products, 
rather than struggle with the more difficult task 
of developing better-differentiated criteria, In each 
field of creative endeavor there are several dimen- 
sions peculiar to the product of the field to which 
the five criteria listed above may be applied. For 
example, in determining the creativeness of archi- 
tectural designs, one could seek to determine the 
degree to which they satisfy the five criteria of 
originality, adaptiveness, elegance, transcendence, 
and realization in meeting the several traditionally 
recognized demands of architecture, namely, firm- 
ness (technological requirements), commodity 
(planning requirements), and delight (aesthetic 
requirements). A similar extension of the criteria 
of creativity could be undertaken in all fields of 
endeavor. 


The creative process 


Stages described autobiographically. The larg- 
est literature on the nature of the creative process 
is found in the writings of highly creative persons 
who, fascinated by their extraordinary creative ex- 
periences, have sought to describe them for others. 
There is remarkable agreement among those who 
have enjoyed the peak experience of high creativ- 
ity, as well as among psychologists who have made 
systematic analyses of the introspective and retro- 
spective reports of highly creative persons, as to 
how, in the main, the process is to be described. 
Both types of studies have observed distinguishable 
stages, or phases, of creativity. To be sure, differ- 
ent terms have been used by different writers to 
describe the same phases, and there has been some 
variation in the number of stages that have been 
noted. It has even been argued that the differen- 
tiated phases should be thought of, not as stages 
at all, but rather as aspects of the creative process, 
since they blend together and do not always occur 
in the order of their usual listing. Yet despite 


these disagreements, the following generalized de- 
scription has emerged. 

The first phase of the creative process involves 
a period of preparation, during which one acquires 
the skills and techniques and the elements of 
experience that make it possible for one to pose a 
problem to oneself. In one sense the individual's 
life history up to the moment of posing a problem 
constitutes the first, protracted phase of the crea- 
tive process. Some have asserted, however, that 
only the acquisition of those skills and techniques 
and elements of experience that are directly rele- 
vant to the solution of the problem constitute the 
preparatory phase and that such purposeful and 
directed acquisition of knowledge occurs only after 
one has already posed a problem to oneself; con- 
sequently, they have argued that the creative proc- 
ess always starts with the recognition of a problem. 

There is general agreement, however, that there 
follows next a period of concentrated attention 
in an attempt to solve the problem. This may 
involve a relatively brief period of time, during 
which attention is focused solely upon the problem 
until it is solved; but perhaps more often, and 
especially when the highest levels of creativity are 
ultimately reached, there is a blocking of one’s 
efforts to solve the problem and the experiencing 
of so much frustration, tension, and discomfort 
that one is led, out of sheer self-protection, to the 
third phase, a period of withdrawal from the prob- 
lem, a psychological “going out of the field,” a 
period of renunciation of the problem or recession 
from it. Following this phase, which is usually 
referred to as a period of incubation and which 
may be of quite variable length, there is the 
fourth, brief phase, a moment or period of insight, 
accompanied by exhilaration, glow, and elation at 
the moment of insight. The fifth and final phase 
is a period of verification, evaluation, elaboration, 
realization, and communication of the insight that 
has been experienced. These phases may be tele- 
scoped into a very brief period of time, as in musi- 
cal improvisation, or may involve a considerable 
span of years, as was required for Einstein’s crea- 
tion of the theory of relativity. 

Factor analysis and qualities of thinking. Al- 
though a gross analysis of the creative process re- 
veals the phases just described, a more searching 
analysis indicates that the creative process is a 
complex set of cognitive and motivational proc- 
esses, involving perceiving, remembering, thinking, 
imagining, deciding, etc. As more is learned about 
these processes, the more fully will the creative 
process itself be understood. Indeed, it was Guil- 
ford’s approach to the study of creativity and his 
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attempt to conceptualize the intellectual processes 
that enter into it that led to his construction of a 
new model of intellect, for he soon realized that 
the intelligence tests developed prior to 1950 had 
almost entirely ignored the crucial variables of 
creative thinking. Consequently, he set about to 
construct a battery of tests hypothesized to meas- 
ure these relevant but previously neglected varia- 
bles. From the factor analysis of the correlated 
test scores of large samples of persons have 
emerged what Guilford believes to be the primary 
intellective factors that account for individual dif- 
ferences in creativity: associational fluency, idea- 
tional fluency, originality, adaptive flexibility, spon- 
taneous flexibility, redefinition, and sensitivity to 
problems. 

Out of this work there has developed a now 
widely recognized distinction between convergent 
thinking, which places a premium on analysis and 
reasoning, measured by the traditional tests of 
intelligence, and divergent thinking, which places 
a premium on richness and novelty of ideas, for 
the measurement of which the new tests of crea- 
tivity have been developed. After a short period in 
which creativity was assumed by some enthusiasts 
for the new testing to be solely the result of diver- 
gent thinking, there is now general recognition that 
both divergent thinking and convergent thinking 
are required in all creative thought, although their 
relative proportions will vary widely from one crea- 
tive task to another. 

Other methods. In addition to the introspec- 
tive autobiographical and factor-analytic ap- 
proaches to the study of creativity, three other 
methods of study are frequently employed. One 
seeks to elicit and study creative performance 
under standard conditions of observation, e.g., a 
person is assigned a specific task, such as writing 
a poem, etc., under controlled conditions and 
within certain time limits. Another method, pre- 
ferred by the experimental psychologist, seeks to 
test particular hypotheses about details of the 
creative process, e.g., comparing the relative speeds 
with which the insightful solution to an assigned 
problem is achieved under varied conditions of 
initial information, instructions to the subject, 
level of motivation, etc. A fifth method, of more 
recent origin and radically different from the 
others, seeks insight into the creative process by 
attempting to simulate creative problem solving 
and thinking on high-speed electronic computers. 

Enhancement and inhibition of creativity. From 
the use of these several methods, information is 
being gained concerning, on the one hand, those 
factors that further the creative process and, on 
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the other hand, those factors that block it. Among 
cognitive factors that have been shown to hinder 
creative thought are failure to perceive and define 
a problem correctly; too much, as well as too little, 
information; and the embeddedness of required 
elements in a functional context that makes it 
difficult or impossible to perceive their relevance 
to the solution, i.e., functional fixedness; rigid per- 
sistence of a misleading set; the formulation of 
inappropriate and rigid categories and schemata 
of information; an excessively analytical attitude; 
premature closure of cognitive processes, etc. 
Among motivational factors known to inhibit crea- 
tive performance may be noted too much motiva- 
tion (leading to incapacitating anxiety); too little 
motivation; a preponderance of extrinsic motiva- 
tion (interest in external rewards) over intrinsic 
motivation (interest in a problem for its own sake); 
and a host of inhibiting fears, such as the fear of 
making a mistake, the fear of social disapproval, 
the fear of separateness and nonconformity, the 
fear of one’s images and impulses and of one’s 
own unconscious processes, etc. The obverse of 
these factors, both intellective and motivational, 
has been shown to facilitate creative thought and 
action. 


The creative person 


The determination of the characteristics of crea- 
tive persons requires first that such individuals be 
accurately identified and the level of their creative- 
ness measured. This is the troublesome problem 
of criteria, which is seldom if ever ideally solved 
in the study of the creative person. Far too often 
subjects’ performance on so-called tests of creativ- 
ity have been taken as indicating the level of their 
creativeness, despite the fact that it has not yet 
been clearly demonstrated that, either singly or 
in combination, an individual’s scores on the pre- 
sumed factor-pure tests of creative thinking yield 
a valid measure of his performance in actual crea- 
tive work. When, on the other hand, creative 
persons are identified and the level of their crea- 
tiveness is judged on the basis of their known 
products, such judgments, even when made by 
experts, are to some unknown degree contaminated 
and confounded by factors such as the subject’s 
social prestige and reputation. When, in a third 
type of study, the criterion of a person’s creative- 
ness is taken to be the degree of his self-actualiza- 
tion or the extent to which he is a fully functioning 
individual, the criterion is even more vague and 
suspect. 

Despite these unresolved methodological difficul- 


ties, there is an impressive congruence of findings 
from quite varied studies concerning the charac- 
teristics of the creative person. Those researches 
that have contributed most significantly to the 
picture of the creative person have relied on the 
opinion of experts who have identified members 
of a single profession whose creativeness, judged 
on the basis of their tangible creative products, 
has ranged from relatively low to extremely high. 
Such researches have involved intensive psycho- 
logical assessment of the nominated subjects. 

Relation to intelligence. The creative person 
appears to be intelligent, yet there is a far from 
perfect correlation between intelligence as meas- 
ured by intelligence tests and creativity. In several 
professional groups, e.g., architects and research 
scientists, the correlation is essentially zero. The 
relationship in these instances is doubtless to some 
degree attenuated, because of a restriction in 
range of both creativity and intelligence. The gen- 
eral conclusion with respect to intelligence and 
creativity is that a certain degree of intelligence is 
required if one is to be creative, but beyond that 
point being more or less intelligent does not deter- 
mine the level of a person’s creativeness; and the 
level of intelligence required for creativity, which 
varies from field to field, is sometimes surprisingly 
low. What is more important than the level of in- 
telligence as measured by an intelligence test is 
the effectiveness with which the creative person 
uses whatever intelligence he has. 

In addition to general intelligence, special skills 
and abilities appropriate to a person’s field of crea- 
tive endeavor are clearly necessary for the achieve- 
ment of high levels of creative performance. 

Psychological health and cognitive styles. As to 
the relationship between a person’s creativeness 
and his psychological health—a relationship widely 
studied and widely disputed—the situation is again 
not a simple one. There is no doubt that some 
quite disturbed persons have been highly creative; 
yet there is no doubt, also, that many persons of 
apparent good health, both physical and psycho- 
logical, have shown unusual creativeness. Perhaps 
the more general picture, and one that reconciles 
the opposites of well-being and pathology, is one 
in which there is a good deal of psychic turbulence 
and at the same time adequate ego control. In 
several studies, creative persons have earned, on 
the average, scores some five to ten points above 
the standard score mean of 50 on the clinical 
scales of an inventory that measures tendencies 
toward the major psychiatric disturbances. The 
meaning of such elevated scores, as well as the 


self-reports of creative subjects in psychiatric 
interviews, are less suggestive of psychopathology 
than of good intellect, complexity and richness of 
personality, general lack of defensiveness, and 
candor in self-description—in other words, an 
openness to experience and especially to experience 
of one’s inner life. Most typically, one finds evi- 
dence of psychopathology in test profiles of crea- 
tive subjects, but at the same time there is evidence 
of superior mechanisms of control. 

One of the prominent characteristics of the test 
performance of male creative groups is a tendency 
to score high on scales that measure femininity. 
This finding could be easily misunderstood. The 
proper interpretation is this: creative men reveal 
an openness to their feelings and emotions, a sen- 
sitive intellect, and an understanding self-aware- 
ness, and their wide-ranging interests include 
many that in Western culture are thought of as 
feminine. They give fuller expression to the fem- 
inine side of their nature than do their less crea- 
tive peers. In the language of Jungian psychology, 
creative males are not so completely identified with 
their masculine persona roles as to blind them- 
selves to or deny expression to the more feminine 
traits of the anima [see ANALYTICAL PSYCHOLOGY; 
and the biography of JUNG]. 

The perceptiveness of the creative person and 
his openness to richness and complexity are re- 
vealed also in his preference for perceptual com- 
plexity. As between perceiving and judging, they 
are on the side of perception—open to and recep- 
tive of experience. They are challenged by dis- 
ordered multiplicity and by the unfinished, which 
arouses in them an urge, indeed a need, to achieve 
the most difficult and far-ranging ordering of the 
richness they are willing to experience. 

The creative person is not only open to experi- 
ence; he is intuitive about it. In perceiving, one 
may emphasize sense perception, focusing upon 
what is yielded by the senses, things as they are, 
the facts; and in the extreme, one can remain 
stuck there, bound to the stimulus, the presented 
material, or the situation. Or one may, in any 
perception, be intuitive about and responsive to 
its deeper meanings, its implications, and its pos- 
sibilities, immediately grasping the real, as well 
as the symbolic, bridges between what is and what 
can be. Creative persons show a marked preference 
for intuitive perception. 

With their awareness of inner life and their 
interest in the symbolic equivalents of experience, 
it is not surprising that highly creative persons are 
more often introvert than extravert, though there 


CREATIVITY: Psychological Aspects 439 


is no evidence that introverts as such are more 
creative than extraverts. 

Relation to interests and values. Studies of the 
interests of creative persons have been remarkably 
consistent in revealing them to have interests sim- 
ilar to those of certain vocational groups and un- 
like those of others. If one focuses, not upon the 
specific interests that are shown, but upon what 
they reveal about those who hold them, it may be 
said that creative persons are inclined to be rela- 
tively uninterested in small details or in facts for 
their own sake and more concerned with their 
meanings and implications, cognitively flexible, 
verbally skillful, interested in communicating with 
others, and relatively disinterested in policing either 
their own impulses and images or those of others. 

As with interests, so in the realm of values, 
creative persons show a preference different from 
that of their less creative colleagues. Of the six 
values conceptualized by the philosopher Eduard 
Spranger—the theoretical, economic, aesthetic, 
social, political, and religious—creative persons 
strongly value the theoretical and the aesthetic. 
For them it is not sufficient that problems be 
solved; there is the further requirement that the 
solutions be elegant. They seek both truth and 
beauty. 

Although highly creative persons are character- 
istically independent in thought and action and 
are nonconformists in their ideas, they are not 
deliberately nonconforming. Often, in fact, they 
are quite conventional in matters and actions that 
are not central to their areas of creative endeavor. 

General profile and summary. If a general pic- 
ture of the productively creative person were to be 
drawn, it would read as follows: 

He is dominant, possessed of those qualities and 
attributes which underlie and lead to achievement 
of personal status; poised, spontaneous, and self- 
confident in social interaction, although not of an 
especially sociable or participative temperament; 
intelligent, outspoken, sharp-witted, demanding, 
aggressive, and self-centered; persuasive and ver- 
bally fluent, self-confident and self-assured; and 
relatively uninhibited in expressing his worries 
and complaints. 

He is comparatively free from conventional re- 
straints and inhibitions, not preoccupied with the 
impression he makes on others, and thus, he is 
capable of great independence and autonomy and 
is relatively ready to recognize and admit self- 
views that are unusual and unconventional. 

He is strongly motivated to achieve in situations 
in which independence in thought and action is 
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called for, but unlike his less creative peers, he is 
not inclined to strive for achievement in settings 
where conforming behavior is expected or required. 
In efficiency and steadiness of intellectual effort, 
however, he does not differ from his fellow workers. 

Finally, he is definitely more psychologically 
minded, more flexible, and possessed of more fem- 
ininity of interests than less creative persons. 

This is the generality. What needs to be equally 
emphasized is that there is no single mold into 
which all who are creative will fit. The full and 
complete picturing of the creative person will re- 
quire many images. 

The problem of the identification of the creative 
person by means of assessment techniques is still 
largely unsolved. Many personality correlates of 
creativity, both intellective and nonintellective, 
have been reported in the literature and summa- 
rized above. Which of them, individually or in 
combination with others, will most accurately iden- 
tify creative persons or persons with creative poten- 
tial in any given profession or field of endeavor is 
today not known. That is an empirical problem 
that seems much less difficult of solution today 
than it did in the past, but it is one that must be 
solved separately for each professional group. 
There is no general solution, to be applied to all 
professions, because of the enormous differences 
between them. A core of techniques and a basic 
methodology for the identification of creativity are 
now available. What is needed are the energy and 
ingenuity to modify and extend them in the study 
of creativity, and of the creative person, in all fields 
of endeavor. 


The creative situation 


To speak of a creative situation is to imply that 
creativity is not a fixed trait of personality but 
something that changes over time, waxing and 
waning, being facilitated by some life circum- 
stances and situations and inhibited by others. 
While such an assertion would be accepted by al- 
most everyone, much less agreement can be found 
for statements that specify the types of situation 
or life circumstance that facilitate or inhibit crea- 
tivity. Such disagreement merely underscores the 
continuing need, despite a considerable body of 
research findings, for studies to determine what 
kinds of situations contribute most significantly to 
the encouragement of what kinds of creativity in 
what types of person. The problem almost certainly 
has to be phrased in this manner, for it is unlikely 
that all persons will find the same situation equally 
conducive to creative effort. 


Questions concerning the facilitating or inhibit- 
ing effect of environment on creativity can be 
directed both to the historical past and to the con- 
temporaneous present of the individual. 

Life history. Looking first at life-history data, 
the widest variety of early circumstances and fam- 
ily situations has been reported for persons of high 
creativeness. Despite this diversity, there are, how- 
ever, several themes to be noted. For example, so 
recurrent is the theme of remembered unhappiness 
in childhood that one is led to speculate about its 
role in fostering creative potential. In the absence 
of a sensitive awareness of one’s own experience 
and that of the world around one, without consid- 
erable development of and attention to one’s inner 
life, and lacking an interest in ideational, imaginal, 
and symbolic processes, one can hardly be ex- 
pected to exhibit highly creative responses. Some- 
thing less than complete satisfaction with oneself 
and one’s situation in childhood, if not a prerequi- 
site for the development of a rich inner life and 
concern for things of the mind and spirit, may 
nevertheless play an important contributory role. 

In a nationwide study of creativity in architects, 
a number of events and circumstances have been 
discovered in the life histories of the more creative 
subjects that appear to have fostered their creative 
potential and independent spirit and to have pro- 
vided the opportunity, if not the necessity, for their 
developing the secure sense of personal autonomy 
and zestful commitment to their profession that so 
markedly characterize the creative person (Mac- 
Kinnon 1962). 

These factors may be briefly summarized as fol- 
lows: an extraordinary respect by the parent for 
the child and an early granting to him of unusual 
freedom in exploring his universe and in making 
his own decisions; an expectation that the child 
would act independently but reasonably and re- 
sponsibly; a lack of intense closeness between 
parent and child, so that neither overdependence 
was fostered nor a feeling of rejection experienced, 
in other words, the sort of interpersonal relation- 
ship between parent and child that had a liberating 
effect upon the child; a plentiful supply in the 
child’s extended social environment of models for 
identification and the promotion of ego ideals; the 
presence within the family of clear standards of 
conduct and ideas as to what was right and wrong, 
but at the same time an expectation, if not re- 
quirement, of active exploration and internaliza- 
tion of a framework of personal conduct; an 
emphasis upon the development of an individual 
ethical code; the experience of frequent moving, 


within single communities or from community to 
community or from country to country, which pro- 
vided an enrichment of experience, both cultural 
and personal, but which at the same time contrib- 
uted to experiences of aloneness, shyness, isolation, 
and solitariness during childhood and adolescence; 
the possession of skills and abilities that, although 
encouraged and rewarded, were nevertheless al- 
lowed to develop at their own pace; and finally 
the absence of pressures on the child to establish 
prematurely his professional identity. 

This particular set of early circumstances and 
interpersonal experiences, although observed in 
one study, takes on added significance because of 
its congruence with the set of life-history factors 
that Otto Rank thought so conducive to man’s 
winning his own individuality and the realization 
of his creative potential; with those interactions 
between the child and the significant others in the 
environment that Erik Erikson described as so 
crucial to the fullest development of ego; and with 
those experiences that, in Robert W. White's the- 
ory, sustain and nurture the fullest development of 
competence. 

It remains something of a moot question 
whether creative thought and action are most 
stimulated by solitariness and separateness—the 
individual working alone—or by interaction with 
others in group activity. There is mounting evi- 
dence that here again individual differences play 
an important role. 

Implications for education. To many, the im- 
plications of the observed characteristics of highly 
creative persons for the training or nurturing of 
creativity seem rather clear. In contrast to earlier 
emphases in education, there is today a tendency 
to stress freedom and autonomy for the child, a 
substitution of self-discipline for discipline imposed 
from outside, an openness to all ideas and a defer- 
ment of judgment in choosing from among them, 
the adoption of a more playful attitude toward 
study, engagement in imaginative play, the nur- 
turing of a feeling for analogies, similes, and 
metaphors, a searching for common principles in 
terms of which quite different domains of knowl- 
edge can be related, etc. It has yet to be demon- 
Strated, however, that these new emphases do 
indeed foster more creativeness than did earlier 
forms of education, which stressed rote learning, 
repeated drill of material, precise memorization, 
orderly habits of study, strict discipline, etc. It may 
well be that here, as in so many other domains of 
creative performance, some reconciliation of these 
Opposites (and the resultant tension that experi- 
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ence of the opposites produces) will turn out to 
be most nurturing of creative potential. 


DonaLp W. MACKINNON 


[Other relevant material may be found in AESTHETICS; 
FANTASY; PROBLEM SOLVING; STIMULATION DRIVES; 
and the articles listed under ART.] 
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ie 
SOCIAL ASPECTS 


Creativity is a concept that emerged during the 
Renaissance and was first given formal expression 
by the philosophers of the Enlightenment; its origi- 
nal use was as a term in the “great analogy” 
(Nahm [1956] 1965, chapter 2) between the divine 
creator of the natural world and the artist with his 
power to create, as Sir Philip Sidney put it, “forms 
such as never were in Nature” ([1595] 1895, p. 25). 
Modern associations of the concept are nearly all 
psychological: creativity is often held to consist in 
personal attributes such as spontaneity, originality, 
sincerity, high intelligence, or some combination of 
these or of similar qualities. It would seem, then, 
that the sociology of creativity (however creativity 
may be defined) should consist in the study of 
those social conditions which either favor or im- 
pede the appearance of creative individuals. 

But there is another sense altogether in which 
it is possible to speak of creativity as having social 
aspects. Civilizations, social institutions, formal or- 
ganizations, and small groups can all be called 
creative, and this may imply much more than that 
they include creative individuals. One possible im- 
plication is that there is something about the way 
in which a social unit is organized without which 
none of its members would have become creative. 
Another is that the members of a social unit may 
in some sense work together to produce collective 
achievements that reflect creative qualities of the 
group as a whole, 

Creativity can also be seen as an attribute of 
objects. Indeed, the terms “creative product” and 
“creative object” have by now become standard in 
the psychological literature on creativity (see, for 


instance, Bruner 1962, p. 8; Roe 1963, p. 155) 
and can be used to denote whatever is counted as 
evidence for the existence of individual creativity. 
It is here that a basic difference becomes apparent 
between the psychological and sociological ap- 
proaches to creativity. Understandably, much of 
the psychological research has been concerned with 
measuring creative potential rather than creative 
achievement; the focus of interest has been on the 
internal psychological processes that issue in the 
finished creative object, with the general assump- 
tion that, once the object has seen the light, the 
creative process is complete. It is admitted that 
various social factors may facilitate or retard the 
process, but they are not generally thought of as 
part of it. Psychological study of creative actions 
of course involves more extensive consideration of 
social factors, but such study is usually carried out 
in a context other than that of creativity (see, how- 
ever, Fiedler 1964). 

But sociological theories, whether they are de- 
veloped by professional sociologists or by historians 
and other authors with an interest in the quality 
of human achievement, also tend to give an incom- 
plete account of the creative process. Many authors 
have remarked on the way in which the creative 
object seems to live a life independent of its cre- 
ator, in the sense that the social mechanisms by 
which it is distributed, or the amount and type of 
attention paid to it, seem to be matters that are 
mostly beyond his control (see, for example, Rank 
1932, p. 215; Valéry [1940] 1952, pp. 94-96). But 
historical studies of such processes are usually con- 
fined to the level of particular social institutions, 
such as literature or the fine arts, at particular 
periods and do not make structural comparisons 
between institutions. 

However, there is a growing realization that 
practitioners of the various social sciences who 
have studied creativity may have been talking not 
about different sets of problems but about the same 
problems from different points of view (see the 
discussion in Mooney [1957] 1962). Moreover, a 
distinctively social conception of the creative proc- 
ess is slowly gaining recognition. Thus Rogers has 
defined the creative process as “the emergence in 
action of a novel relational product, growing out of 
the uniqueness of the individual on the one hand, 
and the materials, events, people, or circumstances 
of his life on the other” ([1954] 1962, p. 65). In 
similar fashion Guérard has argued that “the word 
Art applies at the same time to the creative urge. 
to the process through which that urge is mani- 
fested, to the material result of the process, to the 
appreciation of the result. These are different 


phases or aspects of the same reality” ({1936] 
1963, p. xxvii). The key notion in both of these 
definitions is that of the interrelatedness of the 
different phases through which the creative object 
passes in the course of becoming part of a culture. 
In short, and leaving on one side for the moment 
the question of how to establish criteria of cre- 
ativity, the creative process in its social aspect is 
the process by which creative objects are produced 
and then become cultural objects (for a discussion 
of the “process versus product” issue by psycholo- 
gists, see Utah Creativity Research Conference 
1964, pp. 112-121). 

But creative objects are not merely cultural ob- 
jects, nor are they usually created with the whole 
culture in mind, To produce a work of art or sci- 
ence, for instance, is primarily to make a claim on 
the attention of other artists or scientists. The 
artist may desire self-expression but, as a Marxist 
sociologist has expressed it, this “is really self- 
socialisation, the casting of . . . private experience 
in such a form that it will be incorporated in the 
social world of art and appear as an art-work” 
(Sprigg [1937] 1963, p. 202). The creative work of 
art or science does not only claim to be incorpo- 
rated into the institution appropriate to it; it also 
claims to modify or even subvert the collective ide- 
ology of the institution in some important respect. 
It is the fact that the claim is made good that en- 
ables us to identify the work as creative. In order 
to reach this conclusion it is not necessary to make 
assumptions about the intentions of creative indi- 
viduals, but only to note that the objects produced 
by such individuals do in fact exert a fruitful influ- 
ence on other members of the same institution. A 
creative object, then, is a successful demonstration 
that the sum of established institutional values 
does not exhaust the range of possible institutional 
values. It is through the incorporation of creative 
objects that institutions are saved from eternal 
elaboration of what is already known or believed. 


The creative process 


Person-oriented approaches, which focus on the 
psychological processes that issue in creation, can 
be contrasted with object-oriented approaches, 
which are more concerned with what is created. 
Many of the early person-oriented studies of cre- 
ativity foundered because they treated established 
fame as if it were an infallible index of creative 
achievement. Such studies therefore became in- 
volved not only with the vastly complicated prob- 
lem of estimating the degree of fame a creative 
individual enjoyed at any particular time but also 
with the fluctuations to which fame is subjected by 
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changes in taste or interest. But if an object- 
oriented approach is adopted, considerations of per- 
sonal fame and reputation become secondary; the 
main problem is to determine which scientific, 
artistic, or other objects had effects of an order 
that distinguished them from the common run of 
cultural objects. This is not to equate creativity 
with extent of influence, but it is to equate it with 
a certain type of influence on a certain institution 
during a specifiable period. The length of the period 
need not depend on anything but the purpose of 
the study. 

Social institutions. A cultural system can be 
divided into as many parts or subsystems as suit 
the purpose of an inquiry, but the principal sub- 
divisions are generally conceived in terms of the 
mutually irreducible classes of values that are gen- 
erated in the pursuit of major human interests. 
The organized cultivation of a distinctive class of 
values in pursuit of a major human interest will 
here be designated as a social institution. This view 
of social institutions is quite close to that adopted 
by the Whites in their study of the nineteenth- 
century French art world (White & White 1965, 
p. 2). However, it differs somewhat from the tra- 
ditional view in American sociology, which has 
tended to follow W. I. Thomas in regarding institu- 
tions as the systems formed by the “more or less 
explicit and formal rules of behavior by which the 
group tends to maintain, regulate, and make more 
general and more frequent the corresponding type 
of actions among its members” (Thomas, Social 
Behavior and Personality, p. 52). By contrast, the 
present definition emphasizes the ultimate values, 
whether religious, artistic, scientific, or political, 
that rules of behavior are designed to preserve. 

It is generally realized that the social structure 
of a society may affect the creativity of its institu- 
tions. Most writing in this area has focused on the 
problem of cultural integration and disintegration. 
Some authors have argued that a high degree of 
cultural integration is necessary for creativity to 
reach its greatest heights in any institution, but 
they fail to explain why all institutions are not 
equally creative, even in societies that they regard 
as highly integrated. Other authors, pointing to the 
greater efficiency achieved through an increasing 
division of labor both within and between institu- 
tions, have seemed to support the opposite point 
of view, though they have not been able to demon- 
strate that a high degree of institutional autonomy 
is inconsistent with a high degree of social inte- 
gration. A very few authors have confined them- 
selves to demonstrating that creativity in one in- 
stitution can have indirect effects on creativity in 
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another. More often, comparison between creative 
achievement in different institutional spheres has 
taken the form of seeking to explain one type of 
achievement in terms of another, as when an out- 
burst of artistic creativity is said to have been 
caused by a particular state of the economic sys- 
tem. But in any case it is clear that social institu- 
tions are interconnected as well as distinct, and 
there is a strong assumption, though little system- 
atic evidence, that the type and degree of cre- 
ativity that they display is somehow related to their 
mode of interconnection [see INTEGRATION, article 
On CULTURAL INTEGRATION]. 

Social institutions vary not only in autonomy 
and distinctiveness but also in structural complex- 
ity, which usually increases with size. But the 
major institutional functions remain the same in 
spite of these changes. Thus creative objects can- 
not become cultural objects unless they are ap- 
praised by members of an appropriate institution, 
nor can they be offered for appraisal unless they 
find an appropriate institutional outlet. The proc- 
ess of appraisal, which is essentially collective, is 
the means by which the institution asserts its claim 
to a distinctive interest in the cultivation of those 
values which are its raison détre. Appraisal is 
therefore a matter of deciding, for instance, not 
only how good a work of art is (a question that, in 
any case, is never finally settled) but first and fore- 
most of deciding whether it is a work of art at all. 
If the work is never displayed for appraisal through 
a recognized outlet—that is, if it is never published, 
exhibited, or at the very least brought to the atten- 
tion of someone whose influence on the process of 
appraisal makes him a kind of living outlet-—then, 
from an institutional point of view, it does not yet 
exist as a work of art. 

The nature of an institution’s outlets is some- 
times determined largely by its appraisal system, 
though under open market conditions the relation- 
ship is often reversed. The form taken by outlets 
is also shaped by technology, including marketing 
techniques, which tends, as it grows ever more 
complex, to create multiple outlets for ever new 
classes of consumers. But the intimate connection 
between outlets and appraisal persists in spite of 
technological change, though various new types 
of appraisal may arise to match the new types of 
outlet. Indeed, if the volume of products is very 
great, release of a product through a certain type 
of outlet may largely supplant appraisal as the 
means by which products are, so to speak, stamped 
with the required institutional marks of identifi- 
cation. 


An apparent difference between art and science 


in this respect has often been noted (see, for in- 
stance, Kuhn 1962, p. 163): an artist is thought 
to produce for a public that does not consist only 
of other artists, while a scientist's work is appraised 
only by his professional colleagues. But whether 
or not one perceives a real structural difference 
here depends on where one draws the boundaries 
of art and science as institutions. If the full range 
of scientific products is considered, from high-level 
theories to minor technological innovations, it im- 
mediately becomes obvious that, like artistic prod- 
ucts, they are submitted through different kinds of 
outlets for different kinds of appraisal. 


Creative ideologies 


Creative objects affect institutional ideology be- 
cause they are themselves ideological creations. It 
is convenient to call the particular constellation of 
values expressed by a creative object its “creative 
ideology,” since in this way the elements common 
to several apparently divergent traditions of thought 
about creativity can be brought together under 
one head. 

Personal and collective ideologies. The notion 
of artistic creation as essentially a process of con- 
flict between personal and collective ideologies was 
first developed by Rank, who defined the artist as 
someone who “constructively applies his will-power 
in the service of ideological creation” (1932, 
p. 159). Rank draws attention to the institutional 
base of the art form or style by calling it “the ideol- 
ogy of the art” (ibid., p. 112; compare p. 186), 
and he further distinguishes between this and the 
“something personal” of the artist, on the one hand, 
and the “general ideology of the culture,” on the 
other (ibid.; compare chapter 4, passim). Thus art, 
for Rank, is neither mere self-expression, since it 
must make use of forms that are collective in 
origin, nor simply an expression of collective ideol- 
ogy, since artistic creation fulfills purely personal 
needs for the artist (for a similar view, see Hauser 
[1958] 1963, p. 407). In fact, the more creative the 
artist, the more his personal needs will conflict 
with the needs of the collectivity, Rank therefore 
reverses the Freudian theory of creativity. The 
artist does not create because he is neurotically 
maladjusted to society; rather, he lives in a state 
of necessary conflict with society because he is 
creative (Rank 1932, p. 169). Rank’s views have 
not been generally adopted by psychoanalysts, al- 
though the bulk of empirical psychological evi- 
dence would appear to support him (see, for in- 
stance, Barron 1963). Nor has it been pointed out 
that Rank’s emphasis on the difference between 
the neurotic and the creative personality is remark- 


ably similar to W. I. Thomas’ distinction between 
the “Bohemian” and the “creative individual” 
(Thomas, Social Behavior and Personality, p. 159). 
But Thomas’ approach to the problem of creativity 
was essentially person-oriented. Rank, on the other 
hand, went far beyond both the person-oriented 
and the object-oriented approaches to the creative 
process by including in his schema not only the 
artist's own conception of his role (ibid., p. 202) 
and the social basis of artistic creation (p. 207) 
but also the use to which society puts the creative 
object. “The artist in himself provides in his work 
the raw material the community uses in the crea- 
tion of biographies and fame as an expression of 
its own eternalization” (p. 221). 

Internal and external ideologies. The ideologi- 
cal content of a creative object can also be analyzed 
into those value elements which pertain only to the 
institutions for which the object has been created 
and those which pertain to other institutions or, it 
may be, to no institution at all. The former ele- 
ments constitute the object’s internal ideology (as 
we speak of something as being of “purely artistic” 
or “purely scientific” interest), and the latter its 
external ideology (as, for instance, the glorification 
of aristocratic ideals in a portrait or the religious 
world view implicit in a system of astronomy). 

The extent to which art seems more capable 
than science of serving external ideologies—what 
might be called its apparent ideological penetrabil- 
ity—has been pointed out so often that some 
authors maintain that science as such has no ideo- 
logical penetrability at all. Nor is it generally ad- 
mitted that scientists are in any sense engaged in 
the production and elaboration of ideologies. This 
is probably because, with reference to science, 
ideology is considered only in an external sense. 
But the concept of paradigm, as developed by Kuhn 
(1962) in his study of scientific revolutions, has a 
distinct affinity with the concept of internal crea- 
tive ideology. A paradigm is a body of scientific 
achievement that is “sufficiently unprecedented to 
attract an enduring group of adherents away from 
competing modes of scientific activity . . . [and] 
sufficiently open-ended to leave all sorts of prob- 
lems for the redefined group of practitioners to 
resolve.” In this way it provides “models from 
which spring particular coherent traditions of sci- 
entific research” (Kuhn 1962, p. 10). Clearly, the 
same kind of service is performed for art by those 
artists who, whether singly or in a group, introduce 
a major new idiom or style that serves for any 
considerable length of time to define the range of 
artistic problems worth exploring. A scientific para- 
digm such as Newton’s laws of motion and gravi- 
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tation (Kuhn 1962, pp. 39-40) can certainly be 
compared, in terms of its long-lasting influence and 
eventual replacement by other models, with a 
“musical paradigm” such as the major scale (Allen 
[1939] 1962, p. 192). Artistic and scientific para- 
digms are creative objects par excellence, since 
they institute the guiding principles that make 
many other discoveries possible. Such paradigms 
are almost never the unaided work of one individ- 
ual, and it is this collective origin of theirs that 
makes them highly appropriate units of study for 
the sociology of creativity. In this respect they can 
be compared with what Ogburn called “basic in- 
ventions” ([1950] 1964, p. 23), though Ogburn’s 
preoccupation with the larger social effects of ap- 
plied science and technology seems to have kept 
him from exploring the effects of discoveries in 
pure science upon the structure of science itself 
(see especially Ogburn 1942, pp. 235-236). 


Creativity and ideological succession 


The most striking fact about creative ideologies 
is that all of them are sooner or later subject to 
change. Here, this phenomenon will be called 
ideological succession. Any theory of creativity, 
regardless of the social level to which it is intended 
to apply, can be classified according to the kinds of 
assumptions that its author makes about the dy- 
namics of ideological succession. Four major types 
of theory can be distinguished in this field: organi- 
cist, dialectical, societal, and factorial. 

Organicist theories. At the societal level, the 
most celebrated attempt in modern times at an 
organicist account of ideological succession is that 
of Oswald Spengler, whose historical pessimism in 
his Decline of the West, published 1919-1922, 
owed more to Nietzsche than to Darwin. The most 
impressive application of biological evolutionism to 
the problem of creativity was Francis Galton’s 
Hereditary Genius, published in 1869, in which it 
was demonstrated empirically that certain families 
consistently produced more men of distinction than 
could be accounted for by chance (a point on 
which Galton has never been refuted). From these 
findings Galton inferred not only that distinction 
in any field is the result of superior genetic endow- 
ment but also, as a kind of sociological extension 
of Darwin's principle of natural selection, that 
genetic superiority inevitably overcame social ob- 
stacles to achievement. 

All such theories and approaches had in com- 
mon a tendency to reduce creativity to a merely 
descriptive category with no independent causal 
role in the process of ideological succession. Re- 
gardless of the adequacy or inadequacy of the data 
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on which they are based, organicist theories lack 
any real explanatory power because, although they 
assume that some periods of civilizations (or some 
races, nations, or families) are more creative than 
others, they have nothing to say about the dynam- 
ics of ideological succession except that efflores- 
cence is inevitably followed by decline (or, in the 
case of Galton, will be followed by decline if nature 
is left to pursue its course). 

Dialectical theories. Theories of creativity 
based on dialectical reasoning claim, like organi- 
cist theories, both to explain what has happened 
and to predict what will happen. In order to sus- 
tain this claim, they are forced to conceive the 
dynamics of ideological succession in terms of a 
fixed superstructure that determines once and for 
all the possible range of basic ideological forms. 
Creative ideologies as they actually occur and suc- 
ceed one another are held to be composed of ele- 
ments attributable to one or another of these basic 
forms, either singly or in combination. The forms 
are eternal opposites but may interact in the tem- 
poral sphere to produce unique combinations. 

If the number of basic ideological forms is lim- 
ited, it follows that any concrete ideology at a par- 
ticular moment of its existence is closer to one 
basic form than it is to the others. The implication 
is usually that the more closely the ideology exem- 
plifies some basic form, the more creative it is, 
though a position of equilibrium between two or 
more opposing forms is sometimes regarded as the 
most creative state of all. At any rate, dialectical 
theories of ideological succession tend to be preju- 
diced in favor of historical periods when some 
clearly identifiable style or set of values seems to 
have dominated a whole culture in such a way as 
to give it—at least in the eyes of the historian— 
a certain coherence and internal consistency. 

Such periods of cultural integration are sup- 
posed to have been far more productive of out- 
standing creative achievements than the more 
confused periods of transition from one state of 
cultural integration to the next. Creative people 
born during periods of transition can hope to do 
little more than hasten the destruction of outworn 
ideologies and lay the foundations of the new. In 
this way, dialectical theorists seem to assign a 
somewhat more autonomous role to creative indi- 
viduals. But this concession is an illusory one, 
since the dialectical theory itself becomes the only 
criterion of what is to count as creative, Thus, for 
the revolutionary Marxist only what is “progres- 
sive’—i.e., consistent with the Marxist interpreta- 
tion of history—can be truly creative, though some 


of the more aesthetically-minded Marxists try to 
avoid the absurdities inherent in this doctrine by 
claiming that a bourgeois writer, for instance, can 
be “progressive” if his work mirrors the supposed 
decadence of capitalist society (Fischer [1959] 
1963, chapter 3). 

Some dialectical theories, however, contain valu- 
able sociological elements, since they treat ideo- 
logical succession not only in terms of the effects 
of one ideology on another but also in terms of 
the ideology’s structural base in the social system. 
Thus Sorokin, in his Social and Cultural Dynamics, 
published in 1937-1941, emphasizes that what at 
the cultural level appears as an abstract configura- 
tion of the ideological supersystem is, when viewed 
at the individual level, a matter of voluntary hu- 
man allegiance to certain concrete institutional 
structures, According to Sorokin, transfer of alle- 
giance from one basic form to another is possible 
because cultural integration is never absolute; 
values derived from other basic forms are always 
present in a subordinate cultural role. In short, 
Sorokin recognizes that a certain pluralism of val- 
ues may exist in any social institution and that if 
this pluralism is not preserved as a basis for ideo- 
logical succession, the culture will not be able to 
make the transition from one basic form to an- 
other and so will become “petrified, and uncreative” 
(Sorokin [1941] 1957, p. 25). 

Dialectical theories of internal ideological suc- 
cession at a purely institutional level are clearly 
of sociological interest only when they are related 
to accounts of institutional organization and struc- 
ture. They are, moreover, extremely difficult to 
construct, because of the conceptual problems in- 
volved in setting up a set of categories that between 
them exhaust the known range of specialized ideo- 
logical possibilities. The outstanding example of 
such a dialectic is Wélfflin’s theory of stylistic de- 
velopment in European painting during the six- 
teenth and seventeenth centuries (Wélfflin [1915] 
1956; for extensive comment, see Antoni [1940] 
1959, chapter 6; Hauser [1958] 1963, pp. 139- 
149). 

Societal theories. The revolution in social 
thought that took place during the period 1890- 
1910 was especially notable for its emphasis on 
Society as a unique kind of entity that could not 
be reduced to a combination of other kinds of en- 
tities [see Sociery]. The societal approach had 
little use for organicist or dialectical theories, since 
they seemed to ignore such distinctively social 
Processes as symbolic interaction, cultural accu- 
mulation, and socialization. Thus Cooley (1897) 


and many others (see Ogburn [1950] 1964, pp. 
19-20) rejected Galton’s geneticism on the grounds 
that it was far more plausible to assume a con- 
stant and uniform amount of creative potential in 
all societies and to attribute differences in creative 
achievement to manifest differences in social con- 
ditions, such as literacy. This environmentalist 
position appeared to have been confirmed beyond 
all reasonable doubt by studies showing that iden- 
tical twins, when reared in different settings, de- 
veloped differences that could not, it was thought, 
be accounted for in terms of heredity (Newman 
et al. 1937). Thus the famous controversy about 
the importance of “nature,” or genetic endowment, 
as compared with “nurture,” or socialization, was 
not so much settled as abandoned. The social Dar- 
winist model of creative achievement as a product 
of natural selection gave way to a model of creative 
potential as a kind of subterranean resource that 
had only to be “tapped” by the provision of educa- 
tional opportunities in order to become creative 
achievement. This model, reinforced by popular- 
ized Freudian notions of creativity as an attribute 
of the unconscious, still underlies much of Amer- 
ican educational theory and practice. 

Cultural base as creative agent. Another key 
assumption of the societal approach was that, al- 
though the historical process is not just a succes- 
sion of unique events, it does not evolve in any 
predetermined direction. This was a natural result 
of the vast increase, by the end of the nineteenth 
century, in the stock of historical knowledge, which 
now revealed processes that seemed too complex 
to fit into any deterministic theory. One such proc- 
ess was cultural diffusion, which for the first time 
began to be studied on a scientific basis [see Dir- 
FUSION]. Discovery of the great variety of influ- 
ences that went into the making of any advanced 
culture, and of the capacity of some cultures to 
absorb influences from almost any source while 
others succumbed to alien penetration, destroyed 
the heroic image of culture as something created 
locally by a few outstanding individuals and re- 
placed it with that of the common social heritage 
or “cultural base” (Ogburn [1950] 1964, pp. 24— 
25), which set the conditions for individual 
achievement. 

This increase in historical and anthropological 
knowledge was paralleled by an increase in bio- 
graphical knowledge of creative individuals; it be- 
came clear how much these individuals owed their 
pre-eminence to the peculiar historical circum- 
stances into which they had been born. Study of 
the diffusion of ideas from one creative individual 
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to another led to a search for more general societal 
mechanisms of cultural interchange and ideological 
succession. Thus it was pointed out that major dis- 
continuities in ideological succession—the decisive 
“preaks” in the history of ideas—could be analyzed 
in terms of generational series [see GENERATIONS]. 
Most generational studies have been person-oriented 
and have hardly advanced beyond the descriptive 
level. They do not attempt to explain what general 
social or psychological mechanisms could make 
one rising generation differ so much from another 
in its ability to free itself of its predecessors’ ideas. 
One such mechanism was, however, proposed by 
W. I. Thomas, who argued that social change pro- 
ceeds through a series of situations that disturb 
habitual ways of thought. These situations, which 
Thomas called “crises,” are perceived and reacted 
to in different ways by different members of the 
population affected, but only the creative individ- 
uals in that population are fully able to discrimi- 
nate between situations in such a way as to sur- 
mount habit and adjust themselves to radically new 
social demands (Thomas, Social Behavior and Per- 
sonality, p. 169, note 29; ibid., pp. 218-220). This 
theory of Thomas’ deserves more attention than it 
has received in recent years; certainly, the ability 
to deal constructively with changing circumstances 
is one aspect of creativity, and the empirical re- 
search that now exists on the diffusion of innova- 
tions might well be re-examined in this light [see 
DirFusion, article on INTERPERSONAL INFLU- 
ENCE], 

The city as creative setting. The theory of the 
city (for which see Martindale 1958) has also shed 
some light on the mechanisms of ideological suc- 
cession. The city is often held to constitute a 
uniquely creative setting; thus Weber declared: 
“The city and it alone has brought forth the phe- 
nomena of the history of art. . . . So also the city 
produced science in the modern sense” ({1919- 
1920] 1961, p. 234). 

The model underlying the city theory of ideo- 
logical succession is one of a multiple stimulus 
proceeding from a central and cumulative cultural 
base that is open to all the available influences of 
cultural diffusion. Proponents of this theory are 
therefore inclined to equate creativity with rapid 
growth in complexity, which they see as a neces- 
sarily cumulative process taking place at ecologi- 
cally determined central locations. This viewpoint 
is partly the result of an emphasis on particular 
organizational outlets and on the physical proper- 
ties of the cultural base—properties that are com- 
ing to seem far less relevant now that modern 
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techniques of mass reproduction and communica- 
tion can diffuse knowledge of creative objects at 
low cost, regardless of distance. 

Institutional creativity. The new emphasis on 
the variable properties of social systems and their 
elements resulted in a view of social systems and 
subsystems as creative agents. Those systematic 
properties which enabled the system or subsystem 
to vary adaptively in structure and type of output 
were assumed to be the same that caused creativity 
to emerge at the individual level. Durkheim's theory 
of the collective origin of all values and concepts 
finally caused him to adopt a view of society itself 
as supremely creative ([1912] 1961, pp. 482-496). 
At the institutional level, Durkheim helped to direct 
a far more widely diffused preoccupation among 
social scientists with the processes of education 
and socialization. Since it was evident that, even if 
society was in some sense a creative agent, the 
more specialized creative ideologies were not uni- 
formly diffused throughout society, sociological in- 
terest in creativity began to focus on the ways in 
which these ideologies had been institutionalized 
for cultivation by creative elites. As the division of 
labor increased, it seemed as if creative activity 
necessarily became professional activity. At the 
same time the influence of Marxist sociology led 
to an awareness of ideological vested interests that 
might stand in the way of creativity, not only be- 
cause of their possibly uncreative character but 
also because they tended to recruit institutional 
members mainly from the upper and middle 
classes, Some theorists—not all of them Marxists 
—began to maintain that the more organized the 
cultivation of creative ideologies, the more ideo- 
logical succession was likely to be a product of 
conflict between organized ideological interests. 

Deviance and marginality. A variation on the 
theme of ideological conflict that has played a large 
part in the psychological study of creativity is that 
ideological succession is brought about by individ- 
uals who have deviant personality characteristics, 
since only such individuals could be motivated to 
overthrow received ideas. The sociological equiva- 
lent of this is the theory of marginality, which 
holds that “marginal men”—that is, individuals 
who are marginally located in the social structure 
—will, because they have been imperfectly social- 
ized into the dominant value system (and therefore 
have no vested interest in maintaining it), have a 
greater potential for creative achievement than 
those who are more centrally located, The achieve- 
ments of minority group members, especially Jews, 
have often been cited as evidence in support of this 
hypothesis (see, for instance, Veblen 1919). 


But such arguments tend to overlook the discrep- 
ancy between societal values and the values in 
terms of which creative achievement is appraised 
by experts. It is probably true in all complex soci- 
eties that entry to fields where distinction is based 
entirely on achievement is less restricted than 
entry into other fields where financial success is 
more easily come by (compare Toynbee [1934b] 
1962, vol. 2, pp. 209, 217-220). Nevertheless, the 
theory of the marginal man has some value in the 
study of creativity because it raises the question of 
whether the experience of marginality, however 
acquired, may help to strengthen those faculties 
of detachment and self-reliance which some psy- 
chologists believe to characterize creative individ- 
uals (see, for instance, Henle 1962, pp. 45-46; 
Crutchfield 1962, p. 139). There is no good evi- 
dence that socialization into a marginal status, or 
voluntary adoption of such status, is necessarily 
favorable to creativity. On the contrary, everything 
we know of minority group psychology and of the 
psychology of deviance suggests that, for most 
people, marginality tends to produce feelings of 
insecurity and even self-hatred that are more likely 
to find an outlet in compulsive overconformity than 
in creative achievement. But it does seem likely 
that the creative person—for reasons that are not 
yet understood—is an exception to this pattern, in 
the sense that he is able to turn his marginal status, 
whether sought or unsought, to good advantage. 
Biographies of creative individuals suggest that 
marginality is usually a temporary episode in a 
creative career and that when it is sought out and 
cultivated (as Rilke, for instance, sought out the 
absolute solitude of the Chateau de Muzot), this 
is done with a very definite purpose. From a socio- 
logical point of view, the striking fact about such 
careers is the ability of creative individuals to alter- 
nate periods of disaffiliation and solitude with 
periods in which a variety of social roles are sus- 
tained with great effectiveness. 

Factorial theories. The virtue of the societal 
approach to creativity was that, in spite of its 
failure to produce definitive results, it succeeded 
in demonstrating that collectivities can plausibly 
be regarded as creative agents and that their cre- 
ative properties can be analyzed in structural 
terms. The ambivalence of this approach with re- 
gard to the autonomy of the creative individual can 
be traced to a far more general ambivalence in 
sociological thought with regard to the relation of 
the individual and society. In contrast, there is a 
large class of theories that succeed in evading this 
problem because they treat creativity as a purely 
individual property and regard creative achieve- 


ment as a type of individual response to various 
social pressures. The sociological problem of cre- 
ativity, in terms of this approach, is to establish 
correlations between the distribution of creative 
individuals or objects and the presence or absence 
of one or (more usually) a number of social fac- 
tors. It therefore seems appropriate to call this the 
factorial approach. 

Factorial studies of creativity are rarely con- 
cerned with explicating structural relations between 
the social factors that are held to stimulate creative 
achievement, and still less with constructing mod- 
els of the creative process that would relate social 
factors to psychological ones. They do not attempt 
to explain why a creative response should have 
taken a particular form, or whether it need have 
occurred when it did. Because of these and other 
omissions, many factorial studies are open to the 
charge of determinism, since they tend to ignore 
the voluntary element in creative efforts, as well 
as the extent to which the creative individual de- 
fines problems for himself or chooses to deal with 
certain problems rather than others. Factorial the- 
ories also tend to play down the existence of con- 
flict between institutional values. In this sense such 
theories often seem, as Kuhn points out, to “write 
history backward” (1962, p. 137). 


Toynbee’s theories of creativity 


Toynbee’s important historical treatment of cre- 
ativity has often been criticized by social scientists 
on the grounds that it overemphasizes the “great 
man” at the expense of social factors. But this is to 
ignore Toynbee’s conception of society as a complex 
of institutions (vol. 1, pp. 454-455) and his care- 
fully stated view on the relationship between the 
individual and society (vol. 3, pp. 223, 230). The 
difference between Toynbee’s approach and that of 
most contemporary American sociologists is that 
he judges the influence of institutions on creativity 
to be mainly negative (see especially vol. 4, pp. 
133-245, 303-423). In this he can be compared 
with Thomas, who thought that the control exer- 
cised by the group over the individual “tends to 
destroy much more than to construct . . .” (Thomas, 
Social Behavior and Personality, pp. 164-165). 
But Toynbee also regards the communication of 
creative experience and the display of creative 
products as essential parts of the creative process 
(vol, 1, p. 454, note 3; compare vol. 3, pp. 235- 
236). His main contention is that the process also 
requires a withdrawal of potentially creative per- 
sons and minorities from these institutions into a 
socially marginal position where they are set free 
to carry out the creative work. The final stage in 
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the process is reached only after the creative per- 
son or minority has returned to society in order to 
gain acceptance for a creative product (vol. 3, pp. 
248-263, 366-377 ). 

This model of withdrawal and return is also ap- 
plied to whole nations and civilizations as creative 
agents, and it is here that Toynbee’s theory of 
creativity exhibits both dialectical and organicist 
features. It is organicist insofar as civilizations are 
conceived as entities that undergo the natural proc- 
esses of birth, growth, and decay; it is dialectical 
insofar as these processes are conceived as neces- 
sarily determined by cycles of creative withdrawal 
and return (see, for instance, vol. 4, p. 125). Loss 
of command over the environment, whether physi- 
cal or human, is explicitly ruled out as a cause of 
decadence; for Toynbee, it is merely a symptom 
(vol. 4, pp. 39-119). Yet Toynbee never goes as 
far as attributing creative primacy to the social 
structure or to any of its elements; structural ar- 
rangements as such originate nothing (vol. 3, p. 
230). This is because, in Toynbee’s basic model of 
society, a clear line is drawn between those who 
lead—the creative minority—and those who are 
led. Loss of creative power by the leaders is equiva- 
lent to loss of influence and cannot be replaced by 
the use of force; once the majority have withdrawn 
their assent to the leaders’ values, they have in 
effect seceded from society, which then progres- 
sively disintegrates (vol. 1, pp. 187-188; compare 
vol. 4, pp. 5-6). From such a situation a new soci- 
ety can arise only through emergence of a new 
creative minority. 


Need, demand, and support 


Most societal and factorial theories of creativity 
assume that potentially creative individuals are 
stimulated to creative achievement by some com- 
bination of social need, social demand, and social 
or economic support. But the relative importance 
assigned to each of these factors varies greatly, as 
does the social level at which they are held to 
operate. 

Demand and need in Toynbee’s theory. Toyn- 
bee explains the birth of civilization in terms of 
creative societal response to unusually severe en- 
vironmental challenge (vol. 2, p. 18; see also vol. 2, 
passim). But he does not pay much attention to the 
way in which a society sets about recognizing its 
needs, or to the possibility that its perception of 
need—that is, the nature of its societal demands 
—may not accurately reflect the true state of af- 
fairs. On the other hand, it is clear that fulfillment 
of need sufficient to ensure a society's survival is 
only part of what Toynbee means by creativity at 
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this level. Similarly, his application of the term 
“response” to creative individuals and minorities 
often implies that they succeeded in doing far more 
than was required by the situation that “chal- 
lenged” them. 

Ogburn’s theory of invention. Ogburn differs 
greatly from Toynbee, first, in viewing creative po- 
tential simply as the upper end of the normal dis- 
tribution of mental ability and, second, in adopting 
a conception of culture that makes the creative 
individual not so much a leader as a “medium in 
social change” (Ogburn [1926] 1964, p. 43; com- 
pare [1950] 1964, p. 22, note 4). 

Ogburn’s overriding preoccupation with techno- 
logical invention, and his commitment to a cumu- 
lative model of scientific development, gave rise to 
certain anomalies in his treatment of creativity 
that he recognized but was unable to resolve. Other 
social institutions did not appear to evolve in the 
same way or at the same rate as science, and this 
made it difficult to apply the theory of cultural ac- 
cumulation to whole cultures, at least when dealing 
with complex societies. Moreover, different socie- 
ties clearly evolved at different rates—a fact that 
Ogburn could not explain in terms of the accumu- 
lation of purely internal cultural materials, once 
he assumed that all races had the same creative 
potential and that cultural accumulation was the 
prime creative force. 

The social processes adduced by Ogburn in order 
to remove these difficulties have an adventitious 
character. Cultural evolution is said to be explica- 
ble in terms of four factors—invention, accumula- 
tion, diffusion, and adjustment—of which inven- 
tion is the “central” factor, while the others “all 
lead to further invention, but they do more. Each 
is a significant and special process, irrespective of 
its stimulation of new inventions” (Ogburn [1950] 
1964, p. 23, note 6). In dealing with concrete ex- 
amples, Ogburn tends to play these different fac- 
tors off against each other rather than to analyze 
the way in which they are interrelated. For exam- 
ple, he states that the “inventional process” con- 
sists in the “operation of three factors: mental 
ability, demand, and . . . the ‘cultural base: ” But 
then he admits the existence of a class of inven- 
tions that “are made accidentally while working on 
something else; for these, demand did not direct 
the invention. Such are many discoveries in pure 
science. The use of an invention, however, implies 
a demand” (Ogburn [1950] 1964, pp. 23-24). In 
other words, the absence, in a given type of situa- 
tion, of one of Ogburn’s factors does not lead him 
to question his general model of invention but 
merely to change the topic. Elsewhere, Ogburn ex- 


plains the fact that “significant invention” is not 
usually accompanied by change in all other parts 
of society by adducing his celebrated theory of 
cultural lag, which is based on the hypothesis that 
all societies seek through invention to regain the 
equilibrium that invention has upset (ibid., p. 30; 
compare Ogburn 1957). But by Ogburn’s own 
admission his theory of cultural evolution does not 
accord at all with his conception of artistic crea- 
tivity ({1936] 1964, pp. 56-57 )—though it is hard 
to believe, on this evidence, that he ever gave any 
serious attention to the history of art. 

Demand and institutional appraisal. Both Toyn- 
bee and Ogburn tend to overlook the relationship 
between social demand and institutional processes 
such as appraisal. Thus the phenomenon of neg- 
lected genius—a central problem for the sociology 
of creativity—cannot be explained wholly or even 
mainly in terms of lack of demand, as it would 
have to be under Ogburn’s theory, nor yet as a de- 
ficiency of qualities making for successful leader- 
ship, as Toynbee would probably argue. Both of 
these factors are certainly capable of overshadow- 
ing creative achievement, but it must be very sel- 
dom that they obscure it altogether. A far more 
likely explanation in many cases is that, because 
of a breakdown in the system of appraisal (which 
may be associated with a sudden upsurge of new 
creative ideologies after a period of stagnation, or 
with the emergence of a new kind of audience for 
creative products) there is as yet no outlet for a 
certain type of creative object in those social areas 
where a demand for it exists. In any case, it is 
certain that different types of appraisal are linked 
with different types of demand and that appraisal 
systems are highly complex structural entities 
which are rarely examined as such. 

Thus, for the creative individual, the existence 
of a stratified social demand mediated by a sophis- 
ticated appraisal system is likely to intensify the 
conflict between the claims of creative achieve- 
ment and those of public recognition—or, as Rank 
put it, the antagonism between success and fame 
(1932, chapter 6). Little systematic work has been 
done on any of these questions, but notice should 
be taken of Kuhn’s suggestion (1962, p. 69) that 
the breakdown of established paradigms, as evi- 
denced by their increasing failure to solve the 
problems they themselves generate, may have a 
greater effect on the emergence of scientific crea- 
tivity than the more generalized social pressures 
that preoccupied theorists such as Toynbee and 
Ogburn. 

Support. Social demand for creative products 
is so often measured in terms of economic support 


for their creators that these two factors have be- 
come inextricably confused. Yet it is clear that 
although they are for the most part closely associ- 
ated, at least in capitalist societies with a high de- 
gree of literacy, they may vary independently. Thus, 
American concert and opera audiences, possibly 
from a historically conditioned sense of cultural 
inferiority, have always preferred European to 
American composers, even when the latter were of 
outstanding merit. This has not deterred the Ford 
Foundation from subsidizing American opera on 
a lavish scale (Rockefeller Brothers Fund 1965, 
pp. 29-31). Nevertheless, there is no regular audi- 
ence for native contemporary opera in the United 
States, and American composers would probably 
cease to write operas were it not for foundation 
support. The reverse situation—a widespread de- 
mand that goes unfulfilled for lack of support—is 
probably quite rare in American society but can be 
inferred to exist in the Soviet Union, for instance, 
where the prerevolutionary classics of Russian lit- 
erature are bought and read in huge numbers, 
while the output of contemporary Soviet writers 
seems quite insufficient to match this evident de- 
mand for serious reading matter (Friedberg 1962, 
chapters 6 and 7). 

But economic support for cultural activities and 
products is not the same thing as economic sup- 
port for creativity, nor is it possible to prove, simply 
from inspection of the amount a society spends on 
art museums, symphony orchestras, libraries, and 
the like, that the arts of that society are in either 
a creative or an uncreative state. This is the most 
damaging criticism that can be made of otherwise 
useful studies, such as that of Toffler (1964). One 
distinction that would go far to clear up this and 
many other confusions about the relation of crea- 
tivity to support is that between economic and so- 
cial support. The fact that few women have dis- 
tinguished themselves in science or in most of the 
arts except acting and musical performance is 
clear evidence of the lack of social support for 
women to compete in these fields. There are also 
obvious differences in the nature of the social sup- 
port meted out to the same pursuits in different 
cultures. For instance, the amount of social support 
for the creative scientist in the United States dif- 
fers markedly from that for the creative artist—a 
fact that reflects the naively utilitarian conception 
of science that predominates in American society. 
In the Soviet Union, on the other hand, it is the 
official theory of art that is naively utilitarian, and 
the artist who succeeds in accommodating himself 
to this credo enjoys the kind of prestige that, in 
the United States, is more often accorded to scien- 
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tists. Such differences in the ideological basis of 
economic support undoubtedly affect creativity, 
but they have yet to be systematically investigated. 

Patronage systems. Most writers on creativity 
who consider patronage treat it as synonymous 
with economic support. Patrons are usually classi- 
fied according to their social status—for instance, 
aristocratic, as opposed to bourgeois—and frequent 
attempts have been made to relate the rise of this 
or that creative ideology to the growing economic 
prosperity of a social class (see, for instance, 
Hauser [1951] 1957-1958, vol. 2, pp. 26-52). 
Such analyses assume that the form taken by a 
creative product is determined largely by the tastes 
or demands of those who pay for it. 

But even if we grant the assumption that some 
kinds of patronage are especially favorable to crea- 
tivity, it is by no means clear what these are. For 
instance, Italian art patrons during the age of the 
baroque could hardly have been more generous or 
more tolerant, yet the achievements of Italian 
painting during that period were disappointing 
when compared with developments in other parts 
of Europe, in spite of the superior Italian “cultural 
base” for the fine arts. It has been suggested that 
one reason for this decline in creativity may have 
been the extreme tolerance of the patrons: “Un- 
orthodoxy was killed with kindness” (Haskell 1963, 
p. 385). On the other hand, major creative ideolo- 
gies have been propagated successfully under sys- 
tems of patronage that, by all common-sense 
criteria, would seem to have been peculiarly un- 
favorable: one has only to remember William Haz- 
litt’s embittered description of the social conditions 
that attended the birth of the English romantic 
movement ([1825] 1948, pp. 737-738). 

None of the standard explanations of these 
anomalies are very satisfactory. The usual thesis 
is that to maximize social and economic support 
for any institution is to maximize its opportunities 
for creative development. But there is always the 
risk that hypertrophy of an institution may gener- 
ate a flood of mediocrities and a corresponding 
dilution of basic institutional values (see, for in- 
stance, Butler [1962] 1963, pp. 21-22; White & 
White 1965, pp. 2, 44-54). A more systematic 
approach to the problem would be to study patron- 
age systems as institutionalized patterns of support 
in which the social status of patrons and the 
amount of their financial outlays would be only two 
of the many elements to be considered. The essen- 
tial features of this approach are already present 
in Haskell’s analysis of seventeenth-century art 
patronage (Haskell 1963, chapter 1). It is clear 
from Haskell’s account that many different types 
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of patronage, serving many different needs, can 
coexist within the same system, What differen- 
tiated this system from the one that eventually re- 
placed it was not so much the social status of its 
patrons as the set of values, shared by artists and 
patrons alike, relating to the nature of art as an 
institution. From a sociological point of view, it is 
this homogeneity of values that should probably 
be singled out when comparing baroque patronage 
with, for instance, the art market of the late nine- 
teenth century. Clearly, a patronage system in 
which painters and patrons have different concep- 
tions of art as an institution is likely to need 
entrepreneurs who will mediate between supply 
and demand. The forms taken by the mechanisms 
of display and appraisal are also likely to be 
different. 

But these and similar hypotheses cannot be in- 
vestigated systematically in the absence of a his- 
torically sound typology of patronage systems, Four 
such types can be readily identified: the personal- 
ized system described by Haskell, in which there is 
a clear distinction between patronage for public 
and private purposes, though in either case the 
patron is the direct employer of the artist or scien- 
tist; the academy system, which is essentially a 
system of ideological control based on official pa- 
tronage and is the system preferred by authori- 
tarian and communist regimes; the open market 
system, of which American show business is per- 
haps the purest form; and the subvention system, 
in which a variety of patrons, both public and pri- 
vate, underwrite the cost of cultural activities that 
would otherwise cease for lack of demand. 

All of these systems—and probably others, too— 
may coexist in the same society. Thus a society's 
art may be supported by an open market system 
and its science by an academy system. There may 
also be more than one system of support for a 
single institution, as when art is supported by both 
the personalized and the open market systems. 
Under these conditions, the creative individual may 
have a choice of patronage systems, any one of 
which will impose a definite type of demand. It 
may be difficult for him to serve more than one of 
these systems once he has made his initial choice. 
But these and related problems in the sociology of 
creativity have as yet scarcely been identified, let 
alone discussed. 


Empirical studies 


Most empirical studies of creativity are person- 
oriented. The samples on which they are based fall 
into two main categories: those in which the indi- 
viduals studied have been selected on the basis of 
acknowledged creative achievement at some arbi- 


trarily defined level and those in which an entire 
population has been sampled and classified, usually 
on the basis of projective psychological tests 
(though ratings by peers or supervisors are often 
collected as well), into “creative” and “uncreative” 
groups. Samples based on achievement criteria 
have often been institutional in scope; for instance, 
the entire population of “creative” American scien- 
tists is continually being studied (Roe 1963). A 
few attempts, of which Galton’s is still the best- 
known, have been made to study the characteristics 
of creative individuals at different historical periods 
on a national basis (Genetic Studies of Genius, 
vol. 2, 1926; White 1931). Studies using psycho- 
logical measures have usually been conducted at 
the level of formal organizations and of the organ- 
ized work groups that they contain (for a brief re- 
view, see Hughes 1963; for some of the current 
research, see Utah Creativity Research Conference 
1964, parts IV and V; Fiedler 1964). 

Few of the psychological studies examine their 
subjects over any considerable period of time: the 
outstanding exception is the work of Terman and 
his associates, who drew a sample of over 1,400 
school children with IQs of 140 or more and suc- 
cessfully conducted follow-up studies over a period 
of 25 years (see, for instance, Genetic Studies of 
Genius, vol. 4, 1947). Studies using projective tests 
are occasionally made of mature creative indi- 
viduals selected on the basis of acknowledged 
achievement in the arts or sciences (a good ex- 
ample is Barron 1963, chapter 19); such indi- 
viduals are never identified, and it is therefore im- 
possible to make an independent appraisal of their 
creative ideologies. The most interesting achieve- 
ment of the psychological studies so far has been 
the differentiation of intelligence from creative po- 
tential (Getzels & Jackson 1962), and there are 
many indications that these two factors, though 
associated, may vary independently within a cer- 
tain range (Meer & Stein 1955). Studies and auto- 
biographies of creative individuals have, of course, 
been analyzed by social scientists, but Rank’s 
devastating criticism of the psychological approach 
to biographical material is still valid (1932, p. 
150; compare pp. 129-130, 143-144). 

A complete list of studies that fall into the cate- 
gories outlined above would include nearly all the 
social science literature ostensibly concerned with 
creativity. But this list would omit most of the 
studies that have relevance for the sociological ap- 
proach to the topic. Unfortunately, there is as yet 
no agreement on the nature of this relevance, 
though it has been clear to sociologists for some 
time that it exists. In my opinion, one of the main 
obstacles to the inauguration of a sociology of cre- 


ativity is the assumption that all studies of crea- 
tivity must be person-oriented. But the study of 
creativity need not be confined to the individual 
level. Many empirical studies of social movements, 
formal organizations, and small groups deal with 
creative actions and products, even though they 
may not use the word “creativity.” Findings at 
these different levels could be compared with re- 
spect to their definitions of the creative object; the 
amount of attention they pay to institutional 
values, creative ideologies, and outlets for creative 
products; and the ways in which they describe the 
basic mechanisms of ideological succession, dis- 
play, appraisal, and support. But if a mature soci- 
ology of creativity is ever to emerge, far more 
empirical studies will have to be undertaken at the 
institutional and societal levels. 

Societies. Social systems are usually conceived 
as having various kinds of outputs. Within this 
frame of reference any attempt to explain crea- 
tivity at the societal level must be concerned with 
the quality of such outputs. But there is no agree- 
ment about what they are and considerable con- 
fusion about what they might be. Some studies 
treat the output of a single institution, such as art, 
as if it were a societal output and then relate varia- 
tions in the quality of this output to different states 
of the social system. Thus Kavolis has made use of 
the phase-cycle theory of Parsons and Bales in 
order to argue that a society’s artistic creativity is 
likely to increase when social conditions are such 
as to “increase the social utility of art as a sym- 
bolic facility for reintegration, without necessarily 
destroying or massively diverting the resources 
needed for artistic creation” (1966, p. 215). This 
theory is held to be supported by Sorokin’s data on 
the rate of internal disorders in various countries. 
But even if it is granted that the social function of 
art is social integration, there remains the question 
of how institutional values are related to societal 
needs. Highly specialized creative ideologies such 
as artistic styles are clearly not generated by the 
social system acting as a whole, nor does an up- 
surge of creativity in a single institution constitute 
evidence that the entire social system is in a crea- 
tive state. It might be possible to measure the out- 
puts of different institutions on the same scale, but 
this would only obscure the question of how the 
institutions were related to each other. It may be 
that a truly creative state of society would be one 
in which the different institutions stand in rela- 
tions of reciprocity and mutual support. In this 
case, the output of the creative social system would 
be, presumably, the civic morality that made such 
relations possible. Conversely, an uncreative social 
system would be one characterized by chronic and 
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acute disharmony between its institutions (com- 
pare Toynbee [1939] 1962, p. 133). This hypoth- 
esis attributes creativity to social integration, not 
cultural integration, since if each institution had 
as much creative autonomy as was consistent with 
civic morality, considerable heterogeneity of cul- 
tural forms and styles would result. 

Another topic on which more empirical research 
is needed is the relationship between creativity and 
national character. Although the notion of national 
character, generally accepted in David Hume's day 
(see Toynbee, vol. 1, p. 470), has now fallen into 
discredit as a tool of social science, it still appears 
to have some use in literary and art criticism. For 
instance, Nikolaus Pevsner (1956) has argued that 
the English national character, as revealed in Eng- 
lish art and architecture, has undergone a series of 
radical changes: opposing stylistic traits such as 
“moderation” and “fantasy,” which derive from the 
English social heritage, appear in different com- 
binations at different periods without ceasing to 
reflect certain dominant formal qualities. What is 
impressive about Pevsner’s method of analysis, 
which owes much to Wölfflin, is the way in which 
he links aspects of internal ideology (that is, purely 
formal properties such as emphasis on verticals, 
and disembodiment) with aspects of external ide- 
ology (that is, with values such as conservatism, 
which derive from institutions other than art). 
Since Pevsner associates creativity in the arts with 
intensity and even fanaticism, he concludes that 
the diminishing frequency of these two qualities in 
English art since the Reformation shows that Eng- 
land is no longer a “visual nation,” and he attrib- 
utes this decline in artistic creativity to the growth 
of reason and tolerance in English political insti- 
tutions ([1956] 1964, p. 206). Whatever one may 
think of such daring analogies between internal 
and external ideological succession, the facts on 
either side of the analogy are clear enough, and 
the problem of connecting them more systemati- 
cally than Pevsner has done is a genuine one for 
sociology [see STYLE]. 

Institutions. The importance of institutional 
values for the study of creativity has already been 
emphasized. There is no lack of relevant literature 
here; rather, there is a need for the development of 
formal categories in terms of which the internal 
dynamics of ideological succession can be system- 
atically described. The greatest progress in this 
area so far has been made by those art historians 
who have followed the example of Wölfflin, but 
without his bent for dialectics. 

A far more neglected area is institutional struc- 
ture. The concept of institutional structure should 
(but usually does not) include not only the means 
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by which an institution deals with recruitment, 
socialization, and ideological succession but also 
such aspects of its relations with the larger society 
as its sources of support (both social and eco- 
nomic), its outlets, and its provisions for display 
and appraisal. One of the very few sociological 
attempts at meeting all these requirements is a 
study by White and White (1965) of institu- 
tional change in the French art world during the 
nineteenth century. In this study the dimensions 
of painting as an institution are systematically 
mapped out in terms of such parameters as the 
total number of painters and the rate at which they 
produced paintings. The object-oriented ideology 
and centralized formal structure of the academic 
appraisal system are seen as contributing to the 
academy's eventual loss of control over institutional 
values (White & White 1965, pp. 88-89, 100) and 
thus to the successful emergence of impressionism 
as a creative ideology. But the study does not deal 
with the internal ideology of impressionism except 
insofar as it brought a new importance to the previ- 
ously minor genre of landscape painting. Impres- 
sionism, however, represented a far more drastic 
revolution in pictorial values, and it is possible that 
too much stress on institutional continuity may 
serve to obscure this. 

In contrast, Kuhn (1962) stresses the discon- 
tinuity that accompanies radical change in the 
realm of scientific values. The introduction of a 
new paradigm—presumably the most creative feat 
a scientist or group of scientists can accomplish— 
is essentially a noncumulative episode in the his- 
tory of science and represents such a sharp break 
with the established scientific world view that it 
can gain acceptance only after science has entered 
a period of acknowledged crisis. Thus the creative 
genius of a Galileo consists in the introduction of 
an entirely new mode of interpretation that, be- 
cause it enlarges the whole field of scientific per- 
ception, cannot be described in terms of a single 
discovery or invention—rather, it is a “scientific 
revolution” (Kuhn 1962, pp. 118-122). But, as 
Kuhn has emphasized, it is “normal science” that 
makes scientific revolutions possible, since only to 
those who have mastered a particular scientific tra- 
dition does it become evident that the tradition has 
been exhausted ([1959] 1963, pp. 349-350). 

These considerations suggest that neither per- 
sonality factors nor basic abilities have any part 
to play in the higher forms of creative scientific 
achievement unless they are associated with thor- 
ough socialization into a firmly established tra- 
dition. Thus, for Kuhn the most important precon- 
dition of scientific creativity is an institutionalized 


consensus. Again, there are obvious parallels with 
the history of art. For instance, White and White 
have pointed out that the impressionist rejection of 
tradition was made more meaningful by the exist- 
ence of a firmly established tradition to reject 
(1965, pp. 160-161); similarly, Shils (1964, p. 
356) has pointed out that the function of an avant- 
garde in any institution is to reinterpret tradition. 
But if the history of art is to be systematically com- 
pared with the history of science, it will be neces- 
sary to give an account of artistic creativity in 
terms of problem solving and to distinguish be- 
tween the different kinds of artistic problems as 
they are seen from the artist's point of view, just as 
Kuhn has examined creativity in science from the 
scientist's point of view. The creative object in art 
and science is essentially a solution to a series of 
problems; accordingly, the sociological researcher 
will always fail to understand why or in what re- 
spects an object should be considered creative un- 
less he also understands what these problems are 
and how they relate to each other. From an object- 
oriented point of view, both art and science present 
themselves as hierarchically ordered complexes of 
problems—some basic, some less basic, and some 
quite trivial. Since creativity occurs at all these 
levels, the task of appraising creative objects for 
sociological purposes is not so much one of ranking 
them according to degree of creativity as of assign- 
ing to each object its appropriate place in the total 
hierarchy of problems and solutions to problems. 
To say that one creative object is more or less crea- 
tive than another is to imply that it solves prob- 
lems of a higher or a lower order. How this order 
is to be conceived is a question that can be decided 
only with reference to the values which constitute 
the internal ideology of the institution in question. 

Other problem areas that still await investiga- 
tion at the institutional level are the relation be- 
tween ideological succession and patterns of sup- 
port, the effects of technology upon institutional 
outlets, and the systematic comparison of patron- 
age systems, 


Study of creativity at the societal and institu- 
tional levels has traditionally been a historical en- 
terprise. But it is one thing to evaluate a dead 
civilization and quite another to diagnose the state 
of an ongoing civilization of which no one knows 
the outcome. Thus Toynbee’s theories, when ap- 
Plied to the contemporary United States (Toynbee 
1964), issue in conclusions that are suggestive but 
that could be substantiated only by sociological 
analysis of data that are not likely to be available 
for many years to come. A more sophisticated ac: 


count of obstacles to creativity in modern indus- 
trialized society is that of Murphy (1958), who 
offers hope that the rigidity of the cultural base 
can be counterbalanced by deeper understanding 
of human potentialities, This raises a question that 
has escaped most historians, Toynbee included: 
What will be the social effects of the increasing 
psychological knowledge of creativity? Historical 
research has no answers to this question because 
no civilization but the present one has had the 
opportunity to develop such knowledge. Intelli- 
gence and aptitude tests are now so widely used 
that they obviously meet a permanent demand in 
modern society. If a similar demand for creativity 
tests should ever make itself felt, the structure of 
society, especially its present highly inefficient 
mechanisms for ideological succession, might be 
permanently changed. 

J. M. B. EDWARDS 


[Directly related are the entries CULTURE; FILM; FINE 
ARTS; LEADERSHIP; LITERATURE; SCIENCE. Other 
relevant material may be found in Crry, article on 
FORMS AND FUNCTIONS; COMMUNICATION, MASS, 
article on AUDIENCES; GROUPS, article on GROUP BE- 
HAVIOR; ORGANIZATIONS, article on EFFECTIVENESS 
AND PLANNING OF CHANGE; and in the biographies 
of BUBER; COOLEY; DURKHEIM; FREUD; GALTON; 
KROEBER; OGBURN; RANK; SOROKIN; TERMAN; 
THOMAS; WEBER, MAX.] 
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Tit 
GENIUS AND ABILITY 


In a popular nineteenth-century conception, gen- 
iuses were innately gifted men who somehow 
made history. The “great-man” theory, as argued by 
Carlyle and supported by Galton, Lombroso, and 
others, may be stated in the following simplified 
form: Fame is a consequence of genius and can 
seldom be achieved without it, and genius as a 
tule achieves fame, almost regardless of any en- 
vironmental and social conditions. Furthermore, 
men gifted with high abilities overcome virtually 
all obstacles, but men having social advantages 
and lacking in high abilities cannot rise to emi- 
nence. All truly great achievements, in this view, 
are made by the few eminent men of high ability. 

Although some sociologists were briefly influ- 
enced by the above view, the main trend of socio- 
logical thinking ran steadily in a contrary direc- 
tion. C. H. Cooley (1897) published a lengthy and 
effective challenge to the great-man theory, based 
on his examination of lists of distinguished Euro- 
pean men to see if any had risen to eminence de- 
spite the total lack of educational advantages. He 
found no case in which there was not at least some 
formal childhood education. Illiteracy, at least, he 
concluded, would constitute a sufficient barrier to 
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eminence for persons of any degree of innate 
ability. 

Galton (1869) had accounted for exceptional 
flowerings of culture, such as that of Athens be- 
tween 530 B.C. and 430 B.c., in terms of abrupt 
genetic improvement following unplanned selec- 
tion of superior types of immigrants. Cooley (1897) 
explained the same phenomenon in sociological 
rather than biological terms. The operating factors 
included the accumulation of traditions of tech- 
nical skills that developed atmospheres encourag- 
ing valued types of achievement and linked these 
activities to other dominant motivating enthusi- 
asms, such as political and religious interests. 
Cooley did not hold innate differences to be irrele- 
vant, but he did contend that both ability and 
favorable conditions were essential to the flowering 
of human achievement. For the most part the sub- 
sequent sociological tradition has developed in this 
direction of thinking. 


Genius as a sociological concept 


The concept of genius is not widely used in con- 
temporary sociological writings, perhaps because 
of an aversion to popular usage, in which the term 
implies an exaggeration of the abilities and powers 
of men of achievement so that they appear to pos- 
sess hereditary equipment of an order entirely apart 
from that of the rest of the population. The modest, 
but essentially correct, self-deprecation by persons 
commonly reputed to be geniuses is widely over- 
looked or dismissed as only a display of good 
manners. 

Sociological attention to the subjects of creation 
and social change has increasingly tended to be 
fixed mainly on broad social processes. The respon- 
sibility of research into the nature and importance 
of individual abilities is nowadays, for the most 
part, left to the psychologists. Even from the early 
days of their discipline, however, sociological schol- 
ars have warned against giving undue weight to 
the importance of the abilities of individual per- 
sons as a factor bearing on the course of history. 

However, direct attention to the evaluation of 
mental achievements of specific persons is not 
entirely lacking in sociology. In many cases of 
persons nominated by popular acclaim or by judg- 
ment of historians as true geniuses, it is difficult 
or impossible to separate the contribution of men- 
tal ability from that of other personal qualities and 
from factors external to the persons, including good 
luck. There is no justification, for example, for the 
sometimes attractive inference that a victorious 
military leader has, solely by the fact of his tri- 
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umph, qualified himself as a genius; the mental 
powers of Caesar, Napoleon, or Grant cannot be 
usefully estimated from the materials of history. 
The same methodological difficulty applies to the 
assessment of the intelligence, ex officio, of heads 
of governments, leaders of movements, and spec- 
tacularly successful criminals. 

Popular lists of geniuses are heavily contami- 
nated by the arbitrary standards of popular or 
critical approbation. Thus Rembrandt was widely 
held to be a great artist during the age when his 
work was in vogue and was later dismissed by 
some critics as a hack painter. Longfellow seemed 
to be a greater poet in the nineteenth century than 
in the twentieth. Whatever one’s view of aesthetic 
values, it is clear that the future standing of con- 
temporary artists of renown cannot be known in 
their own time; and this is an undeniable handicap 
to empirical research. 

The search for unmistakable instances of genius, 
in the more narrow sense of extraordinary and 
effective mental ability, has led to the examination 
of the mental operations of creative mathemati- 
cians, research scientists, and prolific inventors. 
The results of such inquiries support the theory 
that the admired feats of such persons do not rest 
on exceptional native ability or on special talents 
not possessed by the rest of the population, but can 
be attributed to a variety of factors such as effec- 
tive training or experience, possession of symbolic 
tools that facilitate mental operations, exception- 
ally strong motivation, and sometimes the good 
luck of being at the right time and place for an 
important advancement of knowledge to be made 
(Faris 1940), 

Types of ability. Psychologists have shown 
that tests of ability can usefully be divided into 
separate factors, and it is now generally conceded 
that ability is not a single entity but that there are 
various kinds, each valuable in a different way 
(Goslin 1963; see also Gruber et al. 1962). Ability 
for routine research and invention, effective judg- 
ment and wisdom, strong administrative and lead- 
ership qualities, various performing skills, and 
general creativity—all have seemed to be distin- 
guishable; and expanding attention has recently 
been turned to the development of separate meas- 
ures for the various types. 

Of these, the concept of creativity has in recent 
times aroused a particularly strong research inter- 
est, in the wake of a realization that the conven- 
tional tests of mental ability have failed to identify 
a type of person who seems to be able to make 
distinctly original contributions to knowledge or 
the arts, even though he does not achieve a score 


at the top IQ level. Attempts have been made to 
identify such creativity in schoolchildren and to 
account for its occurrence (Getzels & Jackson 
1962). No clear pattern has emerged so far, how- 
ever, and it is possible that an abundance of ideas 
and an indifference to conventionality are the prin- 
cipal qualities that are assumed to forecast a capac- 
ity for making original and valuable contributions. 
Correlations among tests designed to reveal cre- 
ativity are for the most part low and, in general, 
scarcely higher than the correlation of the cre- 
ativity test scores with IQ. 

Adults who have actually made original contri- 
butions in their professional fields have also been 
studied in order to find out the nature and sources 
of their creativity (MacKinnon 1962). Direct and 
intensive examination of a large number of spe- 
cially selected writers, architects, mathematicians, 
and research workers in science and engineering 
failed to discover a clear and uniform pattern of 
anything that could be unambiguously labeled 
“creativity.” Compared to a more representative 
sample of professionals in their fields, however, 
the persons judged to be creative were in general 
more open to experience, were independent rather 
than either conformist or nonconformist, tended 
toward introversion without restraints or inhibi- 
tion, and were relatively rich in range of thoughts. 
But there were differences among the creative per- 
sons by profession: writers, for example, scored 
high on verbal intelligence measures, and archi- 
tects high on spatial intelligence. 

The concept of creativity thus remains elusive. 
It may turn out that a variety of factors, rather 
than a unitary talent, will account for the observed 
achievements of valuable originality. 


Ability and achievement 


A contribution to the evaluation of the factor of 
genius was made by William F. Ogburn in his 
studies of the role of invention in social change 
(1922). He found that no inventor develops a 
complex device in its entirety; an invention is al- 
ways made by a series of small steps, and the vari- 
ous steps in each machine are made by a number 
of different inventors. It is, therefore, a scientific 
error to designate, as so many uninformed persons 
have done, Fulton, Edison, and Marconi as the 
inventors of the steamship, electric light, and radio, 
respectively. Each of these men contributed only a 
small part of the series of innovative ideas required 
for the above inventions. In the case of the electric 
light, for example, it was already well known that 
current passing through a wire could cause it to 
glow and emit light. Enclosure of the source of 


light in a glass container was already a feature of 
the arc light. Edison’s achievement, admittedly 
worthy of appreciation, required not so much an 
exceptional amount of mental ability but rather an 
unusual degree of optimism, coupled with a per- 
sistence in finding the right materials. In fact, his 
“invention” was achieved far more by trial and 
error than by abstract mental effort. 

Ogburn further found that when the cultural 
base of existing knowledge is ready, so that only 
one small additional step is needed to develop an 
important invention, more than one inventor is 
likely to take that step independently and at about 
the same time. Convincing evidence for this asser- 
tion is provided in an extensive list of simultaneous 
inventions (Ogburn 1922, pp. 90-102). It seems 
probable that similar conclusions apply to scien- 
tists, artists, and innovators of all kinds. The prog- 
ress of civilization does not rest solely on the 
extraordinary contributions of a gifted few but 
more importantly on the uncounted achievements 
of persons whose abilities exhibit almost the entire 
range of human possibilities. 

Civilization, and social organization, further- 
more, may be seen not only as a result of human 
abilities but also as a cause of them. The limits of 
human potentiality for mental achievement cannot 
be measured at present; there are no tests that 
succeed in measuring the utmost mental capacities 
of a person, only tests that estimate crudely what 
he has already achieved. Some contemporary judg- 
ments hold that most, if not all, persons are in fact 
capable of far greater mental performance than 
they ever actually reach (Bruner 1960; Faris 1940). 

Social development of ability. Human powers 
rest not only on the physiological equipment of the 
nervous system and supporting mechanisms (Brain 
1961) but also, and to a large extent, on the way 
in which they become organized in experience. Two 
persons starting with equally efficient brain mech- 
anisms may become greatly unequal in mental 
performance if their life experiences differ im- 
portantly in the various aspects that influence 
mental organization. 

The components of mental ability contributed 
by experience are not completely known, partly 
because the search for them has been delayed by 
the long-prevailing assumption that innate differ- 
ences were of greater significance. It now seems 
fairly well established, however, that through both 
formal education and unique individual experience, 
the latter sometimes accidental in character, per- 
sons may acquire mental tools that contribute effi- 
ciency to their accomplishments by simplifying the 
Operations, in a manner analogous to the simplifi- 
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cation of arithmetic computations through the use 
of tables of logarithms. 

Mental calculators, whose feats have appeared 
spectacularly superior to normal performances, 
achieve their results by the use of mental equip- 
ment known to be obtainable through training. All 
such prodigies have exceptional memory for digits, 
but previously “untalented” persons have been 
trained to equal the experts in this respect. Much 
of mental calculation is simplified by the use of 
tables and other arrangements of numbers held in 
the memory. A contemporary prodigious calculator, 
for example, able to multiply two 10-digit numbers 
in a little over a minute and six pairs of 3-digit 
numbers in nine seconds, is aided by his memory of 
multiplication tables up to 100 times 100 and by 
tables of logarithms to 14 decimal places up to the 
number 150. Specific devices vary, but all mental 
calculators reported in the literature make effective 
use of such memorized aids (Faris 1940). It is 
highly likely that equally prodigious mental feats 
in many other fields are simplified by such acquired 
mental structures. Chess, for example, is perceived 
by a master player in coordinated image patterns 
of a different order from those of a beginner, so 
that an extremely complex series of thoughts by 
the beginner may be handled by the expert through 
use of a single organized concept. 

It also seems probable that man’s most impor- 
tant acquired mental tool is language itself, and 
differential mastery of language may be the great- 
est contributor to ability variation in the general 
population. Size of vocabulary surely contributes to 
thinking powers, as does orderliness and consist- 
ency of language use. Special languages support 
special abilities, in fields as separate as art, music, 
and mathematics and science. An important part 
of the widely known correlation of abilities of par- 
ents and their children is a consequence of differ- 
ential linguistic heritages within families. 

It has also long been recognized that variations 
in amount of reading correlate with school achieve- 
ment and mental-test scores, The speed with which 
a person reads has been found to be related to the 
amount of reading he does, and speed is readily 
improved by training. It is also virtually certain 
that most of the population reads far below its 
potential with respect to both speed and quantity. 
Contemporary Western culture contains various 
factors operating in both directions to influence 
this variable in the population—from schools that 
train persons to read to diverting and distracting 
amusements that cut into the time they might 
have spent in reading [see COMMUNICATION, MASS, 
article on AUDIENCES]. 
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Ability and motivation. Formal education, how- 
ever, provides only part of the actual differentia- 
tion of ability in a population. A variable of possibly 
equal importance is level of aspiration, on which 
recent research is tending to place ever-increasing 
emphasis. Opportunities to learn appear to be 
mainly wasted on persons with low motivation, and 
almost all instances of exceptional mental achieve- 
ment are performed by persons whose aspirations 
are of obsessional strength. 

Contemporary research indicates that aspiration 
levels are affected by the general atmosphere of a 
residential community, by peer groups, and by fam- 
ily traditions and interaction. It may also be sug- 
gested that among the various ways in which 
family interaction contributes to the mental devel- 
opment of children are the degree of richness and 
warmth of relation between parents and children, 
variations in clarity and orderliness of communi- 
cation within the family, how much a child is 
encouraged to take the initiative in talking and 
relating his experiences, the development of early 
familiarity and ease in handling quantities and 
measurements, acquisition of advance motivation 
for reading and school learning, the creation of a 
broad appetite for orientation to the world and 
a hunger for knowledge of all kinds, a delight in 
novel thoughts, the development of a sense of con- 
fidence that answers to questions are not too hard 
to find, and a prejudice against persons and influ- 
ences that would tend to distract from intellectual 
concerns. 


The supply of ability 


To the extent that achieved ability is determined 
by experience, the general intellectual power of a 
society is improvable by conscious design. Research 
in methods of making formal education more effec- 
tive, plus research in ways of improving informal 
influences on ability, promise to give a society the 
power “to lift itself by its bootstraps.” As new levels 
of collective ability are achieved, the natural tend- 
encies of cultural life would tend to maintain the 
new level, barring the occurrence of disorganizing 
influences. 

The spread of formal education in the United 
States, especially at the college level, has been very 
rapid in recent years and gives promise of much 
more to come. Indeed, the country seems to be in 
process of upgrading ability levels throughout the 
population—a process that is likely to continue for 
half a century or more, 

At the lower levels of education, the rate of illit- 
eracy has declined in recent decades, so that the 
small number of illiterates remaining are mainly 
in the older age groups. The average number of 


years of schooling is steadily increasing, and there 
has been a long and accelerating rise in the pro- 
portion of the population graduating from high 
schools and colleges and earning graduate degrees. 
The effects of such increases cannot be fully ex- 
perienced for at least half a century, since a per- 
manent increase in the proportion of young persons 
graduating from colleges at the age of 21, for ex- 
ample, must automatically continue to enrich the 
intellectual level of the population as this genera- 
tion ages all the way to the retirement stage of life, 
replacing year by year, all through the process, the 
older and less educated generations. 

Improving formal education. An additional 
possibility of improvement lies in the enrichment 
of the educational process at all grade levels. Efforts 
are being made to speed the learning processes of 
preschool children as well as children at all other 
educational levels. Mathematical subjects formerly 
thought suitable only for college students are being 
taught in high schools, and a number of concepts 
once thought too difficult are now successfully 
taught in the lower grades. Similar enrichment is 
being tried in science, social studies, languages, 
and literature. Colleges and universities are also in 
a constant process of improvement through new 
instructional methods, teaching machines, honors 
courses, and other efforts. It appears likely that 
graduate education is also becoming ever more 
efficient, not only with the progress of knowledge 
but also with the availability of technical means, 
including computers, that reduce energy-consum- 
ing and drudgery operations. 

There is no precise way to measure the conse- 
quences of such improvements in educational pro- 
cedures, nor any realistic means of estimating 
future gains. There is reason, however, to suppose 
that the possibilities are of important magnitude. 
It has been found in a contemporary inquiry (Bray 
1954) that among children having the advantage 
of “excellent cultural opportunities,” about 25 per 
cent have IQ's of 125 or more—the level presumed 
to be adequate for earning a PH.D. Since only about 
6 per cent of children in the general population 
achieve so high an IQ, it appears likely that an 
extension of similar cultural opportunities to all 
could quadruple the proportion of those with PH.D. 
ability in the population. Since improvements in 
quality of education may, in turn, create future 
cultural opportunities superior to those of the pres- 
ent, even further advance in the ability level would 
appear to be possible. 

American society and human resources. Pres- 
ent trends toward mechanization and more elab- 
orate organization of the economic and political 
order reveal the critical importance of effecting 


such increases in the ability levels in the popula- 
tion. During the early 1960s in the United States, 
a gap between the needed and the existing capac- 
ities of the population was indicated by the rate 
of unemployment among persons of low educa- 
tional level and by an acute shortage of persons 
with high technical skills. In a situation of intense 
political and economic competition among large 
industrialized nations, it is conceivable that prog- 
ress in the sociology of ability could become a 
major factor in determining the outcome of the 
contest. Knowledge, skills, and capacity for inven- 
tion have sometimes been collectively called “hu- 
man capital,” and some economists have suggested 
that it may become a more important factor in 
economic growth than “tangible capital.” Some 
areas of the world are relatively undeveloped, even 
though rich in natural resources, because of severe 
lack of such “human capital.” More significant, 
however, is the principle that great wealth of hu- 
man capital would tend to give any nation relative 
independence of limiting environmental conditions. 


Rosert E. L. FARIS 


[See also ACHIEVEMENT MOTIVATION; ACHIEVEMENT 
TESTING; CAPITAL, HUMAN; EDUCATION; INTELLI- 
GENCE AND INTELLIGENCE TESTING; and the biogra- 
phies of CooLEY; GALTON; OGBURN.] 
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CREDIT 


For consumer credit, see INSTALLMENT CREDIT. 
For a broad treatment of credit, see BANKING; 
FINANCIAL INTERMEDIARIES, 


Raymond J, Saulnier 
Harold G. Halcrow 


I. GOVERNMENT CREDIT 
11. AGRICULTURAL CREDIT 


I 
GOVERNMENT CREDIT 


Programs under which public credit is extended 
to individuals, private businesses, cooperatives, 
and public agencies have grown in most countries 
in such variety and to such size that they must now 
be regarded as a major activity of government. 
Their variety in the United States is such that vir- 
tually every type of program may be found. Be- 
cause of this fact and because information on U.S. 
programs is more extensive than that on programs 
in other countries, government credit programs 
may best be illustrated by arrangements found in 
the United States. 

Classification. U.S. credit programs may be 
classified on several bases, of which the first is 
according to the sector of the economy served. 
These are (i) agriculture, which is perhaps the 
progenitor of most government credit programs; 
(ii) housing; and (iii) business, including foreign 
trade and regional or area development. 

Second, government credit agencies and pro- 
grams may be grouped according to the methods 
they employ to achieve their purposes. There is, 
first, the direct extension of credit, in which funds 
are made available directly to the borrower. Along 
with other agencies, the Small Business Admin- 
istration and the Farmers Home Administration 
have such programs, the former extending inter- 
mediate-term credit directly to qualifying small 
businesses, the latter extending long-term mortgage 
loans to help farmers acquire farm ownership or 
improve farms already owned. Next are the meth- 
ods of loan insurance and loan guarantees, in 
which credit is extended by a private lender under 
full or partial government protection against loss. 
Loan insurance is illustrated by the Federal Hous- 
ing Administration programs under which private 
financing institutions may, at a fee which is in- 
tended to underwrite losses and to make the pro- 
gram self-supporting, obtain full insurance on 
qualifying residential mortgage loans (on single or 
multiunit structures) in which they have invested 
their own funds. Loan guarantees are illustrated 
by the Veterans Administration program under 
which qualifying loans made by private lending 
institutions to veterans to help them buy homes or 
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acquire small businesses are guaranteed against 
loss up to a stated maximum amount without fee 
and without the use of a reserve for meeting losses. 
Under both of these programs the lenders are 
closely limited as to the interest rates and other 
contract terms on which loans may be made. A 
variant of loan insurance is the deferred participa- 
tion arrangement under which a credit agency 
agrees at the option of an original, private lender 
to take up all or part of a loan. Because the option 
can be exercised when a loan is approaching or is 
in default, it is in effect a guarantee against loss. 

A third type of public credit program is the 
secondary market operation, which is illustrated in 
the United States by the Federal National Mortgage 
Association. In this case, the agency stands ready 
to purchase specified types of financial assets (in 
the case of FNMA, only mortgages) from lending 
agencies or other investors who wish to sell them. 
The purpose is to provide a liquid and orderly 
market, which at times can be a function of crucial 
importance. 

A fourth method, which has a supporting effect 
resembling the secondary market approach, is il- 
lustrated by functions served by the Federal Home 
Loan Bank System through its 11 regional banks 
and by the Federal Reserve System through its 12 
regional banks. The former supplies short-term 
credit to member institutions, mainly member sav- 
ings and loan associations in the Home Loan Bank 
System, to meet temporarily heavy mortgage loan 
demands; the latter lends exclusively to member 
commercial banks in the Federal Reserve System 
to meet temporary reserve deficiencies that arise as 
a result of their lending and investing operations. 
[For other functions of the Federal Reserve System, 
see BANKING, CENTRAL.] 

Fifth and finally, there are programs under 
which direct subsidies are made in lieu of the ex- 
tension of credit, although these cannot perhaps 
be regarded strictly as credit programs. An illus- 
tration in the United States is the program under 
which the Public Housing Administration commits 
itself to make annual payments over periods up to 
forty years to local public housing authorities to 
make up any deficiency there may be between the 
cost of servicing obligations issued at the local level 
to finance public housing and the amount of cash 
available for this purpose that remains from rental 
income after meeting other operating costs. This 
has the legal standing of a guarantee by the federal 
government of local public housing authority bonds. 

There is one respect in which U.S. government 
credit programs differ from those found in a num- 
ber of other countries: apart from the now virtu- 


ally abandoned postal savings system and the 
essentially war-emergency-type government savings 
bonds, there never has been any determined effort 
in the United States to collect savings through a 
public agency, and there never has been any mech- 
anism at all for channeling such funds as were 
collected to nongovernment users through a gov- 
ernment credit agency. In many other countries, 
on the other hand, the collection of savings is a 
major activity of government, and savings so col- 
lected are in many cases channeled, at least in 
part, directly to nongovernment purposes, most 
often to the financing of home purchases. This dif- 
ference reflects the fact that the United States is 
amply supplied with mutual or stock savings insti- 
tutions, whereas nongovernment savings-assem- 
bling facilities are inadequate in many countries 
or absent altogether. 

Government credit agencies may be grouped 
also according to their form of organization and 
financing, that is, whether they are (i) wholly pub- 
lic; (ii) quasi-public joint ventures using a com- 
bination of public and private funds; or (iii) agen- 
cies having a relationship with government, such 
as tax exemption or guaranteed access to financial 
help if needed, that justifies their being regarded 
as federally sponsored. The Federal Intermediate 
Credit Banks, which still utilize some federal funds 
along with funds obtained in the open market by 
sale of Intermediate Credit Bank obligations and 
funds that must be subscribed by its borrowers as 
a condition of obtaining credit, are an example of 
the quasi-public type. The Intermediate Credit 
Banks operate through local “production credit as- 
sociations” to provide medium-term financing for 
farmers. The Federal Land Banks, on the other 
hand, which provide long-term farm financing 
through local “national farm loan associations,” 
used federal capital at their inception and for 
many years thereafter but now have retired all of 
this and rely exclusively on open-market borrow- 
ings and obligatory borrower subscriptions of capi- 
tal. However, because Federal Land Banks are still 
supervised by the Farm Credit Administration 
which is an integral part of the Department of 
Agriculture, and because they enjoy certain cost- 
reducing privileges not available to private lending 
institutions, they still must be considered as fed- 
erally sponsored agencies. 

A fourth basis for classifying government credit 
programs is found in the reasons that have led to 
their establishment. In the United States, many 
programs have been launched because it was felt 
that private financial institutions were not provid- 
ing credit in adequate amounts or at rates of inter- 
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Table 1 — U.S. government credit programs in fiscal year 1966 (millions of dollars) 


LOAN FUNDS ESTIMATED 
TO BE DISBURSED 
DURING FISCAL YEAR 1966 


AMOUNT OF LOANS ESTIMATED TO BE 
OUTSTANDING, END OF FISCAL YEAR 1966 


Guaranteed or 


Direct loans insured loans 
Office of Economic Opportunity 52 - 37 
Department of Agriculture: 

Commodity Credit Corporation 1,524 835 2,297 

Rural Electrification Administration 4,270 - 365 

Farmers Home Administration 2,019 1,343 1,036 
Department of Commerce: 

Area Development Administration 202 5 73 

Maritime Administration 86 560 o 
Department of Defense: 

Defense production guarantees 21 55 7 

Military assistance credits 132 126 94 
Department of Health, Education and Welfare: 

Office of Education 772 100 221 

Public Health Service 43 10 25 
Department of Interior: Reclamation loans 108 - 18 
Department of State: 

Loans to United Nations 19 - 0 

Agency for International Development 10,650 565 1,816 
Treasury Department: 

Loans to District of Columbia 192 = 42 

Foreign loans 3,658 - 0 
Housing and Home Finance Agency: 

Community Facilities Administration 2,673 - 434 

Urban Renewal Administration 231 1,856 266 

Federal National Mortgage Association 1,574 1,555 283 

Federal Housing Administration 555 52,762 194 

Public Housing Administration 47 5,641 394 
Veterans Administration 649 29,700 466 
Export-Import Bank of Washington 1,826 3,642 601 
Interstate Commerce Commission 15 208 . 
Small Business Administration 923 122 366 

Total (including small amounts and agencies not listed) 32,743 99,123 9,035 


* Not available, 
Source: Taken from U.S. Bureau of the Budget 1965, pp. 408, 410. 


est and other contract conditions thought to be 
“reasonable.” It perhaps goes without saying that 
what is adequate and reasonable are debatable 
questions. In any case, a “gap” theory has been 
basic to the establishment of financing agencies 
for agriculture, for which private credit is often 
more costly than for industrial and commercial 
borrowers. It is typical also that credit facilities 
are less available to small business than to large 
companies, and a “gap” theory has been important 
here also. Where credit programs are set up on 
this basis, it is frequently required that the bor- 
rower, in order to qualify for assistance, show he 
has tried and failed to obtain credit from private 
sources. 

A related reason for setting up a public credit 
Program is that the government wishes to provide 
assistance to what it regards as an especially im- 


portant activity and one which it believes cannot 
obtain credit in the open market at sufficiently low 
interest cost. This is illustrated in the United States 
by the Rural Electrification Administration, which 
was set up to make low-interest long-term loans 
to cooperatively owned light and power companies. 
Further examples are found in FNMA’s “special 
assistance” programs, under which the agency pur- 
chases at par mortgages which would sell at a 
discount in the open market because of the rel- 
atively low contractual interest rates they typically 
carry. Among types of mortgages so favored are 
those issued to finance college housing, nursing 
homes, housing projects for older people, coopera- 
tively owned housing projects, housing for medium- 
income groups (especially when located in urban- 
renewal areas), and the like. The now inactive 
V-loan program, under which U.S.-guaranteed loans 
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were made during World War 1 to defense con- 
tractors, is a further illustration. 

In some cases, however, the special assistance 
feature is such a prominent element in the pro- 
gram that the activity lacks what may be regarded 
as a true credit quality. Thus, loans made to farm- 
ers by the Commodity Credit Corporation against 
price-supported commodities are used only as a 
substitute for a price or income support that could 
be supplied by other methods. In this case, the 
price-support purpose is so prominent that the 
activity hardly qualifies as a credit program. [See 
AGRICULTURE, article on PRICE AND INCOME POL- 
ICIES.] 

Two other purposes for founding government 
credit agencies may be mentioned: (i) to offset 
recessionary tendencies in the economy and (ii) to 
upgrade standards of private lending practices. The 
Federal Housing Administration illustrates both of 
these. FHA was established in the early 1930s to 
promote home construction and home improve- 
ment, primarily as an anti-recession measure. But 
it has also sought to upgrade lending practices by 
offering insurance only on mortgages that meet 
certain standards of credit quality and only on 
properties constructed to meet specified physical 
standards. 

As would be expected, there has usually been a 
mixture of reasons behind the establishment of 
specific programs, and credit agencies frequently 
follow more than one general approach (that is, 
direct loans or loan insurance) in seeking to achieve 
their purpose. Thus, it is difficult to classify them 
by either of these criteria. Broadly speaking, how- 
ever, they limit their activities to a particular sector 
of the economy—agriculture, housing, or business 
—and accordingly are most often classified in this 
manner. 

Scope and size of U.S. programs. The scope and 
size of U.S. credit programs may be seen in Table 1. 
In addition there are the “quasi-public” credit pro- 
grams presented in Table 2. 

Relative importance of U.S. programs. The rel- 
ative importance of government credit programs 
in the financial system of the United States is not 
easy to measure, but it may be noted that it was 
estimated by the federal government that as of the 
end of June 1964 its outstanding direct loans to 
domestic private borrowers amounted to 2 per cent 
of the then estimated U.S. private debt of $778,000 
million. At the same time, private loans wholly or 
partly guaranteed by federal agencies constituted 
close to 10 per cent of total private debt. The per- 
centages would be much higher, of course, if com- 
parisons were made with privately supplied credit 


Table 2 — Quasi-public U.S. credit programs in fiscal 
year 1966 (in millions of dollars) 
AMOUNT OF LOANS ESTIMATED 


TO BE OUTSTANDING, END OF 
FISCAL YEAR 1966 


Farm Credit Administration: 


Banks for Cooperatives 887 
Federal Intermediate Credit Banks 2,990 
Federal Land Banks 4,371 


Federal Home Loan Bank Board: 


Federal Home Loan Banks 4,600 
Housing and Home Finance Administration: 

Federal National Mortgage Association 

(Secondary market operations trust fund) 2,233 
Veterans Administration: 

National Service life insurance fund 573 

U.S. government life insurance fund 84 


Source: U.S, Bureau of the Budget 1965, p. 414. 


of a type comparing closely with credits supplied 
or insured by the several federal programs. 

The importance of government credit programs 
is suggested also by their relative weight in the 
federal budget, although this is not readily seen in 
the United States because of the way credit pro- 
grams appear in budget accounts. In the U.S. 
budget, repayments of loans are netted against dis- 
bursements of loan funds, with only the difference, 
which can be a minus quantity, counted as an ad- 
ministrative budget expenditure. However, in the 
fiscal year 1965 disbursements under direct loan 
programs (thus excluding loan insurance and guar- 
antee programs) were close to 10 per cent of net 
administrative budget expenditures. 

Sales by government agencies of accumulated 
loans and investments further complicate meas- 
urement of the budgetary and economic impact of 
federal credit programs. As indicated in Table 1, 
the federal government estimated it would hold 
something over $30,000 million of loans at the end 
of fiscal year 1966. At the same time, it planned 
to sell something over $3,000 million of loans dur- 
ing that fiscal year. Since sales of loans are counted 
as a repayment, they tend to shrink budgetary 
deficits or expand budgetary surpluses. And if there 
is a deficit in the budget, by shrinking this below 
what it would otherwise be, the credit programs 
appear to reduce the direct burden which federal 
finances place on capital markets. But the sale of 
loans is also a burden on the capital market, S0 
that the total market effect must be regarded as the 
amount of reported deficit plus sales of loan assets. 

Credit programs in other countries than US. 
Government credit institutions similar in point of 
function and method of operation to those found 
in the United States are present in virtually all 
countries of the world. Major reliance tends to be 


placed, however, on direct lending programs. To 
cite examples in the field of agriculture: Britain 
has its Agricultural Mortgage Corporation; Ireland 
has its Agricultural Corporation of Eire (for gen- 
eral agricultural financing and for financing co- 
operatives) and has departments for rural credits 
and for mortgage banking in the Commonwealth 
Bank of Ireland; South Africa supplies similar serv- 
ices through the Land and Agricultural Bank of 
South Africa, established in 1912; Egypt has its 
Crédit Agricole d'Égypte, established in 1930; and 
India has its government-sponsored financing 
cooperatives. 

It would be a mistake, however, to conclude that 
public institutions are in all cases the major or 
even a principal source of rural and agricultural 
credits. Thus, the All-India Rural Credits Survey 
showed that in 1960 only 7 per cent of all borrow- 
ings came from government cooperatives and com- 
mercial banks, whereas 70 per cent came from 
local lenders (mainly individuals) and 23 per cent 
from relatives, traders, and other private agencies. 
In Denmark, to cite the case of a capital-intensive 
agriculture in a developed economy, only about 
8 per cent of funds used in agriculture in 1954 
came directly from government sources. In the 
more developed economies, greater emphasis is 
usually placed on loan insurance or guarantees 
than on direct extension of credit. 

The picture is somewhat different with regard 
to the financing of residential construction. It has 
been estimated that more than half of new resi- 
dential construction carried out in recent years in 
western Europe received some form of government 
assistance, ranging to as much as 98 per cent in 
the Netherlands. At least the nucleus of a national 
home and construction finance corporation exists 
in practically every Asian country (for example, 
Pakistan has its Building Finance Corporation; 
Ceylon its House Loan Board; Burma its Town and 
Country Development Board; Japan its House Loan 
Corporation, etc.). In some cases these are allied 
to, or are departments of, government agencies 
that collect savings through a widespread system 
of branches. Naturally, the use of such agencies is 
especially extensive where interest rates and equity 
requirements in home financing are high and credit 
availability is limited, reflecting a shortage of 
Savings. 

Finally, there has been an extensive develop- 
ment recently of government agencies devoted pri- 
marily to financing production facilities and infra- 
Structure under national development plans. In 
industrialized countries these agencies are designed 
Primarily to assist small business or business in 


CREDIT: Government Credit 465 


areas affected by abnormally high levels of unem- 
ployment. In less developed countries, “develop- 
ment companies,” as these agencies are frequently 
termed, have a broader function, reflecting a more 
general lack of credit facilities. Among these are 
the Istituto Mobiliere Italiano (IMI); ETIBANK in 
Turkey; the Corporacién de Fomento de la Pro- 
ducción in Chile; and the Pakistan Industrial Credit 
and Investment Corporation (PICIC), to name a 
few. Private capital plays a role in many of these, 
but some represent entirely public efforts to chan- 
nel resources into development projects. [See CAPI- 
TAL, SOCIAL OVERHEAD; DEVELOPMENT BANKS. | 

Policy questions. There are a number of im- 
portant policy questions involved in government 
credit programs which cannot be examined here. 
Among these are the following: 

What are the relative advantages of direct lend- 
ing, insurance, and guarantee of privately extended 
credits as ways of achieving program purposes? 

On what interest rate and other loan contract 
terms should government credit programs be ex- 
tended? On what terms should private credits be 
insured or guaranteed? 

What standards and policies should be followed 
in the relationship between private and public 
credit programs? 

Under what conditions should government credit 
programs be self-supporting? 

How should these programs be coordinated with 
general monetary and fiscal policies for promoting 
the economy's growth and stability? 


RAYMOND J, SAULNIER 
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LL 
AGRICULTURAL CREDIT 


Agricultural credit is the term applied to funds 
borrowed by individuals, farm businesses, and 
others for use in producing, storing, processing, 
and marketing crops and livestock products. Mod- 
ern farming requires increasingly large amounts 
of capital, and credit is one way of supplying this 
capital. Usually the borrowers are farmers, but 
they can also be other agricultural producers, such 
as ranchers in the western United States, graziers 
in Australia, cooperative farming associations, farm 
partnerships, and corporations. Sometimes the term 
includes funds extended to agricultural marketing 
firms and to those supplying services to agriculture, 
such as machinery, equipment, feed, fertilizer, and 
seed companies. 

Those supplying agricultural credit include indi- 
viduals, commercial and savings banks, cooperative 
lending associations, government and government- 
sponsored credit agencies, life insurance com- 
panies, mortgage and loan associations, farm 
service companies, and agricultural marketing as- 
sociations, including cooperatives. In general, both 
commercial credit institutions and government- 
sponsored programs are most active where indus- 
trial economies are advanced and agriculture is 
highly commercialized. Direct government aid pro- 
grams usually originate for specific purposes, such 
as aid to low-income groups, reclamation and de- 
velopment, permanent improvements, or help in 
case of disaster, 

In contrast, where economies are not highly de- 
veloped, a noninstitutionalized type of lending by 
merchants, local moneylenders, and landlords 
tends to predominate. Although in recent years gov- 
ernments have sometimes assisted by lending di- 
rectly for specific purposes, in most underdeveloped 
economies agricultural credit is not efficiently or- 
ganized, resulting in an almost static condition in 
agriculture. As a consequence, during the past 
quarter century the gap in per capita food produc- 
tion between the highly commercialized and the 
underdeveloped nations has widened considerably. 
In many instances, there has been no net increase 
in land productivity, income, or net assets. Under 


most conditions soil productivity remains low, risks 
are great, interest rates are excessive, income is 
low, and farmers remain poor indefinitely. Either 
the farmer is deeply in debt or he works for others 
as a tenant or hired hand. 

In most underdeveloped countries, where the 
problem is broad, the solution may require land 
reform and other drastic changes in the social and 
economic order. Nevertheless, agricultural credit 
can help provide conditions under which crop 
yields can be increased and farmers can get a 
larger net return from the sale of farm products. 
The small size of many farms adds to the per unit 
cost of obtaining credit and creates a problem for 
farmer and banker alike. Agricultural credit is not 
the sole means of providing needed increases in 
food production, but it is an important element. 

The importance of credit cannot be judged in 
isolation or by its amount. Land may be leased or 
rented as an alternative to ownership financed 
through credit. In commercialized agriculture the 
total debt may be only a small fraction of the re- 
sources used in agriculture. In the mid-1960s in 
the United States, for example, agricultural debt, 
or the aggregate of agricultural credit outstanding, 
was only one-seventh of total assets. Yet the fact is 
that agriculture has made great progress through 
expanded use of credit, which has served as a 
catalyst in resource development. 

Types and sources of credit. If use of credit in 
agriculture is to be effective, it must be adapted to 
the peculiar needs of farming. In commercial enter- 
prises, some loans must be made for longer periods 
than usual, and the repayment schedule must be 
flexible enough to allow for variations and uncer- 
tainties in farm income. Generally three types of 
credit are required: short-term financing to cover 
operating and family living costs between cropping 
seasons; intermediate-term credit to finance ma- 
chinery, livestock, and improvements; and long- 
term credit for land and major improvements. 

The need for short-term credit arises from the 
fact that, except on well-diversified farms that sell 
crops or livestock throughout the year, farm income 
fluctuates from season to season, and swings in 
prices or crop conditions sometimes cause addi- 
tional variation in income. This uncertainty, com- 
bined with the typically small amount of reserve or 
working capital held by farmers, creates a demand 
for short-term credit. Such credit is used to pay 
family living expenses and to buy goods and ma- 
terials used in production, such as feed, fertilizer, 
seed, and tractor fuel. 

The demand for intermediate-term credit, COV- 
ering one to five years, arises from the need to 


finance farm improvements and to buy equipment. 
Modern farming requires substantial amounts of 
capital for these purposes, often running into tens 
of thousands of dollars for a single family-type 
farm. 

Long-term credit, extending over six years or 
more, is used to finance or refinance the purchase 
of farm real-estate, to construct farm buildings, 
and to make major improvements. Long-term 
credit often extends over 20 or 25 years and, in 
some countries, 50 to 75 years, Mortgage security 
is usually required. In recent years amortization 
systems have permitted repayment of loans in regu- 
lar annual, semiannual, quarterly, or monthly in- 
stallments. 

In almost every country the first steps in devel- 
oping an agricultural credit system have been taken 
by individuals and other private lenders. In many 
countries, cooperatives and credit unions designed 
to provide credit on more favorable terms have fol- 
lowed, Government has entered the field, either as 
a direct lender or as a sponsor of cooperative credit 
agencies, to provide more adequate sources of 
credit and to standardize the terms under which 
credit is extended. In many advanced econo- 
mies, credit cooperatives and government-sponsored 
agencies provide more varied and complete services 
to agriculture than are available to any other sector. 
Government-sponsored agencies have played a more 
significant role in agriculture than in any other 
major segment of the economy. Thus, credit sys- 
tems range from private through completely coop- 
erative forms to government lending and loan 
insurance, with several systems often operating 
concurrently. 

Government-sponsored cooperative systems. 
Credit cooperatives are usually sponsored by gov- 
ernment through laws and administrations estab- 
lished for their operation. Usually government funds 
are appropriated to purchase capital stock and to 
cover some of the costs of administration. Once 
established, the government-sponsored agency 
tends to become self-supporting, and at least part, 
if not all, of the funds advanced by the government 
are usually repaid. Management may be directed in 
part by government officials. Boards of directors for 
cooperatively organized banks and loan associations 
are usually named by member borrowers and stock- 
holders, 

In most countries of western Europe, local co- 
operative societies belong to regional and state 
organizations that are part of a national system. 
Extension of credit is subject to national regulation, 
but management decisions are in the hands of local 
Officers responsible to the members. Usually the 
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local societies are small, serving a village or a single 
community. Credit operations are sometimes com- 
bined with such cooperative activities as purchas: 
ing, processing, marketing, and other services. 

Other sources of credit supplement the govern- 
ment-sponsored cooperative agencies. For example, 
Belgium, Sweden, Italy, the Netherlands, Switzer- 
land, Denmark, and West Germany each have a 
system of savings banks that loan to farmers, In 
some instances obligations are guaranteed by local 
government units. 

Government-managed systems. In a number of 
countries agricultural credit agencies are associated 
with the government through a system of central 
management. Sometimes the agency depends on 
the government for financing; in other cases a co- 
operative program is maintained and the govern- 
ment role is restricted to management. France, 
Italy, Japan, and Turkey provide examples of gov- 
ernment-managed credit systems. 

In France, the Caisses de Crédit Agricole Mutuel 
are government-managed cooperative societies that 
lend about three-fifths of the credit extended to 
agriculture. The organization includes La Caisse 
Nationale de Crédit Agricole, which enjoys finan- 
cial autonomy and is administered by a plenary 
commission or board; the Caisses Régionales, which 
are private lending organizations serving a region; 
and the Caisses Locales, which are subordinate to 
the Caisses Régionales and lend to local areas or 
communities. La Caisse Nationale is a public body 
that coordinates and controls the Caisses Région- 
ales and helps them by making advances on loans 
that they cannot provide from their own resources. 
The Caisses Régionales in turn help the Caisses 
Locales and serve as correspondent banks on large 
loans. Both the regional and the local banks are 
cooperatives with generally limited liability. 

Italy has a central farm-credit organization run 
by the state. There are also local lending institu- 
tions—savings banks, credit banks, people’s co- 
operative banks, rural and artisan banks, and agri- 
cultural syndicates. Special regional or interregional 
institutes (sometimes with local correspondents), 
under control of the state, coordinate and direct 
agricultural credit in each region. Above the re- 
gional institutes is the national central authority, 
the Consorzio Nationale per il Credito Agraria, 
which, along with the regional institutes, operates 
throughout Italy, making farm improvement loans 
that are subsidized in part by the Ministry of Agri- 
culture and Forestry. Farm operating loans, usually 
short-term, are made by the local banks and rural 
cooperative banks. 

Japan has a cooperative agricultural credit struc- 
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ture and an agricultural, forestry, and fisheries 
finance corporation. The corporation is an inde- 
pendent government institution entirely capitalized 
by the government, existing for the purpose of ex- 
tending long-term, low-interest loans to farmers, 
forestry men, and fishermen and their associations, 
where other credit is difficult to obtain. The corpo- 
ration has no local representatives but operates 
through the branches of the Central Cooperative 
Bank for Agriculture and Forestry, the prefectural 
credit federations, and local commercial banks. The 
Japanese cooperative system includes village co- 
operative unions, a federation grouping by districts, 
and the Central Cooperative Bank. This bank is 
organized as a cooperative society with limited lia- 
bility and with an administrative council, largely 
responsible to the government, which appoints the 
president, vice-president, 19 governors, and three 
monitors. The affiliated cooperatives have a coun- 
cil of delegates elected by holders of the bank stock. 

Turkey finances about three-fourths of the loans 
made to farmers through an agricultural bank, a 
government-owned institution with branches and 
agencies serving all rural areas. Cooperatives, the 
second most important source of credit, borrow 
from the bank to finance their operations. 

Direct government systems. In many underde- 
veloped countries and in some countries where 
commercial banks or cooperative lending associa- 
tions have been slow to develop, the government 
has established a direct system of lending to farm- 
ers and others in agriculture. Direct lending through 
state-controlled credit agencies is also sometimes 
undertaken for special purposes, such as develop- 
ment of new lands, promotion of a particular kind 
of agriculture, expansion of output of certain com- 
modities, aid to farmers suffering from adverse 
weather or low prices, and help to particular classes 
of the farm population who are unable to obtain 
credit from other sources. Although some financing 
may be available from private sources, government 
has provided credit in many of these cases to sup- 
plement these sources and to provide a larger aggre- 
gate supply, generally at reduced cost, to borrowers. 
Many nations with highly developed economies and 
well-developed cooperative credit systems, such as 
the United States and some of the western Eu- 
ropean countries, have also created government 
agencies for direct lending for one or more of these 
purposes. More illustrations are found throughout 
Latin America, Canada, and the Near East, 

The United States example. The United States 
offers the best example of an agricultural credit 
system in which nearly every known method of 
credit is utilized. Although both government-spon- 


sored agencies and direct lending are important, 
major reliance is on credit supplied by private 
sources. Of the total farm mortgage debt of $21,213 
million reported and estimated to be outstanding 
on January 1, 1966, about 40 per cent was held by 
individuals and private businesses, 23 per cent by 
life insurance companies, 20 per cent by federal 
land banks, 14 per cent by commercial and savings 
banks, and 3 per cent by the Farmers Home Admin- 
istration. On the same date non-real-estate credit, 
including both short-term and intermediate-term, 
totaled $18,913 million. Of this amount, about 41 
per cent was held by commercial banks and about 
14 per cent by production-credit associations and 
federal intermediate credit banks. Farmers Home 
Administration direct loans were about 4 per cent 
of the total, The balance, or 41 per cent, was an 
estimate of loans and credits extended by dealers, 
merchants, finance companies, individuals, and 
others. Not included in the above were loans of the 
banks for cooperatives ($1,055 million) and of the 
Rural Electrification Administration ($3,044 mil- 
lion). 

Federal credit services to agriculture, which 
compete directly with commercial banks, insurance 
companies, and other private lenders, including 
individuals, are more varied and complete than in 
any other sector of the economy. Mortgage loans 
and short-term production credit for farmers are 
supplied through two separate systems of coopera- 
tively organized district banks and local lending 
associations. These are grouped with the banks for 
cooperatives under the Farm Credit Administration 
to form a complete system of farm credit in which 
the government-sponsored agencies help to coordi- 
nate activities of local cooperative lending associ- 
ations, 

The cooperative farm-credit system. For many 
years prior to the close of the nineteenth century 
and in the early part of the twentieth, United States 
farmers had agitated for a system that would pro- 
vide agricultural credit on more favorable terms 
and more reliably than was then provided by pri- 
vate lenders. In many parts of the country interest 
rates were high, varying from 6 or 7 per cent up to 
11 or 12 per cent per year. Also, long-term loans 
were unusual, and farmers had to depend on the 
willingness and ability of lenders to renew their 
notes. Uncertainties in the banking system in re- 
spect to renewal added to their difficulty. 

In 1916, after several years of study, the U.S. 
Congress passed the Federal Farm Loan Act, creat- 
ing two kinds of banks: a system of joint-stock land 
banks, which were privately incorporated and 
financed, and federal land banks, whose original 


capital was furnished mostly by the federal govern- 
ment and which were made subject to close govern- 
ment direction and supervision. The joint-stock 
land banks flourished for a short time, but wide- 
spread defaults on their loans caused them to be 
placed in liquidation in 1933. Twelve federal land 
banks, one in each of the Federal Reserve districts 
of the United States, became the foundation for the 
cooperative long-term farm-mortgage system. 

In the postwar agricultural depression of the 
1920s, it became evident that farmers also desired 
a more uniform system of short-term credit. The 
Agricultural Credits Act of 1923 established 12 in- 
termediate credit banks. The $60 million of original 
capital, $5 million for each bank, was subscribed 
by the government and is still in use. The initial 
purpose was to liberalize short-term farm-produc- 
tion credit by making loans to commercial banks, 
agricultural credit corporations, and other agencies 
lending to farmers. This purpose was largely unful- 
filled until 1933, when Congress provided for a 
nationwide system of cooperative production-credit 
associations to make loans directly to farmers. 

The third major part of the cooperative farm- 
credit system is the banks for cooperatives. There 
are 13 banks, one in each of the Federal Reserve 
districts and a Central Bank for Cooperatives in 
Washington. 

The service and experience of the farm credit 
system can be grouped roughly into three periods, 
1917 to 1932, 1932 to 1940, and 1941 to date. 
During the first period the activity of the federal 
land banks was limited to serving a minor part of 
the total farm-mortgage market. By the end of 1932 
farm-mortgage holdings of the federal land banks 
totaled only $1,147 million, scarcely more than one- 
eighth of the total farm mortgages outstanding, and 
the joint-stock land banks, already in financial 
trouble, held less than half as much. The intermedi- 
ate credit banks were not a significant factor, with 
less than $80 million loans outstanding. Neither 
of these banking systems was playing a major role 
in agricultural credit. As they were then capitalized 
and organized, they were not strong enough to 
do so. 

As a means of combating depression and aiding 
agriculture, Congress in 1932 appropriated $125 
million capital for the federal land banks and in 
1933 about $189 million as a subscription to paid- 
in surplus to enable the land banks to make exten- 
sions and deferments of defaulted farm loans. 
Appraisal and lending standards were made more 
liberal and flexible. The federal land banks, lending 
through local cooperative associations, greatly in- 
creased their activities, especially from 1933 
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through 1936. By the end of 1940 the federal land 
banks and the Federal Farm Mortgage Corporation, 
which was affiliated with the banks, held more 
than 40 per cent of the total farm mortgages out- 
standing. 

The importance of federal farm credit during the 
1930s can scarcely be overemphasized. In deep de- 
pression it was used to stabilize the farm real-estate 
market and to liquidate loans of lenders, who were 
often in distress. It provided financing for farmers 
who otherwise could not have survived financially. 
Interest rates on land-bank loans were kept rela- 
tively low, about 4 to 41⁄2 per cent per year. 

From 1941 onward the federal land banks played 
a more passive role in the farm-mortgage market, 
dropping by the end of World War 11 to about one- 
fourth of total farm-mortgage debt outstanding and 
by 1954 to less than one-fifth, at which point the 
shares of the market tended to stabilize. 

The experience of major groups of farm-mort- 
gage lenders is remarkably similar, on the average, 
over the years since the beginning of the farm 
credit system. During the 1920s life insurance 
companies and commercial banks suffered heavy 
defaults and delinquencies. Until the end of the 
1920s the land banks had a very small number of 
their mortgages delinquent, but by 1932 about half 
of them were delinquent or extended. The 1930s 
were a time of heavy delinquency for all bankers. 
Foreclosures and bankruptcies would have been 
much more numerous had not the-federal govern- 
ment expanded farm-mortgage lending. Since 1940 
almost all lenders have had very low losses on 
farm-mortgage loans, as rising real-estate values 
have consistently allowed those who were in finan- 
cial difficulty to liquidate through sale of their 
property. 

Non-real-estate lending of the farm credit system 
grew slowly during the 1930s but has shown a 
strong and steady growth since that time. Produc- 
tion-credit associations (PCA’s) make production 
loans for periods of thirty days to five years, with 
interest rates generally around 5 to 6% per cent. 
The system is a government-sponsored cooperative, 
with stock in local associations held by member- 
borrowers, who are required to invest 5 per cent of 
a loan in capital stock. The PCA’s provide services 
comparable to those of commercial banks, and their 
rates and loss experience have been about the same 
over similar periods as those of commercial banks. 

Banks for cooperatives, created in 1933, make 
loans to eligible farmer cooperatives engaged in 
marketing agricultural products, purchasing farm 
supplies, and furnishing farm business services. 
Facility loans for constructing and acquiring build- 
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ings and equipment are secured by first mortgages 
and are limited to 60 per cent of the value of the 
property and a twenty-year term, Commodity loans 
are secured by first liens on storable commodities, 
warehouse receipts, and other title documents rep- 
resenting agricultural products and supplies. Oper- 
ating loans are made for short periods and may or 
may not be secured. The banks for cooperatives, 
like other agricultural lenders, suffered losses dur- 
ing the 1930s; since then, however, they have had 
few losses, and cumulative profit has easily covered 
the cost of government capital originally invested. 

Farmers Home Administration. Direct govern- 
ment lending to agriculture in the United States has 
in recent years been confined largely to the Farm- 
ers Home Administration. In 1918 and for many 
years afterward, the Congress appropriated money 
to aid financially distressed farmers who had ex- 
perienced drought or other natural disaster and 
who were too poor to obtain credit from other 
sources. In 1935 establishment of the Resettlement 
Administration combined in one agency various 
loan and aid programs for relief of low-income farm 
families. In 1937 this program was transferred, 
under the name of the Farm Security Administra- 
tion, to the Department of Agriculture. In 1946 the 
Farmers Home Administration was established to 
combine all programs of emergency loans for oper- 
ation and production, loans to disaster areas, and 
crop, drought relief, and seed loans. 

Real-estate loans made by the Farmers Home 
Administration include farm ownership loans, rural 
housing loans, and small amounts in soil and 
water conservation loans. The organization also in- 
sures loans made by other bankers for farm owner- 
ship. A new rural housing program was begun in 
1949. Since about 1961 the amount of loans made 
for rural housing has greatly exceeded the amount 
made for farm ownership and about equals the 
amount of farm ownership loans insured. 

Non-real-estate loans are made chiefly for oper- 
ating purposes, are generally shorter in term than 
the others, and tend to be concentrated in areas 
where economic conditions are unfavorable. 

The experience of the U.S. government in direct 
lending varies from high losses on emergency 
disaster-type loans to results in real-estate lending 
much the same as those of commercial lenders. At 
present the major net expense is administrative 
costs, which have in recent years run $15 to $20 
million a year. Losses during the 1930s were much 
higher. Results indicate that even the direct gov- 
ernment farm-mortgage lending that involves some 
degree of emergency can be successful if loans are 


based on the potential earning capacity of the farm 
property. 

Rural Electrification Administration. The Rural 
Electrification Administration (REA) was estab- 
lished in 1935 to provide credit to cooperative as- 
sociations and others engaged in the generation 
and distribution of electric power. The policy of the 
federal government has been to provide loan funds 
to the REA at relatively low cost. From 1935 to 
1944 the REA borrowed from the Reconstruction 
Finance Corporation (RFC) at an interest rate of 
3 per cent. The REA relationship with the RFC was 
discontinued in 1947, and loan funds were there- 
after borrowed from the U.S. Treasury, By memo- 
randum of agreement the interest rate was set at 
the average rate paid on all marketable securities, 
with a maximum of 2 per cent. The rate of interest 
paid by REA borrowers thus has been generally 
lower than rates paid by other public utility firms. 

Loan funds advanced by the REA from its incep- 
tion in 1935 to 1963 were about $4,800 million. 
Total outstanding on January 1, 1963, was $3,800 
million. Losses have been negligible, and the main 
cost to government is money appropriated for ad- 
ministration, which totaled $154.8 million from 
1935 to 1963, and a somewhat smaller amount to 
subsidize the low-interest-rate borrowing. 

The effect of the REA program is far-reaching. 
When the program began in 1935, about 11 per 
cent of the farms in the United States were served 
with central-station electricity; by 1953 about nine 
out of ten farms were being so served; and by 1963 
an estimated 97.6 per cent of all farms in the 
United States were included. About one-half of 
these were customers of REA borrowers. The REA 
policy has been to provide “area coverage,” that is, 
to extend electrical facilities into fringe areas and 
to supply all customers in a given area. Conse- 
quently, a large part of the clientele served by the 
REA is nonfarm, and this is a major source of re- 
cent and potential growth. Other power companies 
have probably developed more rapidly as a result 
of REA competition, and rural electrification has 
proceeded much more rapidly than would have 
been the case without this government program. 
Additional investment as the result of power avail- 
ability is probably on the order of $5,000 to $10,000 
million in the farm sector and possibly as much as 
$3,000 million elsewhere. 


Agricultural credit covers a wide variety of situa- 
tions and needs. It is provided by three general 
types of lenders: individuals and commercial 
agencies, government-sponsored cooperatives, and 


government agencies which make direct loans. 
How much credit is needed or should be provided 
and which system is best are matters of social ob- 
jectives and judgment. Certainly, however, agri- 
cultural credit will continue to play a crucial role 
in agricultural and economic development. 


Haroip G. HALCROW 


[See also AGRICULTURE, particularly the articles on 
CAPITAL and DEVELOPING COUNTRIES; LAND TENURE, 
article on LAND REFORM.] 
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I 
CAUSES OF CRIME 


The contemporary literature on crime causation 
theory is closely linked with the more general liter- 
ature in anthropology, psychiatry, social psychol- 
ogy, and sociology. Since criminal acts and crime 
rates are similar to other acts and rates studied by 
social scientists, the alliance of criminology with 
more general scientific disciplines is not surprising. 
Although some of the scholars specializing in the 
study of crime and criminals (criminologists) are 
concerned with penal legislation, with the sociology 
of the criminal law, or with simple fact finding, the 
Majority of them are directly or indirectly con- 
cerned with crime causation. 

This concern has been expressed in two prin- 
cipal forms. First, criminologists have studied the 
processes by which persons become criminals and 
have developed theories of criminality. Second, 
criminologists have studied the relation of crime 
rates to variations in culture and social organiza- 
tion and have developed theories of crime. The 
theories of criminality are theories about social 
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learning and personality development and thus are 
necessarily related to more general theories about 
these phenomena. Theories about the distribution 
of crime rates are theories about social systems 
and, consequently, are closely related to more gen- 
eral theories of social structure. 

Although theories of criminality are sometimes 
considered psychological theories, whereas theories 
about crime rates are viewed as sociological, it is 
clear that sociologists have contributed significant 
theories of criminality as well as significant theories 
regarding the distribution of crime rates. Accord- 
ingly, there is no distinct division of labor between 
the sociologist—criminologist and others. The the- 
oretical work of criminologists has developed most 
significantly in the last half century. Although 
there were some happy exceptions, little systematic 
theoretical research on criminality or crime was 
conducted until the twentieth century, and most of 
the books written on these subjects prior to that 
century were writien by persons outside academic 
circles, such as theologians, physicians, and re- 
formers. Most of these writers were attempting to 
find a panacea for criminal behavior, and they often 
merely selected a general “cause” of all criminality 
and then sought to convince their readers that 
elimination of that cause would eradicate crime 
both by reforming criminals and by preventing 
future criminality. There was little attempt to 
“make sense,” by means of a theory, of the known 
facts about criminals or about variations in crime 
rates, in part because few facts were known. 

Near the end of the nineteenth century, sociol- 
ogy was making its way into the curricula of 
American universities and colleges, and a survey 
conducted in 1901 indicated that criminology and 
penology were among the first courses offered 
under the general title “sociology” (Tolman 1902- 
1903). From that time to the present, the main 
American contributions to crime causation theory 
have been made by sociologists, among whom the 
tendency has been to develop and state theories of 
crime causation that are consistent with, but never- 
theless separate from, more general sociological 
and social psychological theory. On the other hand, 
the tendency among psychologically trained and 
psychiatrically trained persons is to assume that 
general knowledge of clinical psychology and psy- 
chiatry is a sufficient basis for understanding crim- 
inality, with the result that few psychological 
theories specifically directed toward explanation of 
criminality have been stated. 

Theoretical requirements. Ideally, a theory that 
explains social behavior in general, or a specific 
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kind of social behavior like crime, should deal with 
both the individual conduct and the epidemiology 
involved (Cressey 1960). Thus, in a theory of 
crime causation, there should be a statement that 
explains the statistical distribution of criminal be- 
havior in time and space (its epidemiology) and 
that can be used to derive predictive statements 
about unknown statistical distributions. But the 
same theory also should be concerned with crim- 
inality and should identify, at least by implication, 
the processes by which individuals come to exhibit 
criminality and from which can be derived predic- 
tive statements about the behavior of individuals. 
We shall review the principal theories that attempt, 
in varying degrees, to integrate explanation of the 
epidemiology of crime with explanation of individ- 
ual criminality. This arbitrary limitation eliminates 
from our immediate concern the older biological 
theories, which held that individual criminality is 
inherited or caused by nonhereditary biological 
conditions. It also eliminates from consideration 
those theories of crime that are based on economic 
or cultural determinism and that, therefore, have 
only slight implications for explanation of the be- 
havior of individual criminals. 

The multiple-factor approach. Perhaps the 
most popular approach to integrating explanation 
of crime rates and explanation of individual crim- 
inality is not, in the strict sense of the term, a 
theory at all. William Healy's early emphasis upon 
multiple causation in the cases of individual delin- 
quents (1915, pp. 33-125, 130-138), combined 
with the then prevalent attempts to discount bio- 
logical explanations of criminality, played an im- 
portant part in the development of the multiple- 
factor approach in criminology. A revolt against 
stressing one cause for all criminality led to pro- 
duction of extensive lists of causes of criminality, 
each one of which was at first said to produce a 
portion of all criminals (Burt [1925] 1944, p. 600). 
The idea that criminality is the result of multi- 
farious influences has persisted to the present, but 
the specific “factors” said to be important have 
shifted from time to time, becoming progressively 
more social in nature (Jones [1956] 1962, pp. 57- 
74). 

Since a theory of crime causation consists of a 
logical generalization about facts pertaining to 
crime and criminality, the multiple-factor approach 
actually is not a theory. In recent years the ap- 
proach has taken on two separate aspects. Some 
workers who use the approach in the study of in- 
dividual criminality maintain that one case of 
criminality results from one configuration of fac- 
tors, while another case of criminality results from 


a different configuration of factors. This is not the 
same as the earlier multiple-factor notion that one 
kind of criminality is caused by one kind of factor 
and another type of criminality is caused by a 
different kind of factor. For example, this early 
approach solved the heredity versus environment 
controversy by maintaining that some crimes are 
caused by hereditary factors, some by environmen- 
tal factors. The current idea is that each case of 
criminality results from a peculiar, if not unique, 
combination of factors, each with its own degree 
of weight. The factors may be assigned equal 
weight, or one factor may be classified as major 
while another is classified as minor. For example, 
one author observed that delinquency is caused by 
a combination of hereditary factors and environ- 
mental factors and then was able to conclude that 
heredity accounts for about 60 per cent of the in- 
fluence and environment about 40 per cent (Hirsch 
1937, pp. 55-56). Another author has commented 
that the logic behind this kind of conclusion “is 
about on a par with that which would be involved 
if the assumption were made that the element 
hydrogen is twice as important as the element 
oxygen in the compound water .. .” (Vold 1958, 
pp. 101-102). Broken home, alcoholic parents, 
poverty, defective intelligence, poor education, and 
emotional immaturity are examples of the factors 
contemporarily used in this kind of approach. 

When the multiple-factor approach is used in the 
study of crime rates, the investigator merely lists 
conditions (factors) that are statistically associ- 
ated, to a high or low degree, with crime. Generally 
speaking, the higher the degree of association, the 
more importance is ascribed to the particular con- 
dition. In this kind of study, there is little attempt 
to make sense of the statistical findings by showing 
that they are consistent or inconsistent with a the- 
oretical scheme. Reckless has advocated the use of 
this method under the name “actuarial approach” 
(1943, p. 74). 

Critique of the multiple-factor approach. In 
one of the best critiques of the multiple-factor 
approach, Albert K. Cohen (1951, pp. 5-13) iden- 
tified three principal errors. First, there is confu- 
sion between explanation by means of a single 
theory and explanation by means of a single factor. 
A single theory is a logical statement showing how 
variations in one phenomenon (variables) are 
linked with variations in other phenomena. State- 
ments of fact are made in terms of the values of 
variables, but such statements of fact are not the- 
ories of crime causation, whether they are con- 
cerned with the relationships between criminality 
or crime and the values of one variable (say, 


income) or with the relationship between crimi- 
nality or crime and the values of a number of 
variables (income, age, sex, education, parental 
discipline). Further, the fallacy that “evil causes 
evil” usually characterizes the multiple-factor ap- 
proach, although it is not peculiar to it. The fallacy 
consists in thinking that effects that are viewed as 
undesirable (criminality and high crime rates) 
must have antecedents that are undesirable (alco- 
holism, psychopathic personality, poverty, class 
discrimination). When this fallacy is present, “ex- 
planations” of criminality and crime are likely to 
be statements that merely attribute causal power 
to a list of ugly and sordid conditions that any 
“decent citizen” must deplore. Finally, factors are 
confused with causes, and each factor is assumed 
to contain within itself a fixed amount of crime- 
producing power. For example, the fact that a per- 
son has a low income is said to have some 
criminality-producing power; the fact that he is a 
young adult pushes him further in the direction of 
criminality; and the fact that he is a male is the 
last straw. Sometimes the basis for imputing causal 
power to a factor in an individual case is high 
statistical association between the factor and crime 
rates—if in a city the areas of poor housing are 
also the areas of high crime rates, a person who 
lives in a substandard house is considered on the 
road to crime. Statisticians have regularly pointed 
out the fallacy of such reasoning, and it is not part 
of the “actuarial” approach. But, equally important, 
sometimes the basis for imputing causal power to 
a factor is based only on the subjective judgment 
of the person doing the imputing and Cannot be 
determined at all by others. 

Psychological theories. Among psychologists 
such attention as has been given to the problem 
of epidemiology has been directed toward locating 
social conditions that affect the personal traits said 
to produce criminality. Thus, psychological interest 
in criminality has been logically akin to psychiatric 
interest in finding unusual conditions producing 
abnormal traits in the make-up of criminals. How- 
ever, in psychology this interest has been expressed 
principally in attempts to measure objectively the 
degree to which criminals are psychologically dif- 
ferent from noncriminals. 

Intelligence testing came into vogue in the years 
just after World War 1, and the newly devised in- 
telligence tests were rather indiscriminately given 
to delinquents and criminals. One result was a 
widespread belief that low intelligence is an im- 
portant cause of criminality. However, more recent 
Studies indicate that criminals are not intellectually 
inferior to noncriminals. Nevertheless, the assump- 
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tions that led to the attempts to differentiate crim- 
inals from noncriminals on the basis of intelligence 
test scores are still maintained. They now are most 
evident in attempts to differentiate criminals from 
noncriminals on the basis of scores on tests of 
personality. Generally speaking, the work in this 
area has been fragmentary in character, in the 
sense that studies of criminals’ scores on person- 
ality tests have been made without reference to a 
systematically developed theory of criminality 
(Schuessler & Cressey 1950). 

Psychiatric theories. With the development of 
psychiatry in the last half century, there has been 
an increasing tendency to apply psychiatric and 
psychoanalytic techniques and theories to the 
problem of criminality, especially by such practi- 
tioners as prison, probation, and parole workers. 
William Healy's The Individual Delinquent (1915) 
stressed multifarious influences on the criminal, 
but it also freed psychiatric criminology from 
Lombrosian preconceptions and thus opened the 
door to theorizing about the nature of individual 
criminal conduct. Bernard Glueck (1918) provided 
another impetus to the psychiatric study of indi- 
vidual offenders when he published the results of 
a study of 608 inmates of Sing Sing Prison. 

The early preoccupation of psychiatrists con- 
cerned with criminality was the assumption that 
criminals constitute an inferior type, characterized 
by mental disorders, alcoholism, neuroticism, and 
the like (Hakeem 1958). In more recent years, 
this concern with serious mental defects and dis- 
orders among criminals has continued, but there 
also has been a trend toward inclusion of numerous 
minor emotional traits within the scope of the con- 
ditions held to be significant in producing crim- 
inality. Psychiatrists are also moving away from 
the notion that there is an undue amount of devi- 
ation from the normal among criminals and are 
moving toward descriptions of processes involved 
in the development of criminal mentalities. Thus, 
criminality is viewed as an adjustment to an emo- 
tional problem, and the criminal act is therefore 
considered as a symbol or a symptom. Consistently, 
there is an increasing concern with explanation 
of crime rates, although attention continues to be 
focused on individual criminality. For example, 
psychiatrists are beginning to study the epidemi- 
ology of mental disorders and, as a by-product, to 
develop explanations for the high incidence of 
crime among men as compared to women, among 
city dwellers as compared to rural dwellers, among 
lower-class persons as compared to upper-class 
persons, and the like. One popular theory is that 
such variations are due to differences in child- 
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rearing processes, but other theories are closely 
allied with the sociological “cultural” and “social 
structure” theories, which account for the distribu- 
tion of various forms of deviancy. 

Sociological theories. The central theme run- 
ning through criminological studies conducted by 
sociologists and social psychologists is that crime 
and criminality are products of the same kinds of 
social conditions and processes that produce non- 
criminal social behavior. The sociological attempts 
to define and identify the processes by which per- 
sons become criminals have involved the use of 
such social psychological concepts as imitation, 
role playing, differential association, differential 
identification, compensation, self-conception, and 
frustration-aggression. The sociological attempts 
to link variations in the crime rates of societies, 
subsocieties, and groups with variations in social 
organization and culture have led to theoretical 
concern for such processes as mobility, competi- 
tion, and culture conflict; political, religious, and 
economic ideologies; population density and com- 
position; and the distribution of wealth, income, 
and employment. This kind of structural analysis 
and the theoretical propositions based upon the 
analysis declined in popularity after about 1940, 
principally because investigators became aware of 
the great hazards in making generalizations about 
crime rates as measured by conventional crime 
Statistics. At present, however, sociologists are 
again studying the relationships between crime 
rates and social organization and developing the- 
ories to account for the variations observed. 

Differential association. Edwin H. Sutherland 
hypothesized that persons acquire patterns of crim- 
inal behavior in the same way they acquire pat- 
terns of lawful behavior. It is his idea that “criminal 
behavior is learned in interaction with persons in 
a pattern of communication,” and the specific 
direction of motives, drives, rationalizations, and 
attitudes—whether in the direction of anticrimi- 
nality or criminality—is learned from persons who 
define the legal codes as rules to be observed and 
from persons whose attitudes are favorable to the 
violation of legal codes, “A person becomes delin- 
quent because of an excess of definitions favorable 
to violations of law over definitions unfavorable to 
violations of law” (Sutherland & Cressey 1960, 
p. 78). Sutherland named the process of giving and 
receiving these definitions “differential association” 
because the content of what is learned in the proc- 
ess of association with criminal behavior patterns 
differs from the content of what is learned in the 
process of association with anticriminal behavior 
patterns. Differential association refers to a ratio 


of associations with both criminal behavior pat- 
terns and anticriminal behavior patterns. Daniel 
Glaser (1956) has modified this theory of crimi- 
nality by placing stress on the process of receiving 
criminal and anticriminal behavior patterns rather 
than on the process of donating them, thus chang- 
ing the basic concept from differential association 
to “differential identification.” 

Differential social organization. When it is ap- 
plied to the variations in the crime rates of nations, 
cities, or groups, the theory of differential associ- 
ation and differential identification becomes a 
theory of differential social organization. A high 
crime rate in a city, for example, is the end product 
of a situation in which a relatively large number 
of persons have received an excess of criminal be- 
havior patterns as compared with anticriminal 
behavior patterns. From this it can be reasoned, 
as Sutherland did, that a group with a high crime 
rate is organized for crime at the same time that it 
is organized against crime. Whether the crime rate 
is high or low depends upon the degree to which 
the organized system for presenting anticriminal 
behavior patterns is counteracted by the organized 
system for presenting criminal behavior patterns. 
In contemporary societies the social conditions in 
which the influences on a person are relatively 
inharmonious and inconsistent are themselves a 
form of social organization, and there are wide 
variations in the degree of conflict between crimi- 
nal and anticriminal norms. Since the rates of 
criminality vary with the degree of normative con- 
flict, they can be attributed to differential social 
organization. 

Culture conflict. Like the theory of differential 
association and differential social organization, the 
culture conflict theory developed by Thorsten Sel- 
lin (1938, pp. 21-32) stresses the importance of 
conflicts between conduct norms. According to this 
theory, both criminality and noncriminality of in- 
dividual persons are attributable to the kinds of 
conduct norms that have been experienced. But 
learning of divergent conduct norms presupposes 
the existence of a society in which the conduct 
norms of one group are in conflict with the conduct 
norms of another. Thus, a condition of culture 
conflict underlies a condition of high crime rates, 
for it is only when there is culture conflict that 
persons can learn conduct norms that permit the 
reaction to some situations to be one of criminality. 

Sellin proposes that criminologists should not 
restrict their attention to criminality and crime but 
instead should study the conflicts of conduct norms 
that exist when divergent rules of conduct govern 
the specific life situations in which a person may 


find himself. These rules “prohibit and conversely 
enjoin specific types of persons, as defined by their 
status in (or with reference to) the normative 
group, from acting in a certain specified way in 
certain circumstances” (Sellin 1938, pp. 32-33). 
Conduct norms arise as a group reaction to be- 
havior that is not in the interests of the social 
group and acquire validity when they are incorpo- 
rated into the personalities of the group members. 
Hence, the differentiation of the personality struc- 
ture or growth process of the violator over and 
against that of the conformist is closely allied with 
variations in the degree of culture conflict and, 
therefore, with variations in crime rates. 

Criminogenic cultures and social structure. 
Where the theories of differential association and 
culture conflict stress the importance of transmis- 
sion of criminal and delinquent behavior patterns 
from one person to another, a different type of 
sociological theory extends beyond the individual 
and his personal groups to the broader culture of 
a people. This kind of theory is directed more to 
the problem of explaining why one nation, group, 
or class has a high crime rate as compared with 
another nation, group, or class than to the problem 
of identifying the processes by which individuals 
become criminals. 

One cultural theory, for example, links the rel- 
atively high American crime rates with conditions 
and processes in the American culture that are 
criminogenic in their influence (Taft [1942] 1956, 
pp. 336-349, 754-755). Among these are the cul- 
ture’s dynamic, complex, and materialistic quali- 
ties, the tradition of the frontier, the breakdown of 
primary-group relationships, and political corrup- 
tion and inefficiency. Variations in crime rates 
among different groups, classes, and categories 
within nations are attributable to the differential 
impact of the criminogenic culture on those 
groups, and individual criminality and noncrimi- 
nality are attributable to differential impact of 
these conditions on individuals (Reckless [1950] 
1961, pp. 335-359). For example, the theory ex- 
plains that the excess of crimes by men is due to 
the relative protection of women from the stresses 
of competitive economic life and to their different 
moral codes and social roles. More generally, such 
factors as relative economic welfare, relative isola- 
tion from competition, and families that give secu- 
rity and affection will isolate individuals from the 
criminogenic culture and will, thus, keep crime 
rates low in some groups. 

A “social structure” theory of crime causation 
has been developed by Robert K. Merton (1938) as 
part of a more general theory of deviancy. Although 
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the theory has been modified in some respects by 
Merton himself ({1949] 1957, pp. 161-194), by 
Albert K. Cohen (1955), and by Richard A. Clow- 
ard and Lloyd E. Ohlin (1960), the basic notion 
is that high crime rates are a reflection of a situ- 
ation in which a society places great emphasis 
upon the goal of individual “success” while effec- 
tively blocking, for some part of the population, 
the paths to achievement of that goal. In this kind 
of social system, the generally approved rules of 
the game may be known to those individuals who 
evade them, but the emotional supports that ac- 
company conformity to the law are offset by the 
emphasis placed upon achieving success. Conse- 
quently, it may be said that the social structure 
frustrates some individuals but does not provide 
mechanisms of social and personal control (Reiss 
1951) that enable them to release their frustration 
legitimately. 

Limited theories. The theories outlined above 
are directed at explanation of criminality and crime 
in general. Such general theories are desirable be- 
cause they organize, integrate, and make sense of 
the factual data on crime and criminals. However, 
it also is desirable to break crime down into homo- 
geneous units, and develop theories to explain each 
unit. Crime in general consists of a great variety 
of criminal acts; and except for the fact that such 
acts are all violations of law, they may have very 
little in common. Consequently, a theory that at- 
tempts to explain all of them must necessarily be 
quite general in nature. Such general theories can 
be, and have been, supplemented with theories 
about specific units within the broad area of crime 
and about specific units within the legal definitions 
of types of crime, such as embezzlement, robbery, 
and murder. 

General theories about criminal behavior as a 
whole can guide the research and theory directed 
at explaining particular kinds of crime and crimi- 
nal behavior, and studies of particular kinds of 
crime and criminal behavior can lead either to 
strengthening or to modifying the general theories. 
In this sense, explanation of crime is like explana- 
tion of disease. A theory purporting to explain all 
disease must be quite general, but such theory is 
quite useful in understanding the origin and trans- 
mission of illness. Nevertheless, even general the- 
ories of disease, like the germ theory, do not apply 
to all diseases, and theories about specific diseases 
have been developed to supplement them. In crim- 
inology, important specific theories of this kind 
have been developed to explain embezzlement 
(Cressey 1953), forgery (Lemert 1953; 1958), 
gang delinquency (Cohen 1955), homicide (Henry 
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& Short 1954; Wolfgang 1958), professional theft 
(Conwell 1937), vandalism (Clinard & Wade 
1958), and white-collar crime (Sutherland 1949). 


DonaALp R. CRESSEY 


[See also CRIMINOLOGY; PENOLOGy; and the biography 
of SUTHERLAND.] 
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The Crime Prob- 


LLI 
OFFENSE PATTERNS 


The concept of crime is an unusually difficult 
one, since it is difficult to find any definition of 
crime that does not have a large element of circu- 
larity. In general, crimes are defined as events and 
actions that are proscribed by the criminal law of 
a particular country. This reduces to the definition 
of crime as being “what the criminal law says it is.” 


. The boundaries of a legal system are usually those 


of the nation-state, but the national boundaries are 
not exactly similar to cultural boundaries. England 
and Wales, for example, have a different legal sys- 
tem from that of Scotland. Yet it would be difficult 
to define the difference in culture between border 
towns. 

Legal fictions are useful within any legal system; 
but since they vary across nations and cultures, 
they are not very satisfactory concepts in behav- 
ioral science when comparison is made across cul- 
tures or nation-states, Almost all actions of which 
mankind is capable have, at some time, at some 
place, been defined as “criminal”; and almost all 
acts now defined as criminal have, at some time, 
at some place, been socially approved by the cul- 
ture of the time and hence not proscribed by the 
law. 

Crime involves at least two essential actors, the 
victim and the offender. Crime may be perceived 


and defined differently according to the role of 
either of them. In the Soviet Union crime is made 
to mean something different from what is under- 
stood as crime in Western cultures. There are many 
reasons for this, not the least of which is the fact 
that the “victim,” by reason of the social and eco- 
nomic system, must frequently be the state. 

Most crime in most countries consists of of- 
fenses against property. It is difficult, in modern 
society, to imagine it otherwise, even in very dif- 
ferent cultures. But when property is owned by 
the state, crime involving property involves the 
state as the victim, and, if the state has a philos- 
ophy regarding property, crime against property 
will be seen as crime against the state. If, on the 
other hand, wealth is privately owned, crime 
against such ownership is seen as an attempt to 
obtain the wealth of another by illegal means. 

But the mere transfer of wealth does not consti- 
tute a crime. Fair exchange is not crime, But what 
is fair exchange is not determined by the same 
considerations as those used to determine what is 
crime in terms of other forms of behavior, as, for 
example, an assault on a person. Moreover, the 
type of exchange that is defined as legitimate may 
be determined more by the nature of a country’s 
economy than by its moral values. 

Crime is always defined in terms of the social 
institutions and their organization in a society. The 
crimes will be defined through some organized 
process as behavioral deviations from the values as 
institutionalized in the norms and rules. There 
could be no crime in any society where all persons 
behaved in the same way, no matter how they 
behaved. 

In the Western world we are concerned in our 
definitions of crime mainly with what is sometimes 
a very fine distinction between legitimate and ille- 
gitimate means of acquisition of wealth. Thus, the 
ownership of private wealth is valued—the “wrong” 
in stealing consists only in the means of acquisition 
of wealth. When the line between legitimate and 
illegitimate becomes very thin, different subcul- 
tures within the same legal system may perceive 
the distinction differently. 

To draw a sharp distinction between what is 
criminal and what is noncriminal does not make 
sense, Crime is human behavior. But not all human 
behavior is the same. Presumably all human ac- 
tions form a frequency distribution of actions and 
can be placed on an imaginary continuum from 
the most saintly to the most sinful, or from acts 
incurring the greatest degree of public acclaim to 
those acts which attract the greatest amount of 
public disapprobation. There are very few extremely 
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sinful acts committed in this world, and there are 
very few extremely saintly acts; most of the actions 
taking place throughout any culture are just 
“normal.” The continuum of ethical content of our 
actions may be cut for purposes of definition at 
any point, thereby including or excluding a pro- 
portion of marginal acts. The cutting points, as 
defined, may change from time to time, depending 
upon changes in statutes, cultural meanings at- 
tached to their language, or customs’ allowing stat- 
utes to lapse. This process of change may go 
unnoticed by most observers in the culture, since 
it is often gradual. 


Crimes and criminals 


It will be obvious that “crime” and “criminals” 
are two different concepts. Often these two are 
confused, particularly when reference is made to 
criminal statistics. It is necessary always to in- 
quire closely into exactly what figures relating to 
“crime” refer to. A “crime” (illegal action) can 
occur without there being an identifiable criminal, 
and a criminal can commit a number of crimes 
even in the course of one actual action or event. 
A person removing a motor vehicle without the 
owner's consent may break a large number of laws 
in this one act, or, less frequently, there are mul- 
tiple acts which are subsumed under the definition 
of one crime. To some extent crimes committed by 
persons under the legally defined age of “criminal 
responsibility” are crimes without defined crimi- 
nals. If a crime is not detected or otherwise “cleared 
up,” there is always a chance that it may be an 
outcome without a criminal. 

The problem of definition of crime is closely 
related to the problem of measurement of crime. 
Can we measure crime and express it in numbers? 
Certainly there are plenty of numbers in the U.S. 
Uniform Crime Reports, in Criminal Statistics, pub- 
lished in England and Wales, and in similar re- 
ports published in all Western countries. But do 
these numbers measure what we are speaking 
about when we speak about “crime”? 

Sutherland and Cressey ([1924] 1960) have 
criticized criminal statistics on the grounds that 
it is impossible to determine with accuracy the 
amount of crime in any jurisdiction at any time. 
But the difficulty is not a statistical difficulty; it is 
a difficulty relating to problems of definitions. 

Figures are available for “indictable offenses 
known to the police” in England and Wales, and 
there are similar figures for other countries; figures 
are available for the numbers of persons proceeded 
against in courts; there are figures for persons 
found guilty, figures for persons arrested (in the 
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United States), and many other sets of figures. Are 
any of these figures measurements of “crime”? Un- 
doubtedly not. But as numbers representing what 
they claim to represent, they are not necessarily to 
be impugned as the most unreliable of statistics. 
It is not the statistical data that claim to be what 
they are not; rather, it is the people who make use of 
these data who may be regarded as “unreliable.” 

There are a large number of points at which it 
is possible to intercept events or persons and to 
make counts of them. Whether these counts are 
correctly and efficiently made is one problem, but 
whether the figures represent what we are speak- 
ing about is another problem, It is necessary to 
keep these two problems separate. 

Any one event in the continuum ranging from 
sinful to saintly may trigger off a series of events. 
Processes involving human behavior, whether de- 
fined in law as “criminal” or not, are dynamic proc- 
esses. If the event is definable in law as criminal, 
the processes from the commission of the act, so 
defined, to the reception of the convicted offender 
in a penal institution are many and varied. Differ- 
ent decisions are made at different points in this 
process, and each decision may be considered as a 
“gate” which opens or closes and diverts the 
“offender” from one channel to another—into or 
out of the system. Each of these decision “gates” 
can provide an efficient count of the “gate pas- 
sages” (Wilkins 1964). Whether a record of any 
or all of these decisions at any or all of these points 
would be a measure of what we are speaking about 
when we speak about “crime” is debatable. 

It is obvious that counts and measurements at 
different points along the process continuum will 
give different numbers, and these different num- 
bers will measure different things. It may be desir- 
able to examine some of these distinctions. 

Crimes known to the police. It will be clear 
that figures collected at some points along the con- 
tinuum from the commission of the offense to the 
disposal of the offender by a court or by other 
means are likely to be more reliable than others. 
But perhaps the more reliable the figures can be, 
the less informative they may be regarded as being 
on other matters, such as the volume of crime. For 
example, it is often considered important to know 
about the amount of “juvenile delinquency,” but it 
is difficult, in practice, to measure the amount of 
juvenile delinquency in any area or country. Clearly, 
the age of the offender cannot be known until a 
crime has been detected and an offender identified 
who is presumed to have committed it, Thus, to 
measure juvenile delinquency, some action must 
already have been taken by Society regarding a 


reported crime or behavior. Some writers have pro- 
posed methods for questioning children themselves 
about their acts, and such methods overcome this 
difficulty but raise many others of a different kind 
(see Sutherland & Cressey [1924] 1960; Reiss & 
Rhodes 1960). 

In the general criminal statistics, it would ap- 
pear that data relating to persons are more reliable 
than data relating to events. For example, the num- 
ber of persons received into prison under sentence 
in any one year is a figure which it is possible to 
know exactly, as also is the population of any 
country’s prisons on any day or an average of days. 
There will be problems of what is meant by 
“prison,” and perhaps what is meant by “person” 
and by “under sentence,” but these terms may be 
dealt with by means of operational definitions. 
Nonetheless, such numbers, even though abso- 
lutely accurate, do not necessarily relate to a ques- 
tion about crime, unless we are careful to speak 
in exactly the same terms as those used to deter- 
mine the counting procedures. 

It has been suggested by some authorities that 
the value of a crime index decreases as the dis- 
tance from the crime itself in terms of procedure 
increases (Short & Nye 1957-1958). This means 
that figures which relate to “crimes known to the 
police” provide a better index of crime than figures 
relating to arrests, persons proceeded against or 
found guilty, or various other figures relating to 
the later processing of persons defined as offenders. 
In other words, and reminding ourselves of the dis- 
tinction between crimes and criminals, it is often 
claimed that the most meaningful figures, from 
a criminological point of view, are figures relating 
to events (crimes) and not those relating to per- 
sons (criminals). If one is speaking about events, 
one should not use data relating to persons, but it 
is doubtful whether data relating to events can be 
obtained in any satisfactory way. A person is sim- 
ply defined by his own physical identity. If persons 
pass through any decision point and that point has 
a counting mechanism, there can be little doubt 
about the number of such persons. But whenever 
an event is imagined independently of a specific 
person and a specific decision point, or a concept 
is discussed which has no relation to a specific ac- 
tivity or decision which can be observed, there is 
far more chance of uncertainty. 

In the definition of crime we are concerned with 
the cutting points in a continuum, not a gate deci- 
sion system. If we wish to discuss crime, it is 
necessary to consider the working of the system of 
definitions that may vary the cutting points in the 
continuous distribution of actions from evil to good. 


It is, of course, obvious that the police do not get 
to know of crimes by some direct process of “know- 
ing” or observation. Criminals usually go to great 
pains to make this difficult for the police. Crimes 
“known to the police” are usually reported to them 
by the public, most frequently by the victim. This 
means that an event must first of all be defined by 
a member of the public, with no special legal 
knowledge, as an event requiring that the police 
do something about it. That is to say, a crime is 
usually first defined by a democratic process as 
“something that the police ought to do something 
about.” It may be that events defined by an ordi- 
nary citizen as a “crime worth reporting to the 
police” may not be a crime within the legal code 
of the country concerned, and, although the event 
is reported because it is perceived as a crime, the 
police will not record it. Although they may still 
take action to remedy the problem, they would not 
record an event as a crime simply because the 
reporter viewed it as such. On the other hand, there 
are many events known to the same citizen which 
he does not define as “something that the police 
ought to do something about” although they are in 
fact crimes under the law. 

It would, of course, be possible to define as 
“crimes” all events which become known to ordi- 
nary citizens and are thought by them to be worthy 
of reporting to the police in the expectation of 
some action to remedy their complaint. Such events 
generally are called “complaints known to the 
police,” but not “crimes.” It might, however, be 
more useful ‘operationally to define “crimes” as 
“complaints,” since this would amount to defining 
crimes as events that give rise to a level of dis- 
approbation on the part of the victim or other 
members of the public such that they actively seek 
the assistance of the forces of social defense. 

Criminal statistics. At present the figures re- 
lating to “crimes known to the police” (in countries 
where these are available), although nearest to 
the event which is regarded as “criminal,” provide 
an amended record of events believed by the gen- 
eral public to be crimes, plus some additional 
events where the police obtain direct evidence and 
where information regarding one event leads to dis- 
closure of another. In most countries there are 
events which are crimes according to the law but 
which the average citizen does not define as any- 
thing that the police should know about. Similarly, 
in most countries there are events which are per- 
ceived by average citizens as crimes against their 
culture but which are not acknowledged in law. 

In most democratic societies the law will tend 
to be amended to agree with public opinion, but the 
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relationship between public opinion and legal defi- 
nitions is not a one-to-one correlation. An example 
of public disapprobation leading to new legislation 
may be cited—the Offensive Weapons Act of Eng- 
land and Wales, which made it a crime for persons 
to offer certain types of knives for sale. In the 
other direction, attempted suicide, for a long time 
not a criminal offense in Scotland, has recently 
been removed from the criminal definition in Eng- 
land and Wales also. 

Legal definitions of crimes, while very suitable 
for legal processes, are not directly translatable into 
sociological and psychological terms, Behavior usu- 
ally regarded as “normal” includes some acts that 
are crimes by legal definition, while at the extreme 
end of a distribution are crimes that all sane per- 
sons, even offenders themselves, regard as such. 

Crimes and morals. It is often argued that the 
cause of an increase in crime is a decline in moral 
standards of the population, If this argument is 
not circular (crime being held to be a reflection of 
morals), it is possible to throw some light on the 
relationship between moral values and crimes. Per- 
haps the layman who assumes, when he hears that 
crime has increased, that he is in greater risk of 
an attack upon his person or property is nearer to 
equating morals with crime than the official defini- 
tions allow. Indeed, it seems possible, and even 
probable, that any improvement in moral values 
would show in the official records of crime as an 
increase in the number of crimes known to the 
police and perhaps also as an increase in the num- 
ber of arrests. 

Any change in the social definition of an event 
worth notification of the police will, even in stable 
conditions, influence the bias of the sample of 
crimes represented by the “crimes known” figures. 
If a society is becoming more “moral,” it is possible 
to suggest that that society is becoming more strin- 
gent and requiring more conforming standards of 
behavior. If this occurs, the democratic definition 
of “things which the police ought to do something 
about” will tend to agree more closely with the legal 
definition of crime for the majority of events that 
are defined in the criminal law as crimes. 

It is simpler to demonstrate the effect of change 
in public opinion in regard to expected levels of 
behavior where the criminal statistics are based on 
“crimes known to the police” than where data are 
derived from arrests. It may be supposed, however, 
that the police as public servants are aware of 
public pressure, and, of course, they are themselves 
members of the culture that forms the definitions 
to which they operate in their official capacities. 

In England and Wales and some other countries, 
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what is a “crime” to be included in the “crimes 
known” figures is determined by whether the of- 
fense is “indictable.” The definition of an “indict- 
able offense” depends upon the way in which an 
adult person accused of the offense might be tried. 
Special provisions for the juvenile (persons under 
17 years of age in England and Wales and certain 
states in the United States) are ignored for pur- 
poses of this definition. Perhaps the criterion of 
recording as crimes only indictable offenses was at 
one time related to levels of public disapprobation. 
Today it has no very strong association with public 
opinion, particularly in the distinction between of- 
fenses of larceny and vandalism. In England and 
Wales, larceny of any value is an indictable of- 
fense, but vandalism is a misdemeanor. For exam- 
ple, it would be regarded as an indictable offense 
for a person to open the doors of a parked car and 
remove a half-empty package of cigarettes from 
the glove pocket, but not an indictable offense to 
carve his name on the windshield or twist off the 
door handles, This illustration is not universally 
applicable, but in all legislations some similar event 
can be found. 


The problem of trends and comparisons 


It is generally agreed that precise cross-cultural 
comparisons of crime are impossible. Attempts 
have been made by committees of the United Na- 
tions to establish some form of comparison for the 
most serious offenses, like homicide and robbery, 
but even at this level of seriousness no strictly 
comparable basis can be found. In the case of mur- 
der, the social significance of weapons differs 
among countries, and vestigial traces of the dueling 
cultures of the past still remain in the legal codes 
of many countries. Comparisons have been at- 
tempted in terms of legal definitions, because it is 
believed that the legal basis for crime cannot be 
avoided. It might be possible to find some basis for 
comparisons if crimes were described in terms of 
exactly what happened, as perceived in the current 
cultural setting, but even this would be difficult. 
Add to the difficulty of cultural differences, which 
are current in their effects, traces of history pre- 
served to greater or lesser degrees in the laws of 
different countries, and exact cross-cultural com- 
parisons are impossible, 

For legal purposes, and not without good reason, 
there is a tendency to try to preserve a law that has 
worked in the past to suffice for the newer con- 
ditions for as long as possible. The horse was a 
factor to be considered in relation to economic and 
criminal behavior in the past, but it is very doubtful 


whether the transition from horse to “horseless 
carriage” is a sufficiently direct one for the con- 
tinuity of legal definition and sociological and psy- 
chological inferences based on this transition. 

Perhaps the main problem in the study of trends 
arises from the fact that changes in the law can 
be made only in discrete steps, whereas the proc- 
esses of technological change are continuous. 
Changes in perceptual processes that follow upon 
and are related to technological changes as well 
as value systems are also continuous changes. 
Some items in legal classifications must, no matter 
how much the law is changed, be out of step with 
social and economic conditions. The changes will 
be made in different countries and different states 
at different rates and with respect to different parts 
of the law, But these discrete adjustments imme- 
diately prejudice the study of trends. Trends seem 
to require a stable definition, but changes are es- 
sential if the law is to have any meaning in a 
changing society, 

Clearly, the law cannot be in a process of con- 
tinuous change if it is to function for the protection 
of society, although society, in order to preserve 
itself, must be continuously adjusting to change. 

Changes within a culture. It may be thought 
that although the problem of trends and compari- 
sons is difficult and complex when cross-cultural 
analysis is being attempted, something more pos- 
itive may be said about trends within a culture. 
But even here there are serious problems. 

It may seem a simple matter to adjust the fig- 
ures for crime within a country or state according 
to changes in the population. The total amount of 
crime is expected to change because, if the popu- 
lation of a country increases and the definitions 
remain constant and are constantly interpreted, 
more persons would be expected to commit more 
crimes. But in some parts of the world it is not 
possible to estimate the population with any degree 
of accuracy during the years between censuses. In 
the United States this is particularly true for states 
with a rapidly growing or declining population. 
Even an accurate estimate of the total population 
may not provide sufficient information for adjust- 
ment of crime data to be meaningful. It is obvious 
that a better crime rate is provided if the crude 
number of crimes is modified by the number of 
persons available to commit them, The rate per 
100,000 of population (the conventional rate) pro- 
vides something which is obviously better than 
rates that make no allowance for population size. 
But the obviousness of the improvement may be a 
snare and a delusion. The majority of crimes, so 


far as is known from the persons identified as 
criminals, are committed by younger male persons. 
In all Western countries females commit from 
one-fifth to one-tenth of the amount of crime com- 
mitted by males. Further, persons of either sex over 
thirty years of age commit (or are found guilty of) 
relatively few offenses. 

Thus, if the age structure of a population is 
changing and if there are increasingly more young 
people in the population, then even with a con- 
stant population an increased amount of crime 
would be expected. The converse is also true. If a 
population is becoming older, the crime rate can 
be expected to drop, even if every person, age for 
age and sex for sex, remains as criminal as before. 
This argument assumes, of course, that crimes for 
which offenders are identified are similar, in re- 
spect to age and sex distribution, to crimes which 
are not “cleared up.” (The rate of clear-up varies 
between types of crimes and from place to place 
but is seldom greater than 50 per cent for crimes 
where the victim does not see the offender in a 
face-to-face situation. ) 

Taking all these points and bearing in mind 
others which could be raised, it is regarded as un- 
safe to make any statements about the state of 
crime or crime trends over time or between differ- 
ent countries, states, or districts, except perhaps in 
the crudest possible terms. 


Crime and opportunity 


Adjustments for changes in population are usu- 
ally regarded as obvious and sound. But there are 
few crimes that relate to a basis of persons. Per- 
haps the number of murders may be expected to 
change in proportion to the number of persons 
available to commit murders or to be the victims of 
murder, but if adjustments are obviously necessary 
for persons whose crimes affect persons, why are 
other adjustments not equally obvious? If, for ex- 
ample, there are more cars available to be stolen, 
why should not the number of cars stolen be ex- 
pected to rise proportionally to availability? If there 
is more money available for legitimate transfer, 
why should not the number of illegitimate transfers 
of money be expected to rise proportionally? 

Some data relating to certain indictable offenses 
known to the police, adjusted for population, are 
given in Table 1. It will be observed that the ad- 
justed number of murders has remained reason- 
ably constant over the years from 1946 through 
1959, whereas housebreaking has shown a fall and 
a rise, and embezzlement an almost continuous 
tise. Indeed, the figures for murder do not show 
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Table 1 — Certain indictable offenses known to the 
police, adjusted for population changes, 
England and Wales, 1946-1959 


Murder’ Burglary Embezzlement Housebreaking 
1946 131 4,220 799 31,270 
1947 134.5 3,872 1,052 30,285 
1948 146.3 4,153 1,732 31,417 
1949 118.1 3,712 1,734 26,770 
1950 121.2 3,592 2,153 26,057 
1951 122.4 3711 2,333 26,164 
1952 134.7 3,811 2,894 28,195 
1953 129.2 3,659 2,842 26,479 
1954 134.5 3,126 2,981 22,782 
1955 121.5 2,862 2,833 21,435 
1956 142.7 2,836 3,091 22,705 
1957 144.6 3,516 2,563 27,775 
1958 118.2 4,316 2,756 34,569 
1959 134.6 4,387 3,533 35,874 
a. Uncorrected for cases subsequently found to be manslaughter, infanticide, 

and so on. 


b. Bose year 1946 = actual figures. 
Source: Adapted from Wilkins 1964, based 
on Criminal Statistics of England 
and Wales, Annuol Report, 


any variation in excess of chance, whereas other 
offenses show considerable and signiticant varia- 
tion. This raises the question of whether the mur- 
der figures remain constant because the factor used 
to compute these adjusted figures is based on the 
relevant “population at risk.” If equally meaningful 
bases could be used to construct rates for other 
crimes, it might even be discovered that the total 
amount of crime in this and other societies remains 
constant. However, it seems unreasonable to deny 
that major changes can occur in the long run; 
what is needed is a better understanding of both 
short-term fluctuations and long-term trends. 

The problem of heterogeneity. It is possible to 
examine the problem of the heterogeneity of cur- 
rent definitions of crime in a number of ways. It 
may be postulated, for example, that if all types of 
crime tend to increase and decrease simultane- 
ously over time, it does not matter which type of 
crime is selected as an index or whether all crimes 
are added together. It might seem desirable to se- 
lect an offense that presents as little difficulty as 
possible in identification and classification. Another 
method is provided by an area analysis. It could be 
suggested that if different areas show different 
interrelationships between crimes, an index of 
crime should not mask these differences. But 
neither of these methods can begin with any finer 
breakdowns of the data than are provided by the 
original classifications. It is true that the law may 
subdivide into different crimes types of actions that 
may be sociologically and psychologically similar, 
but it may also place together into one category 
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crimes that are dissimilar in other dimensions. 
Once information has been lost by coding it, it can- 
not be recovered without a basically new encoding 
system. 

Despite these limitations, the writer has used 
both a historical analysis and an area analysis to 
gain information regarding ways of classifying 
similar crimes together and separating those which 
do not show similar patterns. The following classi- 
fication system is proposed (Wilkins 1963a): 


I. Homicide 

(a) Murder 
(b) Manslaughter 

II. Serious crimes against the person (including 
sex offenses and violence) 

Ill. Serious crimes against property (burglary, 
breaking and entry, robbery, etc.) 

IV. Social disorganization (drunkenness, dis- 
orderly conduct, petty larceny, etc.) 


In terms of variations over time and variations 
between districts, these categories tend to show 
different patterns and thus to suggest heterogeneity. 
In many jurisdictions the legal classification does 
not discriminate by degree of seriousness. Robbery, 
for example, will generally cover a bank robbery 
at one end of the scale and a small boy tripping 
another and stealing his pocket money at the other. 
Thus, the dividing line between serious and non- 
serious offenses against the person is not fixed in 
terms of injury to the victim or period of medical 
treatment occasioned. The perception of what is 
“serious” may well vary from time to time and from 
place to place according to the social attitudes of 
the culture in which the events occur. 

Two comparisons. It would be unsatisfactory 
to leave the impression that all forms of research 
into crime which involve cross-cultural compari- 
sons or even comparisons within one culture are 
invalid. It is true that it is difficult to sustain the 
common concept of crime and to make valid com- 
parisons, and the legal definitions only make the 
problem the more difficult. Nonetheless, two ex- 
amples will be given of comparisons that seem to 
be legitimate, and these may serve to indicate 
methods through which some positive contribu- 
tions may be made, 

Comparisons jtequire well-defined and specific 
forms of behavior, and, with existing data, these 
are few. Larceny from motor vehicles and drug 
addiction are the examples used, the former for a 
comparison within one culture and the latter for a 
cross-cultural analysis. 

Offenses of larceny from motor vehicles which 


Table 2 — Larceny from motor vehicles, adjusted for 
number of vehicles, England and Wales, 
1940-1961 


Registered private 


motor vehicles, Larceny from Rate per 
in thousands motor vehicles* 100 vehicles 
1940 1,423 16,849 1.2 
1941 1,503 15,672 1.0 
1942 858 12,180 1.4 
1943 718 11,084 15 
1944 755 14,509 1.9 
1945 1,487 26,520 1.8 
1946 1,770 32,546 1.8 
1947 1,943 33,984 17 
1948 1,961 32,665 17 
1949 2,131 30,297 1.4 
1950 2,258 33,156 1.5 
1951 2,380 43,127 1.8 
1952 2,508 41,125 1.6 
1953 2,762 39,739 14 
1954 3,100 39,398 13 
1955 3,526 43,304 1.2 
1956 3,888 50,782 1.3 
1957 4,187 54,937 1.3 
1958 4,549 68,466 1.5 
1959 4,966 79,899 1.6 
1960 5,526 92,704 17 
1961 5,979 112,671 19 


* Crimes known to the police. 


Source: Adapted from Criminal Statisties of 
England and Woles, Annual Report. 


were recorded as “known to the police” in England 
and Wales during the period from 1938 through 
1961 are shown in Table 2, together with the num- 
ber of motor vehicles registered. It will be noted 
that when we use as a divisor the number of motor 
vehicles “at risk,” the “crime rate” for this category 
of offenses shows a very different picture from that 
which would be obtained by dividing by the popu- 
lation. Were the moral values in England and 
Wales at a low ebb in the years 1944 to 1946, 1951, 
and 1961, when thefts from cars reached an index 
of 1.8 per cent or more? Or have moral values 
tended to deteriorate more or less steadily since 
the end of World War 11? Is the base of “oppor- 
tunity’ a meaningful base? Could thefts from 
other sources be better interpreted against the 
changes in the gross national product or some other 
economic measure of affluence? In this case, the 
highest rate of larcenies per 100 vehicles is only 
1.9 times the lowest rate, although the largest ab- 
solute number of larcenies in any year is about ten 
times the lowest number. Would similar results be 
obtained for other crimes, or do these results repre- 
sent a tendency toward a constant factor in crime 
after adjustment of crude figures to a suitable 
base? Whether any of these questions are an- 
swered one way or another, it would certainly seem 


to be meaningful to explore divergencies from the 
“constant” rather than to examine figures unad- 
justed by “vehicles exposed to risk.” 


Can the study of crime be separated from eco- 
nomics and other forms of study of human be- 
havior? Moral explanations are obviously not suf- 
ficient—unless it is seriously held that moral values 
have deteriorated (and improved) strictly propor- 
tional to the number of cars! Before there can be 
serious discussion of trends in crime, much more 
sophisticated models must be sought than the 
simple cause-effect models that have been con- 
sidered so far. If “crime” concerns economic be- 
havior, then other aspects of economic behavior 
must be considered in assessing its cause. If legiti- 
mate opportunities are increasing in any culture, 
it will usually mean that illegitimate opportunities 
are also increasing, If a society wishes to generate 
change in criminal behavior, it seems highly likely 
that it will have to consider changes also in eco- 
nomic behavior—at least insofar as the balance 
between legitimate and illegitimate opportunities 
for the exchange of wealth is concerned. 

The need for more complex models seems to be 
illustrated by the cross-cultural comparison of drug 
addiction. The United States, and particularly the 
major cities on the east and west coasts, have a 
very serious problem of addiction. England has no 
such problem. Yet it is often claimed that the sys- 
tem of drug control in England is almost exactly 
the same as that in the United States. This point 
is argued, but it would appear that the difference 
in the addiction rates cannot be due to the small 
procedural differences—that is, if similar causes 
must have similar effects. But there are types of 
models, called “deviation amplifying systems,” in 
which similar causes can have different effects 
(Wilkins 1964). In order to illustrate such a model, 
let us reverse the present example and suggest how 
England might find itself with a drug addiction 
problem. One way could be if an attempt were 
made to reduce the number of addicts (at present 
about five hundred) by more stringent control. 
The existing “image” of addicts as sick persons 
might then be changed by increased pressure from 
the police authorities. The difference between the 
image of a sick person and a sinner may be unsub- 
Stantial, but it might generate considerable differ- 
ences in behavior. No one goes out of his way to 
become sick, but sinning has some attractions! 
The deviation-generating model is very often found 
in economic behavior; what, for example, causes 
prices to fall on the stock exchange—a fall in con- 
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fidence? Or does a fall in prices cause a drop in 
confidence? Clearly this is a mutual causal system 
where a small instability can cause a major devia- 
tion to be generated. The concepts of “image” and 
“confidence” are extremely similar; indeed, con- 
fidence in the market may be described as the 
“image” of the market. 

LESLIE T. WILKINS 


[See also GOVERNMENT STATISTICS; STATISTICS, DE- 
SCRIPTIVE; and the biography of SUTHERLAND.] 
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m 
WHITE-COLLAR CRIME 


The concept of white-collar crime covers law- 
breaking among the middle and upper (or “white- 
collar”) socioeconomic classes. This type of criminal 
behavior differs from that of the lower socioeco- 
nomic classes in several important respects; differ- 
ent, also, are some of the legal sanctions for the 
two classes of offenses. To include white-collar 
violations within criminology, crime must be de- 
fined in terms broad enough to cover any behavior 
punishable by the state, regardless of whether the 
penalty is criminal, civil, or administrative, and 
regardless of the offender’s social status. 

Lawbreaking can be divided into two categories: 
conventional crimes (burglary, for example, or lar- 
ceny), which are usually punishable under the 
criminal law, and white-collar crimes, which are not 
usually punishable in this manner. An apprehended 
burglar or robber is punished by a jail sentence, a 
fine, or probation; a doctor may be punished 
through revocation of his license. Penalties imposed 
on businessmen include enjoinment by the govern- 
ment, the levying of civil damages, suspension of 
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license to do business, or (as in impure food cases) 
the seizure and destruction of commodities. All of 
these sanctions imply that the behavior is socially 
injurious; in each case, punishment is involved and 
society stigmatizes the offender. 

History of the concept. The concept of white- 
collar crime is a relatively new addition to crimi- 
nological theory, although the need for such a term 
had been apparent to at least one of the founders 
of American sociology (Ross 1907) and was re- 
affirmed in the 1930s (see, for instance, Morris 
1934). However, the term itself did not gain wide 
currency until 1940, when Edwin H. Sutherland 
published a paper entitled “White-collar Crimi- 
nality.” In it he developed the concept and outlined 
its widespread implications for criminology (Suth- 
erland 1940). Nine years later, he published the 
first major study of white-collar crimes: those 
committed by seventy of the two hundred largest 
nonfinancial corporations in the United States 
(Sutherland 1949), 

After World War 11, Clinard (1946; 1952) and 
Hartung (1950) published studies of white-collar 
crime in the form of black-market violations of 
price and rationing laws by United States business- 
men during the war. Other offenses dealt with by 
social scientists that have the same general kind of 
orientation include employers’ violations of labor 
laws (Lane 1953), embezzlement and other viola- 
tions of trust (Cressey 1953), violations of pure- 
food laws (Newman 1957), and illegal activities 
of pharmacists (Quinney 1963). Theoretical arti- 
cles on white-collar crime have been written by a 
number of American and European criminologists. 
But, with a few exceptions, such as Great Britain 
and Germany (see Mannheim 1946; 1965; Grygier 
et al. 1965; Strafrechtspflege . . . 1961; Zirpins & 
Terstegen 1963; Middendorff 1959), research in 
this field has remained a unique contribution of 
American criminologists, although white-collar 
crime occurs in nearly all countries. 

Social status of the offenders. Sutherland used 
the term “white-collar crime” to refer to violations 
of the law by persons of relatively high social and 
occupational status. “White-collar crimes” are those 
crimes committed by “respectable” persons in con- 
nection with their occupations. The concept, there- 
fore, does not include such conventional crimes as 
murder or robbery, offenses that are defined in the 
same way regardless of the status or occupation of 
the persons committing them. Nor does it include 
legal acts that are considered “unethical”; it cannot 
be a white-collar crime unless it is punishable in 
some manner by the state, 

Sutherland was not specific about which occu- 


pations should be included in the concept, and as 
a result there has been considerable confusion. In 
his study of black-market violations, Clinard in- 
cluded all gasoline-station operators and anyone 
who rented property. Newman suggested that 
“farmers, repairmen, and others in essentially non- 
white-collar occupations could, through such il- 
legalities as watering milk for public consumption, 
making unnecessary ‘repairs’ on television sets, and 
so forth, be classified as white-collar violators” 
(Newman 1958, p. 737). Others feel that white- 
collar crime should be restricted to those in high 
status positions and that those with non-high status 
positions in the middle class should be dealt with 
separately (Mannheim 1965, p. 474). Conse- 
quently, some have suggested that it would be 
advisable to change the emphasis of the concept 
from “white-collar crime” to “occupational crime” 
(Quinney 1964); it would not include all violations 
of law by white-collar persons, such as income-tax 
evasion and rent-control violations, unless they oc- 
curred in connection with the violator’s occupational 
role. 

White-collar criminal activities. Common forms 
of white-collar crime by businessmen include ille- 
gal activities of reorganization committees in re- 
ceiverships and bankruptcies; restraint of trade, 
such as monopoly, illegal rebates, and infringement 
of patents, trade-marks, and copyrights; misrepre- 
sentation in advertising; unfair labor practices; 
financial manipulations; and wartime crimes, such 
as black-marketeering. In developing countries, 
businessmen are especially prone to violations of 
the income-tax laws, import and export regulations, 
and currency-control measures. 

Embezzlement is an especially common form of 
white-collar crime. One classification of embezzlers 
identifies three types (Cressey 1953): “Independ- 
ent businessmen” is the name given to violators 
who convert “deposits” entrusted to them for spe- 
cific purposes while at the same time maintaining 
their regular businesses. The second group, the 
“long-term violators,” consists of employed individ- 
uals who convert funds belonging to their employ- 
ers, or to their employers’ clients, by taking relatively 
small amounts over a long period of time. “Ab- 
sconders” are persons who violate their trust by 
removing funds or goods entrusted to them and 
then severing connections with the trustor by leav- 
ing his employment or leaving the vicinity. 

Politicians and government employees commit 
various white-collar offenses, including direct mis- 
appropriation of public funds as well as the indirect 
acquisition of these funds through padded payrolls, 
the placement of relatives on a payroll, or monetary 


payments from appointees. Their illegal activities 
are usually more subtle than this, however. Politi- 
cians and government employees make financial 
gains by granting favors to business firms, such as 
illegal commissions on public contracts, the issu- 
ance of fraudulent licenses or certificates, and tax 
exemptions or underestimated tax evaluations. 
Labor union officials engage in such criminal activi- 
ties as the misappropriation or misapplication of 
union funds; failure to enforce laws affecting their 
unions; collusion with employers to the disadvan- 
tage of the union members; and the use of fraud- 
ulent means to maintain control over the union. 

Doctors may illegally prescribe narcotics, per- 
form illegal abortions, make fraudulent reports in 
accident cases, and split fees. Fee-splitting, wherein 
a doctor gives part of his fee to the doctor referring 
the case, is illegal in many places in the United 
States because of the danger that such referrals 
might be based on the fee rather than the practi- 
tioner’s ability. Lawyers engage in such illegal 
activities as the misappropriation of funds in re- 
ceiverships; the securance of perjured testimony 
from witnesses; the practice known in the United 
States as “ambulance chasing,” that is, investigat- 
ing as many accidents as possible soon after they 
occur in order to secure the damage-suit business 
of the victims; and collaboration in making fraud- 
ulent claims for damages. In these cases, an appre- 
hended offender is more likely to be disbarred from 
practice than prosecuted. 

Extent of white-collar crime. Obviously, to re- 
strict the definition of “crime” solely to conventional 
crimes results in an underestimation of the extent 
of crime and its effect on society. Studies have 
indicated the widespread nature of white-collar 
crime; however, these crimes are far more difficult 
to tabulate than ordinary crimes. The categories of 
offenses covered by the term are not as precise as 
in conventional crime; the criteria for determining 
the social class of the offender are often not clear; 
and the criminal law, with few exceptions, makes 
no distinction regarding the social class of offenders 
(Caldwell 1958). National crime statistics, such as 
the Uniform Crime Reports of the Federal Bureau 
of Investigation in the United States, tabulate only 
violations of criminal law and not those of admin- 
istrative and civil law, and therefore include very 
little about white-collar crime. Some of the latter 
information may be obtained from reports of vari- 
ous other government agencies and investigations, 
but this requires considerable effort, and often the 
required information is not fully available. The 
financial losses to society in a single case of white- 
collar crime may be equal to the total amount 
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involved in thousands of larcenies, burglaries, and 
robberies. However, society does not conceive of 
the white-collar criminal as a true criminal; this is 
reflected in the fact that poor and relatively un- 
educated offenders are usually sentenced to prison, 
whereas white-collar criminals are seldom im- 
prisoned. 

Sutherland’s study (1949) of 70 large nonfinan- 
cial corporations reported a total of 980 decisions 
rendered against them for violation of government 
regulations, an average of 14 per corporation. Re- 
straint of trade, infringement of patents, and 
unfair labor practices were the most frequent vio- 
lations. Sixty per cent of these decisions were ren- 
dered during a ten-year period (1935 through 
1944) of increased government enforcement of 
business regulations. Although only 158 cases were 
dealt with by the criminal courts, crimes were 
actually committed in 779 of the 980 cases. Two- 
thirds of these corporations had been convicted in 
criminal court with an average of 4 convictions 
each; 97 per cent of the corporations were repeaters. 

Violations of rationing and price-control laws by 
businessmen (black-marketeering) were a serious 
problem in many countries during World War n. 
Clinard (1952) found that approximately one in 
fifteen of the three million U.S. business concerns 
were punished for such violations, but only 6 per 
cent involved the criminal sanction. Of the 250,000 
concerns selling gasoline in the United States dur- 
ing the war, one in sixteen was punished for ration- 
ing violations, most of which were complex, eva- 
sive, and willful. A significant case of white-collar 
crime involved conspiracy in price-fixing and price- 
rigging violations of the federal antitrust laws by 
many leading U.S. electrical concerns (Herling 
1962). Twenty-nine companies were convicted of 
illegalities in sales of heavy electrical equipment to 
both government and private purchasers; the fed- 
eral court imposed fines totaling $1,924,000. Of 
the 45 company executives who were convicted, 7 
at the policy-making level were sentenced to 30 
days in jail, and 24 others received suspended 
sentences. These sentences, and the severity of the 
fines, are fairly unusual punishments for white- 
collar crimes. Civil suits involving millions of 
dollars were brought against many of the firms to 
recover damages. 

The case of the electrical companies does serve, 
however, as a reminder that white-collar crime in 
many areas is a well-organized affair, ranging from 
the comparatively simple reciprocal relationships 
involved in doctors’ fee-splitting to the more com- 
plex illegal activities of large corporations. Most 
techniques of violation are, of course, selected be- 
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cause they involve the smallest danger of detection, 
but they may also recommend themselves, espe- 
cially to a corporation, because the violators are 
confident that the case will either not be prose- 
cuted, or not result in a conviction, or receive a 
minor penalty. Indeed, they may even hope to 
change the law itself, or at least demonstrate that 
it is unenforceable. 

Punishment for white-collar crime. The punish- 
ments for white-collar offenses vary considerably, 
but almost without exception they differ from those 
given for ordinary offenses. One of the most sig- 
nificant of the many factors involved in this varia- 
tion is the fact that many of these acts were not 
made illegal until recent years. For instance, the 
following practices were not made illegal in the 
United States until the beginning of the nineteenth 
century: restraint of trade, false advertising, in- 
solvency of banks due to fraud or negligence of 
officials, sale of fraudulent securities, and misuse 
of trade-marks. Previously, the philosophy of laissez- 
faire (and its time-honored companion, the slogan 
caveat emptor) had completely dominated public 
thought and policy. Legislation directed toward con- 
trolling the more powerful economic groups was 
an inevitable, although somewhat tardy, outcome 
of industrialization, as the entrepreneur was gradu- 
ally displaced by the corporation and large-scale 
labor unions began to appear on the economic 
scene. Public recognition of the socially injurious 
nature of white-collar crime has also developed 
slowly. White-collar crimes are usually both more 
complex and diffused over a longer period of time 
than ordinary crimes; and so the essential crimi- 
nality of the acts tends to be obscured. Further- 
more, the type of publicity given white-collar 
crimes, as contrasted with the more overt crimes 
like burglary or larceny, seldom creates much pub- 
lic resentment. It is therefore difficult to create 
and sustain the kind of public pressure needed 
for the enactment of stronger legislation desig- 
nating this type of behavior, however antisocial, as 
“criminal,” 

Most statutes outlawing white-collar crime dif- 
fer from conventional criminal laws in five ways: 
origin, determination of responsibility or intent, 
philosophy, enforcement and trial procedures, and 
sanctions against violators (Newman 1958, 
p. 738). Responsibility for enforcement is delegated 
primarily to specially created agencies. The admin- 
istrative process of hearing cases closely approxi- 
mates juvenile court procedures, and the actions 
taken are more often remedial in nature—injunc- 
tions, for example, rather than fines or imprison- 


ment. In fact, there has been a tendency in the 
United States to enact rather lenient statutes and 
then to enforce them in similar fashion, with 
favoritism shown to offenders of high social status. 
Even when criminal sanctions are included, they 
have generally been used hesitatingly. Criminal 
action against corporations presents difficulties, for 
even after long litigation, often the only result is a 
fine that is modest by corporate standards or a 
sentence for an officer that is light when matched 
against the gravity of the offense. Some of the 
specific legal problems connected with white-collar 
crime have been discussed by Mannheim (1965, 
pp. 481-484), 

Criticisms of the legal concept. Some law pro- 
fessors and sociological criminologists have criti- 
cized the expansion of the concept of crime beyond 
the criminal law, stressing the criminal law’s more 
rigorous procedures, greater stability, and greater 
capacity to stigmatize the offender (Tappan 1947; 
Caldwell 1958). Such critics think that “crime” and 
“criminal” relate only to overt acts of ordinary con- 
victed offenders: “there is an obvious and basic 
incongruity involved in the proposition that a com- 
munity’s leaders and more responsible elements 
are also its criminals” (Vold 1958, p. 253). Mann- 
heim has pointed out that the basic issue involves 
the elements of conviction and stigma. He feels 
that the legalistic view that only punishable acts 
dealt with by criminal law are crimes and the view 
that punishable acts are always crimes are both 
unsatisfactory: “What is needed is an injection of 
sociological thinking into the whole administration 
of criminal justice to insure that the present in- 
congruities between stigma and conviction will be 
reduced to an unavoidable minimum” (Mannheim 
1965, p. 33). 

It is true, of course, that the criminal law is 
more integrated into the mores than civil law, since 
the latter is more recent and derives largely from 
the prevailing economic system. But the boundaries 
have also expanded; indeed, the nature of modern 
criminal law has been used as evidence in favor 
of a concept of criminality that includes almost any 
behavior showing “a lack of a developed social 
feeling and ethical code” (Hurwitz [1947] 1952, 
P- 31). Such a view certainly includes white-collar 
crime, but it neglects the crucial factor of legality, 
which is an essential part of the definition (see 
Clinard 1952, pp. 226-262; Mannheim 1965, pp- 
30-31). The concept of white-collar crime is not 
as tendentious and moralistic as has been claimed 
by some of its critics (see, for instance, Jones 
1956, p. 8). It was not Sutherland who designated 


these types of behavior as undesirable; they had 
already been so designated by laws in the United 
States and elsewhere. We must therefore conclude 
that the white-collar offender is indeed a criminal; 
what kind of criminal remains to be seen. 

Typologies of white-collar crime. The study of 
white-collar crime has been handicapped by sub- 
suming under this term a large variety of behaviors 
needing different causal explanations. To avoid 
this, it has been suggested that white-collar crimes 
be separated into three categories: those com- 
mitted by relatively autonomous professionals, such 
as lawyers or doctors; those committed by em- 
ployees against corporations, for example, embez- 
zlement; and those committed in the form of 
policy-making decisions by officers of corporations 
(Bloch & Geis 1962), Whatever scheme is adopted, 
there seems to be a need for some classification 
according to the occupational status of the offend- 
er. Geis has thought that white-collar crimes should 
be concerned with more homogeneous occupational 
groups; he therefore has restricted the concept to 
“corporate violations” (1962). 

Quinney has argued that more homogeneous oc- 
cupational units would be desirable, but that they 
have to be delineated by the researcher, since the 
construction and explanation of occupational crime 
rates is a task that has been largely neglected by 
sociologists. He therefore suggests concentrating on 
studies both of the violation of legal norms and of 
deviation from occupational norms; both types of 
deviance could then be related to occupational be- 
havior in general. Occupations change, and so do 
the types of crime and other deviance associated 
with them; thus an increase in the extent of an 
occupational practice hitherto considered merely 
deviant may evoke a law that renders it illegal. In 
distinguishing categories of crime based on occu- 
pation, one should consider the kind of occupation, 
its position in the occupational structure, the occu- 
pational role of the offender, and the nature of the 
institutional setting. The researcher should also 
acquaint himself with the effect of the law on that 
particular occupation and with the relevant norms 
and values embodied in the law (Quinney 1964). 

Theories of white-collar crime. The concept of 
white-collar crime, in the fifteen years since it be- 
came current, has had little effect on the theory 
and research of criminologists in the field of psy- 
chiatry and psychology. But a useful theory of 
crime should apply alike to the ordinary criminal 
and the white-collar criminal. Most current studies 
of ordinary criminals have largely disregarded this 
fact. On the basis of studies of offenders, chiefly 
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of the lower socioeconomic groups, various theories 
have attempted erroneously to explain criminal be- 
havior as the result of childhood emotional inse- 
curity, unrepressed primitive desires, guilt feelings, 
and the like. Similarly, attempts have been made 
to explain crime as a result of poverty, poor hous- 
ing, broken homes, and feeble-mindedness. Such 
writings have made almost no references to the 
contradictory evidence presented by white-collar 
offenders. 

Sutherland stated that white-collar crime could 
be explained within the general framework of 
criminal behavior as a process of differential as- 
sociation, wherein the behavior is learned from 
those who consider it favorable, in isolation from 
those who do not. For example, lawbreaking may 
be normative in certain business concerns, and per- 
sons who are isolated from other situations may 
learn values, rationalizations, and techniques that 
will enable them both to violate the law and to feel 
justified in so doing. Several factors isolate busi- 
nessmen from unfavorable definitions of illegal 
activity. The media of mass communication usually 
treat white-collar crime leniently as compared with 
their treatment of conventional crime. Further- 
more, government officials often shield business- 
men from severe criticism, since many of the 
officials either were formerly in business or may 
have accepted political contributions from business 
sources. In addition, businessmen associate chiefly 
with other businessmen, both at work and in their 
social activities, so that the implications of white- 
collar crime are removed from outside scrutiny 
(Sutherland 1949). 

Clinard (1952) has pointed out that most white- 
collar violations appear to originate in behavior 
learned through association with others. But this 
does not explain why some persons engage in such 
behavior and others do not. Such an explanation 
would require an examination of the roles played 
by the individual. Also, some violations may in- 
volve the independent invention of complex tech- 
niques without learning from others. 

Cressey (1953) has developed what he believes 
to be a universal explanation of trust violations by 
individuals entrusted with funds or property, al- 
though not an explanation of white-collar crime 
generally. He has identified three essential elements 
in a trust violation: a “nonshareable” financial 
problem, knowledge of how to violate, and rational- 
izations about the violations. Financial problems 
which are experienced as nonshareable include 
business reversals and important obligations in- 
volving the person's status or sense of personal 
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responsibility. Trust violators are aware that these 
problems can be resolved secretly by violating their 
positions of trust, and they have acquired knowl- 
edge of the techniques to do so. They define the 
situation through rationalizations in terms that 
enable them to regard their criminal behavior as 
essentially noncriminal. For example, they explain 
that they were “only borrowing,” that their behavior 
is part of the “general irresponsibility” for which 
they are not completely accountable, or that their 
behavior is due to unusual circumstances. Unfor- 
tunately, Cressey’s study does not describe the char- 
acteristics of trust violators and does not specify 
the situations that most often result in violations. 
Future studies may enable us to predict more ac- 
curately which persons will be violators and what 
situations are most conducive to violations. 

Conception of self. The white-collar criminal is 
set apart by his distinctive conception of himself. 
He generally regards himself as a “respectable citi- 
zen” rather than as a “criminal”; at most, he sees 
himself as a “lawbreaker.” In this sense, he is 
similar to some offenders convicted of such crimes 
as statutory rape, nonsupport, and drunken driv- 
ing. The white-collar criminal gains support for 
this image of himself from the general public, 
which, although it does not necessarily condone his 
activities, finds it hard to conceive of them as be- 
ing associated with “real” criminal behavior. More- 
over, the white-collar offender usually suffers little 
loss of status among his associates. Although some 
of them may frown upon his behavior, others may 
even admire it, especially if his transactions reveal 
considerable shrewdness. This last reaction may be 
related to a general contempt for laws affecting 
them as well as for government (especially gov- 
ernment personnel) as a whole on the part of 
large sections of an occupational group. 

Role orientations. So far, we have emphasized 
characteristics that the white-collar offender shares 
with other kinds of criminals. But there is one 
important aspect in which he differs from them: 
he may play a variety of other, noncriminal roles, 
and the degree to which he recognizes the exist- 
ence of conflict between these roles and his criminal 
role may vary considerably. 

The theory of differential association, especially 
if there has been continuous and intimate associa- 
tion with unethical and illegal norms and some 
isolation from other norms, cannot explain why 
many people never become white-collar criminals 
at all, even though they are likely to have both the 
means and the opportunity to do so, as well as the 
inducement. Nor it is enough to say that most 
people are “honest,” but some are not. The known 


variations in individual rates of deviance cannot 
be explained simply in terms of the differential 
acceptance of general social values, although this 
is of course an important fact. The best available 
empirical evidence shows that people appear to 
accept or reject opportunities for white-collar crime 
according to their orientations toward the various 
roles they play in society, especially their occu- 
pational role. 

But most criminologists interested in white-collar 
crime have been concerned more with establishing 
its existence than with studying variations in of- 
fense rates within various occupations. Quinney, 
who did study such variations among retail phar- 
macists, found that prescription violations are re- 
lated both to the structure of the occupation and 
to the differential orientation of retail pharmacists 
toward it. The retail pharmacist was found to have 
two divergent occupational role expectations—that 
of the professional and that of the businessman; 
the structural strain inherent in this situation is 
usually met by the adoption of an “occupational 
role organization” that, in both including and em- 
phasizing the “professional” aspect of the occupa- 
tion, tends to restrain the pharmacist from vio- 
lating the prescription laws. On the other hand, 
pharmacists who emphasize the “businessman” 
aspect of the occupation tend to have the highest 
violation rate, whereas those who emphasize 
neither aspect have a rate somewhere between 
those of the two other groups (Quinney 1963). 

White-collar crime and the social structure. 
White-collar crime cannot be fully understood with- 
out reference to the value conflicts presented to 
people in higher-status positions. These vary ac- 
cording to the social structure and value system 
of society. For instance, the values involved in the 
state regulation of commercial transactions in the 
United States may conflict with those of the Ameri- 
can free-enterprise system. When such conflicts 
occur, attitudes that are involved in selective obedi- 
ence to a “good” or “bad” law become the key to 
compliance. “The demand for law arises out of the 
conflicts in cultures; and because there is a conflict 
in cultures, the law is not effective as a deterrent 
upon the other groups that did not at first demand 
the law” (Sutherland [1929] 1956, p. 108). Further 
research should focus on the conditions that lead 
to a definition of behavior as criminal and to a 
definition of the way in which legal norms intersect 
and are integrated with the norms of other institu- 
tional structures (Jeffery 1956). Study of the 
values, norms, and other aspects of middle- and 
upper-class subcultures and occupations may help 
to explain white-collar crime in much the same 


way that knowledge of the subculture of the lower 
class and the culture of poverty is necessary to 
understand many conventional crimes. 

The recent concern of sociologists with white- 
collar crime brings into focus a long-neglected re- 
lationship between criminal behavior, criminal law, 
penal sanctions, and social structure (Aubert 
1952). Studies of white-collar crime in highly dif- 
ferentiated societies have focused on the ambiva- 
lent attitudes of average citizens toward the law, 
with the result that structured conflicts have been 
discovered between social roles and the larger social 
system. In this way, as Cressey (1961, p. xii) has 
emphasized, there is need for study of the entire 
society if we are to understand why white-collar 
crime is often not reported or studied. White-collar 
crime can often be viewed as violation of laws 
which are not part of the moral values of the 
groups concerned (Fuller 1942). 


MARSHALL B. CLINARD 


[See also PUNISHMENT and the biography of SUTHER- 
LAND.] 
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Iv 
HOMICIDE 


Homicide is the killing of one human being by 
another and may be criminal or noncriminal. Non- 
criminal homicide is considered to be either excus- 
able, that is, accidental or in self-defense, or justi- 
fiable, such as when a police officer shoots a felon 
or an executioner carries out a penalty of death. 
Criminal homicide, on the other hand, may be mur- 
der (first-degree or second-degree) or manslaughter 
(voluntary or involuntary). The legal distinctions 
are important because the sanction imposed on the 
offender in each case is a function of the adjudged 
degree of criminal homicide. 

Among the different state and federal jurisdic- 
tions in the United States, there is considerable 
similarity in the meaning of these degrees of homi- 
cide, partially because the Pennsylvania homicide 
statute of 1794 was used as a model. First-degree 
murder generally means premeditated killing, “by 
lying in wait” or by poison, and with malice; all 
the states but Alaska, Hawaii, Iowa, Maine, Michi- 
gan, Minnesota, New York, North Dakota, Rhode 
Island, Vermont, West Virginia, and Wisconsin 
retain the death penalty as the maximum sanction. 
In addition, by statute any death that occurs during 
the commission of one or more types of felonies, 
which vary among the states, is a first-degree 
murder. Although some states include all felonies 
under this “felony-murder rule,” most restrict it 
to rape, arson, burglary, robbery, and kidnaping. 

Second-degree murder refers to a killing without 
premeditation but with malice and in the heat of 
passion. A maximum sentence of 20 years is com- 
mon. Voluntary (or nonnegligent) manslaughter 
is a killing that occurs with the presence of intent 
to do bodily harm; and involuntary (or negligent) 
manslaughter is unintentional killing, usually dur- 
ing commission of some criminal act other than 


a felony or through negligence during an act that 
is otherwise noncriminal. 


Legal codes of other countries reflect different 
juridical situations. Mord and Totschlag have meant 
murder and manslaughter, respectively, in the 
German code, Meurtre in the French code is not 
murder but manslaughter, and premeditated homi- 
cide is assassination (assassinat), which is a 
common designation in Latin countries. The Italian 
penal code distinguishes between omicidio doloso 
(willful killing, which may be aggravated by pre- 
meditation or other circumstances); omicidio pre- 
terintenzionale (the intent is not to kill, only to do 
harm, but death results); and omicidio colposo (no 
intent, but death occurs by negligence or the like), 


Homicide statistics 


Like other crimes, criminal homicides may be 
reported by the police as the number of offenses 
known or the number of persons arrested, by the 
courts as the number of persons convicted, or by 
the prisons as the number of persons committed. 
Unlike most other crimes, offenses may be counted 
by the number of persons victimized; if two persons 
are killed in a single criminal event, the public 
authorities will generally count two criminal homi- 
cides even though there is only one offender, More- 
over, because there is a death, governments at 
almost all levels, using the international list of 
causes of death, will record the act under the title 
of homicide in the mortality statistics. 

Valid international statistical comparisons are 
difficult to make because of differences in the legal 
definition of criminal homicide and the range of 
variations encompassed by the term. Hence, all 
such references must be examined with caution 
and understanding of these variations. The Demo- 
graphic Yearbook of the United Nations reports 
homicide data for many countries; in 1960 the 
rates (per 100,000 population) ranged from 34.0 
for Colombia to 0.2 for Ireland (United Nations 
1961). Although the United States does not have 
a criminal-homicide rate as high as those of 
Mexico, Colombia, and several other nations, the 
contrast with England, which is one of the leading 
low-rate countries, is dramatic. For example, the 
city of Philadelphia (population two million) has 
about the same number of criminal homicides each 
year as all of England, Scotland, and Wales com- 
bined (population 45 million). Yet the rate per 
unit of population for Philadelphia is about the 
same as that for the United States as a whole. 

For the United States, one of the major sources 
of criminal homicide data is the Uniform Crime 
Reports, published annually by the Department of 
Justice. These are compilations of information sub- 
mitted voluntarily by police departments through- 
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out the country, and they provide interesting 
breakdowns by city size and region. The New 
England states consistently have the lowest rates; 
the southeastern states, the highest rates. In 1960 
the regional rates (per 100,000 population) varied 
from 1.5 for New England to 9.7 for the South 
Atlantic states (U.S, Federal Bureau of Investiga- 
tion 1960). The criminal homicide rate in the 
United States has been declining since 1933, when 
the rate was 7.1. Although there were some slight 
upward changes in 1945 and 1946, the rate grad- 
ually fell to 5.1 in 1960; this rate still means that 
as many as 9,136 persons met death as a result of 
murder and nonnegligent manslaughter. A break- 
down by murder and manslaughter or degrees is 
not available on a national level. In Standard 
Metropolitan Statistical Areas (county or counties 
having at least one core city of 50,000 or more 
inhabitants), the rate was 4.9 in 1960. All other 
cities had a lower rate of 3.8, and rural areas had 
a higher rate of 6.4 (U.S. Federal Bureau of In- 
vestigation 1960). 


Patterns in criminal homicide 


Research has shown that although criminal hom- 
icide is largely an unplanned act, uniformities and 
patterns exist nonetheless. There is, for example, 
a statistically significant association between crim- 
inal homicide and the race and sex of both victim 
and offender, Negroes and males involved in hom- 
icide far exceed their proportions in the general 
population of the United States, and rates for these 
two groups are many times greater than the rates 
for whites and females, In a study of criminal 
homicides in Philadelphia from 1948 to 1952, the 
rate per 100,000 by race and sex of offenders was 
in the following rank order of magnitude: Negro 
males (41.7), Negro females (9.3), white males 
(3.4), and white females (0.4) (Wolfgang 1958). 
The association between race and homicide is 
Statistically more significant than that between sex 
and homicide. Thus, the proportion (but not the 
rate) of female offenders is much higher in Eng- 
land than in the United States; it has been sug- 
gested that in those countries with high homicide 
rates the proportion of female offenders is low, 
while in countries with low homicide rates the 
Proportion of female offenders is high (Verkko 
1951, pp. 55-56). In fact, it seems likely that there 
is a greater constancy to female homicide. 

Variations by age. Among offenders the age 
group 20-24 predominates, while the highest rate 
for victims is in the age group 25-34. Victims are 
generally older than their offenders; the median 
age of the former, in the Philadelphia study, was 
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35.1 years and of the latter, 31.9 years. The race 
differential by age is particularly striking; the low- 
est five-year age-specific rates for Negro males and 
females are similar to, or higher than, the highest 
five-year age-specific rates for white males and 
females, respectively. Although males of both races 
more frequently commit criminal homicide during 
their twenties than during any other period of life, 
Negro males in their early sixties kill as frequently 
as do white males in their early twenties. Sociocul- 
tural and psychological evidence appears to offer the 
best explanations for these age—race differentials. 

Variations by method. Methods of inflicting 
death vary, but about one-third of criminal homi- 
cides in the United States are shootings, one-third 
are stabbings, and the remainder are mostly beat- 
ings, with a small percentage involving miscella- 
neous methods, such as poisoning. There appears 
to be some cultural preference for particular 
weapons: males, if Negro, usually stab and are 
stabbed, if white, beat and are beaten; females 
generally stab their victims with a butcher knife, 
but are themselves often beaten to death. 

Temporal variations. Although homicides tend 
to increase during the hot summer months, there 
appears to be no significant association by seasons 
or months of the year (Brearley 1932, pp. 189-190; 
Wolfgang 1958, pp. 98-100). But homicide is sig- 
nificantly associated with days of the week and 
hours of the day. The weekend in general and 
Saturday night in particular are related to homi- 
cide, as are the hours between 8 P.M. and 2 A.M. 
In one study (Wolfgang 1958, pp. 106-110) it 
was pointed out that about two-thirds of all homi- 
cides occur during the weekend, as compared to 
one-third from Monday to Friday. 

The social significance of the weekend period as 
a time of increased social intercourse between per- 
sons of similar groups is even more strikingly re- 
vealed by data on the use of alcoholic beverages 
by both victims and offenders. Either the victim or 
offender or both had been drinking immediately 
prior to the slaying in nearly two-thirds of the 588 
cases in the Philadelphia study (Wolfgang & 
Strohm 1956); other studies have shown slightly 
different proportions. Caution must be exercised 
in evaluating the presence of alcohol in homicides, 
because drinking—particularly on Saturdays, the 
day of highest incidence of homicide—is an inte- 
gral part of the mores of the lowest socioeconomic 
group, whose members are most likely to be in- 
volved in this crime. The consumption of alcohol 
is involved in a significantly higher proportion of 
weekend homicides than of homicides occurring 
during the remainder of the week, 
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Victim and offender. Criminal homicide usu- 
ally results from a domestic quarrel, jealousy, ar- 
gument over money, robbery, or some more vaguely 
defined altercation. These are the “motives” that 
are commonly recorded by the police and that have 
been mentioned with only slightly varying terms 
and classifications in many studies over time and 
space (Gillin 1946; Hentig 1948). Most victim— 
offender relationships may be classified as “primary- 
group” relations, or those that include intimate, 
close, frequent contacts. Close friends and relatives 
alone usually account for more than half of the 
known relationships that involve male offenders, 
and much more than half of those that involve 
female offenders. Mate slayings are common and 
may constitute as much as one-fifth of the cases 
in some areas. Wives killed by their husbands make 
up nearly half of all female victims, but husbands 
slain by their wives rarely constitute more than 
10 per cent of all male victims. When a woman 
commits homicide she is more likely than is a man 
to kill her mate; and when a man is killed by a 
woman, he is most likely to be killed by his wife. 
With rare exceptions, criminal homicide is an 
intragroup phenomenon; where there is racial 
heterogeneity, as in the United States, victims and 
offenders are of the same race in more than 90 per 
cent of the cases. 

Previous criminal records. Many studies report 
that participants in homicide do not generally have 
previous criminal records (Brearley 1932, pp. 85- 
86; Gillin 1946, p. 551). They have also proved to 
have good disciplinary records in prison and are 
among the best risks for parole. However, the defi- 
nition of “previous record” is often unclear in 
reported studies and sometimes refers to prior 
police or arrest record, other times to prior con- 
viction or prior institutional commitment. In con- 
trast to earlier research, the Philadelphia study 
revealed that nearly two-thirds of the offenders 
and almost half of the victims had a previous 
arrest record, and that more male victims than 
female offenders had such a record. Moreover, 
when an offender had a previous record, it was 
more likely to be one of offenses against the person 
than against property; and when he had a record 
of offenses against the person, he was more likely 
than not to have a record of having committed a 
serious assaultive offense. 

Rates of clearance and conviction. Among seri- 
ous offenses, homicide has the highest rate of 
“cleared by arrest,” that is, the taking into custody 
by the police of one or more suspects held for 
prosecution. In the United States, among the seven 
offenses used by the Uniform Crime Reports to 


construct a crime index, the over-all “clearance 
rate” is usually less than 30 per cent, but that for 
homicide is consistently about 90 per cent. The 
intragroup, unplanned, primary-group character of 
the offense partially explains this high rate. How- 
ever, the rate of conviction among persons charged 
with homicide is relatively low, often less than 50 
per cent. The seriousness of the charge, the sever- 
ity of the penalty, the greater use of a jury trial, 
the greater strain placed on the meaning of “rea- 
sonable doubt,” and the greater probability that 
cases of homicide will be disposed of by ways other 
than a conviction all contribute to a low conviction 
rate. Those who commit suicide (usually directly 
after homicide) and those who plead insanity may 
together constitute as much as 8 to 10 per cent of 
offenders known to the police. Homicide—suicide 
is relatively uncommon in the United States, how- 
ever, for studies show that only about 2 to 4 per 
cent of offenders kill themselves after killing others. 
In England homicide-suicide has reached very 
high proportions: as much as half the homicide 
cases in 1939 and, not uncommonly, as much as 
one-fourth to one-third each year (Griinhut 1952). 
About the same proportions appear to hold for both 
the United States and England in regard to homi- 
cide offenders declared to be insane. 


Theories about criminal homicide 


There are more studies using descriptive data 
and case reports of homicide than there are accept- 
able theories. Unfortunately, there is little inter- 
disciplinary theory or research, and most “explana- 
tions” of homicide are limited to the biological, 
psychological, or social aspects of the problem. 

The published anthropometrical and medical 
studies are few and contribute little to explanation. 
It is generally asserted (Buss 1961) that the causa- 
tive chain of aggression is traced back to stimuli 
that are external to the organism and that, although 
there may be individual differences in the reactivity 
to external stimuli evoking aggression, these inner 
characteristics do not by themselves explain ag- 
gressive, particularly homicidal, behavior. 

Psychometric studies. With few exceptions, psy- 
chometric studies of homicide have not emerged 
from testable hypotheses; furthermore, they have 
failed to distinguish different types of homicides 
and have neglected the use of control groups and 
adequate statistical analysis. Most studies on the 
differential psychology of homicide have been ex- 
ploratory and tentative. The intelligence of homi- 
cide offenders in different countries has generally 
been noted to be lower than that of property of- 
fenders (Berg & Fox 1947; Lazzari et al. 1958). 


However, the differences are not very discrimina- 
tive, and the intragroup differences are greater 
than the differences between various criminal 
typologies. 

Personality tests have been used to some descrip- 
tive advantage, indicating, for example, that with 
the Thematic Apperception Test, “heroes” of an 
antisocial type and themes of rebellion and impul- 
sivity occur with high frequency (Paolella 1960). 
In a careful study of an aggressive-content scale 
applied to the TAT, Stone (1956) found that the 
scale was able to correctly identify murder cases. 
Most psychometric studies have used the Rorschach 
test; leading traits of homicide offenders as ob- 
tained from Rorschach protocols appear to be 
introversion, impulsivity, and explosive emotions. 
In summarizing several Rorschach studies of homi- 
cide, Endara (1960) stated that one of the most 
important findings is the fact that those who com- 
mit homicide produce about half the number of 
“human-content” responses as compared to other 
criminals, which indicates a lack of empathy and 
the presence of hostility and rebelliousness against 
authority. However, as Buss (1961) claimed in his 
review of studies assessing aggression through 
projective techniques and questionnaires, the tests 
appear to measure behavioral, not latent, aggres- 
sion. Moreover, as Schuessler and Cressey (1950) 
earlier had stated in their criticism of studies on 
the personality characteristics of criminals, the 
studies generally apply a personality test without 
reference to any hypothesis about the relation of 
personality elements and criminal behavior. 

Frustration-aggression theory. The frustration- 
aggression theory (Dollard et al. 1939) was readily 
accepted by many sociologists and psychologists 
as a useful theoretical framework, but few psychol- 
ogists today would assert that the presence of frus- 
tration inevitably leads to aggression or specifically 
to homicide. In a review of the literature on the 
social psychology of aggression (McNeil 1959) it 
was concluded that scientific research on this theory 
is difficult because of the nearly inherent problems 
of circular logic. However, Leonard Berkowitz 
(1962), using the theory, has been doing some 
very interesting experimental research on violence 
in general, and Lewis Coser (1962) in a theoretical 
paper demonstrated that violence can be viewed 
from a sociological perspective. 

Henry and Short (1954) and, to a more limited 
extent, Palmer (1960) have used elements of the 
frustration thesis in specific analyses of homicide. 
Henry and Short assume that in American society 
such lower status groups as Negroes, females, and 
the elderly are socially deprived of upward status 
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mobility opportunities; hence frustration will lead 
them to aggression. The hypothesis, they contend, 
is supported by data relative to Negroes but not to 
all females. Suicide has higher incidence among 
the aged than does homicide. They further argue 
that the degree of internal restraints is greater for 
members of the upper social classes, who commit 
suicide more frequently than members of the lower 
social classes. External restraints operate more 
among lower social classes, for whom other-oriented 
aggression is legitimized and manifested frequently 
as homicide. There is some theoretical connection 
in these terms to Reckless’ containment theory 
(1950) of criminal and noncriminal behavior. 
Palmer's study (1960) defines frustration in such 
a general way that items like epilepsy, severe 
measles, head trauma, are taken as “frustrating” 
factors, with no concern for the aggressive person- 
ality deformations that they can cause by them- 
selves, exclusive of any “frustration” hypothesis. 
Moreover, Palmer’s study lacks the more system- 
atic logic and analytical consistency of the study 
by Henry and Short and is methodologically inade- 
quate in many ways. 

In sum, frustration, it is generally agreed, is an 
inevitable and necessary part of the socialization 
process. The frustration—aggression thesis is chal- 
lenging and should not be abandoned, but its pres- 
ent value for research purposes in homicide studies 
appears to be limited (Buss 1961; McNeil 1959). 

Homicide and suicide. Much the same may be 
said of the concept of anomie, although Bohannan 
(1960) has provided a taxonomy of homicide in 
primitive societies that follows the “anomic,” “ego- 
istic,” “altruistic” classification earlier proposed for 
suicide by Durkheim (1897). It should be noted 
that theory and research on suicide and homicide 
often have considered these two phenomena to be 
related—either as complementary phenomena, as 
similar manifestations of the same stream of ag- 
gression, or as the antithesis of one another (Henry 
& Short 1954; Verkko 1951; Hentig 1948, chap- 
ter 12; Wolfgang 1959). 


The subculture of violence 


A sociopsychological theory of a subculture of 
violence has been suggested as an explanation for 
homicides, most of which are crimes of passion or 
violent slayings that are not premeditated or psy- 
chotic manifestations (Wolfgang 1958, pp. 188— 
189; Wolfgang & Ferracuti 1962). The empirical 
data have shown that homicide rather consistently 
occurs with highest frequency among certain social 
groups and that in these groups there is close con- 
tact between offender and victim. The group mem- 
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bers’ characteristics and expected responses to 
certain stimuli, their perceptual differences in the 
evaluation of the stimuli, and the limited impor- 
tance they place upon human life in the scale of 
values appear to be conducive to the use of violence 
as a principal way to solve everyday problems. If 
there exists among these certain social class—sex— 
age groups a subculture of violence, it is further 
suggested that the greater the degree of integration 
of the individual into this subculture, the higher 
the probability that his behavior will often be vio- 
lent; or, that there is a direct relationship between 
rates of homicide and the degree of integration into 
the subculture of violence to which the individual 
belongs. 

This thesis has been used to view the high rates 
of homicide among nonwhites in American society. 
As a group nonwhites have been segregated both 
physically and socially and thus constitute to some 
extent a “subcultural” area. This subgroup is char- 
acterized by poor housing, high population density, 
overcrowded home conditions, and often by a sys- 
tem of values that condones violence and physical 
aggression, all the way from child-rearing processes 
to adult interpersonal relationships, which some- 
times end in criminal slayings. To a lesser degree, 
whites in the lower socioeconomic class also are 
part of a subculture of violence and participate in 
criminal homicide. By the dispersal of this cultur- 
ally isolated group and the integration of its mem- 
bers into the general community of morality and 
values, there may occur some reduction of the vio- 
lence that results in homicide. 

Finally, this theory contains working hypotheses 
that can be tested by psychological tools. The fact 
that an individual belongs to a deviant subculture, 
which is reflected in commitment to a set of values 
that is characterized by the ready use of violence, 
will cause him to adopt a differential perception of 
the environment and of its stimuli. In this subcul- 
ture of violence, the continuous challenges and 
daily frustrations that are faced and solved by the 
adaptive mechanisms of the individual have a 
greater chance of being perceived and reacted upon 
as menacing, aggressive stimuli, which call for im- 
mediate defense and counteraggression. Such a 
hypothesis lends itself to objective study through 
appropriate psychological methodologies. 

A review of homicide studies tends to reaffirm 
the contention of many criminologists that mean- 
ingful and significant contributions to an under- 
standing of the phenomenon can occur only through 
parsimonious, integrated theory that generates hy- 
potheses to be tested by the collection of data from 
allied disciplines. 

Marvin E. WOLFGANG 


[See also AGGRESSION, article on PSYCHOLOGICAL AS- 
PECTS; FEUD; SUICIDE.] 
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CRIMINAL LAW 


Criminal law, in the substantive sense, is a body 
of norms, formally promulgated through specified 
governmental organs, contravention of which war- 
rants the imposition of punishment through a spe- 
cial proceeding maintained in the name of the 
people or the state. Criminal procedure is formal- 
ized official activity that authenticates the fact of 
commission of a crime and authorizes punitive 
treatment of the offender. These are at best core 
definitions; they do not comprehend all legal sys- 
tems, all stages in the development of a legal 
system, or all elements within a given legal system. 


Substantive criminal law 

Substantive criminal law embodies both specific 
prohibitions and the general principles within 
which those prohibitions are construed. From the 
Specific prohibitions one may glean the particular 
problems about which the society that produced 
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them is concerned or at least has been concerned 
in the past. From the penalty provisions one can 
ascertain whether the community believes that 
punishments should be meted out in exact retribu- 
tion for harm done or believes that flexible periods 
of segregation suit its needs best. From the state- 
ment of general principles in a penal code one can 
learn the factors or circumstances that the law- 
makers view as significant in deciding whether 
punishment shall be mitigated or eliminated en- 
tirely in an individual case. 

Moreover, how one views the context of a penal 
code depends on his view of law in general. Typi- 
cally, legislators and judges appear to believe that 
legal norms are self-executing, so that the important 
thing is enactment of a law on any subject about 
which there is current concern. To make certain 
that citizens listen to it, the legislature usually 
attaches relatively heavy penalties to its enact- 
ments. The fact of enactment is usually equated 
with actual control. 

At the other end of the philosophical spectrum, 
however, stand those who maintain that legal 
norms, including those embodied in the criminal 
law, are valid only to the degree that they express 
the generally accepted values in the community 
and thus are enforceable only for whatever period 
of time the community's attitudes toward these 
values remain basically unchanged. In support of 
this approach, one need only cite the mass of out- 
moded sections or statutes that are typically part 
of a state’s legislation. The traditionalist will insist 
that statutes like these are still law, while the func- 
tionalist will say that they are not law at all be- 
cause they are never invoked. 

Furthermore, when law is viewed as a pragmatic 
effort to cope with contemporary problems, the 
primary problem is less that of identifying what 
must be controlled than that of ascertaining the 
most effective means of achieving control over that 
minority in the group which does not conform 
through group pressures or inculcation of group 
standards. Therefore, according to the functional 
approach, controls in the nature of license revoca- 
tion, administrative penalties, and, in some in- 
stances, preventive detention are usually far more 
efficient means of control than are criminal penal- 
ties. There is implicit, if not explicit, in this view 
a repudiation of the classical deterrent theory, ac- 
cording to which the threat of future harshness 
administered through legal agencies outweighs the 
immediate sense of pleasure or benefit that the 
actor contemplates as a result of his act. In short, 
in a society-oriented framework of criminal law, 
one identifies first the interest or value that the 
group wishes to promote, second the most efficient 
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means of effecting promotion of the value, and 
last the most effective methods of either rehabili- 
tating the offender or segregating him indefinitely 
from the community. 

Criminal codes are generally organized accord- 
ing to one of two patterns. One is an alphabetical 
listing of offenses. This has the advantage of any 
good index, that is, ease of access from a known 
starting point; but it places definitions in strange 
contexts. The other, and more common, arrange- 
ment involves a division into general principles 
and specific problems, under the established head- 
ings “The General Part,” “The Special Part,” and a 
further categorization of specific crimes according 
to the type of harm sought to be prevented. Each 
crime, therefore, appears in an identifiable context 
that facilitates its application to concrete problems. 
Though in form the general part precedes the 
special part, a layman often finds it less difficult to 
comprehend criminal law concepts if he first con- 
siders concrete problems; for that reason we will 
start with the special part. 

The special part. One primary aim of the 
special part is to protect the existence of govern- 
ment itself and its ordinary operations. Laws evi- 
dencing this aim include those against treason, 
sedition, counterfeiting, theft and intentional de- 
struction of government property, obstruction of 
public officials in the exercise of their office, 
bribery, and official oppression or coercion. In addi- 
tion, prohibitions against perjury safeguard the in- 
vestigative functions of government. 

A second major category of crimes includes 
crimes against public order, health, and morals: 
unlawful assembly, riot, interference with lawful 
group meetings, vagrancy, public intoxication, so- 
licitation to sexual acts, prostitution, and traffic in 
obscene materials. Public health considerations 
underlie prohibitions against prostitution and traf- 
fic in narcotics and, to a degree, support regulation 
of traffic in alcoholic beverages and tobacco prod- 
ucts. The latter two commodities, however, have 
such revenue potential that it is usually only the 
traffic in which the state has no financial interest 
that is outlawed. 

A third very important grouping comprises 
crimes against the person, including intentional 
homicide or murder, manslaughter, assaults of 
various kinds, and rape; the last often includes 
intercourse obtained through fraud or misrepre- 
sentation of identity, as well as sexual activity with 
certain classes of women who are viewed as in- 
capable of consent, such as mentally ill persons or 
girls below a certain age. 

A fourth category is that of offenses against 


property. It includes theft or larceny, burglary or 
other criminal intrusion into premises, arson, ma- 
licious destruction of property, robbery, extortion, 
and forgery. In some of these offenses (e.g., arson, 
burglary, and robbery) protection of life and limb 
looms as large as or larger than protection of prop- 
erty, while in others (e.g., theft) only property 
interests are protected. 

A fifth category of offenses against family rela- 
tionships is common to all codes: bigamy and 
polygamy, incest, extramarital and nonmarital in- 
tercourse, homosexuality, abortion, infanticide, and 
child abuse and nonsupport. 

A final grouping comprises crimes relating to 
businesses and occupations, including public health 
controls. There are few occupations that are not 
subjected to controls through criminal law penal- 
ties, at least in form. 

The general part. The general part provides 
the framework into which the particular offense 
must fit. In some applications it indicates when 
the court is competent to act or has jurisdiction. It 
states whether the legislature has intended citi- 
zens or aliens to be penalized for acts done outside 
the geographical limits of the state in which trial 
is held (the forum state). It decrees when the state 
has lost the power to prosecute because of certain 
limitations: that is, because the period of prescrip- 
tion or the statute of limitations has run, or be- 
cause a prior prosecution by the same state, or 
perhaps by another state, requires application of 
either the double-jeopardy concept (in Anglo- 
American law) or the principle of non bis in idem 
or that of non bis poena in idem (in the civil law). 

Retroactive punishment prohibited. The gen- 
eral part, in civil-law codes at least, also requires 
that all crimes be stated in law before penalties 
may be imposed (the principle of nullum crimen, 
nulla poena sine lege). Anglo-American statutes 
sometimes state the same principle, but equivalent 
control is usually achieved indirectly through the 
constitutional requirement that criminal statutes 
not be vague and indefinite, so that a man may be 
apprised of the legality of projected activity before 
he engages in it. The common-law tradition of 
judge-created criminal law, however, transgresses 
this principle. An expansion of criminal law by 
judicial decision is retroactive in the case in which 
the decision is given. What is forbidden to the 
legislature is thus assumed proper for a court. Per- 
haps all one can say in justification of this is that 
the extension of doctrine is usually so gradual that 
there may be very little hardship in most cases and 
that in fact the same process occurs in civil law, 
in which the broad terms of a statute are often 


expanded by judicial interpretation. The concep- 
tual difference, however, is still appreciable. What 
is common to both systems is the difficulty of de- 
veloping a meaningful application and adaptation 
of succinct formulations of law to the amorphous 
context of the society within which legal rules are 
to be utilized. 

Elements of an offense. The general part de- 
fines the structure of a crime: it requires that there 
be an act, including an omission or failure to act— 
accompanied by a certain subjective state of mind 
(intent, or mens rea)—by a person, including 
legal entities like corporations or associations, that 
causes a specified harm or injury for which penal- 
ties may be imposed through prosecution. This 
provides a guide to interpretation of each crime in 
the special part. 

Intent element of crimes. The sections on in- 
tent, or mens rea, embody the greatest verbal dif- 
ferences between the Anglo-American and the 
Roman-derived systems of law, Both systems pun- 
ish acts done purposely to inflict any injury that 
the penal statute is designed to prevent. The act 
may warrant punishment if the actor knows there 
is a substantial likelihood that the harm or injury 
delineated in the statute will be inflicted if he acts; 
this principle is embodied in the dolus eventualis 
concept of the civil law and the “reckless disregard 
of known consequences” of Anglo-American law. 
But the systems diverge in the matter of whether 
(a) “negligence” can be the basis for exacting 
criminal penalties or (b) “strict” or “absolute” lia- 
bility can be imposed for the doing of an act, de- 
spite the actor's unawareness of its dangerousness 
or undesirability. Anglo-American common law 
has long utilized negligence as a basis of criminal 
liability in manslaughter and certain other offenses 
against the person, but it has not used it in prop- 
erty crimes, even serious crimes like arson. Legis- 
latures, however, have increasingly included negli- 
gence as a basis for imposing punishment for 
property crimes and offenses involving government 
activities. Moreover, “strict liability” is a common 
ingredient of crimes regulating the economy. An 
entrepreneur, for example, may be liable to crimi- 
nal penalties if the commodity that he produces 
does not meet statutory requirements, though he 
may in good faith have believed that it does. In 
contrast, most civil-law commentators deny that 
Strict liability is either possible under their law or 
compatible with criminal-law theory, penalizing as 
it does ignorance and not purpose. 

Is the civil-law position preferable? As a matter 
of abstract criminal-law theory, the careless person 
is certainly not as “evil” as one who either intends 
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harm or is willing to ignore the almost certain con- 
sequences of his activity, that is, consequences of 
which he is fully aware. Criminal penalties are 
crude instruments to promote knowledge or cau- 
tion. If, however, enforcement is stressed, strict 
liability may be necessary in some cases. In most 
serious offenses, purpose may be inferred from 
activity. For example, if a man stabs another, he 
usually does so with intent to kill or severely in- 
jure. But one who sells a commodity may not know 
that it is adulterated or mislabeled; common ex- 
perience may not equate act with intent. To require 
the state to prove “intent” in the first instance may 
not significantly impair the enforceability of the 
statute, because the act usually purports intent. A 
similar requirement, however, may render the sec- 
ond statute totally unenforceable. If a standard of 
“strict liability’ is imposed, by which knowledge 
of impropriety is rendered superfluous, prosecu- 
tions are easy; this position assumes that most 
persons in fact know what they are doing, and 
therefore only a few “innocent” criminals are num- 
bered among the many “guilty.” 

Nevertheless, to give the “innocent” an oppor- 
tunity to explain, should the legislature, as an 
alternative, make the doing of an act “prima-facie 
evidence” of the required intent, and permit the 
defendant to persuade the tribunal that he did not 
in fact have the mens rea (guilty mind) which 
merits imposition of criminal penalties? Certainly 
many American statutes embody this compromise 
between the ideals of pure legal theory and the 
realities of law enforcement. 

Furthermore, many civil-law scholars in effect 
evade the issue by equating “penal law” with the 
“penal code,” thus completely ignoring a broad 
array of penal legislation (outside the penal code) 
that imposes strict liability or a separate code of 
“economic offenses” that parallels Anglo-American 
strict liability legislation. Though comparison of the 
two systems, civil and common-law, is rendered 
difficult because Anglo-American law embodies 
the jury system and the complicated doctrines 
of evidence law, including presumptions, which it 
has engendered, and the civil law does not, there 
is still a decided similarity in penal economic legis- 
lation in legal systems the world over, whatever 
the theoretician may say. 

Legal significance of mistake. The general part 
also lists special states of mind that affect criminal 
responsibility. One is ignorance or mistake, of 
either fact or law. Mistake of fact, which means 
nonknowledge of a factor or circumstance known 
to others, affects the mental element of the crime, 
not the objective elements. If the actor's error sug- 
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gests lack of purpose to inflict harm, but he is 
punished anyway, he is punished for his ignorance 
only, But this can hardly be distinguished from 
negligence, which embodies a failure to know what 
the “reasonable man” would know. Therefore, if 
only “reasonable” mistakes of fact are taken into 
account as extenuating, negligence extends to all 
crimes. Civil-law codes usually recognize this by 
saying that any “honest,” that is, nonperjured, 
claim of mistake is to be considered in its impact 
on the required state of mind, unless the crime is 
based on negligence, in which case only a “reason- 
able” mistake excuses. Anglo-American law is 
somewhat more ambivalent: some judicial inter- 
pretations require that all mistakes be “reasonable,” 
but others recognize “good faith” claims of mistake. 

Mistake of law involves either ignorance of the 
existence of the statute under which the prosecu- 
tion is maintained or ignorance of a collateral 
doctrine of law affecting, for example, property, 
contracts, or marriage. Few systems permit an 
absolute defense based on ignorance of the exist- 
ence of the criminal law under which the defend- 
ant is prosecuted, but civil-law codes commonly 
authorize mitigation of penalty. Collateral mistake 
of law that negates a required specific intent war- 
rants acquittal, if not under a general provision to 
that effect, then because the trier cannot find that 
intent to have existed as a fact, 

Mental conditions negating criminality. The 
other most important states of mind affecting 
criminal responsibility are unconsciousness, in- 
sanity, and intoxication. Civil-law codes usually 
state only that if a person is either unconscious or 
irresponsible because of insanity, mental illness, 
mental deficiency, and the like when he commits 
a criminal act, he is not to be punished. Expert 
data are received as a matter of course, but courts 
do not generally explain in detail what underlies 
their use of code labels of unconsciousness, mental 
illness, or mental deficiency. 

Anglo-American law, however, has had to formu- 
late legal tests for insanity, primarily because the 
jury must be formally instructed as to what “in- 
sanity” is. That term, it must be noted, is a legal, 
not a medical, term. The prevailing test is the 
so-called M’Naghten Rules, under which a man is 
deemed insane if he does not know that he is per- 
forming an act or, assuming he has that awareness, 
if he does not know that the act is “wrong,” which 
traditionally means legally rather than morally 
wrong. The original formulation of this test also 
included a third alternative—‘insane delusion” — 
according to which a defendant is excused if the 
facts as he deludedly believed them to be would have 


justified an acquittal; it might be called an “insane 
mistake of fact” test. This test lingers on in a 
handful of American states. Many states have 
added the alternative of “irresistible impulse”: 
that is, if the defendant was uncontrollably im- 
pelled by his mental condition to do the act with 
which he is charged, he is to be viewed as non- 
accountable. 

These tests have been violently criticized by be- 
havioral scientists as scientifically outmoded and 
as embodying value judgments that a psychiatrist 
or psychologist cannot be called upon to make in 
his professional capacity. It is indisputable that 
expert testimony is distorted, in the typical case, 
because of insistence that the expert testify in con- 
clusory terms. The emphasis on “right—-wrong” and 
“knowledge-nonknowledge” has caused medical 
opinion and lay opinion to be used indiscriminately, 
which in turn has prevented the development of a 
scientifically sound concept of legal insanity. Ac- 
cordingly, several new tests have been formulated, 
though as yet they have not been widely adopted. 
They stress mental defect or disorder, beyond the 
actor’s control, that causes the act on which the 
criminal charge is based. All demand qualified ex- 
pert data and exclude lay opinion. 

The value of these modern tests does not rest on 
the legal terms in which they are couched but on 
the freedom which they offer the expert to describe 
the defendant in scientific terms. A change in 
terminology is useful chiefly because it requires 
judges and lawyers to re-evaluate what they do. 
Statutory changes may, therefore, be in order in 
civil-law countries; however, there the problem in 
regard to the concept of legal insanity arises from 
legislative understatement rather than from out- 
moded overstatement, as in common-law countries. 

Intoxication and criminal responsibility. Intoxi- 
cation and the broader problem that it presents, 
namely addiction, are ambivalently dealt with in 
both civil-law and common-law systems. Whenever 
the defendant is identified only circumstantially, 
evidence of intoxication is always relevant to prove 
that he was not physically able to commit the 
crime with which he is charged. However, intoxi- 
cation is relevant as a matter of formal law only 
when the defendant is admittedly the actor and 
the issue is that of his intent. Logically, if the de- 
fendant is unable because of intoxication or addic- 
tion to entertain the required state of mind, he 
ought not to be punished. His addiction as such 
would be more appropriately controlled through 
Protective custodial treatment than through pun- 
ishment. But most codes view intoxication and 
addiction as, if anything, aggravating factors; and 


thus they either require increased punishment for 
intoxicated persons or demand that the judge ex- 
clude from consideration the actual impact of in- 
toxication on the defendant's state of mind and 
decide the case on the assumption that the defend- 
ant was in full control of his acts. 

Defenses to criminality. The general part also 
takes account of external circumstances that affect 
criminality, including duress, consent of the victim, 
defense of self or others, defense or protection of 
property, the needs of law enforcement, and other 
extraordinary circumstances, such as famine or 
extreme peril to life, that can be avoided only by 
acts against the life or property of others. These 
concepts most commonly apply to homicide and 
assault, though some defenses based on necessity 
affect property crimes. In effect, the actor's motive 
for forming an otherwise prohibited intent or pur- 
pose serves to eliminate or mitigate the criminal 
punishment provided for the act and intent in the 
special part. Common-law courts recognize only 
certain stereotypes of necessity, like self-defense. 
Codes, however, may include a more general clause 
on necessity based on a “choice of evils,” so that 
the harshness of the standards in the special part 
may be tempered in unusual cases. 

Punishable attempts. Yet another function of 
the general part is to identify preparatory conduct 
which, though it does not eventuate in the ultimate 
harm prohibited by one or more criminal pro- 
visions, comes close enough to warrant punish- 
ment. This is called “attempt.” Though efforts are 
made to develop legal standards to control its ap- 
plication, the concept of attempt is simply a device 
for extending the coverage of a criminal statute 
beyond its literal scope. It may be applied if the 
actor has done enough to evidence his criminal 
purpose, if his attitude and purpose suggest that 
he is likely to try again in the future with perhaps 
greater success, and if the harm which he sought 
to inflict is of appreciable gravity. The decision is 
basically one for the individual case. Conspiracy 
is also penalized as one form of preparatory ac- 
tivity, that of agreement to commit a crime. 

Punishment of accomplices. The general part 
authorizes imposition of punishment on others 
than the actor himself, by imposing vicarious re- 
sponsibility for the acts of the principal on those 
who have induced him to act, conspired with him 
in the conception of the act, or aided or encour- 
aged him in its commission. Some codes reach 
backward into a criminal transaction to punish un- 
successful solicitation to commit a crime, and some 
reach forward in time to punish those who, as ac- 
cessories after the fact, harbor or aid the criminal 
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actor after the specific crime has been committed. 
The latter, however, is a special crime of obstruct- 
ing the administration of criminal justice. 

Disposition of offenders. Last, the general part 
grades offenses according to seriousness and cre- 
ates standards by which sentence is determined in 
the individual case. Some civil-law codes include 
in this portion of the general part the so-called pro- 
tective or security measures, by which a defendant 
is channeled into special institutions for mentally 
ill persons, alcoholics, or youthful offenders. Re- 
cidivism may also call for special forms of in- 
carceration as well as increased maximum and 
minimum terms of imprisonment. 


Criminal procedure 


It is difficult to equate and compare civil-law 
and common-law proceedings on the basis of the 
legal terms that each embodies. But there are cer- 
tain stages and functions common to both, even 
though the details of procedure differ. 

Investigation. The first function of criminal 
procedure is investigation. Civil-law codes of crimi- 
nal procedure delineate fairly precisely the relative 
roles of police and prosecutor in investigations of 
crime. The primary responsibility to investigate is 
with the police, but the public prosecutor has power 
to supervise and intervene. The police have only a 
limited power to arrest; advance judicial authoriza- 
tion is required unless the offender is caught in the 
act or unless there are special circumstances that 
make it difficult or impossible to obtain judicial per- 
mission to arrest beforehand. Special statutes or 
regulations may permit “preventive detention,” but 
they are usually comprehended as one aspect of 
administrative law. 

Civil-law codes strictly limit the period for which 
an arrested person can be held in police custody. 
A judicial order of detention is required after a 
few hours; this order is valid for only a brief 
period while a formal charge is prepared. After the 
filing of the formal charge, a court through a more 
formal proceeding may authorize continued deten- 
tion pending trial. Bail or some other form of pro- 
visional release may be proper after the suspect or 
accused has been detained on judicial authority, 
but this tends to be limited to the period after 
formal accusation and then is at the discretion of 
the court. 

In civil-law codes judicial control extends not 
only to arrest and detention but also to acquisition 
of evidence. Police and public prosecutors have 
limited power to search for and seize contraband 
and evidentiary material, but for the most part they 
must seek a judicial warrant before entering build- 
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ings or seizing matter in the hands of third parties. 
In some systems a special investigating judge takes 
the initiative of ferreting out evidence, while in 
other systems it is incumbent on the public prose- 
cutor or command police officers to apply for au- 
thorization to search, specifying what is sought. 
Acquisition of evidence includes formal question- 
ing of witnesses, whose statements under oath are 
preserved for later use. In some civil-law codes the 
defense may also ask that evidence be preserved 
for its benefit, but in others the defense may only 
inspect whatever the state has taken the initiative 
in gathering. 

The Anglo-American system places much great- 
er responsibility on the police and makes rela- 
tively little use of warrants. Warrants are available 
for arrest, search, or seizure, but there is no duty 
to apply for them. Police are authorized to arrest 
without warrant if they “reasonably believe” that 
a felony has been committed by the arrestee or if 
any offense is committed in their presence. If a 
valid arrest has been made, the arrested person and 
his immediate surroundings may be searched and 
evidentiary matter seized. If an arrest is made, the 
police must produce the arrestee before a magis- 
trate within a short period of time; delays for the 
purpose of interrogation are usually viewed as con- 
travening this requirement. The magistrate may 
then commit the arrested person, pending a pre- 
liminary hearing to determine if there is some 
basis for the charge levied against him. In most 
American jurisdictions bail must be set in all but 
very serious cases. Anglo-American law has done 
very little to develop a systematic means of im- 
pounding evidentiary material under judicial super- 
vision. The preliminary examination or grand jury 
proceeding may be used for this purpose by the 
State, but it is not often useful to the defense. Dis- 
covery, by which one or both parties are required 
by court order to make evidence in its hands ac- 
cessible to the opponent before trial, is in a rudi- 
mentary stage but may in time serve to permit 
acquisition and preservation of evidence for later 
trial use by either the state or the defense. 

Preparation of the formal charge. The second 
function of criminal procedure is preparation of 
the formal charge. Here, too, one must identify 
where discretion to charge lies, for not all arrested 
or suspected persons are brought to trial. In some 
systems the police have sole control, and in all 
systems they in fact dispose of many cases with- 
out trial or other formal proceedings. Private citi- 
zens control some prosecutions; if, for example, 
prompt complaint of rape must be made by the 
victim, or if only the offended spouse can complain 


of adultery, nonaccusation by the citizen for whose 
benefit the criminal statute exists bars all proceed- 
ings. In most countries the public prosecutor has 
discretion whether to press charges, though in 
some instances he may have to obtain judicial con- 
currence. In those parts of the Anglo-American 
world where the grand jury survives, that body 
exercises substantial discretion to refuse prosecu- 
tion. In some civil-law countries the police and 
public prosecutor are required to refer the material 
they have gathered, together with the draft of a 
pleading, to a judge or court, which then deter- 
mines whether the trial shall occur, In the last- 
mentioned system, a person moves from “suspect” 
status to “accused” status to “defendant” status; 
where discretion is vested in police or public prose- 
cutor, only “suspect” and “defendant” statuses are 
formally recognized. 

If discretion is acknowledged in a legal system, 
controls must be placed on its exercise. Thus, 
though a private citizen may determine that cer- 
tain cases shall not be pursued, the public prose- 
cutor has discretion over those prosecutions which 
are desired by private citizens, some of which will 
not be carried through. A grand jury indictment 
can be dismissed on motion of the district attorney; 
on the other hand, a refusal to indict can be cir- 
cumvented by presenting the same matter to an- 
other, presumably more tractable, grand jury. 
Sometimes a judge may order the institution of 
prosecution despite the prosecutor's refusal to do 
so, or an advisory body of citizens may be called 
upon to review that refusal. Whatever the primary 
allocation of discretion, however, at some point the 
judiciary has the final word. 

The code of criminal procedure also specifies 
the contents of the pleading. Legal rules differ pri- 
marily as to how much detailed information the 
defendant requires to prepare an adequate defense 
and as to the extent to which the pleading can be 
amended during the proceedings. Civil-law plead- 
ings are usually more detailed than common-law 
pleadings but may also be amended and changed 
with greater freedom; this comparison suggests a 
balance of function at about the same point in each 
system. The defendant is not usually required to 
submit a formal answering pleading, but this may 
be modified in the case of certain special defenses, 
such as insanity or alibi. 

Trial procedure. The third major phase of 
criminal procedure is trial. In each system the 
place of trial must be fixed. Civil-law codes permit 
one case to be brought simultaneously in more 
than one court, based on place of the crime, resi- 
dence of the defendant, or place of arrest; one 


court is then selected on the basis of convenience. 
Common-law crimes are to be prosecuted where 
they are committed; in theory this means one court 
only, but in practice it permits some choice of 
court. The civil law provides for further transfer 
of a case for reasons both of convenience and of 
bias, while the common law usually acknowledges 
only the latter, and then often only if the defend- 
ant asserts it. However, the differences between 
the systems are less in application than they are 
in theory. Another factor involved in properly as- 
signing a case for trial is the gravity of the offense: 
the court that is selected must, of course, be au- 
thorized by law to adjudicate criminal cases of the 
gravity of the offense charged. 

Lay participation in criminal trials. The civil 
law and the common law differ in outward form 
in the degree of lay participation in trials. In some 
legal systems, particularly after a revolution, all 
judges are laymen; at times, indeed, the com- 
munity as a group both accuses and tries, as it is 
reputed to have done in more primitive eras. In 
most modern systems, however, the judiciary is a 
professionally trained career group, and the issue 
is one of whether judges are to be joined by lay- 
men. Some countries place sole responsibility on 
the judge or on a multiple-judge, or collegiate, 
court. Others utilize lay assessors who sit with 
judges over a period of weeks or months while 
many cases are tried. At times laymen join judges 
to try one case only, as voting members of the 
tribunal. 

The Anglo-American system relies on a lay jury 
selected by lot to hear the particular case and to 
make a “final” determination of guilt. The judge 
supervises the conduct of the trial and instructs 
the jury about relevant law, while the jury deter- 
mines the facts and whether, under the judge's 
instructions, the defendant is guilty. However, the 
jury is not necessarily as independent as common- 
law tradition would have it. In minor cases there 
is no jury. Moreover, even if a jury sits and con- 
victs, the trial judge may repudiate its verdict and 
either discharge the defendant or order retrial. 
Only an acquittal is final. This, however, is a safe- 
guard to the citizen which should not be under- 
estimated, for acquittal and the protection of the 
double-jeopardy concept which it produces have 
been very strong protections against overzealous 
public officials who have prosecuted minorities for 
their beliefs, 

Procedural guarantees to the defendant. The 
outward trappings of the trial are also important. 
There has been a strong reaction in most countries 
against secret proceedings, so that most codes re- 
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quire public trial. Some civil-law statutes permit in 
absentia proceedings, though it is usually possible 
for the defendant to procure a rehearing that he 
can attend. In absentia proceedings are prohibited 
in common-law countries, except perhaps for minor 
traffic offenses. Moreover, the trier must be im- 
partial. Therefore, most codes permit challenge of 
prejudiced judges, assessors, or jurors. It is im- 
portant, at least in theory, that a defendant be rep- 
resented by counsel. Certainly common-law lawyers 
are vocal in urging this as a primary factor in 
assuring fairness of trial. Whether representation 
by counsel is important, however, cannot be de- 
termined solely by whether a lawyer attends and 
participates. He may not be trained as an advocate 
and may view himself only as a special attaché to 
the court. The primary responsibility for interro- 
gating witnesses and evaluating data may rest with 
the court or with the court and public prosecutor, 
so that defense counsel has little to do. Sometimes 
the controlling stage of the proceeding is at an 
earlier time, when evidentiary material is assem- 
bled and a decision reached whether to carry for- 
ward the prosecution. If defense counsel does not 
participate at this earlier stage, then his activity in 
the later “formal” trial has little impact on the 
formal adjudication of guilt by the court. The 
heavy reliance by the court on the data presented 
to it by the prosecutor at the time he institutes 
prosecution means that the decision to prosecute 
in effect guarantees a judgment of guilt. At best 
defense counsel may be able to call extenuating 
circumstances to the attention of the court. In this 
field, also, function and not form is important. 

The two major systems differ in regard to the 
material received to support the final adjudication 
and in regard to the responsibility to marshal and 
present it. The common law embodies elaborate 
rules of evidence, by which some data can be uti- 
lized only for limited purposes and other matter 
cannot be used at all. The responsibility to gather 
and present evidence is on the person who wishes 
to make affirmative use of it. Evidence must be 
presented either through witness testimony given 
in open court or through documents, the authen- 
ticity and contents of which are vouched for by 
witnesses testifying under oath. The opponent 
must be given an opportunity to cross-examine the 
proponent's witnesses; the defendant's right of con- 
frontation ensures that this be done by him or in 
his presence in the case of prosecution witnesses, 
The judge is a neutral arbiter, who intervenes in 
the questioning only to clarify the testimony. 

In contrast, in civil-law systems the judge plays 
the predominant role in the trial; he weighs evi- 
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dence solely in terms of its probative value, not its 
admissibility. However, whether the systems are as 
dissimilar in function as in form is doubtful. Strict 
as common-law rules of evidence may appear, most 
evidence can in fact be submitted either under one 
of the many exceptions to the rule forbidding hear- 
say or as “lay” opinion evidence. Though the judge 
may caution the jury to consider evidence for a 
limited purpose only, there is neither guarantee 
nor probability that jurors will so limit their use 
of it. Moreover, strict invocation of rules of evi- 
dence characterizes only the portion of the trial in 
which the jury participates, Strict rules of evidence 
do not apply to pretrial and procedural matters. 
No limitations apply to material bearing on sen- 
tence; the judge relies principally on hearsay docu- 
ments. In the civil-law tradition, the trial aims 
primarily at final approval of the material gathered 
through earlier proceedings and at pronouncement 
of sentence. Thus, rules of evidence play no more 
useful a role in civil-law trial procedure than they 
do in American pretrial and postconviction pro- 
ceedings; the opportunity to contest and offer 
counterdata is what matters most. Lay participa- 
tion is also a less critical part of the system; and 
perhaps the distrust of the jury is less prevalent 
than it has been in the common-law system, so that 
little purpose is served by creating rules of evi- 
dence. Here, too, the differences are probably 
greater in form than in application. 

When fact-finding ends—which in some sys- 
tems, such as that of the Japanese, may be after 
a series of intermittent hearings—the case pro- 
ceeds to judgment. In all modern systems the bur- 
den is heavily on the state to establish guilt. In the 
common-law tradition the jury is told that it must 
be persuaded of the defendant's guilt “beyond a 
reasonable doubt,” while the classical statement in 
the civil law is that “all doubts are resolved in favor 
of the accused” (in dubio pro reo). Neither term 
is capable of precise definition, but both are similar 
in application, in that a man acquitted under one 
system would very probably have been acquitted 
under the other. 

The form of the judgment differs in the two tra- 
ditions. The common-law judgment records the 
bare conclusion of guilt or innocence and the sen- 
tence; no explanations of any kind are appended 
by either jury or judge. The civil-law judgment, in 
contrast, contains a fairly elaborate recital of the 
facts that suggest guilt, the Statutory provisions 
that have been invoked, and the factors that have 
led the court to impose the particular sentence. 
This difference in the form of the judgment in turn 
indicates a functional difference. In the civil law 


there is no division of the proceedings into guilt- 
ascertainment and sentence-assessment; both are 
done simultaneously and embodied in the same 
judicial document. In the common-law tradition 
the jury ascertains guilt, and the judge then 
assesses sentence. 

Appeal. The final major phase of judicial pro- 
ceedings is appeal, including extraordinary reme- 
dies after the time for ordinary appeal has elapsed. 
Appeals may be designed to relieve the defendant 
or the state from prejudicial error in earlier pro- 
ceedings, to ensure procedural regularity by dic- 
tating compliance with law on the part of lower 
courts, or to perform a didactic function of in- 
structing the population concerning their rights 
and duties under law. Anglo-American law places 
formal emphasis only on the first, and then solely 
for the defendant's benefit. Civil-law appeals permit 
one review for relief against hardship, but review 
beyond that point promotes only regularity and 
purity in legal processes themselves. Both systems 
formally disclaim any purely didactic purpose, but 
in practice many American and Japanese opinions 
are judicial essays intended for public edification. 

After judicial activity is exhausted, there may 
still be administrative relief against court-imposed 
penalties, in the form of pardon, amnesty, or 
parole, It is usually through these devices that soci- 
ety adapts its simple but sweeping rules of law to 
the requirements of common sense and justice in 
the individual case, and works a meaningful inte- 
gration between criminal law and other relevant 
areas of law. 


Criminal law and welfare law 


Traditional criminal law and procedure presup- 
pose a rational actor who exercises considerable 
free will. The legal order encourages acceptable 
choices of conduct; it contemplates that the crimi- 
nal will cooperate in his own rehabilitation. If, 
however, the actor is totally or substantially in- 
capable of cooperation, criminal-law processes 
falter and must be either adjusted or replaced. 
Three problems—mental disorder, addiction, and 
immaturity—account for much of the current diffi- 
culty in administering traditional criminal-law 
machinery. 

Criminal law and mental illness. Mental dis- 
order manifests itself in many ways, some of which 
concern only a diagnostician but others of which 
affect the peace and order of the community. When 
a mentally ill person injures other people or their 
property, disturbs the peace, offends community 
sensibilities, or acts so as to imperil himself, official 
activity almost certainly results. That activity be- 


comes stereotyped, and new legal doctrines de- 
velop. Most systems of law recognize voluntary 
hospitalization as legal; in others the family ac- 
knowledges a responsibility to control its own men- 
tally ill members. Some mentally ill persons, how- 
ever, must be coerced into hospitalization. All legal 
systems, therefore, recognize some form of civil 
commitment, either judicial or administrative, re- 
sulting in hospitalization of the afflicted person 
until he may safely be released. 

Nevertheless, a number of mentally ill persons 
will have been apprehended as “criminals” before 
their mental abnormality is recognized. Since their 
acts have already been tentatively characterized as 
“crimes,” ordinary civil commitment proceedings 
are considered inappropriate, and special proce- 
dures based on their mental condition are devel- 
oped within criminal procedure itself. The defend- 
ant may be characterized as “incompetent to stand 
trial” because of his inability to comprehend the 
nature of the proceedings against him and to par- 
ticipate in his own defense, and thus he may be 
confined indefinitely in a mental institution for the 
“criminally insane” until an administrator certifies 
him “competent” and returns him for trial on the 
original charge. Or he may be denominated a “crim- 
inal sexual psychopath” or a “defective delinquent” 
under a special statute and confined to an institu- 
tion, again usually a hospital for the “criminally 
insane.” If he is tried, he may be found “not guilty 
by reason of insanity” or, in some systems, “guilty 
but insane,” in which case, too, under a special 
statute he is committed to the same facility for the 
“criminally insane” that receives men in the pre- 
ceding two categories. In yet other cases, the de- 
fendant may be found “sane” in the legal sense 
and sent to prison. However, from the prison he 
may be administratively transferred to the same 
institution for the “criminally insane” to which he 
would have gone if he had been acquitted by reason 
of insanity, though he remains there only for the 
maximum term of his sentence. 

Mental abnormality ought always to be ascer- 
tained through a civil commitment proceeding, 
which should supplant criminal proceedings if the 
defendant's otherwise criminal act is symptomatic 
of a pattern of activity that is caused by mental 
illness or abnormality. If that activity endangers 
public safety or order, the actor should be com- 
mitted to a treatment-oriented hospital or, if no 
treatment is possible, to some form of custodial 
institution. A person too disoriented to stand trial 
manifests a mental condition that probably ac- 
counted for his “criminal” activity. Only the acci- 
dent of arrest may explain why no civil commit- 
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ment proceedings have been undertaken. To permit 
criminal charges to sleep while the defendant is 
hospitalized as “incompetent to stand trial” makes a 
fair trial at a later time almost impossible; prose- 
cutorial or judicial discretion is probably not a 
sufficient safeguard to him. Moreover, a mentally 
abnormal person in a prison disrupts prison ad- 
ministration. Much wasted activity would be 
avoided if final disposition were made soon after 
the mentally abnormal actor has been first placed 
under restraint. 

Reform, however, is difficult. For one thing, it is 
less expensive to maintain a “hospital” prison than 
a treatment hospital. For another, lawyers, judges, 
and private citizens fear any system in which psy- 
chiatrists, psychologists, and social workers make 
the critical diagnosis; thus, instead of supporting 
modifications in traditional procedure or the devel- 
opment of new but fair procedures, they oppose 
change altogether. Moreover, a treatment-oriented 
hospital system requires open institutions and out- 
patient treatment. Most citizens are apprehensive 
enough about “lunatics” who have been civilly com- 
mitted; to contemplate that “criminal” mentally ill 
persons may be placed in ordinary hospitals with 
“civil” patients and released on outpatient status is 
almost unbearable. 

Nonetheless, the older traditions are beginning 
to crumble; in many jurisdictions efforts are under 
way to develop a single state hospital system and 
early, final coordination of criminal prosecutions 
and civil commitments. Success, however, will re- 
quire major capital expenditures, for to relabel a 
“prison” a “hospital” makes no major difference so 
far as the average inmate is concerned. 

Criminal law and addiction. Another dilemma 
for traditional law is drug or alcohol addiction. 
Indeed, since addiction is increasingly viewed as 
symptomatic of serious mental disturbance, it may 
be that this is ultimately a specific instance of the 
broader problem of mental illness. But other ex- 
planations of addiction stress socioeconomic fac- 
tors, so that addiction may have to be viewed as an 
independent problem in itself. Criminal law, par- 
ticularly in the United States, has considered nar- 
cotics addiction a result of free choice and has 
relied on increasingly heavy penalties to deter citi- 
zens from using narcotics, particularly from traf- 
ficking in them. The legal significance of intoxica- 
tion has already been touched on, But repression 
achieves only the illusion of control. Narcotics 
users and alcoholics constitute a fairly constant 
percentage of the populace, whatever the legal pro- 
hibitions may be. Prohibition, however, forces the 
price of narcotics upward, so that the addict must 
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commit crime to obtain money to buy narcotics 
through illicit channels. Therefore, the system de- 
veloped in England and western Europe, by which 
addicts receive drugs under medical supervision, 
makes pragmatic sense, if only because crimes 
committed by addicts for gain tend to diminish 
under that system. If addiction is a symptom and 
not a separate condition, then the causes of addic- 
tion must be isolated before addiction itself can be 
controlled. Exclusive reliance on the criminal law 
not only produces hardship in particular cases, 
except as unfettered police and prosecutor discre- 
tion may invisibly ameliorate it, but also fosters in 
the community an illusion of effective control that 
militates against developing more efficient means 
of control. Here, also, there are some signs of 
change, but the successful integration of criminal- 
law processes with administrative public health 
and economic measures lies further in the future 
than integration of criminal-law and mental-health 
procedures. 

Criminal law and immaturity. There was a 
time when immaturity had only limited signifi- 
cance in traditional criminal law: one who had not 
reached a certain chronological age at the time he 
acted was either not within the coverage of the 
criminal law or was presumed incapable of forming 
the requisite criminal intent. But the age at which 
full criminal responsibility was imputed was so low 
(usually 13 to 15 years) that many adolescents 
were processed through the courts and incarcerated 
or executed with adult criminals. The reaction to 
this was the creation of special juvenile courts and 
treatment facilities. 

The new system, however, has proved at best a 
mixed blessing. Special detention facilities are 
often nothing more than juvenile prisons, in some 
instances with fewer educational or vocational 
facilities than adult prisons. In the enthusiasm for 
eliminating the grosser aspects of a criminal trial, 
important safeguards have also been discarded, 
Minors are often given no notice of the allegations 
against them, are at times not permitted to be 
present in court or represented by counsel, are 
usually not given even a summary of the data on 
which the court's findings are based, and are often 
consigned to a custodial institution for a longer 
period than an adult would be for the same act. 
In general, there is also a failure to differentiate 
between juvenile neglect cases, in which some in- 
formality is perhaps in order, and juvenile delin- 
quency cases, in which specific acts form the basis 
for an adjudication against the minor. Even in 
neglect proceedings the procedural tights of the 
parents or custodian of the child are minimized and 


the child disposed of on the basis of anonymous 
reports or the recommendations of welfare agen- 
cies whose representatives either do not appear or 
are not required to substantiate their reports. 

A countertrend has appeared, however; there is 
increasing judicial scrutiny of juvenile court pro- 
ceedings. This has brought about insistence on 
advance notice of the allegations against the par- 
ent or juvenile, participation by attorneys in the 
adjudicative process, and appellate review of find- 
ings, which themselves must be specific and sup- 
ported by relevant data. Nevertheless, even this 
measure of reform is ineffective if the community 
does not provide adequate treatment facilities, The 
past history of juvenile court practice and stand- 
ards forecasts the direction that new efforts to dis- 
pose of the mentally ill and the addicted may take, 
as well as the resources that must be expended and 
the procedural standards that must be created if 
unfairness and hardship are to be avoided. 


B. J. GEORGE, JR. 


[See also Crime; Law; LEGAL SYSTEMS; PENOLOGY; 
PUNISHMENT.] 
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CRIMINOLOGY 


It is proper to use the term criminology to desig- 
nate either (a) a body of scientific knowledge about 
crime, including its causes and prevention, the 
handling of offenders, and the pursuit of such 
knowledge regardless of where it stems from; or 
(b) a didactic discipline, which assembles, ana- 
lyzes, and integrates the findings of criminological 
research in all scientific disciplines and indicates 
the best way in which they can be applied in prac- 
tice to secure socially desirable ends. 

Since Raffaele Garofalo (1885) invented the 
term, no agreement has been reached on the defi- 
nition of criminology. Traditionally, it has con- 
cerned itself with the study of violations of the 
criminal law and of those who commit them, but 
opinions vary on the nature and scope of such 
study, Some hold that it should concentrate on the 
scientific investigation of the causes of crime and 
form a subclass of a more general catch-all disci- 
pline called “criminal science.” Criminal science 
would be composed of many specialized branches 
of study, some concerned with etiology—crimin- 
ology, further subdivided into biological, psycho- 
logical, and sociological criminology—and others 
with police science, substantive and procedural 
legal problems, and penology. By contrast, there 
are those who would regard all these branches as 
parts of criminology, a view reflected in American 
textbooks in particular, which consequently employ 
the term merely as a pedagogical device. This cus- 
tom is opposed by those who see in criminology an 
empirical and naturalistic science, but even this 
more restrictive view raises problems. This is be- 
cause the scientific study of offenses, offenders, 
and the whole complex of penal and law-enforcing 
institutions is carried on by researchers in a variety 
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of scientific fields, each having its own theories, 
hypotheses, and techniques of investigation, which 
are often poorly understood, misunderstood, or re- 
jected by researchers in other fields—either in 
principle or because of faulty interdisciplinary com- 
munication arising from increasing specialization 
in research activity. One cannot, therefore, speak 
of a science of criminology in the narrow sense of 
a discipline that possesses universally accepted 
theoretical concepts. 


The search for causes 


It is in the search for an understanding of why 
people commit crime that most research has oc- 
curred. The history of such inquiries can be said to 
have begun with the nineteenth century, which 
witnessed the development of the psychological and 
social sciences. Previously, indeed going back to 
antiquity, the problem of criminal conduct occu- 
pied the minds of natural philosophers and proto- 
scientists, and various theories were advanced to 
explain it. Observers of the social scene also specu- 
lated on the causes of criminality long before 
sophisticated inquirers with more adequate sources 
of data began their studies. Generally speaking, one 
might say that the search for the causes of crime 
has either been made by those who believe that 
criminal conduct can be explained chiefly by the 
biological or mental characteristics of offenders, or 
by those who believe that environmental conditions 
and circumstances are the chief operative factors. 
We shall call the former the individualists and the 
latter the environmentalists. 

The individualists. Attempts to interpret the 
significance of relationships between body and 
mind led in ancient times to the development of 
physiognomics. This method of character diag- 
nosis survived well into the Middle Ages, and one 
of its practitioners, Giambattista della Porta ( 1536- 
1615), may have been the first criminologist. He is 
said to have made anthropometric measurements 
on criminals in order to establish a typology. These 
medieval studies are not unlike some that ap- 
peared in the last century, and della Porta’s draw- 
ings comparing human and animal faces have 
found counterparts in relatively recent studies by 
responsible scientists working with different under- 
lying hypotheses. However, physiognomics soon 
fell into disrepute and remained so until Lombroso 
revived it in a new connection. 

A different approach to the explanation of crime 
was that of Franz Joseph Gall (1758-1828), the 
greatest brain anatomist of his age, who was in- 
fluenced by the faculty psychology of the time. His 
study of the brain and the nervous system caused 
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him to propound a theory of the localization of 
brain functions, each operating through an organ 
in a cluster of other organs distributed over the 
outer layer of the brain. Criminal conduct then oc- 
curred if an overdeveloped organ of combativeness 
or acquisitiveness, for instance, was too stimulated. 
In vogue for a few decades, Gall’s theory was aban- 
doned, but it called attention to the necessity of 
studying the offender in order to understand his 
conduct. Following Gall’s lead, Lauvergne (1841) 
studied prisoners at Toulon and described a crimi- 
nal type that Lombroso later was to call a criminal 
by nature. 

Psychiatrists were also interested in the etiology 
of crime. Benjamin Rush (1786), a Philadelphia 
physician, published an essay on the influence of 
physical causes on the moral faculties, in which he 
described persons who, although in other respects 
normal, became criminals because their moral fac- 
ulty was impaired, a disease he called anomia. This 
theory, later elaborated by Philippe Pinel and Jean 
E. D. Esquirol, given the name “moral insanity” by 
James C. Prichard, and strongly defended by 
Prosper Despine and Henry Maudsley, took on a 
new lease of life in the late 1930s, when the notion 
of the constitutional psychopathic inferior aroused 
a discussion that seems likely to continue. [See 
PSYCHOPATHIC PERSONALITY. ] 

The political revolutions of the eighteenth cen- 
tury may have proclaimed the equality of man, but 
scientists of the period were fully aware of the 
great variations within the human species. The 
prevalence of physical and mental defects, ill- 
health, poverty, and criminality gave rise to B. A. 
Morel’s theory of “degeneration” (1857), which 
held that all these phenomena were the results of a 
progressive pathological process in which heredity 
played an important role. This process led to the 
formation of a variety of human types, which he 
described in a monograph (Morel 1864) that con- 
tained many of the ideas later adopted by the 
Lombrosians. He also suggested the need for a 
science of “morbid anthropology.” Dugdale’s study 
of the Jukes (1877)° shows some affinity with 
Morel's ideas, 

In 1876 a work with the title L'uomo deliquente 
(“The Criminal Man”), by Cesare Lombroso (1835— 
1909), introduced a new variant among individu- 
alistic theories—the most important one, judging 
from its ultimate influence rather than its intrinsic 
value. On the basis of clinical and anatomical 
studies of criminals, Lombroso was convinced that 
the criminal was a throwback (atavism), a person 
who has the body and mind of our primitive an- 
cestors, a kind of vestigial survivor from a day 


when mankind stood on a lower rung of the ladder 
of evolution, who, acting in a way natural to such 
a person, breaks the laws of modern society. The 
theory is a perfect example of the confluence of 
many contemporary sources of ideas. Jacob Mole- 
schott and Friedrich K. C. L. Büchner supplied the 
materialistic doctrine, Charles Darwin the frame- 
work, Ernst H. Haeckel’s biogenetic principle of 
recapitulation the mechanism that, interfered with, 
would account for the atavism (Mendel’s discovery 
was still unknown), psychiatrists the concepts of ` 
moral insanity and degeneration, Paul Broca the 
techniques and instruments of anthropometry, the 
psychophysicists the methods of psychometrics, 
and Joseph A. de Gobineau the documentary data 
on the customs of primitive man. 

Lombroso’s theory aroused both support and 
Opposition among researchers, not to mention the 
clergy or the laity represented by the legal profes- 
sion. The first serious attempt to test it was done 
by C. B. Goring (1913), who found no support for 
it, and the last by E. A. Hooton, who failed to sub- 
stantiate the claims he made for it. It no longer 
survives anywhere in its original formulation, but 
the orientation it represents, namely the stress on 
the importance of biological factors, still dominates 
criminological thought in Italy, the Iberian penin- 
sula, and the Latin American countries. 

A parallel avenue has been followed by the con- 
stitutionalists, seeking a correlation between somatic 
body types and criminality. The rediscovery of 
Mendel’s hypothesis and the development of new 
psychometric devices such as intelligence tests led 
to numerous researches on the relationship of he- 
reditary or psychological factors in delinquency, 
best demonstrated in studies of identical twins. It 
is within this context that we should appraise the 
psychoanalytic theories advanced in explanation of 
crime. Biological theories also fall under this head; 
indeed, it looks as if ancient beliefs in the role of 
body fluids in shaping temperaments have reap- 
peared in a new guise in physiological studies of 
endocrine influences on behavior. As each new 
hypothesis of a biological or psychological nature 
has appeared, or new diagnostic devices have been 
found (the electroencephalograph or projective 
psychological tests, for instance), attempts have 
been made to determine their usefulness in re- 
search explaining criminal conduct. 

The environmentalists. The earliest environ- 
mentalists were probably the astrologers who 
believed that traits ascribed to planets were mys- 
teriously transmitted to humans, and that persons 
born when certain planets were in ascendancy of 
conjunction were doomed to crime. In later ages, 


such primitive conceptions were replaced by be- 
liefs in the effect of climate on behavior (cf. 
Montesquieu), systermatized less than a century 
ago by Enrico Ferri (1881a) in his discussion of 
telluric factors in the etiology of crime. Today, 
observed seasonal changes in criminality are recog- 
nized but explained in social terms. Indeed, the 
contributions of the environmentalists to an under- 
standing of criminality have been predominantly 
sociological. Their ideas, at first based on common- 
place experience and observation, have undergone 
many different, if not always basic, changes, be- 
cause of improved sources of data, greater methodo- 
logical sophistication, and the stimulus of scientific 
developments of the behavioral sciences in general. 

It is said that Galen, noting the malpractice of 
Roman physicians, claimed that their conduct was 
made possible only by the anonymity of city life. 
Had they lived in small towns, where everybody 
would have known them, they could not have per- 
sisted in their conduct. The effect of poverty on 
crime was seen by Sir John Fortescue in the 
fifteenth century, and described by Sir Thomas 
More and Juan Luis Vives in the sixteenth cen- 
tury. The economic and social consequences of the 
Black Death and the endemic wars of the following 
three centuries created a criminal class, the exist- 
ence of which was attributed to social causes. 
Eighteenth-century writers like Bernard Mande- 
ville, Henry Fielding, and Patrick Colquhoun cited 
police corruption, the moral contagion of prisons, 
poor law enforcement, gambling, the saloon, illiter- 
acy and ignorance, etc. as responsible for criminal- 
ity. Both Fielding and Colquhoun gave graphic de- 
scriptions of organized crime, as did Avé-Lallemant 
in Germany half a century later. 

It is during the last three quarters of the nine- 
teenth century that we discern in the social sciences 
a development paralleling that already described 
in our discussion of the individualists. In 1825 
France set up the first system of judicial criminal 
statistics. It was to be imitated by most other Euro- 
pean countries. These annual series of data in- 
spired the first important statistical studies by 
Charles J. M. Lucas (1827) on the relation of edu- 
cation to crime, by Adolphe Quetelet in 1831 on 
the relation of age to criminality, and by André M. 
Guerry (1833) on economic conditions, education, 
sex, etc. as related to crime. To facilitate the under- 
standing of their tables, they presented maps of 
France showing the distribution of some of the 
phenomena they investigated, a technique much 
employed by contemporary and later authors and 
revived in the present century by American social 
ecologists. 
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The socioeconomic consequences of the indus- 
trial revolution were seen as criminogenic by 
numerous authors, especially those influenced by 
the economic theories of Karl Marx. Indeed, two of 
them, Napoleone Colajanni (1889) and Enrico Ferri 
(1881b), produced the first treatises on criminal 
sociology. Ferris work became particularly influ- 
ential. His effort to reconcile the divergent views of 
the social scientists and the criminal anthropolo- 
gists led to a multiple-factor approach to the study 
of causation, which took account of anthropologi- 
cal, telluric, and social factors. His earliest studies 
(Ferri 1881a) were statistical analyses of crimi- 
nality in France since 1825. 

Two French sociologists were to have consider- 
able influence on criminological thought—Gabriel 
Tarde (1843-1904) and Emile Durkheim (1858- 
1917). Both presented sociological theories of crim- 
inality worthy of comparison with the biological 
theories of men like Morel and Lombroso, and the 
dates of their formulation are evidence of the com- 
paratively later maturation of sociological thought. 
Tarde (1890) held that as a special activity crim- 
inality was explained by the laws of imitation. The 
criminal represented a social type. Forms of crime 
originated in the upper classes and spread down- 
ward by imitation; thus an individual might be 
born vicious, but societal influences make him a 
criminal. Durkheim's best contribution was his 
theory of anomie, a condition that he held to be 
created by social evolution, as it transforms homo- 
geneous societies into heterogeneous ones by the 
increasing division of labor, and by the rise of more 
and more social groups with divergent norms that 
may be in conflict with legal norms. Both of these 
authors, and especially Durkheim, have influenced 
the thinking of American criminologists. 


Criminology in modern times 


In this brief review, scores of names of scholars 
and scientists who have engaged in criminological 
research and who have assisted in enriching it have 
necessarily been omitted. And it will be noticed 
that, with one exception, Benjamin Rush, only 
Europeans have been mentioned. This is because 
in the United States, until the present century, 
studies of criminality or criminals only imitated 
work done abroad. The poverty of criminal statis- 
tics, which is still a problem, made the kind of 
social investigations done in Europe impossible. 
Psychologists and psychiatrists faced no such prob- 
lem; the publication of Healy’s The Individual De- 
linquent (1915) marked the beginning of a new 
era for their studies. Sociologists had to find dif- 
ferent approaches. Before World War I, no leading 
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American sociologist, except perhaps Charles H. 
Cooley, Franklin H. Giddings, or E. A. Ross, showed 
any theoretical interest in criminology, Courses in 
criminology were offered in many American col- 
leges and universities by 1905, but they were 
largely concerned with social reform, Only in the 
last forty years has a scientific orientation grown 
up in American sociological research on crime, as 
in other sociological areas of inquiry. Such research 
has tended to focus on the causation of crime. 
Psychiatric research in criminology, clinically ori- 
ented, has also increasingly come to recognize the 
significance of social and cultural influences on 
criminal conduct. [See CRIME, article on CAUSES 
OF CRIME,] 

Whereas in the United States criminological re- 
search has been done mostly by sociologists, else- 
where clinical criminology, practiced mainly by 
psychiatrists, dominated research until World War 
1. In most countries of basically Latin culture, this 
is still the case. Since then, however, sociological 
research has been gaining in importance in many 
countries and has reversed the psychiatric domi- 
nance, especially in northern Europe. This process 
has been largely due to the phenomena of postwar 
criminality and delinquency and to the growing 
familiarity with the products of American empiri- 
cal criminological research, which before World 
War 11 seemed to be almost unknown outside the 
United States, 

Sociologists have also become interested in the 
study of correctional institutions seen as social 
systems; significant research in that connection has 
been conducted in the United States, England, and 
Norway. The effectiveness of various forms of cor- 
rectional treatment is being studied in many coun- 
tries. Such research may be expected to have an 
influence on legislation on crimes, their sanctions, 
and correctional administration. Attempts are also 
being made in some countries to develop prognostic 
instruments useful in spotting future delinquents 
among young children, in the selection of offenders 
for placement on probation or parole, and in the 
assignment of prisoners to different types of treat- 
ment programs. In this connection, the central diag- 
nostic clinics established by many state or national 
correctional departments, of which California, 
France, and Italy offer good examples, have aided 
in bringing about closer cooperation among staff 
representatives of the various disciplines concerned 
with criminal conduct. 

Concurrent with the growth of scientific research 
activity and with the proliferation, especially in the 
United States, of state or local programs and 
agencies for the prevention of crime and delin- 


quency and for the treatment of offenders has been 
the increase in the demand for criminologically 
trained research and treatment personnel. Peda- 
gogical institutes of criminology, aiming to broaden 
the knowledge of the judiciary and of correctional 
administrators or candidates for such civil service 
offices, have existed for more than half a century 
at law schools in many foreign countries. Institutes 
combining staff research and teaching have ap- 
peared more recently. The most active are those at 
Cambridge, England, and Vaucresson, France. In 
the United States, a few universities offer programs 
of graduate study leading to advanced degrees re- 
lated to criminology. Such courses are, with rare 
exceptions, designed to train teachers of crimi- 
nology or correctional administrators rather than 
researchers; the same holds true for the offerings 
of the institute recently established in Japan by the 
United Nations, 
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CRISIS 


“Crisis” is a lay term in search of a scholarly 
meaning. Some scholars treat it synonymously with 
Stress, panic, catastrophe, disaster, violence, or 
potential violence. Others, adhering to the medical 
connotation, regard it as a “turning point” between 
a fortunate and an unfortunate change in the 
state of an organism, In decision-making analysis, 


it is one kind of situation or event. The concept 
is used by historians, sociologists, political scien- 
tists, and psychologists. 

Because of its varied meanings the term “crisis” 
has not been useful in building “systematic knowl- 
edge” about social phenomena. Terms that cover 
almost any situation are not helpful in analysis 
that emphasizes variables and the relations among 
variables. If many different kinds of situations are 
labeled crises, then the factor becomes a constant 
and cannot be related to variations in other aspects 
of social process. 

Uses of the term are substantive and procedural. 
Substantive uses specify the content of a policy, 
problem, or situation. Procedural conceptions em- 
phasize generic characteristics of situations, with- 
out regard to whether a particular case involves, 
for example, an international cri a political 
crisis, or an individual crisis. 

Substantive definitions. Among substantive def- 
initions the most elaborate is that of Kahn (1965), 
who enumerated 44 distinguishable steps in polit- 
ical—military escalation, from a minor provocation 
to full-scale nuclear holocaust. His conception is a 
helpful instrument of policy making because it 
suggests that military and political decision makers 
have available many stages through which to in- 
crease pressure on an adversary, without, however, 
necessarily converting a hastened escalation into 
full-scale nuclear war. (This presumes that both 
adversaries have similar perceptions of the scale.) 
The theoretical potential of this definition is, how- 
ever, limited because it is confined to nuclear crises 
and, at least at present, calls for far more data 
than are likely to be found. 

Procedural definitions. Procedural definitions 
identify elements that occur in any crisis. These 
uses have been reviewed in two papers. Wiener 
and Kahn (1962) enumerated 12 generic dimen- 
sions: (1) Crisis is often a turning point in an 
unfolding sequence of events and actions. (2) 
Crisis is a situation in which the requirement for 
action is high among participants. (3) Crisis 
threatens the goals and objectives of those in- 
volved. (4) Crisis is followed by an important out- 
come whose consequences shape the future of the 
participants. (5) Crisis consists of a convergence 
of events that results in a new set of circumstances. 
(6) Crisis produces uncertainties in assessing â 
situation and in formulating alternatives for deal- 
ing with it. (7) Crisis reduces control over events 
and their effects. (8) Crisis heightens urgency, 
which often produces stress and anxiety among 
participants. (9) Crisis is a circumstance in which 
information available to participants is unusually 


inadequate. (10) Crisis increases time pressures 
for those involved. (11) Crisis is marked by 
changes in the relations among participants. (12) 
Crisis raises tensions among participants, espe- 
cially in political crises involving nations. 

Miller and Iscoe (1963) reviewed traits of crises 
as used in psychological and sociological studies: 
(1) A crisis situation is acute rather than chronic, 
although its length is usually unspecified. (2) 
Crisis results in behavior that is frequently “path- 
ological,” such as inefficiency or scapegoating. (3) 
Crisis threatens the goals of persons involved. (4) 
Crisis is relative; what is a crisis for one party or 
participant may not be for another. (5) Crisis 
causes tension in the organism, including physical 
tension and anxiety, 

Studies by Wiener and Kahn and by Miller and 
Iscoe identified some of the same traits of crisis. 
Both studies list threat to goals and pathological 
effects, such as frustration and anxiety. Both ad- 
here to a bias that appears in scholarly writings 
on crisis, as well as in the more conventional con- 
ception of crisis taken by participants and laymen, 
namely, that crisis is something to be avoided. 

Several political analysts have experimented 
with different definitions in their empirical re- 
search. Robert North and his associates (including 
Richard Brody, Ole Holsti, and Dina Zinnes ) have 
engaged in extensive studies of the crisis resulting 
in World War 1 and of more recent Sino—Soviet 
and Soviet-American crises, including the Cuban 
crisis of 1962. North and his colleagues adhere to 
the original Greek meaning, which persists to this 
day in the medical conception of the term (North 
et al. 1963, p. 4). This definition identifies crisis 
as a “turning point” that distinguishes the outcome 
of an event favorably or unfavorably, between life 
or death, violence or nonviolence, and resolution or 
protracted conflict. The difficulty with this concep- 
tion is the obverse of that with Kahn’s definition. 
Whereas Kahn’s refined 44 stages are too numerous 
for theory building, the conception of crisis as a 
turning point is too restricted. An event is either 
a crisis or it is not; it is either a turning point or 
it is not; it is either a favorable outcome or an 
unfavorable outcome, Such categories are too gross. 

These difficulties reveal a familiar dilemma that 
occurs in the development of new concepts. Defini- 
tions are either extraordinarily precise and specific, 
and hence not widely applicable to a variety of 
situations, organizations, and subjects; or they are 
so unrestricted in meaning that, in this case, it is 
difficult to distinguish crisis from noncrisis. 

Crisis as a decision situation. Robinson (1962) 
set forth a threefold conception of crisis founded 
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on a number of case studies of political decisions. 
This provisional characterization of crisis as a de- 
cision situation or as an occasion for decision 
(Barnard 1938; Snyder et al. 1962) included: (1) 
identification of the origin of the event—whether 
external or internal for the decision makers; (2) 
the decision time available for response—whether 
short, intermediate, or long; and (3) the relative 
importance of the values at stake to the partici- 
pants—whether high or low. The origin of the 
situation was selected because of apparent differ- 
ences between such crises as the Korean invasion 
of 1950, which surprised American foreign policy 
makers, and the Bay of Pigs crisis of 1961, which 
was precipitated by the United States. It was recog- 
nized that a crisis for one party may not be a crisis 
for another and that the existence of crisis may 
depend upon whether the decision unit precipitated 
the crisis or was confronted by it. 

Decision time was regarded as important be- 
cause of apparent consequences for the content 
of decisions. For example, Snyder and Paige (1958) 
reported that the number of alternatives available 
to the United States in the Korean decision were 
few and that they were quickly reduced to one in 
an effort to meet the action that the situation 
seemed to demand. In contrast, the Marshall Plan, 
developed during 15 weeks in 1947 in response to 
rapidly deteriorating economic and political stabil- 
ity in western Europe, offered a longer period to 
search for ways to deal with the problem (Jones 
1955). Different from both the Korean invasion 
and the Marshall Plan was the settlement of the 
Japanese peace treaty, which was negotiated for 
several years and which allowed for extensive 
search for alternatives and for the exploration of 
the acceptability of alternative treaty formulations 
(Cohen 1957). 

Relative importance of the values at stake was 
selected because crisis confronts decision makers 
with potential consequences of profound impor- 
tance. In international affairs, the stakes may be 
violent or nonviolent, and between violent out- 
comes, they may involve “conventional” warfare or 
varying degrees of nuclear war. In crises other 
than international political ones, the stakes may 
be economic stability or instability, or varying 
degrees of either; organizational survival or demise; 
personal well-being or illness, inter alia. 

Working initially with this typology of crisis 
situations, Charles Hermann (1963) reviewed or- 
ganizational studies and theorized about conse- 
quences of crisis for organizational viability and 
decision making. For purposes of simulating for- 
eign policy crises through internation simulation, 
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Hermann (1965) categorized occasions for deci- 
sions as either anticipated or unanticipated, as 
involving short or long response time, and as 
involving low, medium, or high threat to the goals 
or objectives of the decision-making unit. Because 
of limitations in the simulation technique, he arbi- 
trarily dichotomized anticipation and response time 
and trichotomized threat, Obviously, these three 
dimensions could be scaled in a more refined 
fashion, but their combination yielded 12 occasions 
for decision, which was expected to be a practicable 
number for analysis. Later, however, it was dis- 
covered that the simulation technique was not rich 
enough to deal with this many and that it was 
necessary to compromise by comparing most crisis- 
like with least crisislike situations. [See SIMULA- 
TION, article on POLITICAL PROCESSES, ] 

Decision time. Consideration of this concep- 
tion of crisis uncovered serious difficulties with the 
element response time. Robinson and Snyder (1965, 
pp. 440-442) emphasized the relative effect that 
time may have in different decisions. What is a 
short time for one decision may be a long time for 
another. The complexity of tasks confronting de- 
cision makers may require different amounts of 
time for identification, alternative search and selec- 
tion, and, in different cases, implementation. 

Decision time should not, therefore, be equated 
with clock time. There are two reasons for treating 
decision time differently. The first is that time has 
varying meanings and effects for different decision 
makers. Owing to variations in cognitive capacity 
or decision-making styles, some individuals need 
a short amount of time to work on a task for which 
others require a longer period. Decision makers 
differ in reflectiveness and decisiveness. The same 
decision may be taken by the reflective as by the 
decisive, but their personal procedures for deciding 
may not be the same. This problem can be over- 
come by relating personality variables to crisis 
variables and by treating the problem-solving or 
decision-making characteristics of individual deci- 
sion makers elsewhere than in the conception of 
crisis. 

Another difficulty with clock time, however, can- 
not be so arbitrarily disposed of or transferred con- 
ceptually. When calendar time is long, the decision 
may be so complicated that many tasks need to be 
performed in formulating a policy. A case that il- 
lustrates this is the set of British decisions concern- 
ing membership in the Common Market, Between 
1960 and 1963, the British Foreign Office con- 
fronted a wide range of detailed tasks in an effort 
to negotiate entry into the European Economic 
Community (Young & Robinson 1962). Although 


the calendar for negotiation was nearly three years, 
it was crowded by demands and competition. 

Decision time, therefore, should not be treated 
as an absolute. It varies with the intricacies of the 
decision and with the number of participants. [See 
LEADERSHIP, article on POLITICAL ASPECTS.] 

Related terms. Stress, conflict, tension, panic, 
catastrophe, and disaster are terms used more 
frequently by social scientists than crisis (C. Her- 
mann 1965, p. 23). Like crisis, they too have many 
meanings, as Horvath (1959) showed with respect 
to stress. The most common usage of stress is 
probably that formulated by Lazarus and Baker 
(1956): thwarting some motive state or potentially 
thwarting, resulting in effective arousal. “Negative 
affect” is a frequent consequence of stress and 
constitutes a collective term for anxiety, fear, frus- 
tration, hostility, and tension. The difficulties with 
using stress synonymously with crisis center on 
the one-dimensional character of stress. As a threat 
to goals, it is only one of the major elements of 
crisis, as we have formulated the term 

Similar problems arise for a term like conflict. 
Conflict is an incompatibility between parties with 
respect to a goal (Boulding 1962, p. 5; North et al. 
1960, p. 356). This is the same as threat to an 
objective, one of the three major aspects of crisis. 
[See ConF.ictT.] 

Tension is another related concept. It, however, 
is a consequence of crisis and not a characteristic 
of crisis. It seems useful not to include in the con- 
ception of crisis the outcomes or effects of crisis. 

Panic, catastrophe, and disaster, although sub- 
ject to study in important social contexts, are less 
technical terms and ordinarily have not been the 
subject of investigation in organizational and polit- 
ical contexts. 

Another limitation of related concepts is that, 
with only a few exceptions (for example, M. Her- 
mann 1965), they have been investigated in prob- 
lem-solving rather than decision-making research. 
Problem-solving experiments typically present sub- 
jects with situations for which alternatives are 
given and for which a determinate and “best” solu- 
tion exists. Decision making differs from problem 
solving in that decisions are not confined to the 
selection among alternatives but rather extend to 
the search for alternatives and to the formulation 
and negotiation of alternatives. Moreover, these 
Stages of the decision process probably consume 
a larger share of the time for making a decision 
than does selection among alternatives (Simon's 
article in Easton 1966). 

In addition, many decisions are not subject to 
determinate and “best” solutions. Routine, recurring 


decisions may be subject to linear programming or 
other computational techniques that result in near- 
determinate solutions. The kinds of situations that 
crisis ordinarily connotes, however, are not. Ac- 
cordingly, the relevance of problem-solving experi- 
ments to crisis decision making is limited. Dahl 
(1961, p. 98) and Wood (1961, p. 17) caution 
against applying findings from nondecision-making 
experiments (such as problem solving) to political 
arenas in which policy decisions are taken. 

In addition to these limitations on stress as a 
synonym for crisis, others apply to panic, catas- 
trophe, and disaster. These concepts usually have 
been used to study mass behavior and other re- 
sponses dissimilar to decision-making processes. 

The state of theory. Although no theory of 
crisis has been developed as yet, theorizing about 
this phenomenon has already begun. Charles Her- 
mann (1963; 1965) inductively reviewed hypoth- 
eses and placed them in a chain of independent, 
intervening, and dependent variables. In addition, 
Hermann (1965) advanced ad hoc predictions 
about some of the relations among crises and other 
variables. In some cases, however, the ad hoc pre- 
dictions were arbitrary, and contradictory reasons 
could be given for opposite predictions. However 
innovative this effort was for testing hypotheses, 
it indicates the lack of a rich deductive theory or 
theories involving crisis. 

Not only is crisis theory barren, but theories of 
decision making rarely accommodate explicit refer- 
ence to crisis. For example, the theory of games, 
which deals with decision making under varying 
conditions, does not treat the concept. The critical 
aspects of decision with which game theory deals 
are conflicts between parties under conditions of 
uncertainty. Conflict is only one dimension of crisis 
(threat to values). It is important to note also that 
uncertainty is a variable. The amount of uncer- 
tainty in situations or occasions for decision varies, 
and all crisis decisions involve uncertainty. Because 
the theory of games treats uncertainty as a param- 
eter, not as a variable, it is difficult to draw prob- 
able hypotheses that will apply to both game theory 
and crisis. 

Theories of psychotherapy, including those of 
Erikson (1959, pp. 50-100) and Dabrowski (1964), 
regard crisis as inevitable in the development of 
the identity of individuals. The relevance of this 
for a theory of crisis is not immediately apparent, 
except for its suggestion of “positive” effects of 
crisis in the development and evolution of person- 
ality. Likewise, in theorizing about negotiation, as 
in labor-management bargaining, crisis has been 
identified as an inevitable and positive occurrence 
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that precedes resolution or settlement (Douglas 
1962). 

Most studies, however, emphasize the “negative,” 
pathological aspects of crisis. Communication the- 
ories concerned with the processing of information, 
with the possibilities of overloading networks of 
messages, and with selective perception and dis- 
tortion of content bear on crisis (Deutsch 1963; 
Meier 1963; Pool & Kessler 1965). With respect to 
one dimension of crisis, that of anticipation or un- 
anticipation, the processing of information in an 
organization may determine whether warnings 
about impending events reach top level decision 
makers. The “level of crisis” is a category for 
analysis in the conceptual scheme of “the policy 
science orientation” where variations in the crisis 
level are associated with variations in the outcomes 
and effects of social processes (Lasswell & Kaplan 
1950). 

Crisis will become a useful concept when it plays 
a part in theoretical formulations. Just as a fact is 
regarded as without meaning apart from a theory, 
so a concept can hardly be productive if it does 
not relate to other variables in a theory. 


James A. ROBINSON 


[See also CRISIS GOVERNMENT; Srress. Other relevant 
material may be found in DECISION MAKING; DEcI- 
SION THEORY.] 
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CRISIS GOVERNMENT 


The concept of crisis government is founded on 
the assumption that in certain situations of polit- 
ical, military, or economic emergency the system 
of limitation and balance of power typical of a 
constitutional government has to give way before 
the enlarged power of the executive or even the 
military. 

There exist, however, vast differences of opin- 
ion as to the theoretical foundation and the range 
of practical measures, as well as to their legal in- 
terpretation and political justification. Moreover, 
historical experience shows that the institution of 
crisis government can often bring about the very 
consequences of political and social crisis that it 
was designed to prevent. If employed prematurely, 
it may impede further attempts at solving the prob- 
lems by the means of constitutional government. 
Provisions of crisis government may thus invite 
dictatorial ambitions of political parties, groups, 
and persons. When practiced over a longer period, 
those provisional emergency arrangements de- 
signed for the protection of the existing constitu- 
tional order may even become tools for pseudolegal 
revolution and dictatorial domination at the cost 
of that order. 

Historical examples. For reasons of political 
logic and practical experience, the problem of crisis 
government has occupied political theory from its 
beginning, but in particular since the rise of the 
modern state. In antiquity the constitutional frame- 
work of the Roman Republic, with its emergency 
provision of a temporary dictator, represented the 


most elaborate attempt at institutionalizing crisis 
government, as distinct from Oriental despotism 
and unlimited tyranny. The concept of “constitu- 
tional dictatorship” contained, however, the ever- 
present danger of turning into permanent dictator- 
ship. The Roman institution of a temporary dictator 
nominated for a period of six months by the acting 
consul—who alone had the power to proclaim or 
prolong the state of emergency that gave the dicta- 
tor his special powers—was applied successfully 
during the earlier history of the republic. However, 
with the emergence of revolutionary generals in 
the civil wars of the first century B.C., it was turned 
into a permanent regime leading to the dictator- 
ships of Caesar and his successors. 

Modern constitutional arrangements reflect the 
same lack of distinction between constitutional 
emergency provisions and legalized dictatorial re- 
gimes. The transition from the first republic in 
France to the regime of Napoleon is a classical 
example of a pseudo-revolutionary turnover. Be- 
tween the two world wars, many feeble democ- 
racies surrendered to the same tendency. The 
proclamation of a protectionist dictatorship to meet 
crisis situations determined developments in Italy 
(Mussolini), Poland (Pilsudski), Turkey (Kemal 
Atatiirk), Spain (Primo de Rivera and Franco), 
Portugal (Salazar), Germany (Papen and Hitler), 
Austria (Dollfuss and Schuschnigg), and the Bal- 
kan countries; in South America, such develop- 
ments belonged to everyday politics. 

In Germany, Hitler's coming to power was a 
clear result of use and misuse of the emergency 
powers given to the president of the Weimar Re- 
public, Under article 48 of the constitution he had 
the right to subsede normal parliamentary legisla- 
tion by crisis decrees and to apply military force 
for the execution of such federal decrees in the 
states. In 1923, under a democratic president 
(Ebert), this power was exercised as a strictly tem- 
porary measure for the protection of the republic; 
after 1930, under Hindenburg, emergency govern- 
ment became gradually institutionalized, and par- 
liamentary controls were paralyzed by successive 
dissolutions of parliament. It was at this very point 
that antidemocratic forces of the right succeeded 
in breaking up the constitutional government. The 
Nazi dictatorship was founded and legalized by 
posing as a presidential crisis government, before 
Hitler managed to override parliament and to sup- 
press the non-Nazi groups, which until the last 
party elections (March 1933) commanded a ma- 
jority of German votes. 

Although these examples demonstrate the dan- 
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gers contained in the institution of crisis govern- 
ment, many exponents of constitutional theory 
maintain that emergency provisions remain essen- 
tial to the existence and maintenance of democratic 
regimes. The discussion distinguishes between 
emergency situations caused by exterior forces 
(military threat, war) and revolutionary situations 
conditioned by domestic developments. Since dis- 
turbances in the operation of parliamentary govern- 
ments can hardly be cured by extraparliamentary 
provisions, it remains open to discussion whether 
emergency articles should be formally included in 
a constitution and how far they should be applied 
to either external or internal crises. Consequently, 
constitutional structure and political practice in 
modern democracies have attempted many varied 
solutions. In substance, the problem is always how 
far civil rights may be restricted and to what degree 
such increased executive power should be trans- 
ferred to certain bureaucratic, political, or even 
military authorities. 

Contemporary provisions. In such Western de- 
mocracies as the United States, Great Britain, and 
Switzerland, the constitutions contain no explicit 
clauses regarding crisis government. To meet emer- 
gency situations, either parliament is expected to 
convey necessary powers to the government or 
constitutional theory agrees on implied and inher- 
ent powers for the president (as in the United 
States). By common understanding, however, the 
controls by the courts and parliamentary legisla- 
tion are not to be curtailed by such concentration 
of power. 

The constitution of the semiauthoritarian French 
Fifth Republic (1958), which is reminiscent of the 
dualistic, parliamentary-presidential structure of 
the Weimar Republic, contains far-reaching emer- 
gency powers conferred on the president “when the 
institutions of the Republic, the independence of the 
Nation, the integrity of its territory or the fulfill- 
ment of its international obligations are threatened 
with immediate and grave danger, and when the 
regular functioning of the constitutional govern- 
mental authorities is interrupted” (France, Con- 
stitution, 1958, art. 16). 

Such sweeping stipulations, while leaving every- 
thing to the personal qualities and intentions of the 
president, contain all the dangers discussed above. 
This certainly does not present a strong case for 
institutionalized emergency provisions. Neverthe- 
less, there are strong efforts in West Germany to 
change the constitution of 1949 and to add an 
“emergency constitution” that would allow for re- 
strictions on civil rights and suspension of parlia- 
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mentary legislation by government decrees, with 
controls remaining (as a compromise formula) in 
a small committee of the two houses of parliament. 
In the author's opinion, this contradicts the original 
intentions of the Bonn constitution, which was 
designed to avoid the mistakes of Weimar by sub- 
stituting for emergency provisions strong regula- 
tions against antidemocratic activities as well as 
regulations supporting stable governments and 
working majorities in parties and parliament. 

Problems of crisis government. The basic prob- 
lems posed by any attempt to provide, by constitu- 
tional and institutional means, for strong crisis 
government may thus be summarized in five ques- 
tions, 

(1) Who is to be in charge of proclaiming the 
state of emergency, or the need for crisis govern- 
ment? A widespread view, often justified with 
technical reasons and arguments of greater effi- 
ciency, tends to leave it to the executive. This 
coincides with a general tendency toward the bu- 
reaucratic administrative state; yet it contradicts 
the practice of proved democracies to maintain the 
responsibilities of parliament. The right to declare 
a state of emergency should be reserved to an 
organ not invested with the powers derived from 
that state. For the same reason, this requires a 
qualified majority larger than the governing ma- 
jority. 

(2) When does the state of crisis eventuate, and 
how can it be defined? In general, the practice of 
confining crisis government to the case of war 
appears the least dangerous way. All attempts to 
apply it to situations of “imminent danger” (as in 
recent German drafts) or domestic problems (as 
in the Weimar Republic and Fifth French Repub- 
lic) will lead into a jungle of divergent interpreta- 
tions inviting misuse of power and enabling the 
overthrow of democratic government by one-party 
systems or military dictatorship, thus paving the 
way to authoritarian or totalitarian regimes. 

(3) What does crisis government really mean? 
Here again the basic distinction would be between 
(a) a concentration of power functions, with some 
simplification of the governmental process in the 
framework of responsible government that leaves 
intact the competence of legislatures and courts; 
and (b) the concept of temporary dictatorship, 
which, suspending the democratic system and the 
rule of law, aims at empowering the executive to 
enact laws on its own authority. In the first case, 
the parliament may temporarily yield power to the 
executive by passing “enabling acts,” but retains 
its prerogatives; in the second case, the executive 


commands emergency powers to be mobilized by a 
proclamation of a state of emergency and effected 
by decrees. However, there is wide agreement that 
the constitution should not be changed during the 
period of emergency. 

(4) Which controls are to remain or should be 
introduced to reduce the dangers inherent in any 
of these solutions? The answer again depends on 
the degree to which responsible government is 
abandoned in favor of a bureaucratic and dictato- 
rial interregnum. If crisis government means 
merely a strengthening of democratic government 
by bipartisan politics or an all-party coalition gov- 
ernment, no basic changes in the system of con- 
trols seem necessary. On the other hand, any 
serious shift in function between the legislative and 
executive agencies, even if only for a restricted 
period, requires special controls designed to meet 
the dangers of unrestricted government with its 
tendencies to perpetuate and extend the anomalous 
power situation. 

Among the institutions granting a minimum of 
essential control, so that the efforts to defend 
democracy do not destroy the very existence of 
democracy are parliamentary committees and coun- 
cils representing various groups, an independent 
position for the constitutional courts, and the wid- 
est possible freedom of the press. The question 
remains whether the principle of voluntary coop- 
eration and self-restriction as practiced in countries 
without fixed emergency provisions (e.g., Switzer- 
land or Britain during the war) is not more effec- 
tive than the most elaborate schemes of crisis 
government, since these will never be able to fore- 
see all the actual forms of emergency measures 
necessary in a future crisis. 

(5) How is crisis government ended? The end 
of crisis government poses a final problem that all 
constitutions with emergency provisions have to 
meet with special care. Considering the crucial 
danger of constitutional dictatorship—its perpetu- 
ation and transformation into a new regime—the 
question of who, how, and when an emergency 
government is to be terminated and must relin- 
quish its powers is the acid test of any institution- 
alized crisis government. Such a decision ean 
certainly not be left to the executive enjoying his 
increased power; it appears as the particular func- 
tion of the parliament. But this implies a continuity 
of parliamentary life during the period of suspen- 
sion. In fact, unlike the Weimar constitution, that 
of the Fifth Republic makes the dissolution of pat 
liament impossible during the period of emergency 
government. After specifically limited periods, pat- 


liament must always be in a position to decide on 
the end (or continuation) of emergency politics. 

Crisis government and democracy. Any review 
of the problems arising from the attempts, old and 
new, to design safe and effective forms of crisis 
government leaves doubts as to their compatibility 
with democratic systems. In many cases, such at- 
tempts have been favored by exponents of political 
theory or practice who turned out to be more inter- 
ested in nondemocratic rule. One well-known ex- 
ample is Carl Schmitt, the German professor of 
public law who started by posing perfectionist 
demands to the democratic state, then concentrated 
on the “decisionist” superiority of emergency gov- 
ernment (decision before constitution), only to 
become the outstanding protagonist of Hitler’s to- 
talitarian dictatorship. In other cases, as in present- 
day France or West Germany, the antiparty effect 
of a great man or the perfectionism of a bureauc- 
racy may be the prime mover behind the schemes 
for institutionalizing crisis government, advocated 
as they may be by arguments of military threat, 
anticommunism, socioeconomic problems, or the 
failure of parliamentarism. 

But historical experience as well as political rea- 
soning favor the other side, In times of crisis, 
government should be founded on as broad a basis 
as possible, with parliament, parties, and a free 
press participating in, not eliminated from, the 
political mastering of the problems. This contra- 
dicts the widespread assumptions that democracy 
is feasible during “normal” times only; that the 
stronger a state has to be the less democratic it 
can be; that security can be achieved by constitu- 
tional perfection and elaborate precautionary meas- 
ures; and that bureaucratic government without 
parliamentary interference would grant a higher 
degree of efficiency in handling emergency situa- 
tions, 

Constitutional possibilities, once established, al- 
ways produce an appetite to use them. What is 
more, they tend to seduce the responsible author- 
ities—parties and parliament—to escape from dif- 
ficult or unpopular situations by leaving the field 
to emergency measures that do not have to be 
justified to the public. Such political and psycho- 
logical consequences can hardly be avoided, unless 
all elements of the democratic state remain fully 
incorporated into the mechanism of emergency 
tule. Crisis government remains a task of demo- 
cratic cooperation and trust in the final superiority 
of free societies with responsible governments, 
rather than a matter of reshaping constitutions and 
Suspending basic rules of democracy. Good as the 
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purpose of such experiments may be, the way back 
to a normal situation is most difficult once the road 
toward dictatorial forms of government has been 
pointed out. 

KARL DIETRICH BRACHER 


[See also Crisis; DELEGATION: OF POWERS; DICTATOR- 
SHIP; and the biography of SCHMITT.] 
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CROCE, BENEDETTO 


Benedetto Croce (1866-1952) was a writer with 
an unusually wide-ranging mind who dominated 
Italian intellectual life from the eve of World War 1 
to the middle of the twentieth century. His more 
than sixty volumes embrace history, literary criti- 
cism, political polemic, and formal philosophy, par- 
ticularly ethics, aesthetics, and the philosophy of 
history. It is in this last capacity that he is of con- 
cern to social scientists. As a philosopher of history, 
Croce, while continuing the idealist tradition of 
Droysen and Dilthey, eliminated a number of the 
problems that perplexed his German predecessors 
and contemporaries: he made a radical separation 
between science (including social science) on the 
one hand—dismissing it as arbitrary and “prac- 
tical” in its aim—and the realm of history and 


philosophy on the other. But rather than impose 
philosophy on history, as Hegel had done, Croce 
included philosophy within history, as the method- 
ology of history. He asserted that this philosophi- 
cal type of history is supreme among intellectual 
disciplines and that the “truth” of history lies in 
the logic applied by the historian himself and in 
his skill in reconstituting in his own mind the 
thoughts and actions of past generations, 

Born of Neapolitan parents, Croce chose to spend 
virtually his entire life in Naples, His substantial 
independent means freed him from the necessity 
of a university career, and he remained a private 
scholar, largely self-taught and supremely confi- 
dent in his chosen role as the thinker who would 
“de-provincialize” Italian intellectual life and bring 
it into the mainstream of European thought. The 
most urgent task, he believed, was to combat the 
vulgarized positivism which pervaded Italian writ- 
ing and teaching in the last quarter of the nine- 
teenth century and to re-establish the concept of 
the “spirit” in human affairs. Starting his career as 
an antiquarian-scholar of his beloved Naples, Croce 
naturally encountered the writings of his fellow 
Neapolitan Giovanni Battista Vico, who became his 
most influential intellectual guide (but as the foun- 
der of history philosophically understood, rather 
than as a schematizer of the past—something with 
which Croce had little patience). After a brief en- 
counter with Marxism at the turn of the century, 
and after a more sustained interest in Hegel (al- 
though he never became more than a qualified 
Hegelian), Croce, just before World War 1, put 
together the four systematic volumes of his Filo- 
sofia come scienza dello spirito (1902-1917; 
“Philosophy of the Spirit”), on which his reputa- 
tion primarily rests. These volumes deal respective- 
ly with aesthetics, logic, economics and ethics, and 
historiography. 

The latter part of Croce’s life brought him into 
closer touch with public events and partisan poli- 
tics. Appointed a senator of the realm in 1910 and 
having served from 1920 to 1921 as minister of 
public instruction, Croce ranked from the mid- 
1920s onward as the chief intellectual opponent 
of Mussolini’s fascist regime. Concomitantly, in his 
writings he began to concern himself with the 
question of value judgments, which he had earlier 
neglected, most notably in four substantial volumes 
of history published between 1925 and 1932 and 
in his last major philosophical work, History as the 
Story of Liberty (1938). In these books he argued 
that the vita morale (“ethical life”) is the central 
thread of history and that, consequently, history, aS 
“the story of liberty,” is necessarily liberal in char- 


acter. With the fall of Mussolini in 1943 Croce 
emerged as Italy's leading citizen. Too old, how- 
ever, to give a clear direction to the course of 
events, he was able to participate only in the first 
postfascist governments, retiring from active poli- 
tics shortly after the end of World War n. 

Croce's influence on subsequent generations of 
historians and philosophers of history has been 
widely ramifying, although his reputation in the 
English-speaking world has suffered from the fact 
that his chief interpreter, R. G. Collingwood, pushed 
his teaching to extreme conclusions. Certainly 
Croce ranks as the single most important member 
of the twentieth-century idealist school. Some of 
the difficulties his theory presents are character- 
istic of most idealist argumentation—a tendency 
to denigrate science, both natural and social, and 
to give no clear account of the processes the his- 
torian follows in arriving at historical understand- 
ing. (In this connection, it is important to note that 
Croce was largely ignorant of the twentieth- 
century philosophy of science and that he paid only 
scant attention to the work of such leading con- 
temporaries as Freud and Max Weber.) 

Other difficulties are peculiarly Croce’s. His most 
celebrated dictum that “every true history is con- 
temporary history” has proved most illuminating in 
showing how contemporary relevance lifts histori- 
cal writing above mere antiquarianism. But his ap- 
proach has raised two crucial problems. First, 
Croce’s concern with values and with what he 
called ethico-political history led him, in the later 
part of his life, to stress the abstractions of “rea- 
son” and “liberty” as against that understanding of 
the past in its own terms which had been central 
to traditional idealist historiography. Nor did Croce 
ever satisfactorily explain whether he considered 
these values absolute or relative. Second, although 
Croce’s work was more consistently rationalistic 
than that of most idealists (with the significant 
exception of Hegel), he offered no more than a 
metaphorical validation (for example, a “lightning- 
flash” of understanding) for the truth of a given 
historical interpretation. The paradoxical result is 
that what started as a rationalist theory terminated 
in an “act of faith.” 

H. STUART HUGHES 


[For the context of Croce’s work, see History, article 
ON THE PHILOSOPHY OF HISTORY; and the biographies 
of DILTHEY; HEGEL; Vico.] 
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CROSS PRESSURE 


Cross pressure refers to that social situation in 
which an intrapersonal conflict arises when the 
motives affecting a decision are incompatible. Two 
broad categories of such conflicts can be distin- 
guished—attitudinal and affiliative. Attitudinal 
conflict may occur when a person is faced with a 
choice between alternative beliefs or courses of 
action under conditions which bring into play atti- 
tudes motivating different and opposing choices. 
Affiliative conflict can result from a person’s at- 
tachment to several groups which have preferences 
for different alternatives. These two types of con- 
flict can be illustrated by voting behavior, the field 
of study to which the cross-pressure hypothesis has 
most frequently been applied. If a voter who gen- 
erally agrees with the foreign policy of one party 
prefers another party's domestic policy, his voting 
decision will be affected by attitudinal cross pres- 
sure. If his personal friends belong to one party 
and his business associates to another, his voting 
choice may be subject to affiliative cross pressure. 
While the two types of cross pressure are not mutu- 
ally exclusive and frequently occur together, they 
are based on different psychological mechanisms. 
In attitudinal conflict the individual is assumed 
to be striving for consistency between his actions 
and the various relevant parts of his attitude 
structure, whereas in affiliative conflict he is trying 
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to adjust satisfactorily to various relevant parts of 
his social environment. 

The concept of cross pressure is typically used 
to link the choice-behavior of individuals to social 
processes. This implies that the sources of cross 
pressures are part of the social world. That is, in 
the case of attitudinal cross pressure the condi- 
tions to which the individual responds must have 
objective reality. To illustrate, the specific domestic 
policies which motivate a person to vote for a 
political party must actually be the policies of that 
party or at least the policies which public opinion 
generally attributes to that party, not merely those 
the individual alone attributes to it. In the case of 
affiliative cross pressure, the attitudes which the 
individual imputes to his various relevant groups 
must actually be the attitudes which characterize 
them. Again, the voting preferences a person im- 
putes to friends or business associates must be 
their real voting preferences. If this assumption 
of objective social reality were not made, there 
would be no reason to treat cross pressure as a 
special phenomenon apart from the general field 
of internal psychological conflict. 

While the causes of attitudinal cross pressure 
include a large variety of social, individual, and 
situational factors, the etiology of all kinds of 
affiliative cross pressure reduces itself to what 
Riecken (1959, p. 178) calls “social transition.” 
The term covers both vertical and horizontal mo- 
bility, as well as certain processes of culture 
change, i.e., acculturation and cultural evolution. 

There are special problems of definition with 
respect to affiliative cross pressures. In Simmel’s 
usage the term “social circle” (which Bendix trans- 
lated as “group”) referred to any kind of social 
aggregate, from the most intimate primary group 
to the community and from involuntary social 
groups to voluntary associations (Simmel [1908] 
1955, pp. 127-130). More recent writers have 
sometimes broadened or restricted the meaning of 
the term “group.” In their Erie County voting study, 
Lazarsfeld, Berelson, and Gaudet (1944) dealt 
chiefly with broad social strata such as classes, 
religious sects, and residents of urban or rural 
areas, In their more recent study in Elmira, N. Y., 
Berelson, Lazarsfeld, and McPhee (1954, pp. 118- 
127) put greater stress on the voter's personal 
associates (his friends, family, co-workers). They 
believe that these personal associations translate 
the more abstract group memberships into con- 
crete personal experience and thus make them 
effective. The mechanism which accounts for con- 
flict, then, is that of primary group attachment, 
i.e., the individual’s need to maintain a sense of 


belonging to groups of close associates by conform- 
ing to their basic values and attitudes. Yet another 
version of the affiliative cross-pressure hypothesis 
holds reference groups accountable for the genera- 
tion of conflicts. Campbell et al. (Michigan, Uni- 
versity of . . . 1960, pp. 370-380) regard the 
individual as subject to cross pressure if he identi- 
fies with a social stratum to which he does not 
belong according to objective criteria. Other defini- 
tions of cross pressure include conflict between 
group norms or preferences and internalized val- 
ues and between group and mass-media pressures. 
Effects on individuals. Simmel ([1908] 1955, 
pp. 140-143) saw cross pressures as the social 
mechanism which leads to the development of 
individuality. In a complex society, he argued, 
each person stands at the “intersection” of a large 
number of social circles. To the extent that the 
“interests” of these “circles” are not identical—to 
the extent that they are in conflict—the person 
has freedom of choice, and the particular set of 
choices he makes establishes him as an individual 
distinct from all others. Thus, in Simmel’s view 
cross pressures stimulate innovation, although he 
acknowledged that they may also have inhibitory 
effects and occasionally lead to “psychotic breaks.” 
Some. researchers dealing with acculturation 
processes support Simmel’s views. Thus Lerner 
(1958, pp. 49-54) describes the “transitional’— 
the person exposed to both the traditional and the 
contemporary culture—as the chief agent of 
change. There are, however, other reports which 
point to the destructive or debilitating effects of 
cultural cross pressures (cf. Opler 1959). 
Students of voting behavior have shown that 
cross pressures are associated with a reduction of 
partisanship and interest in both the election cam- 
paign and its outcome. Voters under such cross 
pressures tend to make up their minds late in the. 
campaign; they tend to split their ticket; and more 
of them fail to vote than in the general population. 
Effects on society. Cross pressure is believed to 
reduce social tensions and political conflict, since 
cross-pressured individuals serve as bridges be- 
tween social and political groups. 
For instance, an individual who identifies with 
a social class different from his own tends to vote 
according to the norms of the chosen class rather 
than those of his social environment (Michigan, 
University of . . . 1960, pp. 369-375). This pre- 
vents the polarization of society into status groups: 
Similarly, cross pressures arising from multiple- 
group membership tend to restrain interest groups 
from extreme and uncompromising stands (Tru- 
man 1951, p. 158). Workers whose locus in $0- 


ciety keeps them in a state of relative isolation 
are more strike-prone than workers subject to 
social cross pressures (Kerr & Siegel 1954), And 
since individuals under cross pressure are likely 
to be the members of society who are most directly 
involved in processes of social change, their pres- 
ence in the system makes for flexibility and ability 
to accommodate to new conditions (Parsons 1959, 
p. 98). 

Second, cross pressures are believed to stabilize 
the political system’ by strengthening dominant 
opinion. Cross-pressured voters tend to resolve 
their conflicts in favor of the party or policy which 
commands majority support or is otherwise domi- 
nant in the community (Berelson et al. 1954, 
p. 100). Lipset et al. (1954, pp. 1133-1134) 
further believe that the cross-pressure mechanism 
contributes to the maintenance of existing power 
relations, since lower-class people are subject to 
the pressures of both their lower-class culture and 
the upper-class-controlled mass media; for upper- 
class people, there is no such divergence between 
affiliative and mass-media pressures. 

Critique. The main burden of the cross-pressure 
hypothesis is the postulated decrease in interest, 
partisanship, and other forms of involvement as 
the pressures upon the individual become more 
evenly balanced: the notion is that such pressures 
then tend to cancel each other. This physical 
analogy carries over into the realm of social and 
political relations: the lower the aggregate energy 
potentials carried by all its members, the less the 
tensions in the society. 

The seeming obviousness of this physical anal- 
ogy may have prevented theorists from specifying 
the exact conditions under which the hypothesis 
can be expected to hold. Clearly, it is possible to 
imagine cases in which a strong attachment to a 
congruent set of attitudes or to groups with con- 
gruent attitudes goes with low interest in both the 
decision process and the outcome. Thus, a person 
firmly attached to the moral conventions of his 
society or to a variety of groups committed to 
society’s ethical code may routinely choose the 
conventionally correct alternative course of action 
without evincing any involvement in either the 
decision process or its outcome. Conversely, a per- 
son not so firmly attached and thus experiencing 
greater conflict may be more deeply involved. 

Therefore, two conditions appear to be necessary 
for the cross-pressure hypothesis to be applicable: 
the outcome of the impending decision must have 
emotional significance; and the outcome must be 
delayed or uncertain. Emotional significance in 
Social decisions is usually supplied by the presence 
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of social dissensus over norms or policies. Delay 
or uncertainty gives rise to the anxious expecta- 
tions or anticipations of reward which stimulate 
involvement. The cross-pressure hypothesis thus 
applies mainly to situations of social conflict or 
political competition. 

If so restricted, the cross-pressure hypothesis 
may seem a tautology: it argues that people whose 
attitudes or group attachments make them hesitate 
between two alternatives are not very partial to 
either of them. Certain psychological theories sug- 
gest, however, that the reaction to internal conflict 
can be, quite to the contrary, an exaggerated com- 
mitment to one of the alternatives. In such a case, 
the unpleasant consequences of internal conflict 
are eliminated by blocking out the other alterna- 
tive—for example, by selective perception or selec- 
tive exposure to the situational stimuli (Festinger 
1957, pp. 123-176). 

This possibility is clearly present in cases of 
attitudinal conflict. In a less direct way, it may 
apply to affiliative conflict as well. Some individ- 
uals experiencing strong internal conflict may 
overcome their discomfort by selective perception 
of their associates’ attitudes or by one-sided selec- 
tion of primary groups. Others, similarly con- 
flicted, may rationalize their indecision by (rightly 
or wrongly) attributing divergent views to the 
members of their several primary groups. The 
existing evidence on voting behavior does not pre- 
clude such interpretations, since it relies exclu- 
sively on the reports of subjects about their asso- 
ciates, not upon direct testing of these associates 
themselves (Luce 1959, pp. 349-352). Thus, some 
individuals whose primary groups do in fact di- 
verge in their attitudes may escape from the 
psychological pressures resulting from such affilia- 
tions by taking extremely partisan positions and 
falsely attributing similar positions to all of their 
associates. 

Neither version of the cross-pressure hypothesis 
contains any theoretical notions, nor are there any 
data that might help in deciding how an individual 
will react to cross pressure—by an increase in, 
or a reduction of, one-sided commitment. Thus, it 
is not possible to assert unconditionally that cross 
pressures will always reduce social and political 
tensions. Three types of response to cross pressures 
seem possible: (1) withdrawal or alienation (as 
hypothesized in the voting studies); (2) over- 
reactive affirmation of, or extreme commitment 
to, one of the alternatives (Festinger’s hypothesis ) ; 
and (3) careful weighing of all alternatives with 
possible innovation as a solution (Simmel’s hy- 
pothesis). Which of these responses will actually 
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occur may be a matter of the individual's ability 
to tolerate internal conflict. It has been suggested 
that such ability to tolerate “ambiguity” or “dis- 
sonance” is a part of personality structure (Frenkel- 
Brunswik 1949; Festinger 1957, pp. 268-270). 
Furthermore, threatening social situations may 
reduce such ability among the general population. 
Both the frequency of personality types with high 
and low conflict-tolerance and the frequency of 
threatening situations may have historical ante- 
cedents and thus be rooted in a given culture. If 
this is true, then it is not possible to predict any 
particular effects on the individual or society only 
from the knowledge of patterns of cross pressures. 
Rather, certain personality characteristics, the fre- 
quency of their occurrence in a given society, and 
their distribution over the structure of social roles 
must be considered contributory sources of vari- 
ability. 

FRANK A. PINNER 


[See also COGNITIVE THEORY; ELECTIONS; POLITICAL 
BEHAVIOR; PUBLIC OPINION; SOCIALIZATION. ] 
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CROSS-SECTION ANALYSIS 


Empirical analysis is concerned with the estab- 
lishment of quantitative or qualitative relations be- 
tween observable variables. From a temporal point 
of view two kinds of data are used in empirical 
analysis—cross-section data and time series data. 
Cross-section data are observations on variables at 
a point of time, whereas time series data are ob- 
servations covering several time periods. Sometimes 
the two kinds of data are used together to over- 
come some specific difficulties. 

The unit of observation in a cross section is gen- 
erally, although not necessarily, elementary, such 
as a firm or a consumer; the unit of observation in 
a time series is generally an aggregate. The em- 
pirical association between the type of data and the 
level of aggregation contributes to differences in 
the results obtained with the two kinds of data. 

While cross-section analysis is used in many of 
the social sciences, this article focuses on applica- 
tions of cross-section analysis in economics. [For 
other applications, see SURVEY ANALYSIS.] 

Some of the more important areas of economics 
in which cross-section data have been used are the 
estimation of Engel curves (Liviatan 1964; Prais 
& Houthakker 1955), the estimation of consump- 
tion functions (Friedman 1957), the estimation of 
production functions (Bronfenbrenner & Douglas 
1939), and the estimation of investment functions 
(Kuh 1963). 

Reasons for using cross-section data. There are 
several reasons why cross-section data are often 
superior to time serjes data for estimating eco- 
nomic relations. 

(1) Cross-section data contain large variations 


in some variables whose variations over time are 
only moderate and often subject to trend. For in- 
stance, there is a much larger variation in income 
among consumers at a particular point of time than 
in average per capita income over time. Likewise, 
there are wide variations in productive capacity and 
sales among firms in a cross section, but only rela- 
tively small variations in these variables in a time 
series. 

(2) The size of a cross-section sample of data 
can usually be increased enough to make sampling 
variance relatively negligible (Kuh 1963). 

(3) Multicollinearity among variables in a cross 
section is often less acute than among correspond- 
ing variables in a time series. 

(4) The problem of interdependent disturb- 
ances, which frequently arises in the analysis of 
time series because of trends in time series data, 
usually does not arise in the analysis of cross- 
section data, simply because the order of the 
observations has no meaning. 

(5) In some cases cross-section data are more 
reliable; and the variables that can be measured 
in a cross section often correspond more closely to 
the variables defined and studied in economic the- 
ory. For instance, cross-section data on consumer 
budgets furnish a precise account of consumption 
by commodity, and such data can be more useful 
than time series data for estimating demand func- 
tions. On the other hand, some variables are sub- 
ject to larger errors in a cross section, although the 
errors are of a different nature than those encoun- 
tered in a time series. For example, the discrepancy 
between observed income and permanent income 
is larger in a cross section than in an aggregate 
time series (Friedman 1957). This problem is dis- 
cussed below, 

(6) The distinction between the individual eco- 
nomic unit and the market, which is basic in 
economic analysis, leads to different classifications 
of variables at the microeconomic and macroeco- 
nomic levels of analysis. For instance, prices can 
be assumed to be given (exogenous) for a con- 
sumer, but they should be considered as dependent 
(endogenous) variables when markets are ana- 
lyzed. Cross-section analysis usually deals with the 
behavior of microeconomic units; therefore it can 
often proceed on the basis of a simpler economic 
model than time series analysis of macroeconomic 
data. 

It should be noted that cross-section analyses had 
been undertaken long before the statistical ad- 
vantages of cross-section data were recognized; and 
in some cases, such as the study of Engel curves, 
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cross-section analyses also preceded the rationale 
that was to be provided for them by economic 
theory (Staehle 1934-1935; Stigler 1954). 

The scope of cross-section analysis. The ex- 
planatory variables appearing in economic relations 
can be divided into three groups: (1) variables 
that vary in a cross section and over time; (2) 
variables that are stable in a cross section and vary 
over time; (3) variables that vary in a cross sec- 
tion and are stable over time. 

It is the existence of the first group that makes 
cross-section analysis valuable in economics, For 
instance, income is an important variable in con- 
sumption functions, and its impact on consumption 
can be learned from cross-section analysis. If in- 
come did not vary in a cross section, cross-section 
analysis of consumption functions would be im- 
possible; if income did not vary over time, the re- 
sults of cross-section analysis would be of little 
interest. 

The existence of the second group implies that 
cross-section analysis by itself is not sufficient to 
explain the variations of many economic variables. 
For example, variations in consumption resulting 
from changes in income can be determined by 
cross-section analysis, since income varies in a 
cross section; but variations in consumption result- 
ing from changes in prices cannot be determined 
by cross-section analysis, since prices generally do 
not vary in a cross section. Thus, to obtain com- 
plete economic relations, variables in the second 
group must be included in the relations, and the 
coefficients of these variables must be estimated by 
time series analysis. 

The variables in the third group may be very 
important in explaining variations of economic 
variables in a cross section, For instance, age and 
sex and their interaction may explain more of the 
variations in consumption of certain commodities 
among individuals than does income. Yet they may 
be relatively unimportant variables to consider in 
most economic decisions or predictions, since 
neither the average age nor the sex composition of 
the population varies much over time. 

The variables in the third group are largely non- 
economic, that is, they are not endogenous varia- 
bles in current economic theory. Among the im- 
portant variables in this group are those measuring 
the uncertainty faced by decision-making units, 
particularly firms. According to the theory of the 
firm facing no uncertainty, the amount of a good 
supplied by a firm and the amounts of productive 
factors demanded by a firm depend on the price of 
the good and on the prices of the productive factors. 
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Given input and output prices, the firm should 
choose the input-output configuration that maxi- 
mizes its profits, But in many empirical analyses of 
cross-section data, it is found that prices explain 
only a small proportion of variations in input de- 
mands and output supplies among firms. The unex- 
plained variations may be attributable to differ- 
ences in the degree of uncertainty among firms or 
to differences in the response to uncertainty among 
firms. Firms may be certain of the prices that will 
prevail when they execute their decisions, but they 
may be uncertain of the quantity of output that 
can be sold or the amounts of inputs that can be 
purchased at those prices. 

Given uncertainty of this kind, a firm may devi- 
ate from its profit maximizing input-output con- 
figuration, so as to avoid partially the costs that 
would be incurred if plans cannot be realized, e.g., 
the costs of undesired inventory accumulations. In 
such cases a firm might consider all input-output 
configurations that yield profits greater than a pre- 
assigned fraction, say 95 per cent, of maximum 
profits and select one of these configurations for 
final execution, The range of acceptable configura- 
tions may be very large. Hence, firms may differ 
considerably in their decisions, the differences re- 
sulting from variations in uncertainty and in the 
response to uncertainty among firms. 

From these considerations it is evident that eco- 
nomic variables may well explain only a small frac- 
tion of the variations of a dependent variable in a 
cross-section relation and that other variables that 
appear or should appear in the relation will per- 
form the major explanatory role. Thus, there is no 
a priori requirement that cross-section relations 
produce high degrees of explanation or that, if they 
do, their explanatory powers result from the eco- 
nomic variables included (Grunfeld & Griliches 
1960). Yet when all individuals are taken together, 
the noneconomic variables generally offset each 
other, and economic variables, such as prices, may 
turn out to be the important explanatory variables. 

The problem of multiperiod relations, In gen- 
eral, the time horizon of economic decisions ex- 
tends beyond a single period, Consequently, “true” 
economic relations contain variables of several 
periods; but in cross-section analysis one can 
usually observe only the current variables, and the 
problem of inferring a complete relation from such 
partial information arises. To illustrate, according 
to the permanent income hypothesis the consump- 
tion function is a relation between permanent con- 
sumption and permanent income. While permanent 
consumption may be approximately equal to ob- 
served consumption, permanent income is an aver- 


age of incomes of several periods. Hence, consump- 
tion in any given year will depend, apart from 
errors, on a stream of income which, except for its 
current component, is unobserved. Similarly, in 
deciding whether to invest in durable assets, a firm 
takes into account the profits that the assets will 
yield not only in the year of the investment but also 
in future years. Thus, in order to ascertain em- 
pirically the determinants of business investment, 
it is necessary to take into account firms’ expecta- 
tions of the future, which are unobservable, 

The nature of the problem can also be seen by 
expressing an economic relation as 


F(x, æi, +++, Me, +++) =0, 


where x, is the vector of variables from period t 
that enter the relation. The problem discussed 
above exists when observations are available only 
for a particular æ, (as is the case in cross-section 
analysis) and when f is not separable with respect 
to that x,. In such cases the estimated cross-section 
relation is subject to bias resulting from the omis- 
sion of the unobserved variables. This is all that 
can be said at this level of generality. 

More specific conclusions can be arrived at when 
the underlying theory is more specific with respect 
to the economic relation to be estimated and with 
respect to the relationships between the observed 
and the unobserved variables. For instance, in the 
case of the permanent income hypothesis the stream 
of income over time in the true relation is replaced 
by one unobserved variable—permanent income. 
For purposes of estimation it is assumed that ob- 
served income measures permanent income with 
an unsystematic error. With specifications of the 
properties of the error term, the estimation prob- 
lem reduces to a regression problem with errors in 
the variables, and the appropriate statistical meth- 
ods are applied [see LINEAR HYPOTHESES, article 
On REGRESSION]. 

It should be noted that if the measurement errors 
are unsystematic, then aggregating the observa- 
tions will reduce the measurement errors. Thus, 
while measurement errors are a serious problem in 
estimating most multiperiod relations with cross- 
section data, they usually do not present much of a 
problem in estimating such relations with aggre- 
gate time series data. 

In studying multiperiod relations, considerable 
information can be gained by taking repeated ob- 
servations on microeconomic units over time. 
Liviatan (1963) used such information to per- 
form a rich variety of tests of the permanent in- 
come hypothesis. 

The study of multiperiod relations is further 


complicated by the existence of uncertainty in the 
decisions of microeconomic units. Consumers and 
firms do not have complete information on the 
future values of the variables exogenous to them. In 
such cases observed values of variables cannot be 
identified with expectations, and the utilization of 
repeated observations does not solve this problem. 
Assumptions must be made regarding the forma- 
tion of expectations and regarding the behavior of 
individuals under uncertainty. Some of the conse- 
quences of uncertainty were noted in the preced- 
ing section. 

Estimation. An initial step in estimating cross- 
section relations is choosing the explanatory varia- 
bles to be included. Returning to the classifications 
noted above, variables in the second group obvious- 
ly cannot be included, since they do not vary in a 
cross section, Variables in the first group should 
be included for two reasons. First, their variations 
in the cross section may contribute to explaining 
the variations in the dependent variable of the rela- 
tion, Second, since they vary over time, cross- 
section estimates of their coefficients will be useful 
in making intertemporal forecasts with the esti- 
mated relation. Variables in the third group, which 
are specific to the cross section, may be of little 
interest for forecasting intertemporal changes in 
the dependent variables. However, if they are cor- 
related with variables in the first group, they should 
also be included in the relation, to avoid bias in the 
estimates of the coefficients of the variables in 
the first group. For example, the size of the family 
is included as a variable in cross-section studies of 
Engel curves, even though it may have little to 
contribute in making intertemporal forecasts (Li- 
viatan 1963; Prais & Houthakker 1955). 

Sometimes the variables in the third group are 
not quantifiable but their attributes can be speci- 
fied. They are then introduced into the analysis by 
grouping individuals according to the attributes 
and estimating separate relations for each group. 
For example, in studying consumption functions 
one of the variables in the third group might be 
“place of residence.” Individuals in a cross-section 
sample might then be grouped according to geo- 
graphical areas, After fitting the relation to each 
group, one may test the hypothesis that individuals 
in all groups behave in the same manner, i.e., that 
the same relation holds for all the groups and that 
the so-called group effects are insignificant. Co- 
variance analysis provides the statistical framework 
for testing the equality of intercepts and the equali- 
ties of some or all of the slopes in the group rela- 
tions. In such analysis, within-group variations of 
the variables (deviations of observations in the 
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group from the group mean) are utilized, so at 
least two observations per group are required. 

It may happen that the appropriate groups are 
identical to the units of the observations, For ex- 
ample, managerial ability is not quantifiable but 
must be allowed for in a cross-section study of pro- 
duction functions. However, the unit of observation 
may be the firm, and managerial ability probably 
differs for each firm. In that case, covariance 
analysis is impossible with just one cross section, 
since there is only one observation per group. If 
managerial ability is to be handled by the use of 
covariance analysis, repeated observations over 
time must be made on each firm, Since this calls 
for a combination of time series and cross-section 
data, some of the variables in the second group 
must also be included in the relation. If some of 
these variables are not directly quantifiable, their 
effects may be allowed for by introducing different 
intercepts and slopes for the various years in the 
sample (Mundlak 1963). 

Analysis that is based merely on within-group 
variations of the variables ignores the between- 
group variations (deviations of the group means 
from the mean of all the observations), which are 
often much larger. The between-group variations 
can be utilized if the explanatory variables in the 
estimated relations are not correlated with the 
group effects. For example, if income is uncorre- 
lated with “place of residence,” then the mean con- 
sumptions of the groups can be regressed on the 
mean incomes of the groups. This is particularly 
desirable when the variables are subject to un- 
systematic measurement errors (as is the case 
when testing the permanent income hypothesis), 
because averaging observations for each group will 
eliminate most of the measurement errors. [See 
LINEAR HYPOTHESES, article on ANALYSIS OF VARI- 
ANCE.] 

The problem of measurement errors is also han- 
dled in cross-section analysis by the use of instru- 
mental variables [see SIMULTANEOUS EQUATION 
ESTIMATION], For instance, in estimating the con- 
sumption function it is assumed that observed in- 
come measures permanent income with error. If 
this error is not serially correlated, the income of 
one year may be used as an instrumental variable 
for estimating the consumption function in another 
year. Note that this again calls for repeated ob- 
servations over time on the incomes of the micro- 
economic units. 

When variables in the relation to be estimated 
are jointly determined, the explanatory variables 
in the relation may not be independent of the dis- 
turbance term. For instance, factor inputs may not 
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be independent of the disturbance term in a pro- 
duction function (Mundlak 1963; Walters 1963). 
In such cases estimating the relation by direct 
least squares will result in biased estimates of the 
coefficients. Various multiequation estimation pro- 
cedures are available to overcome least-squares 
bias. However, they depend fundamentally on re- 
strictions that may be satisfied only over time and 
not in a cross section, An exception is the instru- 
mental variables method which, for example, 
uses lagged (or lead) factor inputs as instrumen- 
tal variables in estimating a production function. 

Special problems exist in estimating dynamic 
cross-section relations that involve adjustment 
processes [see DISTRIBUTED LAGS]. The empirical 
implications of many of the currently used ad- 
justment models may be more applicable to group 
behavior than to individual behavior, While indi- 
viduals facing uncertainty may not react instan- 
taneously to changes in the variables on which they 
base their decisions, changes in their decisions may 
be discrete rather than continuous. It may be ad- 
vantageous for them to make larger adjustments 
less often. However, the adjustment models em- 
ployed in empirical work generally assume con- 
tinuous adjustment. Since the frequency and size 
of adjustments may vary among individuals, con- 
tinuous adjustment might be the result for the 
group. Here again, repeated observations over time 
on individuals can be utilized to surmount this 
difficulty. 

Problems of application. The application of 
cross-section estimates of economic relations to 
intertemporal predictions of aggregates is subject 
to several difficulties, The cross-section estimates 
may depend on the values of the variables that are 
constant in the cross section but vary with time. 
Presumably this problem should be solved by a cor- 
rect specification of the estimated relations, so that 
the variables that are stable in the cross section 
will not affect the estimates of the coefficients of 
the cross-section variables, For instance, income 
coefficients should be independent of prices, so that 
an estimated income coefficient will be applicable 
to periods with different prices. While such inde- 
pendence may nearly exist for income coefficients 
and prices, it may not exist in relations such as 
donde functions, where less regularity is the 
rule. 

Furthermore, estimation of the coefficients of 
variables that are constant in a cross section but 
vary over time requires time series data. In esti- 
mating their coefficients by time series analysis, it 
is possible to use cross-section estimates of the 
coefficients of the variables in the first group in 


the time series relation. This of course can be done 
only if the cross-section estimates do not vary much 
from year to year. Income elasticities estimated 
from cross-section analysis are often grafted onto 
time series demand equations (Tobin 1950), 
Finally, the transformation of estimates ob- 
tained for individuals to estimates applicable to 
markets is somewhat problematic. Aggregation over 
individuals is sensitive to the distribution of the 
explanatory variables among individuals and may 
lead to aggregate relations that differ in form from 
the individual relations (Houthakker 1955; Tobin 
1950). 
YAIR MUNDLAK 


BIBLIOGRAPHY 

BRONFENBRENNER, MARTIN; and Dovectas, P. H. 1939 
Cross-section Studies in the Cobb-Douglas Function. 
Journal of Political Economy 47:761-785 

FRIEDMAN, Mitton 1957 A Theory of the Consumption 
Function. National Bureau of Economic Research, Gen- 
eral Series, No. 63. Princeton Univ. Press. 

GRUNFELD, YEHUDA; and GRILICHES, Z. 1960 Is Aggre- 
gation Necessarily Bad? Review of Economics and 
Statistics 42: 1-13. 

HOUTHAKKER, HENDRIKS. 1955 The Pareto Distribution 
and the Cobb-Douglas Production Function in Activity 
Analysis, Review of Economic Studies 23, no. 1:27-31. 

KLEIN, LAWRENCE R., 1953 Textbook of Econometrics. 
Evanston, Ill.: Row, Peterson. > See especially pages 
211-241. 

Kun, Epwin 1963 Capital Stock Growth: A Micro-econo- 
metric Approach. Contributions to Economic Analysis, 
32. Amsterdam: North-Holland Publishing. 

LiviaTan, Nissan 1963 Tests of the Permanent-income 
Hypothesis Based on a Reinterview Savings Survey. 
Pages 29-59 in Measurement in Economics: Studies 
in Mathematical Economics and Econometrics in 
Memory of Yehuda Grunfeld. Stanford Univ. Press. 
=> A “Note” by Milton Friedman and a reply by Livia- 
tan appear on pages 59-66. 

LiviaTan, Nissan 1964 Consumption Patterns in Israel. 
Jerusalem: Falk Project for Economic Research in 
Israel. 

MICHIGAN, UNIVERSITY or, SURVEY RESEARCH CENTER 
1954 Contributions of Survey Methods to Economics. 
Edited by Lawrence R. Klein. New York: Columbia 
Univ. Press. 

Monptax, Yarr 1963 Estimation of Production and Be- 
havioral Functions From a Combination of Cross- 
section and Time-series Data. Pages 138-166 in 
Measurement in Economics: Studies in Mathematical 
Economics and Econometrics in Memory of Yehuda 
Grunfeld. Stanford Univ. Press. 4 

Prats, S. J.; and Hournaxxer, H. S. 1955 The Analysis 
of Family Budgets With an Application to Two British 
Surveys Conducted in 1937-1939 and Their Detailed 
Results. Cambridge Univ. Press. 

STaEHLE, HANS 1934-1935 Annual Survey of Statistical 
Information: Family Budgets. Econometrica 2:349- 
362; 3: 106-118. 

STIGLER, GEORGE J. 1954 The Early History of Empiri- 
cal Studies of Consumer Behavior. Journal of Political 
Economy 62:95-113. 


Tos, James 1950 A Statistical Demand Function for 
Food in the U.S.A. Journal of the Royal Statistical So- 
ciety Series A 113, part 2:113-141, 

WALTERS, ALAN A. 1963 Production and Cost Functions: 
An Econometric Survey. Econometrica 31: 1-66. 


CROWDS 
See COLLECTIVE BEHAVIOR; MASS PHENOMENA; 
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CULTS 
See SECTS AND CULTS and the biography of 
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CULTURAL ADAPTATION 


ee under CULTURE. 


CULTURAL ANTHROPOLOGY 
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CULTURAL COOPERATION 
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CULTURAL LAG 
See CULTURE, article on CULTURE CHANGE; 
SOCIAL CHANGE. 


CULTURAL RELATIVISM 
See under CULTURE. 


CULTURE 


The articles under this heading deal primarily 
with the nature and history of the concept of cul- 
ture. The study of cultural and social anthropology 
is discussed under the heading ANTHROPOLOGY; 
the development of the concept of culture and its 
applications are reviewed in CULTURE AND PERSON- 
ALITY; DIFFUSION; ECOLOGY; EVOLUTION; HISTORY, 
article on CULTURE HISTORY; SOCIAL STRUCTURE. 
Culture patterns and configurations are described 
in INTEGRATION, article on CULTURAL INTEGRATION, 
and in the biography of BENEDICT. Other utiliza- 
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tions of the concept are reviewed in POLITICAL 
CULTURE and STRATIFICATION, SOCIAL, article on 
CLASS CULTURE. The biographies of BOAS; KROEBER; 
MALINOWSKI; and TyLor should also be consulted. 
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v. CULTURE CHANGE 


I 
THE CONCEPT OF CULTURE 


In his charter definition of the anthropological 
concept of “culture,” Tylor stated: “Culture or Civi- 
lization, taken in its wide ethnographic sense, is 
that complex whole which includes knowledge, be- 
lief, art, morals, law, custom, and any other capa- 
bilities and habits acquired by man as a member 
of society” ({1871] 1958, vol. 1, p. 1). His defini- 
tion does not distinguish social organization and 
social institutions from a general concept of culture. 

This inclusive use of the term “culture” was con- 
tinued by Boas, Malinowski, and other ethnolo- 
gists. In other respects the later usage differs from 
Tylor’s in the new emphasis on the plurality of local 
cultures as functioning and organized wholes and 
in the loss of interest in the long-run evolution of 
discrete customs and institutions. With these many 
focuses, the three axioms of ninetenth-century an- 
thropology—the psychic unity of mankind, the 
unity of human history, and the unity of culture— 
began to fade away. If such unities existed, it was 
thought, they would have to be laboriously pieced 
together from the comparative and intensive studies 
of many individual societies and cultures. Such 
universal principles could not be invoked as ex- 
planatory postulates. 

This pluralistic and relativistic conception of 
culture, a product of the “Boas revolution” in 
anthropology, has characterized anthropological 
thought for almost fifty years, at least until the 
early 1950s, when a revival of interest in universal- 
istic theories occurred. In its relativized form, an- 
thropology did not, however, devote itself exclusively 
to the study of Tylorian culture. Under the leader- 
ship of Radcliffe-Brown social anthropology devel- 
oped and was made the basis for a separation be- 
tween social anthropology as the comparative study 
of “social structures” and ethnology and cultural 
anthropology, which study cultures comparatively 
or historically. This separation was probably first 
dramatized in a famous debate between W. H. R. 
Rivers, the teacher of Radcliffe-Brown, and A. L. 
Kroeber on the proper interpretation of L, H. Mor- 
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gan’s distinction between classificatory and descrip- 
tive kinship systems. Out of this debate about the 
nature of kinship systems grew the two major rival 
anthropological theories of culture—the theory of 
“culture patterns,” best represented by Kroeber, and 
the theory of “social structure,” best represented 
by Radcliffe-Brown. 

This rivalry is still very much alive, although 
some anthropologists have tried to moderate it with 
peacemaking formulas and with new, integrated 
theories. It has mobilized the major factions in 
modern anthropology and sociology, so that in 
Great Britain, Malinowski and his followers are 
regarded as students of culture and of cultural an- 
thropology, while Radcliffe-Brown and his followers 
are regarded as students of social structure and of 
social anthropology. In the United States the con- 
trast between culture and social structure has sym- 
bolized the institutional rivalry between anthropol- 
ogists and sociologists. Not until 1958 did the dean 
of American anthropologists, A. L. Kroeber, and 
the dean of American sociologists, Talcott Parsons, 
agree to sign a nonaggression pact in which both 
culture and society are recognized (Kroeber & Par- 
sons 1958). 

British social anthropologists usually set them- 
selves off from American anthropologists who have, 
with few exceptions, until recently emphasized 
studies of culture and cultural anthropology (Mur- 
dock 1949; Firth 1951). The national labels are 
out of place, since the “British” Radcliffe-Brown 
derives from the work of Morgan and the French 
sociological school, while the “American” cultural 
anthropologists derive from Tylor and, through 
Boas, the German diffusionists. Bronislaw Mal- 
inowski wrote the article on culture for the Encyclo- 
paedia of the Social Sciences (1931), placing 
emphasis on culture as a functioning, active, effi- 
cient well-organized unity, which must be analyzed 
into component institutions in relation to one an- 
other, in relation to the needs of the human organ- 
ism, and in relation to the environment, man-made 
as well as natural. This concept of culture became 
the “common sense” of an entire American genera- 
tion of anthropologists in the 1930s and 1940s. 
Similarly, in the 1950s and 1960s, much of British 
social anthropology became the common sense of a 
younger generation of American anthropologists. 

Behind this rivalry there are, of course, intel- 
lectual issues; however, in order to separate the 
genuine issues from the spurious, we cannot take 
at face value what the members of one school say 
about the views of another, nor can the chapter 
headings in an ethnographic monograph—religion 


and art, family and marriage—tell us whether the 
material is treated in a framework of culture pat- 
terns or of social structure. The decisive criterion 
is the general framework of theory which is used 
for the interpretation and explanation of a par- 
ticular set of facts. There are two frameworks to 
discuss, that of “culture patterns” and that of 
“social structure.” 


The pattern theory of culture 


A significant text for pattern theory is the his- 

torical and critical review by A. L, Kroeber and 
Clyde Kluckhohn of several hundred definitions of 
culture and their heroic effort to arrive at a sum- 
mary formulation which, they believed, would be 
acceptable to most social scientists: 
Culture consists of patterns, explicit and implicit, of 
and for behavior acquired and transmitted by sym- 
bols, constituting the distinctive achievement of hu- 
man groups, including their embodiments in artifacts; 
the essential core of culture consists of traditional (i.e., 
historically derived and selected) ideas and especially 
their attached values; culture systems may, on the one 
hand, be considered as products of action, on the other 
as conditioning elements of further action. (1952, 
p. 181) 

This represents a condensation of much of what 
American anthropologists, at least in the 1940s and 
1950s, would call culture. And it is certainly a 
richer and more adequate formulation than the 
well-known formula of the 1920s and 1930s, “cul- 
ture is learned behavior,” which seemed so satis- 
fying then. For as Kroeber and Kluckhohn observed, 
while the logical construct of culture is based on 
the study of behavior and behavioral products and 
makes behavior intelligible, 


culture is not behavior nor the investigation of be- 
havior in all its concrete completeness. Part of culture 
consists in norms for or standards of behavior. Still 
another part consists in ideologies justifying or ration- 
alizing certain selected ways of behavior. Finally, every 
culture includes broad general principles of selectivity 
and ordering (“highest common factors”) in terms of 
which patterns of and for and about behavior in very 
varied areas of culture content are reducible to parsi- 
monious generalization. (ibid., p. 189) 

A. Irving Hallowell, himself one of the first an- 
thropologists to apply learning theory to the study 
of culture, has come to a somewhat similar con- 
clusion in a recent discussion of personality, cul- 
ture, and society in behavioral evolution: “Cultural 
adaptation cannot be equated with learned and 
socially transmitted behavior, although it is one of 
the necessary conditions underlying it. Equally im- 
portant in behavioral evolution is how much is 


learned and what is learned, relative to the psycho- 
logical capacities and total life adjustments of the 
animal” (1963, p. 492). But even if we accept the 
Kroeber and Kluckhohn definition of the culture 
concept, as they themselves say: “But a concept, 
even an important one, does not constitute a 
theory. . . . In anthropology at present we have 
plenty of definitions but too little theory” (1952, 
p. 181). 

There are, as they also point out, adumbrations 
of a general theory of culture in the works of Boas 
(1911), Sapir (1927), Benedict (1934), Linton 
(1936), Bateson (1936), Kluckhohn (1941), 
Kroeber ([1923] 1948, chapter 8; 1952; see also 
“Anthropological Horizons” 1962), White (1949), 
Opler (1945; 1946; 1959), and others. Essentially 
this general theory emphasizes the study of pattern, 
form, structure, and organization in culture rather 
than diserete culture traits and culture content. 
While influenced by biological analogies, the pat- 
tern theory is also closely affiliated with the nine- 
teenth-century German school of cultural history 
and with gestalt psychology. Culture patterning is 
an “emergent” of human creativity transcending 
the limits of biology and the natural environment. 

Different spheres of social life differ in suscep- 
tibility to patterning, and culture patterns differ in 
degrees of consciousness and complexity as well 
as in kind. The simplest patterns are the explicit 
and more or less objective patterns of behavior ex- 
pressed in customs of dress, diet, work, and salu- 
tation, and in artifacts, Then there are the more 
complex patterns underlying social, political, and 
economic organization and the systems of religion, 
language, law, philosophy, science, and the arts. 
Among these Kroeber has distinguished those 
“basic” or “systemic” patterns in different fields of 
culture which have persisted (at least in their cul- 
tural descendants) for several thousand years as 
coherent organizations of traits with functional 
value (e.g., the alphabet, plow agriculture, mono- 
theism), and the “secondary” patterns (of formal 
social organization, systems of thought, etc.), 
which are subject to greater variety and insta- 
bility, Different again from all these kinds of cul- 
ture patterns are those qualities of cultural organ- 
ization which come to pervade all or most spheres 
of some cultures and give them a distinctive indi- 
vidual “slant.” Such are the implicit and uncon- 
scious configurations which Ruth Benedict de- 
scribed in her Patterns of Culture (1934). Kroeber 
saw these configurations as “patterns of patterns” 
in those cultures which have achieved stylistic inte- 
gration. Acknowledging that they may have psy- 
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chological correlates in personality traits, he pre- 
ferred to analyze these total cultural patterns in 
cultural and historical terms. 

The totality of human culture also contains an 
element of patterning that provides the general 
framework for individual cultures and represents a 
historical summation of those cultures which have 
segregated themselves out as crystallized historical 
configurations of culture. The “universal pattern” 
of human history, in either sense, is not yet known 
but can only be gradually discovered through com- 
parative-historical studies of the systemic, second- 
ary, and configurational patterns in all cultures, 
primitive and civilized (Kroeber & Kluckhohn 1952, 
p. 185; “Anthropological Horizons” 1962). 

There are no absolute units or natural bound- 
aries for cultural—historical studies, “The lines of 
demarcation of any cultural unit chosen for de- 
scription and analysis are in large part a matter of 
level of abstraction and of convenience for the 
problem at hand. Occidental culture, Graeco-Roman 
culture, nineteenth-century European culture, Ger- 
man culture, Swabian culture, the peasant culture 
of the Black Forest in 1900—these are all equally 
legitimate abstractions if carefully defined” (Kroeber 
& Kluckhohn 1952, p. 185), 

Culture patterns tend to persist as organized 
bodies of custom in spite of changes in items of 
culture content. Changes in individual items can 
be explained as selections and rejections consistent 
with the cultural patterns, But the patterns them- 
selves are also subject to change. There is a cul- 
tural orthogenesis, in which “the direction of at 
least some culture change is more predetermined 
by earlier forms of the culture than caused by en- 
vironmental press and individual variability” (ibid., 
p. 189). Sapir called this “cultural drift”: “When- 
ever the human mind has worked collectively and 
unconsciously, it has striven for and often attained 
unique form, The important point is that the evo- 
lution of form has a drift in one direction, that it 
seeks poise, and that it rests, relatively speaking, 
when it has found this poise” (quoted in ibid., 
p. 182). 

The relevance of “cultural drift” to studies of 
cultural continuity and change is obvious. Eggan 
(1963) has recently applied it to an analysis of 
cultural changes in the Philippines, and Redfield 
and the present writer have suggested ways in 
which the role of cities in cultural change may be 
interpreted as the product of “orthogenetic” and 
“heterogenetic” processes (see the article by Red- 
field & Singer in Redfield 1962; also see Singer 
1959). 
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Another important kind of pattern change has 
been analyzed by Kroeber in his Configurations of 
Culture Growth (1944). In this and in later dis- 
cussions Kroeber showed that the rise and the de- 
cline of civilizations can be viewed as phases in the 
growth and realization of stylistic configurations. 
The clustering of the peak periods of cultural crea- 
tivity in each civilization within limited time peri- 
ods suggests the importance of critical periods of 
“ripeness” in the process of cultural growth and 
innovation (Kroeber 1957; 1963). 

Pattern theory assumes that culture is created by 
individuals and groups and interacts with them as 
well as with the environment. However, these 
interactions of biology, psychology, and geography 
are the given conditions and starting points for 
cultural growth but not its determinants. Such a 
theory views the process of cultural growth as a 
historical process, as Boas emphasized, a “growing 
together” of elements of culture content from differ- 
ent sources, which have become associated in a 
historical configuration, The end result of this his- 
torical process, at any given time, is an associated 
set of patterns, a precipitate of the history of a par- 
ticular group, of its past choices, conscious and 
unconscious, Culture is this precipitate “present in 
persons, shaping their perceptions of events, other 
persons, and the environing situation in ways not 
wholly determined by biology and environmental 
press. Culture is an intervening variable between 
human ‘organism’ and ‘environment’” (Kroeber & 
Kluckhohn 1952, p.186). It is “an abstract descrip- 
tion of trends toward uniformity in the words, acts, 
and artifacts of human groups” (ibid., p. 182). 

Culture pattern theory has also been used in ac- 
culturation studies (Spicer 1962) and in studies 
which try to relate culture patterns to personality 
Structure (reviewed in Singer 1961; and in Mead 
1962) or to changes in environment and demogra- 
phy (Steward 1955), 

Applications of pattern theory tend to avoid 
causal hypotheses, because culture is so intricate, 
multiple and cumulative that there seems no point 
in looking for specific external causes of specific 
cultural forms, either in deterministic laws or in 
cross-cultural statistical correlations, The primary 
research task of a pattern theorist is to delineate 
culture patterns and, beyond that, to compare and 
classify types of patterns as well as to distinguish 
the most fundamental and constant patterns from 
the secondary and variable ones. 


Social structure as a theory of culture 


The theory of social structure was first developed 
in an important series of papers by Radcliffe-Brown 


in the 1930s and 1940s and has since been con- 
siderably extended, “Social structure,” is defined by 
Radcliffe-Brown as a network or system of social 
relations including persistent social groups and dif- 
ferentiated social classes and social roles. In Rad- 
cliffe-Brown’s formulation the theory follows the 
organic analogy very closely; thus comparative 
social morphology is concerned with studying and 
classifying the different types of social structure 
and social physiology with studying how particular 
types of social structure function. It is assumed as 
a working hypothesis that each structural system 
is a functional unity in which all the component 
parts contribute in a harmonious way to its ex- 
istence and continuity. To test this hypothesis all 
kinds of social phenomena—morals, law, etiquette, 
religion, government, economics, education, lan- 
guage—need to be studied “not in abstraction or 
isolation, but in their direct and indirect rela- 
tions to social structure, i.e, with reference to the 
way in which they depend upon, or affect the social 
relations between persons and groups of persons” 
(Radcliffe-Brown 1952, p. 195). 

The study of how particular types of social struc- 
ture change into new structural types forms a third 
branch of the theory of social structure. This study 
requires assistance from history and archeology to 
trace the actual processes in the formation and 
transformation of particular structural types. Rad- 
cliffe-Brown suggested that the hypothesis of social 
evolution, which he accepted as plausible, should 
be defined “as the process by which wide-range 
systems of social structure have grown out of, or 
replaced, narrow range systems” (ibid., p. 204). 

Radcliffe-Brown’s formulation of the theory of 
social structure is universal; it is intended to apply 
to societies of all kinds, at all places and times. In 
practice, Radcliffe-Brown and others at first re- 
stricted social anthropology to a comparative study 
of the social structure and social physiology of con- 
temporary nonliterate and simple societies, Since 
primitive societies were assumed not to have his- 
tories or historical records, the study of structural 
change was also restricted to cases of contact with 
civilized societies. These limitations led to a defini- 
tion of social anthropology as the intensive study 
of the structural systems of small, nonliterate com- 
munities, or “primitive isolates.” 

Developments in social anthropology have since 
relaxed these restrictions, and studies of structural 
change in primitive and simple societies have 
undertaken to combine history and archeology 
with structural—functional analysis (as in the work 
of Eggan, Evans-Pritchard, and M. G. Smith) oF 
to restudy the same society at different periods 


(as in the work of Redfield, Firth, and others); the 
study of structural conflicts in simple societies was 
undertaken and the assumption of stability was 
dropped (Leach, Gluckman, Fallers); and studies 
of the social structures of peasant and modern 
communities and of civilizations were begun (Red- 
field, Warner, Geertz, Firth, Schneider, M, Freed- 
man, E. Wolf and others). 

These developments have broadened social an- 
thropology and have brought it closer to the origi- 
nal universal scope of the theory of social structure. 
The expansion of social anthropology to include 
macrostructural studies of peasant societies, mod- 
ern communities, and civilizations has gone more 
smoothly in comparative social morphology than 
in social physiology. The reasons for this are ob- 
vious—it is relatively easier to trace networks 
of social relations, social classes, and social groups 
in a large-scale society than it is to demonstrate 
how such a macrostructural system constitutes a 
functioning, integrated unity, To demonstrate the 
existence of such a functional unity requires con- 
sideration of the results of different approaches, 
such as economics, political science, literary 
studies, and art history, each of which has made 
some specialized aspect or subsystem of the total 
society its peculiar subject matter. Structural 
changes, on the other hand, are easier to trace at 
this level because of the longer time perspectives 
and data provided by historical and archeological 
studies. 

The boundaries and identity of the unit of study 
also became more problematic when social anthro- 
pology moved beyond the primitive isolate. Rad- 
cliffe-Brown was very much aware of this diffi- 
culty: “It is rarely that we find a community that 
is absolutely isolated, having no outside contact. At 
the present moment of history, the network of 
social relations spreads over the whole world, with- 
out any absolute solution of continuity anywhere” 
(ibid., p. 193). 

If this is so, what then is a unit society, asks 
Radcliffe-Brown? “Is the British Empire a society 
or a collection of societies? Is a Chinese village a 
society, or is it merely a fragment of the Republic 
of China?” The answer is pragmatic: “If we take 
any convenient locality of a suitable size, we can 
study the structural system as it appears in and 
from that region, i.e. the network of relations con- 
necting the inhabitants amongst themselves and 
with the people of other regions” (ibid. ). 

Thus, finding the suitable and convenient unit 
of society becomes a matter of the problem at hand 
and the resources one has available for dealing 
with it, In effect, then, there is no “natural” unit 
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of society most suitable for structural analysis 
which can be defined at the beginning of fieldwork; 
natural units emerge in the form of structural 
types only as the results of intensive field studies, 
comparison, abstraction, classification, and gen- 
eralization. 

Radcliffe-Brown and other adherents of the the- 
ory of social structure tended to avoid using the 
term “culture” after the early 1930s. This avoid- 
ance is based on the claim that social anthropology 
studies social structure, not culture, This claim is 
misleading. In fact, the theory of social structure 
both explicitly and implicitly incorporates a con- 
cept of culture. Fortes, for example, writes that 
social structure and social organization are not just 
“an aspect of culture but the entire culture of a 
given people handled in a special frame of theory” 
(1953, p. 21). Fortes uses “culture” in almost pre- 
cisely the same sense as Kroeber and Kluckhohn. 
In this frame “the facts of custom—the standard- 
ized ways of doing, knowing, thinking, and feeling 
—universally obligatory and valued in a given 
group of people at a given time” are then seen “as 
symbolizing or expressing social relations” (ibid.). 

This special frame of theory is obviously Rad- 
cliffe-Brown’s social physiology. Without using the 
word “culture” Radcliffe-Brown acknowledges the 
concept when he defines a social system as “the total 
social structure of a society together with the 
totality of social usages in which that structure 
appears and on which it depends for its continued 
existence” (1952, p. 181), These social usages in- 
clude morals, law, etiquette, religion, government, 
education, and every kind of social phenomenon 
which is a part of “the complex mechanism by 
which a social structure exists and persists” (ibid., 
p. 195). Social physiology, in other words, is a 
frame of theory which tries to relate all aspects of 
culture, in Tylor’s sense, to social structure as a 
network of social relations. 

That a concept of culture is implicit in the 
theory of social structure is usually overlooked be- 
cause of the notion that social anthropology deals 
with “actually existing social relations” and not 
with such abstractions as culture, Radcliffe-Brown, 
who occasionally writes in this vein (e.g., ibid., 
pp. 189-190), nevertheless makes it quite clear 
that while actually existing social relations may 
provide the raw data of observation, they are not the 
same thing as the social structure, which is derived 
from them by abstraction and generalization. 


In the study of social structure the concrete reality 
with which we are concerned is the set of actually 
existing relations, at a given moment of time, which 
link together certain human beings. It is on this that 
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we can make direct observations. But it is not this that 
we attempt to describe in its particularity. ... What we 
need for scientific purposes is an account of the 
form of the structure. For example, if in an Australian 
tribe I observe in a number of instances the behaviour 
towards one another of persons who stand in the rela- 
tion of mother’s brother and sister's son, it is in order 
that I may be able to record as precisely as possible 
the general or normal form of this relationship, ab- 
stracted from the variations of particular instances, 
though taking account of these variations. (ibid., 
p. 192) 


A social structure, then, is not something di- 
rectly observed but an abstraction of structural 
forms from the actually existing relations, which 
are observable. These abstracted “structural forms” 
or “normal forms of social relations” cannot be 
described or understood without reference to cul- 
ture: “Social relations are only observed, and can 
only be described, by reference to the reciprocal 
behaviour of the persons related, The form of a 
social structure has therefore to be described by the 
patterns of behaviour to which individuals and 
groups conform in their dealings with one another” 
(ibid., p. 198). 

Such behavior patterns sound very much like 
Kluckhohn and Kroeber’s explicit and implicit pat- 
terns: “These patterns are partially formulated in 
rules which, in our own society, we distinguish as 
rules of etiquette, of morals and of law. Rules, of 
course, only exist in their recognition by the mem- 
bers of the society; either in their verbal recogni- 
tion, when they are stated as rules, or in their 
observance in behaviour” (ibid.). 

At the very heart of the theory of social struc- 
ture we find the concept of culture as a set of rules, 
implicit or explicit, of standardized modes of be- 
havior and thought. The concept of culture is also 
implicit in Radcliffe-Brown’s definition of “a social 
relation” as a mutual adjustment of interests be- 
tween persons: “Whenever we say that a subject 
has a certain interest in an object we can state the 
same thing by saying that the object has a certain 
value for the subject. Interests and values are cor- 
relative terms, which refer to the two sides of an 
asymmetrical relation” (ibid., p. 199). 

This conception of a value as any object of any 
interest, derived from the American philosopher 
R. B. Perry, is extended by Radcliffe-Brown to a 
definition of “a social value” as the object of a com- 
mon interest. This leads Radcliffe-Brown to the 
position that values—and their correlative interests 
—are the determinants of social relations, and 
hence of social structure. The foundations of the 
theory of social structure are thus two intangibles 
—social values and psychological interests. 


It is now clear why the theory of social structure 
can dispense with the word “culture”: it has incor- 
porated the culture concept into the core of the 
theory, for the theory of social structure deals with 
social relations not simply as concrete actually 
existing objects of observations but as institution- 
alized and standardized modes of behavior and 
thought whose normal forms are socially recog- 
nized in the explicit or implicit rules to which the 
members of a given society tend to conform 


Culture patterns and social structure as 
parallel and as complementary 


There is a striking formal parallelism between 
the theory of culture patterns and the theory of 
social structure. Both are holistic theories in the 
sense that they try to cover all aspects of society 
and culture—law, politics, economy, technology, 
kinship and social organization, art, literature, lan- 
guage, religion, philosophy, science, and so on. 
Fortes’ formulation applies to both theories: each 
provides a special frame of theory to handle the 
entire culture of a people. Or, to put it a little dif- 
ferently, each theory incorporates Tylor’s omnibus 
concept of culture in a different frame. Both the- 
ories are universalistic: they are intended to apply 
to all kinds of societies and cultures, and not to 
just one special kind. Each theory defines its basic 
concepts in such a way that it is possible to deal 
with different levels and hierarchies of pattern and 
structures, including the possibility of a single 
world-wide culture pattern and a world-wide net- 
work of social relations. 

The early field studies of “primitive” and small- 
scale societies and cultures were in part associated 
with two kinds of theoretical interests: the linger- 
ing interest in the origins and evolutionary place 
of “contemporary primitives” and the belief that 
primitive societies and cultures are instances of 
simple, functionally integrated units. It was in the 
latter connection that both the theory of culture 
patterns and that of social structure came to be 
thought of as essentially theories of the primitive 
isolate. As both theories were extended to peasant 
villages and to modern urban communities, how- 
ever, the primitive isolate gradually faded as 4 
natural unit; thus liberated, both theories were 
applied to morphological, functional, and historical 
studies of both the culture patterns and the social 
organization of civilizations. 

The parallels between the theory of culture pat 
terns and the theory of social structure are numer- 
ous and striking. Both theories have explanatory 
aims, although each finds different factors to be 
primary. 


The relation between basic and secondary pat- 
terns, on the one hand, and between the “substruc- 
ture” of social relations and the “superstructure” of 
culture, on the other, is not necessarily causal, but 
it has explanatory value. This, therefore, is the 
parallelism between the two theories; the difference 
arises from the fact that the pattern theory does 
not specify which aspects of culture and society are 
most likely to form basic patterns—they may be 
matters of religion, technological invention, or 
ideas—while the structural theory assigns basic 
explanatory value to social relations. This differ- 
ence, in relation to a particular individual society 
or culture, is not very great, because the structural 
theory considers an “explanation” achieved when 
it has shown how each part contributes function- 
ally to the existence and continuity of a particular 
type of social structure, while the pattern theory's 
desideratum for “explanation” is to show how each 
part fits into an over-all configuration or stylistic 
pattern of the culture. 

The difference between the two concepts is not 
that one is an abstraction and the other a concrete, 
observable unit of behavior, for both are abstrac- 
tions of regularities from observations of actual 
behavior, whether these regularities are implicit 
and unconscious or explicit and verbalized. That 
social structure, too, is an abstraction and not a 
directly observable, concrete reality was first effec- 
tively argued by Bateson (1936) and subsequently 
reaffirmed by Fortes (1949), Firth (1951), Kroeber 
and Kluckhohn (1952), Nadel (1951), Eggan 
(1955), Redfield (1955; 1956), Lévi-Strauss 
(1953), Leach (1954), Schneider (1965), and 
others. Radcliffe-Brown himself in an earlier paper 
acknowledged this when he wrote: 


In human society the social structure as a whole can 
only be observed in its functioning. Some of the fea- 
tures of social structure, such as the geographical dis- 
tribution of individuals and groups can be directly 
observed, but most of the social relations which in 
their totality constitute the structure, such as relations 
of father and son, buyer and seller, ruler and subject, 
cannot be observed except in the social activities in 
which the relations are functioning. (1952, p. 181) 


A particularly concise and lucid formulation of 
the issue has been made by Firth: 


If , . . society is taken to be an organized set of indi- 
viduals with a given way of life, culture is that way 
of life. If society is taken to be an aggregate of social 
relations, then culture is the content of those relations. 
Society emphasizes the human component, the aggre- 
gate of people and the relations between them. Culture 
emphasizes the component of accumulated resources, 
immaterial as well as material, which the people inherit, 
employ, transmute, add to, and transmit. (1951, p. 27) 
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Both Eggan (1955) and Redfield (1962) have sys- 
tematically analyzed the differences and similari- 
ties between the complementary abstractions, cul- 
ture and social structure. Eggan’s summary includes 
the essential points: social structures are more 
limited in variety of forms and more predictable 
in terms of change than culture patterns and may 
vary independently; social relations are more ab- 
stract and more difficult to grasp than are cultural 
forms and less likely to be borrowed; social integra- 
tion and cultural integration are defined by differ- 
ent criteria and can vary independently of one 
another; cultural integration is perhaps the more 
essential for personality integration, although the 
integration of the individual into groups is also 
important; the data and forms of culture seem to 
be more amenable to historical study, and social 
structures more amenable to classification and com- 
parison, although the method of controlled compar- 
ison can and should be applied both to social 
structure and to culture patterns in a given histori- 
cal framework (Eggan [1955] 1962, pp. 490-501). 

Eggan explains the major differences between 
British social anthropology and American ethnol- 
ogy in terms of the emphasis given to one or the 
other side of this complementarity: 


The British social anthropologists tend to think of 
themselves as sociologists concerned primarily with 
the social structures and institutions of primitive so- 
cieties, or they utilize social structure as a frame for 
the organization and interpretation of cultural phe- 
nomena; most American ethnologists consider culture 
as the major concept and point of departure and sub- 
ordinate social structure to it, if they utilize this con- 
cept at all, preferring to operate with concepts of 
culture pattern and cultural form. (ibid., p. 490) 


Yet Eggan, along with everyone else who has 
stressed the complementary nature of the concepts 
of social structure and culture, emphasizes that 
these 
two aspects of social behavior—social structure and 
cultural pattern—cannot exist independently of one 
another in human society: society and culture are 
mutually dependent, and social relations are carried, 
or exemplified, only in cultural behavior. Social insti- 
tutions partake of both aspects: they are composed of 
individuals organized through recurring social relation- 
ships into a social structure, with a set of attitudes, 
beliefs, and behavior patterns through which the struc- 
ture is exemplified and the institutional ends achieved. 
(ibid., p. 492) 

If the complementarity between social structure 
and culture pattern is so intimate and interdepend- 
ent, how is it possible to construct a global theory 
of either concept by itself? Eggan suggests that the 
actual field studies by British social anthropologists 
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and American ethnologists show fewer differences 
than their theoretical formulations would lead one 
to expect. This is probably true, but leaves the the- 
oretical formulations in doubt. For if social struc- 
ture and culture pattern are complementary ab- 
stractions, then it follows that a theory of social 
structure must also make a place for the concept 
of culture pattern and that a theory of culture pat- 
terns must make a place for the concept of social 
structure. Each is a comprehensive theory which 
includes both concepts, but not on a basis of equal- 
ity. The pattern theory subordinates social structure 
to culture, and the structural theory reverses the 
subordination, Acceptance of the complementarity 
of the two concepts is consistent with both theories 
and does not, therefore, account for the difference 
between the structural and pattern theories of 
culture. 


Cultural versus structural explanations 
of kinship systems 


The best clue to the difference between the pat- 
tern theory and the structural theory is to be found 
in the Rivers—Kroeber—Radcliffe-Brown debate over 
Morgan’s distinction between “classificatory” and 
“descriptive” kinship systems. This debate is well 
known in the history of kinship studies and served 
to crystallize the difference between the concepts of 
a kinship system as a “cultural system” and as a 
“social system” and the difference between a pattern 
and a structural theory of culture. All the issues in 
this debate are by no means dead, so far as can 
be judged from the revival of Kroeber’s earlier posi- 
tion in the recent development of “componential 
analysis” of kinship by Goodenough and Lounsbury 
and, to some extent, in Lévi-Strauss’s “structural- 
ism.” We shall deal briefly with the debate here 
only insofar as it stimulated both Kroeber and 
Radcliffe-Brown to sharpen the difference between 
structural and cultural explanations of kinship 
systems, and, a fortiori, of other systems as well. 

Kroeber’s 1909 paper, “Classificatory Systems of 
Relationship,” in addition to criticizing Morgan’s 
conception of classificatory kinship systems, pro- 
posed to compare and classify all kinship terminol- 
ogies in terms of a limited number of categories of 
relationship and the degree to which these cate- 
gories are recognized and expressed in different 
terminological systems. Kroeber identified eight 
such categories: difference in generation, differ- 
ence of age in generation, difference between lineal 
and collateral relatives, sex of relative, sex of 
speaker, sex of connecting relative, difference be- 
tween relation by marriage and by blood, and con- 
dition of life of connecting relative. In the 1909 


paper he used these categories to compare English 
kin terms and those of several North American 
Indian languages. His conclusion from this com- 
parison was that while the English terms give ex- 
pression to a smaller number of the categories, each 
set of terms forms a consistent and self-contained 
system. From the comparison he also drew a more 
general conclusion—that one cannot reconstruct 
specific social institutions and forms of marriage 
from specific systems of kinship terms, as was the 
common practice, because “terms of relationship 
reflect psychology not sociology,” and are “deter- 
mined primarily by language and can be utilized 
for sociological inferences only with extreme cau- 
tion” (p. 84), 

Under the pressure of criticism from Rivers and 
later from Radcliffe-Brown, both of whom defended 
the “sociological” theory, Kroeber clarified what he 
meant by “psychological” and “linguistic” determi- 
nation of systems of kinship terms. He meant, he 
explained later, that the systems were “clean-cut 
made-to-order patterns of culture, speech, and con- 
ceptualization” (1952, p. 174). Certainly, kinship 
terminologies are part of a language, and as such 
they are classifications, and classifications are 
based on categories, The categories are conceptual 
categories subject to the patterning of “unconscious 
logic.” The result is “a pattern of semantic classifi- 
cation for thinking and speaking of blood relation- 
ship.” Some of these patterns may possess “a sur- 
prising historical tenacity,” as Kroeber shows in the 
case of the Philippine kinship system. Such seman- 
tic patterns, or “logical schemes,” may also corre- 
spond to social institutions, as in the Philippine 
system, but the correspondence is between the total 
pattern of terminology and the total pattern of in- 
stitutions, and reflects the underlying logical scheme 
rather than any exact detailed causality between 
kin terms and institutions. The correspondence of 
the terminology with the general level of culture— 
language and religion, for example—may be even 
looser. A kinship terminology, in Kroeber’s view, is 
primarily a semantic pattern, that is, a pattern of 
speech and thought, and only secondarily and in 
special cases is it correlated with a pattern of social 
institutions. 

Kroeber was interested in linguistic and logical 
systems for their own sake and believed that cul- 
ture was too intricate to be easily unraveled accord- 
ing to any formula of exact causal determinism. 
He saw kinship terminologies, like other aspects of 
culture, as historically developed and, to some ex- 
tent, independent, stylistic patterns. Patterns of 
formal social structure, such as clans, moieties, 
totems, and unilateral groups, are also examples of 


such stylistic differentiation and secondary elabora- 
tion of the primary patterns of subsistence and 
residence. Kinship terminologies and formal social 
structures are kinds of culture patterns which, 
among primitive peoples, express the impulse to- 
ward cultural play, innovation, and unconscious 
experiment. The extent of their mutual correspond- 
ence is a matter of historical and functional adjust- 
ment and varies from culture to culture. “Kinship 
terminologies are pattern systems of semantic logic, 
highly variable in detail and historically derivable, 
but also classifiable” into “natural types” (ibid., p. 
172). In this respect they are analogous to linguis- 
tic families and biological types. 

Radcliffe-Brown was stimulated by the Rivers— 
Kroeber debate to develop a concept of kinship 
system which differed from both of these and which 
became the foundation of his structural theory. 
Radcliffe-Brown included in his concept several of 
the components suggested by Kroeber; he agreed 
with Kroeber that the terminology of kinship is an 
intrinsic part of the system and an important start- 
ing point for its study, He accepted the position 
that kin terms designate “categories of relation,” 
although Radcliffe-Brown did not restrict his analy- 
sis to Kroeber’s underlying eight categories. For 
Radcliffe-Brown, however, a set of kinship terms 
and associated categories did not constitute a kin- 
ship system as it did for Kroeber in the form of a 
relatively independent semantic system. Radcliffe- 
Brown agreed that terms and categories do reflect 
the way a people generally think and feel about 
kinship, but he believed that social institutions also 
reflect such general modes of thought and should 
also be included in the conception of a kinship 
system. On this point he sided with Rivers rather 
than with Kroeber: social practices, including 
forms of marriage, are regularly connected with 
kinship terminologies. Radcliffe-Brown’s distinctive 
contribution was the way in which he analyzed 
these connections, He did not accept Rivers’ causal 
analysis of the connections or their use for histori- 
cal reconstruction; kin terms and social institutions 
are, for him, related, not as cause and effect, but 
as component and interdependent parts of a struc- 
tural system, Within such a system the kin terms 
are used to establish and recognize particular cate- 
gories of relatives, and the categories fix the actual 
social relations between the relatives who belong 
to these categories. The particular behavior mani- 
fested in the social relations between relatives is 
defined by legally formulated rights and duties or 
by socially approved usages. In this functionally 
interdependent system, there is no line or direction 
of causality running either from kinship terminol- 
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ogy to standard kinship behavior or conversely. 
Both terminology and behavior are reflections of 
the underlying structural principle or principles by 
which the system is organized and characterized. 
These principles, such as that of sibling solidarity 
or solidarity of the patrilineal lineage, need to be 
discovered by intensive study and comparison of 
different kinship systems. Why some societies differ 
in the structural principles they have selected as 
the basis of their kinship systems and of their 
respective social structures can only be answered 
by historical study of how the systems developed in 
particular environments. 

The difference between Kroeber’s and Radcliffe- 
Brown’s ideas of a kinship system is an illustra- 
tion and source of the difference between the pat- 
tern theory of culture and the structural theory. A 
kinship system for Kroeber is one of several kinds 
of culture pattern—as a semantic system it is gov- 
erned by an inner logic, is historically derived, and 
has some functional significance. Its relation to 
social institutions and w other aspects of culture 
is not causal and not entirely accidental but is 
rather the relation of one culture pattern to other 
culture patterns which have become historically 
associated with it and which have undergone some 
mutual adjustments. In this paradigm, social struc- 
ture is subordinated to culture only in the sense 
that social institutions are also subject to the pat- 
terning of the experimental play impulse of human 
creativity. 

The Radcliffe-Brown structural paradigm, on the 
other hand, obviously includes culture as a com- 
ponent of the system, Culture is subordinated to 
social structure only in the sense that both kin 
terms and the social usages defining socially ap- 
proved behavior between relatives are brought to- 
gether into a single system organized by structural 
principles. Although history is considered essential 
for understanding how a system came to be organ- 
ized the way it is, the morphology and functioning 
of the system can be understood without reference 
to its history. If Kroeber’s pattern theory makes 
social institutions subject to patterning, Radcliffe- 
Brown’s structural theory makes culture a compo- 
nent of a structural system, that is, subject to 
“structuring.” 


The social organization of culture 

Both Kroeber and Radcliffe-Brown regarded kin- 
ship systems as natural systems. This was so for 
Kroeber if they could be shown to be systemic cul- 
ture patterns and for Radcliffe-Brown if they could 
be shown to conform to a type of social structure. 
The natural systems, whether culture patterns or 
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structural types, emerge in the course of field 
studies and comparative analysis; the boundaries 
of such systems are relative to the problem being 
studied. 

Thus we may study the culture patterns or the 
social structures of villages, towns, cities, regions, 
nations, civilizations, as well as of occupations, 
social classes, castes, religious sects, or of any 
groups that may turn out to have them. The inter- 
relation of patterns and structures in groups of 
different size and composition may also be traced, 
albeit with extensions and modifications of the 
methods of analysis that have been used to study 
the cultural patterns or social structures of rela- 
tively small, isolated, and homogeneous groups. 
How is this extension to be made? 

In 1948 Kroeber wrote that “perhaps how it 
comes to be is really more distinctive of culture 
than what it is” (see [1923] 1948, p. 253). And in 
1949 and 1952 Radcliffe-Brown talked of culture 
as “the process by which a person acquires, from 
contact with other persons or from such things as 
books or works of art, knowledge, skill, ideas, be- 
liefs, tastes, sentiments” (1952, pp. 4-5), In a 
particular society one may discover processes of 
cultural tradition and “in complex modern societies 
there are a great number of separate cultural tradi- 
tions, By one a person may learn to be a doctor or 
surgeon, by another he may learn to be an engineer 
or an architect” (ibid., p. 5). 

“Culture” is thus reintroduced by Radcliffe-Brown 
as the process by which, in a given social group or 
social class, learned ways of thinking, feeling, and 
acting are transmitted from person to person and 
from one generation to the next, He does not, how- 
ever, reinstate it as an independent concept but 
assimilates it to the social process, 

Neither Radcliffe-Brown nor Kroeber developed 
his respective theory further to take into account 
those processes of interpersonal interaction by 
which, in a particular society, separate cultural 
traditions are formed, transmitted, and modified 
or those processes by which communication is 
established among separate local cultural tradi- 
tions, one with another and with wider regional 
and national cultural traditions, It was Robert Red- 
field who suggested that these problems might be 
fruitfully studied through a study of the institu- 
tionalized social relations involved in the transmis- 
sion of cultural traditions. He formulated his sug- 
gestion in the concept of “the social organization 
of tradition,” which he defined as “the way in which 
elements of action are put together in any particu- 
lar case of transmission of tradition.” In analogy 
with Firth’s distinction between social organization 


and social structure (1951) and Radcliffe-Brown’s 
distinction between organization and structure 
(1952, p. 11), the social organization of a cultural 
tradition represents the expression in concrete, or- 
ganized activities of “the social structure of tradi- 
tion,” or, in Redfield’s words, “those persisting and 
important arrangements of roles and statuses ap- 
pearing in such corporate groups as castes and 
sects, or in teachers, reciters, ritual leaders of one 
kind or another, that are concerned with the cul- 
tivation and inculcation of the great tradition” 
(1956, p. 101). Redfield also generalized these con- 
cepts of the social organization and structure of 
cultural traditions to a conception of a civilization 
as a structure of different levels and kinds of cul- 
tural traditions (“little” and “great”) in mutual 
contact and communication. Civilizations have 
both a “societal structure” and a “cultural structure” 
(Redfield 1962). 

These conceptions have proved very fruitful for 
the study of civilizations, particularly in India 
which, indeed, has been the major empirical source 
and proving ground, and they are beginning to be 
applied to other civilizations. They have stimulated 
studies of how specific networks of social relations 
(marriage, trade, political administration, etc.) 
also serve as channels of cultural transmission; of 
“cultural performances” as the chief vehicles for 
discovering and expressing a sense of “cultural iden- 
tity”; of the roles of different kinds of “cultural 
specialists” and “cultural policy makers” in forming 
and changing “cultural identities”; and of continuity 
and change in the cultural traditions of a historic 
civilization, and so on (see Singer 1964a, for a 
review of some of these studies), One reason why 
this approach is fruitful is that it bypasses the older 
antinomies—“How can one culture pattern produce 
another without the intervention of specific agen- 
cies?” or “How can ‘social structures’ be causally 
correlated with ‘culture patterns’?” By concentrating 
on the institutionalized social relations, media, and 
functionaries which transmit specific cultural tra- 
ditions from person to person and from group to 
group, this approach is at once both structural and 
cultural. 

This approach to the study of culture is some- 
times criticized for being too humanistic, too sub- 
jective and evaluational in contrast to the alleged 
objective and value-free character of other ap- 
proaches. This is an unjustified criticism. All defi- 
nitions of culture have contained implicit positive 
evaluations of the elements of culture and have not 
been neutral and objective. To bring these implicit 
evaluations to the surface it is only necessary t0 
conduct the experiment of turning the elements of 


any of the definitions into their opposites or to ask 
why, if culture is a neutral concept, one would not 
think of applying it to a group that lacked the ele- 
ments of the definition—for example, language, 
art, knowledge, and skills. The implicit evaluation 
is that culture consists of positive achievements 
and desirable characteristics, Even in the context 
of evolutionary discussions, where one problem is 
to differentiate human culture from animal behav- 
ior, the ostensibly objective definition of culture as 
learned behavior in contrast to instinctive behavior 
has not proved very differentiating. And it will not 
be very useful until, as Hallowell and others have 
pointed out, we can specify what and how much 
has been learned and how the learning is trans- 
mitted and modified; in other words, until we can 
tell how learned is learned behavior. 

George Stocking, Jr., is probably correct in link- 
ing the normative elements in Tylor’s definition of 
culture to nineteenth-century humanistic discus- 
sions such as are to be found in Matthew Arnold’s 
Culture and Anarchy (Stocking 1963). But Stock- 
ing’s highly illuminating article tends to understate 
the differences between the Tylor conception and 
that of Arnold. For what was distinctive about 
Tylor’s conception of culture was not its nonnorma- 
tive character—Stocking is quite right to challenge 
this myth—but the breadth of its application. Tylor 
and anthropologists after him have been able to 
find evidences of culture among the most primitive 
and lowly peoples, but many humanists are not yet 
prepared to accept this finding. In spite of his ap- 
parently ethnocentric preference for nineteenth- 
century English institutions, Tylor sought “to treat 
mankind as homogeneous in nature, though placed 
in different grades of civilization,” without regard 
to hereditary differences. From this broad perspec- 
tive he could see “scarce a hand’s breadth differ- 
ence between an English ploughman and a negro 
of Central Africa.” 

Some humanists have come to accept the anthro- 
pologists’ broad use of the term “culture.” T. S. 
Eliot, for example, in his Notes Towards the Defini- 
tion of Culture is very close to this anthropological 
usage when he writes that in his usage “it includes 
all the characteristic activities and interests of a 
people,” among the British, for example, “Derby 
Day, Henley Regatta, Cowes, the twelfth of August, 
a cup final, the dog races, the pin table, the dart 
board, Wensleydale cheese, boiled cabbage cut into 
sections, beetroot in vinegar, nineteenth century 
Gothic churches and the music of Elgar” ([1948] 
1949, p. 30). 

The problem then in studies of the social organ- 
ization of cultural traditions and of the relations 
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between the “little traditions” of the uneducated 
and the “great traditions” of the learned, is not to 
avoid the normative aspects of culture, an impos- 
sible task in any case, but to develop further the 
methods for observing and analyzing specific cul- 
tural traditions within a framework of a general 
theory of culture. 

I have not referred to recent studies of the social 
organization of cultural traditions in order to intro- 
duce still a third general theory of culture to com- 
pete with the pattern and social structure theories. 
On the contrary, these studies illustrate one of the 
ways in which these two general theories can be 
extended, with a little modification, to the study 
of culture in composite societies and in complex 
historic civilizations. 


Current deyelopments—the new ethnography 


In the 1930s and 1940s, Radcliffe-Brown’s anal- 
ysis of kinship systems as social systems tended to 
prevail over Kroeber’s analysis of them as semantic 
patterns, even among American anthropologists 
who had accepted the pattern theory of culture in 
preference to a structural-functional theory, This 
situation was later reversed by a return to Kroe- 
bers semantic analysis of kinship systems. The 
revival of the semantic analysis of kinship is largely 
the result of new developments in linguistics and 
is in turn generating a new theory of culture to 
compete with structural—functional theory, 

Componential analysis. In their papers on the 
componential analysis of kinship terms, both pub- 
lished in 1956, Goodenough and Lounsbury ac- 
knowledge Kroeber’s precedent. The more imme- 
diate sources of componential analysis, however, 
probably come from the methods of phonemic anal- 
ysis. The analysis of kinship terminologies into 
their necessary and sufficient conceptual compo- 
nents resembles Kroeber’s earlier efforts to identify 
a minimal list of conceptual categories of relations 
and results in a similar conception of a kinship 
system as a semantic or cognitive system. Methods 
analogous to componential analysis have been ap- 
plied to other kinds of folk terminologies—for 
plants, diseases, colors, directions—in order to de- 
termine the underlying semantic structures of the 
cognitive systems. 

These extensions of componential analysis have 
been regarded as applications of a new method in 
ethnography and even as a new discipline, “ethno- 
science.” Underlying these extensions and the pro- 
grams of the “new ethnography” is a theory of 
culture, most explicitly formulated by Goodenough. 
According to his formulation, culture “is not a ma- 
terial phenomenon; it does not consist of things, 
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people, behavior, or emotions. It is rather an or- 
ganization of these things. It is the forms of things 
that people have in mind, their models for perceiv- 
ing, relating, and otherwise interpreting them” 
(Goodenough [1957] 1964, p. 36). 

The externals and observables—what people say 
and do, their social arrangements and events—are 
“products . . . of their culture as they apply it to the 
task of perceiving and dealing with their circum- 
stances” (ibid.). Culture then consists of the “con- 
cepts” and “models” which people have in their 
minds for organizing and interpreting their experi- 
ences. Goodenough believes that “it is obviously 
impossible to describe a culture properly simply by 
describing behavior or social, economic, and cere- 
monial events and arrangements as observed mate- 
rial phenomena. What is required is to construct a 
theory of the conceptual models which they repre- 
sent and of which they are artifacts” (ibid.). 

Since the theory is to be empirical, Goodenough 
proposes as tests of its validity that it predict how 
informants will behave in response to what goes 
on in the community as well as in response to the 
ethnographer’s behavior. In operational practice, 
however, the concepts and models of a culture are 
learned “when we learn the system of meanings for 
which its linguistic forms stand. Much descriptive 
ethnography is inescapably an exercise in descrip- 
tive semantics” (ibid., p. 39). This is as true of 
every human being learning his own culture as it 
is of the ethnographer trying to learn another's 
culture, Accordingly, “language is not only a part 
of culture” but “a major instrument for learning it” 
(ibid.). 

This approach does not exclude the nonlinguistic 
aspects of culture, since “nonlinguistic forms have 
systematic relationships to each other in paradigms 
and combine in accordance with principles analo- 
gous to those of linguistic morphology and syntax” 
(ibid.), To one who knows the culture, the non- 
linguistic aspects “are also signs signifying the 
cultural forms or models of which they are the 
material representations” (ibid., P. 36). 

Goodenough’s semantic and conceptual theory of 
culture bears a resemblance to Kroeber’s pattern 
theory. Kroeber’s semantic analysis, however, was 
explicitly applied only to the linguistic aspects of 
culture, to language, and to kinship terminologies. 
He did not explicitly extend this kind of analysis 
to other kinds of culture patterning, for example, 
to patterns of social structure. In Goodenough’s 
theory, and in the new ethnography, the semantic 
analysis is generalized to all aspects of culture, All 
patterns in a culture are conceptual patterns, and 


if some of the conceptual patterns are not dire ctly 
expressed in the vocabulary of the language, their 
semantic structure can nevertheless be determined 
by analyzing the nonlinguistic forms as “artifacts” 
and “signs” of the conceptual patterns. All culture 
thus becomes cognitive and conceptual, and since, 
according to Goodenough, an individual can know 
only the concepts in his own mind, his “private 
culture” is more real than any “public culture.” 

The new structuralism. The componential anal- 
ysis of kinship terminologies treats social structure 
explicitly only insofar as it can be reduced to a 
semantic structure and only in terms of a corre- 
sponding “cognitive structure.” More direct atten- 
tion is given to social structure in Lévi-Strauss’s 
structural anthropology. This theory combines fea- 
tures of both Kroeber and Radcliffe-Brown and adds 
some original elements of its own. As in the other 
cases, the analysis of kinship systems is paradig- 
matic of the general theory. Lévi-Strauss accepts 
Radcliffe-Brown’s views that there is a relation of 
interdependence between kinship terminologies and 
kinship behavior and attitudes and that the inter- 
dependence is not one of linear causality. In agree- 
ment with Kroeber, however, he does not believe 
that the relations of interdependence are point-for- 
point correspondences. Terminologies, on the one 
hand, and behavior and attitude, on the other, can 
be analyzed as separate systems and compared with 
one another as well as with other kinds of systems, 
such as those of social organization, religion, myth, 
ritual, and political ideology. These comparisons 
can be made within a single society or culture as 
well as between different societies and cultures. A 
single culture is “a fragment of humanity which, 
from the point of view of the research at hand and 
of the scale on which the latter is carried out, 
presents significant discontinuities in relation to 
the rest of humanity” (Lévi-Strauss [1953] 1963, 
p. 295), 

So far this sounds very much like the pattern 
theory—a kinship terminology is one kind of cul- 
ture pattern related by varying degrees of morpho- 
logical and historical relations to other kinds of 
culture patterns. Lévi-Strauss introduces, however, 
a far more abstract and mathematical notion of 
system and structure than either Radcliffe-Brown 
or Kroeber. Lévi-Strauss makes explicit the distinc- 
tion in Radcliffe-Brown’s analysis between networks 
of social relations as the raw materials of observa- 
tion and social structure as an abstract model of 
these relations; he goes on to generalize the con- 
cept of structure to its mathematical and logical 
level, as the regular order of relations among ele- 


ments of any kind. Distinguishing the relationship 
between kinship terminologies and kinship behav- 
ior is thus a problem of constructing structural 
models for systems of kinship terminologies and 
for systems of kinship behavior and then investi- 
gating whether the relations existing between the 
structures are homologous, contradictory, and so 
on. Lévi-Strauss believes that the relations between 
the structures of kin terms and behavior are dia- 
lectical and functional; while behavior and attitudes 
reflect the terminological classification somewhat, 
they are at the same time responsive to contradic- 
tions created by the terminological classification. 
Resolving these contradictions leads to termino- 
logical changes that call for new behavior patterns, 
and so on (ibid., pp. 310-311). He has made a 
similar analysis of the dialectical relations between 
the structures of myths and the structures of rituals 
and believes the analysis can be extended to other 
kinds of structures as well. 

The existence of contradictions between differ- 
ent structures and the resolution of these contradic- 
tions through changes in the structures is not 
usually conscious to the participating members of 
a society, The structures and their relations exist at 
a deep unconscious level and reflect, in their par- 
ticular modalities of space and time, universal men- 
tal processes. In some societies, however, there may 
be “home-made” conscious models of these struc- 
tures, which need to be taken into account by the 
social anthropologist because they may be accurate 
and, in any case, form an important part of the 
data (see, e.g., Leach 1954). 

Lévi-Strauss does not believe that all aspects of 
a culture and society are equally structured or that 
every culture has a single all-embracing structure. 
The degree and kind of structuring is a matter for 
anthropological investigation. He believes the fol- 
lowing possess well-ordered structures—language, 
Kinship, social organization, law, religion, myth, 
ritual, art, etiquette, cooking, and political ideology. 
Other domains are either not structured or at least 
their structures have not yet been discovered. The 
structures of specific domains are not microcosms 
of the whole society or culture but are “partial ex- 
pressions of the total society.” 

Although he uses a general mathematical-logical 
concept of structure, Lévi-Strauss has acknowl- 
edged the important influence of structural linguis- 
tics, especially as developed by Jakobson and 
Trubetzkoy, and game theory and cybernetics as 
developed by Von Neumann and Wiener. Both kinds 
of models are used in his structural analysis, not 
to establish the identity of language and culture or 
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the identity of communication and society, but as 
formal analogies only, whose methodological fruit- 
fulness depends on the existence of structural 
homologies among language, culture, and society. 
This is the sense, I suggest, in which the exchange 
of messages, the exchange of goods and services, 
and the exchange of women all represent, as Lévi- 
Strauss says, different levels of the communication 
process and the sense in which culture itself con- 
sists of “rules stating how the ‘games of communi- 
cation’ should be played both on the natural and on 
the cultural levels” ({1953] 1963, p. 296). 


Summary and conclusion 


We can now summarize the major conclusions 
of the analysis in a series of brief propositions: 

(1) Tylor’s omnibus conception of culture is still 
the basis of most modern anthropological theories 
of culture, although the conception has been re- 
fined and developed in several different directions. 

(2) Two theories of culture which have domi- 
nated anthropological thinking from about 1900 to 
1950 are the process-pattern theory derived from 
Boas and best represented by Kroeber and the 
structural—functional theory derived from Malinow- 
ski and Radcliffe-Brown. 

(3) While the process-pattern theory takes the 
concept of culture pattern as basic and the struc- 
tural theory takes social structure as basic, both 
theories cover the full range of Tylor’s culture 
concept. 

(4) Each theory is holistic and universal, each 
seeks to explain all aspects of culture within a 
single theoretical framework, and each intends to 
apply that framework to societies and cultures of 
any kind, from small primitive societies to complex 
civilizations. 

(5) The difference between the two basic the- 
ories cannot be derived from the complementary 
character of the two basic concepts—culture pat- 
tern and social structure—since each theory accepts 
this complementarity but deals with it in a different 
way. 

(6) The difference between the two theories is 
to be found in the different ways they connect cul- 
ture and social structure within explanatory sys- 
tems. 

(7) The precise nature of these explanatory sys- 
tems may be inferred from a paradigmatic model 
of the analysis of kinship systems, According to 
pattern analysis, a kinship system is a terminologi- 
cal system which expresses a system of classifica- 
tion and an underlying unconscious logic. The 
relation of a kinship system, so defined, to social 
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institutions and other aspects of culture is the 
relation of one kind of culture pattern to others 
and varies with the history and mutual association 
of the patterns within particular places and times. 

(8) In the structural analysis, a kinship system 
is a social system which includes a network of 
social relations, as expressed in customary modes 
of behavior, feeling, and thought, as well as a set 
of terms and categories of relations classified by 
the terms. The interrelation within the system is 
one of functional interdependence among the parts. 
The terms express and establish specific categories 
of relation and the categories fix and regulate spe- 
cific modes of behavior and feeling in accordance 
with a limited number of structural principles 
around which the system is organized. 

(9) These different analyses of kinship systems 
are both examples and models for cultural and 
structural analysis of all kinds and, hence, for two 
general theories of culture, 

(10) Neither theory attempts to explain the 
nature of cultural or structural systems in terms 
of linear causality. Each regards such systems as 
outcomes of the multiple influences of biology, 
psychology, and the natural environment, as well 
as of historical processes and of the creative human 
responses to these “givens.” 

These propositions suggest an underlying con- 
vergence of the pattern theory and the structural— 
functional theory of culture, Although the pattern 
theory has probably been influenced more by study 
of language, literature, and the arts and the struc- 
tural-functional theory more by biological and 
organic analogies, the direction taken by both kinds 
of theory has been the same, In closing, it may be 
useful to characterize this trend as it represents 
the mainstream of culture theory since Malinow- 
skis 1931 article and also represents many recent 
theoretical developments, 

Most characteristic is a shift away from a theory 
of discrete culture traits within a framework of 
universal cultural history or cultural evolution to 
a study of the functions, patterns, and structures 
of cultural forms within a plurality of organized 
contexts, There has been a corresponding shift 
from an interest in artifacts and other external 
manifestations of material culture to an almost 
overriding interest in social culture and in mental 
culture. 

The definition of culture in terms of learned be- 
havior (or standardized behavior) seemed at first 
to promise a unified theory of social and mental 
culture. But with the failure of behavioristic learn- 
ing theories to account for the differentiated proc- 
esses and kinds of learning involved in the acquisi- 


tion of language, kinship systems, and other aspects 
of culture, this promise has not been fulfilled. An- 
thropologists, linguists, and psychologists inter- 
ested in higher mental processes have begun to 
look to genetics, neurophysiology, maturation the- 
ory, ethology, and ego psychology for the mech- 
anisms that enable organisms to acquire, transmit, 
and modify culture (see, e.g., Hallowell 1963; 
Chomsky 1959). 

Recent definitions and analyses of culture have 
grown progressively more abstract, formal, and 
conceptualistic, Behavior, observed social relations, 
and material artifacts may provide the raw data 
for a construct of culture but are not themselves 
considered the constituents of culture. Rather, the 
patterns, norms, rules, and standards implicit in 
the behavior, social relations, and artifacts are con- 
sidered as the constituents of culture. They are the 
systems of meanings, ideologies, conventionalized 
understandings, and cognitive and unconscious 
structures, which may be recognized in a given 
society with varying degrees of consciousness and 
explicit verbal formulation but which, in any case, 
are to be brought to conscious awareness and pre- 
cise formulation by anthropological studies (see, 
e.g., the work of Goodenough), 

Those social anthropologists who have been influ- 
enced by the theories of Durkheim, Weber, and 
Parsons are producing a fruitful synthesis of the 
pattern and structural theories of culture (see, 
e.g., the work of Fallers, Geertz, and Schneider). 

The new ethnography, ethnolinguistics, and 
ethnoscience and Lévi-Strauss’s new structuralism, 
share this new emphasis on cultures as abstract 
structures. They differ only in the procedures they 
propose for discovering the unconscious structures 
—in the form of codes, models, rules of the game— 
which determine the underlying cognitive struc- 
tures (see, e.g., Romney & D’Andrade 1964; Lévi- 
Strauss 1953). 

Taken as a working hypothesis the cognitive 
conception of culture offers a promising program 
of research, the results of which should improve 
cross-cultural understanding. Taken as a definition 
of the nature of man or as a general theory of 
human culture, however, it seems just as narrow 
and one-sided as previous definitions and theories. 
That man is a rational animal was long ago enun- 
ciated by Aristotle. And it is perhaps timely that 
anthropology, after neglecting this aspect of mans 
nature for over fifty years while it explored causal 
and structural models of culture, should now €x- 
plore a logical model. This exploration will not 
forget, we hope, that there is more to human Tâ- 
tionality than how different people classify kin, 


colors, plants, and diseases and that there is more 
to human culture than knowledge and the logic of 
classification. Even Tylor, whose theory of culture 
has so often been criticized for being too intellec- 
tualistic, left room in his omnibus concept, and in 
his writings, not only for science and language but 
also for all “the arts of life,” “the arts of pleasure,” 
religion, all forms of social organization, history, 
and mythology, and any other capabilities and 
habits acquired by man as a member of society. It 
is going to take more than one kind of theoretical 
model to do justice to the variety, complexity, and 
richness of human culture, 


MILTON SINGER 
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u 
CULTURAL RELATIVISM 


Cultural relativism is to be understood as (1) a 
method in ethnology and social anthropology; (2) 
a theory of cultural determinism and a philosophy 
of cultural reality; (3) a guide to the evaluation of 
value systems, especially ethics, politics, and aes- 
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thetics; and (4) an attitude toward practical prob- 
lems of sociocultural reform and change. 


Relativism as a method 


Cultural relativism may be described as the 
method whereby social and cultural phenomena 
are perceived and described in terms of scientific 
detachment or, ideally, from the perspective of 
participants in or adherents of a given culture. 
Further, cultural phenomena are evaluated in 
terms of their significance in a given cultural and 
social context. The methodology of cultural rela- 
tivism rests on the assumption that the ethnologist 
is able to transcend, or to eliminate for the mo- 
ment, his own cultural conditioning and values 
and to assume the subjective, ethnocentric atti- 
tudes and mentality of an adherent of or a partici- 
pant in the culture. This requires a measure of 
imagination and empathy on the part of the ob- 
server so that he can see others as they see them- 
selves or as they wish to be seen. The anthropolo- 
gist is required to report what actually happens 
and to attempt to interpret his data from the stand- 
point of his subjects. This may, and usually does, 
lead the observer to participate in the very institu- 
tions he is describing in order to get the “feel” and 
emotional concomitants of the behavior he ob- 
serves. He must also collect ethnographic state- 
ments, narratives, typical utterances, and items of 
folklore, inasmuch as they reflect ideology. Thus, 
although he is objective in collecting his data and 
observations of cultural phenomena, the anthro- 
pologist also tries to identify with his subjects so 
as to perceive their mentality and their vision of 
society and the world about them. 

The social anthropologist tries to evaluate the 
artifacts, socifacts, and mentifacts of a culture in 
relation to given institutions; he seeks to appreci- 
ate the “function” a given object or act performs 
in satisfying the needs of the society. When the 
anthropologist observes how a given artifact is 
used and for what purposes in a given context, he 
is able to provide a relative evaluation of the arti- 
fact through its function; he does not rely on the 
observation of the form alone. Similar detachment 
is required when he observes and reflects upon the 
long-range consequences of a ritual act. In evalu- 
ating the function of an institution from this im- 
personal, objective standpoint, the social anthro- 
pologist often introduces his own mental construct 
or hypothesis, which he tests against his observa- 
tions. To the extent that social scientists tend to 
differ in their analyses of the function of a given 
institution—for example, “magic”—they introduce a 
new subjective and relativistic factor of their own. 

All anthropologists are in agreement on the 
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value of the method of cultural relativism and the 
relative objectivity required to report and inter- 
pret data from the perspective of the adherents of 
the culture. There is considerable disagreement 
in the use of the method of impersonal, objective, 
functional evaluation. 


Relativism as a theory and philosophy 


Cultural relativism is a controversial doctrine 
that was quite fashionable in the second quarter 
of the twentieth century but that has since lost 
much of its support. Melville Herskovits (1948), 
the most articulate contemporary exponent of a 
philosophy of cultural relativism, found support 
for his approach in the Neo-Kantian historical 
idealism of Ernst Cassirer. According to Cassirer’s 
“spiritual anthropology” (1944), man lives in a 
symbolic universe of his own creation. There is, 
for him, no reality other than the symbolic forms, 
and hence all reality is cultural or symbolic reality. 
The physical world is discerned through the screen 
of enculturation; perception of time, distance, 
weight, size, and other “realities” is mediated by 
the recognized conventions of any given group 
(Herskovits 1955, p. 35). All reality as known 
is cultural reality, and all human experience is 
culturally mediated. And if all human experience 
is structured by enculturation, it follows that all 
cultural judgments, perceptions, and evaluations 
are a function of, and are relative to, a given cul- 
tural system. Moral values are but one element in 
cultural experience, and moral relativism is only 
one aspect of a general theory of cultural relativism. 

It should be noted, however, that the theory of 
cultural determinism originated with Auguste 
Comte and Herbert Spencer, both of whom postu- 
lated levels of natural phenomena, with social 
phenomena constituting the top level of phenom- 
enal reality. In America A. L. Kroeber (1917) 
adopted Spencer’s term “the superorganic” and 
applied it to culture, which he at first identified 
with the level of social phenomena. Culture was 
regarded as an entity sui generis, subject to its 
own laws of evolutionary development. As such, it 
was viewed as a closed system in which all cul- 
tural phenomena were to be explained only 
through other cultural phenomena and not by 
“reduction” to a lower order of reality, After 1948 
Kroeber virtually abandoned his old theory of the 
superorganic and came to think of culture as es- 
sentially an abstraction from human behavior. 
Leslie A. White, however, has continued to uphold 
the doctrine of the cultural Superorganic under the 
banner of “culturology.” 

The point to be marked here is that a theory of 


cultural relativism can be derived from, and is com- 
patible with, a variety of philosophical approaches: 
First, as in the case of Herskovits, it may be ex- 
plained by a theory of Neo-Kantian historical ideal- 
ism. Cassirer’s own philosophy of symbolic forms 
was based on the essential postulate of human 
freedom and creativity. Although Herskovits has 
utilized Cassirer’s “spiritual anthropology,” it is not 
necessary to adopt Cassirer’s philosophy of cultural 
idealism in order to explain the facts of cultural 
conditioning. Second, as in the case of White, cul- 
tural relativism depends on a philosophy of his- 
torical materialism, Historical materialism is also 
a philosophy of cultural determinism whereby the 
economic core of a culture determines its ideational 
manifestations and values. Third, for Kroeber (as 
for Comte and Spencer) cultural relativism de- 
pends on a positivistic philosophy of science that 
postulates the autonomy of cultural phenomena, If 
human behavior is determined by culture, there is 
little scope for individual freedom and for creative 
innovation. All individuals participate in the cul- 
ture of their society and their age; individual dif- 
ferences are insignificant when compared with the 
over-all pattern or configuration of a given culture. 

Modern ethnology has made us aware of the role 
played by social institutions in culturally condition- 
ing the personality and experience of the indi- 
vidual. As a result of cultural conditioning, there 
is some degree of cultural relativism that differ- 
entiates the participants in diverse cultures. The 
philosophical problem is to explain systematically 
how this diversity of cultural conditioning is to be 
understood. It is not necessary to subscribe to his- 
toric idealism, historic materialism, or evolutionary 
positivism in order to account for the facts of cul- 
tural relativism. All that is really necessary is to 
recognize that culture is one essential condition 
of human experience and that all experience is to 
some extent culturally mediated. It is equally im- 
portant to recognize other dimensions of reality, 
those of nature, both human and cosmic, that pro- 
vide human experience with a common frame of 
reference within which to test its cultural con- 
Structs. In the last analysis, culture is not the 
measure of all things, but nature is, and there are 
more things in nature than are ever grasped 
through our human, cultural symbols. Culture is 
but our human means of adjusting to nature and 
utilizing its powers in the service of mankind. This 
postulate of a metacultural reality renders scientific 
progress possible and saves us from the culture- 
centric predicament of historic idealism, historic 
materialism, and evolutionary positivism. Cultural 
relativism is a fact of human experience as con- 


ditioned by culture. However, a scientific method- 
ology that includes the comparative study of di- 
verse cultures enables men to transcend some 
ethocentric limitations and to live in a common 
world of reality. 


Validation and evaluation of values 


The principle of cultural relativism affirms that 
all values are a function or product of their culture 
and reflect the interests of their society and cul- 
ture, The fundamental assumption of both socio- 
logical determinism and relativism is that society 
is a newly emergent self-explanatory reality through 
which all the values of cultural life are to be ex- 
plained historically. All cultural values are thought 
to be functions of social organization and to vary 
with the modes and interests of society. The work 
of the sociologists Emile Durkheim and William 
Graham Sumner is devoted to the demonstration 
of this thesis. 

Moral or ethical relativism is one element in a 
general theory of cultural relativism. It is possible, 
however, to maintain a theory of moral relativism 
without subscribing to total cultural relativism. 
For example, Friedrich Nietzsche in his Genealogy 
of Morals transvaluated traditional moral val- 
ues by indicating how they served the respective 
interests of masters and slaves. Similarly, the con- 
temporary communist ideology rests on the rela- 
tivity of cultural values to the class interests of 
capitalists and proletarians. 

Furthermore, one may adhere to a theory of 
moral relativism on psychological and epistemo- 
logical grounds rather than on those of social and 
cultural relativism. For example, Edward Wester- 
marck (1932) maintains that moral evaluations 
are expressions of individual emotions and have 
no objective basis in fact. His argument, like that 
of the philosopher David Hume and some modern 
logical positivists, rests on the assumption of the 
subjective origin of value judgments, which ren- 
ders all value judgments relative to the emotional 
preferences of the individual. 

Insofar as moral relativism is considered to be a 
function of society and culture, it may be said to 
rest on two distinct postulates or assumptions. 
First, all value judgments are culturally conditioned 
or determined, and this limits their validity to the 
social and cultural context in which they originate. 
Second, it is impossible to establish any universally 
acceptable criterion for measuring and comparing 
values. Hence, all value systems are to be regarded 
as having equal validity. 

This practical assumption of the equality of dis- 
parate value systems leads to the prescription of 
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tolerance as a prime virtue. Reverence for cultural 
values, rather than reverence for life, becomes the 
absolute virtue advocated by the cultural relativist. 
Intolerance is said to be a product of an ethnocen- 
tric, uncritical prejudice in favor of the absolute 
validity of one’s own cultural values. 

It should be noted, especially, that the cultural 
and moral relativist does not advocate moral skep- 
ticism and nihilism. Those, he would say, are 
philosophical diseases to which he is not subject. 
Morality is a cultural universal and is essential for 
the corporate existence of any society. The mem- 
bers of any given society are obligated to conform 
to the rules and norms of their society on the prac- 
tical, utilitarian ground that without such obedi- 
ence social life would be impossible. Pragmatically, 
the individual must conform to the rules of his 
society, and each society must tolerate the codes 
of other societies in the interest of mutual sur- 
vival. Obedience and conformity are cultural im- 
peratives once a code of behavior is accepted. 

The cultural and moral relativist differentiates 
between moral universals and moral absolutes. 
Morality as a whole is a cultural universal in the 
sense that all cultures have some system of moral- 
ity. Within all systems of morality there is a lim- 
ited number of universal values, such as a standard 
of what constitutes a “good” man, a standard of 
truth, and some appreciation of beauty. However, 
the criteria for evaluating and delimiting the con- 
tent of these values are culturally defined. The 
cultural relativist would argue that there are no 
absolute, universal norms that are valid for all 
cultures. The only possible exception would be the 
prohibition of incest. 

Against this universal formalism of the moral 
relativist it may be argued that some of the actual 
values reflected in cultural systems are much the 
same everywhere and that differences arise over 
the relative importance of particular values and 
the extent of their application. In all cultures the 
perpetuation of the society takes precedence over 
the life of the individual, and hence no society 
tolerates treason, murder, rape, and incest. All so- 
cieties recognize mutual rights and duties in mar- 
riage and condemn acts that threaten family 
solidarity. Similarly, all societies give recognition 
to some personal property and provide some tech- 
niques for the distribution of economic surplus to 
the needy. The fact of common cultural values 
provides a basis for mutual understanding between 
adherents of diverse cultures. 

The point at issue is whether, following the 
cultural relativist, we recognize only empty, formal, 
universal categories of value with unlimited diver- 
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sity of content or whether a comparative study of 
cultures reveals some common content in univer- 
sal values. As Malinowski, Linton, Kluckhohn, and 
Redfield have advocated, there is an actual com- 
mon core of cultural values in all societies, which 
derives from the universal functions certain acts 
fulfill in satisfying human needs and aspirations. 
There are concrete cultural, universal values be- 
cause there are universal needs, biological, derived, 
and integrative, common to all societies. These 
cultural universals are not merely abstract cate- 
gories but actual regulative modes of conduct and 
norms of conduct common to all cultures. Such 
transcultural values may be called absolutes as 
well as universals. Cultural relativists tend to stress 
cultural differences but neglect the uniformities 
and common elements based on the imperatives 
of a universal human nature. 

The cultural relativist is very much concerned 
lest we commit the fallacy of ethnocentrism, which 
consists in the attitude that “one’s way of life is 
to be preferred to all others.” Hence, he prefers not 
to judge others at all. 

Here again it is necessary to distinguish be- 
tween the fact of ethnocentrism and its value. If 
by ethnocentrism is meant judgments based on 
irrational preferences incapable of rational valida- 
tion, then it is a fact that some degree of ethno- 
centrism is to be found in all societies and cultures. 
Modern theories of racism and extreme national- 
ism are vicious forms of ethnocentrism. Uncritical 
preference for one’s own culture and its mores and 
prejudice against alien cultures, the notion that 
one’s own ethnic society has a true appreciation 
of spiritual values of civilization whereas other 
groups and states are debased by materialistic 
values—these are expressions of ethnocentrism. 
It is not, however, the mere fact of preference for 
one’s own cultural values that constitutes ethno- 
centrism but, rather, the uncritical prejudice in 
favor of one’s own culture and the distorted, biased 
criticism of alien cultures. The only antidote to 
ethnocentric prejudice is comparative knowledge 
of one’s own and other cultures. This implies that 
it is possible to transcend the limits of cultural 
conditioning by empirical observation of cultural 
behavior. 

The cultural relativist, like the skeptic, main- 
tains that cultural determinism leads to ethnocen- 
trism in value judgments but, strangely enough, 
need not lead to ethnocentrism in judgments of 
fact. The anthropologist, by profession, is not 
subject to ethnocentrism so long as he reports the 
actual facts of behavior and belief of his subjects; 
he becomes ethnocentric only when he indulges 


in value judgments and comparative evaluations 
of values. If it is admitted, however, that cultural 
relativism does not preclude objective judgments 
of fact with transcultural, universal validity, why 
should it preclude judgments of value that have 
similar objectivity and validity? Only the prejudice 
induced by a positivistic philosophy of science, 
which divides the world of phenomena into facts 
and values, prevents recognition of the factual ob- 
jectivity of values. 

The advocates of cultural relativism counsel us 
to suspend comparative judgment and to grant, in 
principle, the equality or equivalent value of all 
value systems. This assumed ability to doubt and 
suspend judgment presupposes, as Descartes recog- 
nized long ago, an inherent freedom of judgment 
that liberates the mind from its own prejudices 
and past cultural conditioning. Hence, the exercise 
of freedom of judgment and rational analysis in 
accord with empirical data is as much a fact of 
human behavior as is cultural determinism, Were 
it not for man’s innate ability freely to evaluate 
and verify the truth of his ideas by subjecting them 
to empirical and critical tests, it would be impos- 
sible to overcome ethnocentric prejudices in any 
way whatsoever. Man would be the pr 
victim of his own cultural conditioning, and a sci- 
ence of anthropology would, in effect, be impossi- 
ble. The cultural relativist, in the interests of a 
science of culture, stresses cultural determinism 
but completely overlooks the primary, ineluctable 
fact that freedom of judgment and cultural crea- 
tivity are absolutely essential for an understanding 
of the very existence of cultural phenomena. 


Pragmatic attitude and cultural ideal 

It is necessary to distinguish between the uni- 
versal fact of cultural relativism and the ideal 
value of cultural relativism advocated by the liberal 
ethnologist. The fact of cultural relativism coexists, 
and is historically compatible, with the very ethno- 
centrism the ethnologist deplores. As a result of 
cultural conditioning, especially if it is of a puri- 
tanic, dogmatic nature, a people may come to Te- 
gard their own values as absolute and to disparage 
those of other societies as inferior to theirs. In an 
“enlightened” society, however, people will be 
conscious of the facts of cultural relativism and 
believe that there is no criterion for making com- 
parative value judgments. Such enlightened people 
will readily infer that they should treat all other 
cultural values as equal and equivalent to their 
own. Cultural relativism may thus be viewed as 
an ideal postulate of liberal culture which is tol- 
erant of all other cultures. It is assumed implicitly 


that there is a kind of pre-established harmony of 
cultures that makes it possible for all to coexist in 
a pluralistic cultural world. It has taken the shock 
of World War 11, with its brutalities, to awaken this 
romantic cultural optimism of our modern Can- 
dides to the reality of cultural crises and the actual 
conflict of cultures. 

Thus, the ideal of cultural relativism is to be 
contrasted with the real, ethnocentric cultural rela- 
tivism of history. As an idealist, the culturologist 
may ask other liberals to transcend and shed their 
ethnocentric prejudices, and especially their predi- 
lections for absolute values. He infers from the 
facts of cultural relativism the ideals of cultural 
reverence and tolerance. In Kantian terms, he 
might describe a new cultural imperative: so act 
as to treat all cultures as ends and never as means. 
We are not informed, however, how this leap from 
the ethnocentric real to the ideal of relative objec- 
tivity is to be made. It is equally logical, as many 
a philosopher has seen, to reach the conclusion of 
nihilism and to treat all cultural values as equally 
worthless. 


Cultural relativism involves tolerance based on 
skepticism of universal, objective standards of 
value as well as of the idea of progress. The com- 
parative study of cultures has made us conscious 
of the dangers of uncritical ethnocentrism, but it 
has also provided us with the materials and the 
incentive to transcend the limitations of both cul- 
tural relativism and ethnocentrism through the 
pursuit of scientific truths concerning facts and 
values. 

Dav BIDNEY 


[Directly related are the entries CULTURE, article on 
CULTUROLOGY; EVOLUTION, article on CULTURAL EVO- 
LUTION; OBSERVATION; VALUES, article on VALUE 
SYSTEMS. ] 
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m 
CULTUROLOGY 


Culturology is the branch of anthropology that 
treats culture (institutions, technologies, ideolo- 
gies) as a distinct order of phenomena, organized 
upon principles of its own and behaving in terms 
of its own laws. The culture process is regarded as 
self-contained and self-determined. Variations in 
culture are explained in cultural terms, rather than 
biological or psychological terms. The science of 
culture had, of course, to wait upon the develop- 
ment of a scientifically adequate concept of cul- 
ture. Preliterate peoples have been aware of differ- 
ences of custom, speech, and belief among them- 
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selves. But even so sophisticated a people as the 
Greeks of Aristotle’s day had no word equivalent 
to our term culture. The term was borrowed by 
E. B. Tylor, the great pioneer in English anthro- 
pology, from German culture historians. Tylor de- 
fined culture as “that complex whole which includes 
knowledge, belief, art, morals, law, custom, and 
any other capabilities and habits acquired by man 
as a member of society” (Tylor 1871, vol. 1, p. 1). 
He made it clear that culture was the exclusive 
possession of the human species. 

The symbolic process. Numerous and varied 
definitions of culture have been formulated since 
Tylor’s day, but his is substantially the one that 
prevails today. Culture is the name of those behav- 
ioral traits that distinguish man from other species: 
articulate speech; institutions; codes of ethics and 
etiquette; ideologies; a continuous, cumulative, and 
progressive tool process; etc. Man is unique in that 
he alone possesses the ability to symbol, i.e., to 
bestow, freely and arbitrarily, meaning upon things 
and events, objects and acts. Articulate speech is 
the most characteristic and important form of 
symboling. All culture was produced and has been 
perpetuated by symboling in general and by articu- 
late speech in particular. 

But things and events that are dependent upon 
symboling (called symbolates) may be, and com- 
monly are, considered in two different contexts. 
In a somatic context, their significance lies in their 
relationship to the human organism, and as such 
constitute behavior. In the extrasomatic context, 
symbolates are significant, not in terms of their 
relationship to the human organism that produces 
them, but in relationship to other symbolates. In 
this context they are culture. Thus, mother-in-law 
avoidance is to be regarded as behavior when 
viewed in terms of the concepts, acts, and attitudes 
of human organisms; it is culture when considered 
in its relationship to other customs, such as forms 
of marriage, place of residence of the newly mar- 
ried couple, roles of the sexes in subsistence, of- 
fense and defense, etc, Culture, therefore, is a class 
of things and events dependent upon symboling, 
considered in an extrasomatic context. 

Prior to the emergence of culturology in the ex- 
pansion of the scope of science (White 1949), 
naturalistic (i.e., nonmythological, nontheological ) 
explanations of the behavior of peoples were bio- 
logical, psychological, or sociological. Accordingly, 
peoples behaved as they did because of their physi- 
cal type; or because of the way their minds worked; 
or as a consequence of certain processes of social 
interaction. In all these explanations, man, individ- 
ually and collectively, was the independent vari- 


able; his customs, institutions, beliefs, etc., the 
dependent variable. Man was the cause; culture, 
the result. 

The culturological explanation. The culturo- 
logical revolution reversed this kind of interpreta- 
tion. Culturology asserts that peoples behave as 
they do because they have been born and reared 
in particular cultural traditions. The behavior of a 
people is determined, not by its physical type or 
genetic constitution, nor by its ideas and desires 
and hopes and fears, nor by processes of social 
interaction, but by external, extrasomatic cultural 
tradition. Peoples born into a Tibetan linguistic 
tradition will speak Tibetan, not English. A people 
practices monogamy, polygyny or polyandry, or 
loathes milk, avoids mothers-in-law, or uses the 
multiplication table, because they are obliged to 
react to these cultural traditions. The behavior of 
a people is a function of its culture. 

If the behavior of a people is determined by its 
culture, what determines the culture? The answer 
is that it determines itself. Culture may be regarded 
as a process sui generis. It is a process in which 
culture traits interact with one another, forming 
new permutations, combinations and syntheses. 
One trait, or combination of traits, is the result of 
antecedent and concomitant traits and the cause 
of subsequent traits and trait combinations. One 
form of language, writing, social organization, tech- 
nology, or of culture as a whole, grows out of a 
previous stage or emerges from an anterior condi- 
tion. 

Every sociocultural system is of course affected 
by its terrestrial and celestial environment. Cli- 
mate, topography, flora, fauna, and mineral re- 
sources may or do exert influences upon cultural 
systems. But environments merely permit or pro- 
hibit the existence of certain elements or features 
of culture; they do not determine them. The influ- 
ence of environmental factors finds expression only 
in and through cultural means; consequently, they 
may be dealt with culturologically. Certain ele- 
ments present in the environment, such as iron or 
petroleum, do not enter the culture process except 
at certain stages of development. Finally, in dealing 
with cultures in general, or with culture as a whole, 
the factor of environment may be considered 2 
constant and therefore disregarded in explanation 
of the culture process. 

Although culturology treats the culture process 
without regard to the biological and psychological 
processes of human beings, the culturologist recog- 
nizes the intimate and necessary relationship be- 
tween culture in general and man in general. 
Generically, culture is what it is because man is the 


kind of animal that he is. If man, the animal, were 
different, his culture would be different. If man did 
not have spectroscopic, chromatic vision, his cul- 
ture would be different. If he could subsist only 
upon meat or cereals, his culture would be different. 
If he had a rutting season, or reproduced with lit- 
ters rather than individuals, his culture would be 
different. Culture was brought into being by the 
human species, and it functions to serve the needs 
of this species. Thus, when one considers the ques- 
tion of the origin and functions of culture one must 
take biological man into consideration. But once 
culture has come into being, its subsequent varia- 
tions—its changes, its growth, its additions and 
subtractions—are to be explained without refer- 
ence to the animal man, either individually or col- 
lectively. We do not need to invoke man when we 
consider such questions as the evolution of mathe- 
matics or currency, sociocultural processes of inte- 
gration and disintegration, the relationship between 
social systems and technological systems, the diffu- 
sion and distribution of the keystone arch, etc. To 
be sure, these cultural processes could not take 
place without people. But it is their culture, not 
their innate natures, that causes them to behave 
as they do. Man is necessary to the existence and 
functioning of the culture process, but he is not 
necessary to an explanation of its variations. 

An atom cannot be understood merely by a con- 
sideration of its components; an atom is a system 
that must be understood in terms of itself. The 
properties of sugar cannot be discovered in its 
component atoms of carbon, hydrogen, and oxygen; 
the molecule functions as a molecular system. A 
living cell cannot be understood in terms of its 
component molecules; a biological organism cannot 
be understood in terms of its cells. Individual or- 
ganisms do not reveal the properties of societies. 
Every kind of system exists in terms of its own 
structure and functions, in terms of its own princi- 
ples and laws. In the human species, societies are 
cultural; i.e., sociocultural systems. Like other kinds 
of systems, they must be understood in their own 
terms, 

Linguistic systems are comprehended in terms of 
lexicon, grammar, syntax, phonetics, and so on. 
Languages can, of course, have no existence with- 
out human beings. But linguistic science proceeds 
as if mankind did not exist. So it is with culture 
as a whole. The evolution of culture can be worked 
out as a cause-and-effect sequence of cultural 
events, The influence of technologies upon social 
systems, the interrelationships among technologies, 
Social systems, and ideologies can be ascertained 
and measured without reference to the human car- 
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riers of these systems. Such problems as the evolu- 
tion of mathematics or of tribal confederacies, 
processes of integration and disintegration of social 
systems, or the mechanisms of the regulation and 
control thereof can be attacked and solved without 
regard to human organisms as such. We could not 
have symphonic music, trial by jury, or the Deca- 
logue were it not for respiration and metabolism, 
but a consideration of these physiological processes 
does not in the least help us to understand these 
cultural phenomena. 

The phenomena of culture, like biological and 
physical phenomena, are to be treated scientifically 
from four standpoints. Our point of view may be 
temporal or nontemporal, generalizing or particu- 
larizing. If we combine these two dichotomies, we 
get a fourfold classification of scientific interpreta- 
tion, or treatment of cultural or any other kind of 
natural phenomena, as shown in Table 1. 


Table 1 
TEMPORAL NONTEMPORAL 
PARTICULARIZING Historical Descriptive ethnography 
GENERALIZING Evolutionist Functional—Structural 


Consideration of cultural things and events from 
a temporal—particularizing point of view results in 
culture history. The temporal—generalizing ap- 
proach yields evolutionist interpretations. The non- 
temporal-generalizing interpretation deals with the 
structures and functions of sociocultural systems. 
And the nontemporal—particularizing treatment is 
descriptive ethnography. Thus, all the schools of 
ethnological theory are embraced by this classifica- 
tion of kinds of interpretation: mere ethnographic 
description; the historical schools of Graebner, 
Elliot Smith, Boas; the evolutionist schools of Tylor, 
Morgan et al.; the functionalist schools of Mali- 
nowski and Radcliffe-Brown, and their successors, 
the structuralists of modern British social anthro- 
pology. Culturology employs, therefore, these four 
ways of treating cultural phenomena. 

The culturological point of view has encountered 
considerable opposition from various quarters, Many 
scholars have insisted that it is people, not cultures, 
that meet and interact. The culturologist is accused 
of “reifying” culture into a mystic entity that exists 
apart from society. 

To consider culture—languages, institutions, 
ideologies, and technological systems—as distinct 
orders of phenomena, explainable in terms of them- 
selves, is not to reify them. They are real, observ- 
able things and events in the external world, just 
as atoms, cells, and stars are. 

The origin and development of culture traits, 
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such as jury trials, firearms, constitutional govern- 
ment, the theory of relativity, etc., cannot be ac- 
counted for psychologically; they can be explained 
only in terms of a developmental culture process. 
Again, people are necessary for the existence of 
cultural events, but they are not necessary to an 
explanation of their origins or variations. 

The psychosocial explanation. Emile Durkheim 
contrasted psychological and culturological inter- 
pretations of human behavior and institutions. 
When “one sees in the organization of the family 
the logically necessary expression of human senti- 
ments inherent in every mind, the true order of 
facts is reversed. On the contrary, it is the social 
organization of the relationships of kinship which 
has determined the respective sentiments of parents 
and children” (Durkheim [1893] 1960, p. 340). 
“Every time that a social phenomenon is directly 
explained by a psychological phenomenon, we may 
be sure that the explanation is false” (Durkheim 
[1895] 1958, p. 104). We may properly substitute 
“cultural phenomenon” for social phenomenon in 
the preceding quotation, Race prejudice, war, capi- 
talism, etc. are not to be explained as “the logically 
necessary expression of concepts and sentiments 
inherent in the human mind.” On the contrary, it 
is the structure and behavior of the extrasomatic 
culture process that establishes racial, marital, and 
capitalistic ideas and sentiments in the minds of 
individuals. 

The sociologist tends to regard culture as the 
product of social interaction, One kind of social 
interaction produces polygyny, another kind, poly- 
andry; one kind produces capitalism, another, com- 
munism. But if social interaction alone could pro- 
duce culture, we would find it among baboons. The 
institutions of polygyny and polyandry cannot be 
explained as consequences of the interaction of 
individuals. But the interaction of one man (hus- 
band) and more than one woman (wives), or one 
woman (wife) and more than one man (husbands) 
can be explained as a consequence of the influence 
of external, extrasomatic cultural structures upon 
them. And these institutions must be explained— 
in their origins and variations—in terms of other 
cultural elements, such as the requirements of the 
sex division of labor, customs of residence, occupa- 
tional hazards and death rates of the sexes, wealth 
and prestige, etc. 

There is, and can be, no justifiable conflict be- 
tween the science of psychology and the science of 
culturology; these sciences complement, rather than 
conflict with, each other. Both are essential to a 
complete comprehension of anything that man 
does as a human being. Just as the origins of insti- 


tutions must be explained culturologically, the 
experiences that people undergo within these insti- 
tutions is the concern of psychology. What are the 
conceptions and attitudes held by the individuals 
most directly involved in the mother-in-law taboo, 
namely, a man, his wife, and his wife’s mother? 
Are they imbued with supernaturalism, or are they 
naturalistic in character? Are they attitudes of re- 
spect, fear, or contempt? What is it like to be a 
wife in a polygynous household? Or a husband in 
a polyandrous one? These are questions for the 
psychologist rather than the culturologist. 

The anthropocentric viewpoint. The opposition 
to culturology arises principally from the age-old 
and deeply entrenched philosophy of anthropomor- 
phism and anthropocentrism. Man has been con- 
ceived as a prime mover, as a first cause, and is 
often endowed with free will. This is expressed by 
Sapir: “It is always the individual that really thinks 
and acts and dreams and revolts.” There have also 
been others who insist that culture originates in the 
creative acts of individuals, or that only the indi- 
vidual is real, 

This anthropocentric conception receives support 
from the fact that the behavior of all nonhuman 
species is a function of their respective biological 
constitutions. The biological principle applies to 
ducks, sharks, sunflowers, and all other nonhuman 
species. But it does not apply to human beings, who 
live in a symbolic environment and respond to 
different kinds of extrasomatic traditions. The 
institutions of man are to be explained in terms of 
culture. 

The anthropocentric point of view receives sup- 
port also from the fact that, as a biological organ- 
ism, man is a dynamic system. He reacts positively 
to his terrestrial habitat and to his enveloping cul- 
ture. But with regard to the latter, he can respond 
only within the limits set by his culture. Although 
physically and biologically a thermodynamic sys- 
tem, man is, and remains, a puppet of his culture. 

LESLIE A. WHITE 
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Iv 
CULTURAL ADAPTATION 


Culture, a uniquely human attribute, is some- 
thing which man interposes between himself and 
his environment in order to ensure his security 
and survival. As such, culture is adaptive. 

This way of looking at culture, uncommon a few 
decades ago, has within recent years won increas- 
ing support among anthropologists. Leslie A. White 
(1949, p. 360), for example, speaks of culture as 
“a specific and concrete mechanism employed by 
a particular animal organism in adjusting to its 
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environment,” and Ralph Piddington ([1950] 1952, 
p. 219) holds that “culture is essentially an adap- 
tive mechanism, making possible the satisfaction 
of human needs, both biological and social.” 

As this attitude has gained currency, cultures 
have ceased to be treated as unique constellations 
of traits, each the fortuitous product of history. A 
culture is now more frequently thought of as the 
resultant of a parallelogram of forces—forces whose 
identity can be ascertained and whose effects can 
be weighed. 

Within culture as a whole, anthropologists have 
generally recognized the adaptive function of tech- 
nology. As Robert Lowie succinctly expressed it, 
the purpose of an ax is to fell trees. Some anthro- 
pologists, however, have seemed unsure of the 
function of social organization. Lowie ([1920] 1947, 
p. 439) said it had “unknown ends,” and A. L. 
Kroeber ([1923] 1948, p. 307) found many of its 
manifestations “strange, unpredictable, . . . [and] 
unreasonable.” To Kroeber (1938, p. 308), “the 
phenomena of formal social organization . . . rep- 
resent a field of experimentation or play on the 
part of cultures....” 

That social life is not something mysterious or 
capricious, but that like technology it is adaptive, 
has long been appreciated by biologists. The ecol- 
ogist Angus Woodbury (1954, p. 412) recognized 
that social life is an adaptation for efficient use of 
time and space upon the earth, The existence of a 
social life, he thought, was forced upon members 
of a group by their need to cooperate in order to 
succeed in the struggle for existence. The zoologist 
William Etkin (1954, p. 134) wrote of the essen- 
tial role played by social behavior in an animal's 
adaptation to its environment and stated that vari- 
ous forms of group behavior were as necessary to 
survival as any of an animal’s physiological or 
structural characteristics. In their joint work, Life: 
Outlines of General Biology, J. Arthur Thomson 
and Patrick Geddes (1931, vol. 1, pp. 120-126) 
listed seven advantages of social life, and in Nat- 
ural Communities, Lee R. Dice (1952, pp. 285, 
287) enumerated nine such advantages. 

Social organization, generally, is the way in 
which the individuals of a group within an animal 
species coordinate their behavior in adapting to 
the exigencies of life. Human social organization is 
simply the way in which human beings, through 
cultural means, do the same thing. As White has 
observed: “Human beings have to be related to 
each other in an effective manner in order to carry 
on the business of life successfully. Social, political, 
ethical, artistic, ecclesiastical, and educational sys- 
tems operate to accomplish this purpose. They are 
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means of coérdinating, integrating, regulating, and 
directing human endeavor toward the goal of 
all life: a secure and agreeable existence” (1947, 
p. 183). 

But granting that social organization is adaptive, 
can the same be said, for example, of rituals and 
ceremonies? Some anthropologists have held them 
to lie outside the realm of the rational and the 
functional, and therefore to be nonadaptive. Yet 
it is possible to examine rituals and ceremonies in 
the context of a broader theory of adaptation, and 
the results of such analyses have been to throw 
new light on them. 

The celebrated potlatch of the Northwest Coast 
Indians provides an example of this. It has always 
been regarded by anthropologists not only as non- 
adaptive but also as ostentatiously wasteful. Suttles 
(1960, p. 304), however, writing of the potlatch 
among the Coast Salish, concludes that “the drive 
to attain high status is clearly not the explanation 
of the potlatch. Nor is the production of surplus. 
Nor the cooperation achieved by the potlatching 
community. The potlatch is a part of a larger socio- 
economic system that enables the whole social net- 
work, consisting of a number of communities, to 
maintain a high level of food production and to 
equalize its food consumption both within and 
among communities. The system is thus adaptive 
in an environment characterized by the features 
indicated before—spatial and temporal variation 
and fluctuation in the availability of resources.” 

Another example may be cited. After taking a 
second look at the practice of scapulimancy among 
the Montagnais—Naskapi, who used this form of 
divination in selecting hunting routes, O. K. Moore 
(1957, p. 73) suggested that “it is in essence a 
very crude way of randomizing human behavior 
under conditions where avoiding fixed patterns of 
activity may be an advantage.” In other words, by 
leading them to hunt randomly over their territory, 
and thus preventing the overhunting of any one 
area, the practice of divination may well contribute 
to the preservation of game animals and thus ulti- 
mately to improved chances for the survival of the 
Montagnais—Naskapi themselves. 

Acknowledging, then, that all aspects of culture 
may contribute to the ecological adjustment of a 
society, let us examine the process of cultural 
adaptation more closely, Basically, societies adapt 
to their environment by three means: technological, 
organizational, and ideational. 

Technology, which may be taken to include tools, 
machines, utensils, and weapons, along with the 
techniques associated with their use, is the most 
directly adaptive facet of culture. Man’s occupancy 


of every corner of the earth could not have come 
about without the implements necessary to cope 
with the peculiar problems posed by each habitat, 
This is especially well illustrated by harsh environ- 
ments, To survive in their Arctic habitat the Es- 
kimo must rely on such sophisticated and specialized 
traits as the igloo, the kayak, the seal-oil lamp, the 
dog sled, the harpoon with bladder float, and snow 
goggles, 

Other cultures in other environments furnish us 
with additional examples of highly specialized and 
effective technical adaptations. The boomerang of 
the Australian aborigines, the bolas of the Pata- 
gonian guanaco hunters, the snowshoe of the sub- 
Arctic, the blowgun of Malaysia and South America, 
pemmican among the Plains Indians, the outrigger 
canoe of Polynesia, the manioc squeezer of Ama- 
zonia, and the agricultural terraces of the Incas are 
only a few of the illustrations that could be cited. 

In addition to implements of subsistence, tech- 
nology includes weapons of war. Hostile relations 
often prevail among neighboring societies, and mil- 
itary technology may at times be as important as 
subsistence technology. 

In bronze age China, the invention and use of 
the chi, a combination of the halberd and the 
spear, is credited with helping the state of Ch'in 
to defeat its rivals and to unify China for the first 
time (Li 1957, p. 58). 

Stirrups, an Asiatic invention which reached 
Europe in the eighth century, had a profound effect 
on warfare during the Middle Ages. “The use of 
stirrups enabled an armored horseman, carrying 4 
lance at rest under his right arm, to brace himself 
in the saddle so firmly that the shock of his attack 
could combine the momentum of horse and rider. 
This feature made the mounted knight the most 
powerful instrument of medieval warfare, render- 
ing obsolete the older Roman and Germanic mili- 
tary tactics of fighting on foot in close order 
(Homans 1962, p. 396). 

The second major aspect of culture is the organ- 
izational. While it is logically distinguishable from 
the technological aspect, in actuality it is closely 
related to it. Indeed, White maintains that “social 
systems are social means of operating technological 
systems” (1947, pp. 182—183). It is certainly true 
that cooperation in human societies, especially sim- 
pler ones, exists principally for the purpose of pro- 
curing food, providing shelter, and meeting the 
demands of offense and defense. Accordingly, all 
forms of social organization involved in achieving 
these ends must be regarded as adaptive. Going 
beyond this, however, it can be argued that forms 
of organization developed to deal with an increased 


population are also adaptive, since they contribute 
to a society's integration and, therefore, to its 
survival. 

Plains Indian cultures afford a striking example 
of how seasonal aggregations of population con- 
nected with subsistence can lead to an elaboration 
of social organization. During most of the year, 
bands of a typical Plains tribe were dispersed over 
a wide area, subsisting separately from other bands 
and acting independently of them, The band chief 
had little authority, and the socioceremonial life of 
the band was simple. For the summer buffalo hunt, 
however, the bands came together and assumed a 
form of tribal organization that was distinctly more 
complex. Band chiefs, previously without superiors, 
now formed themselves into a council from which 
a paramount tribal chief was selected. The tribal 
chief wielded considerably greater authority than 
he had as band chief; it was now his responsibility 
to coordinate and direct the activities of the tribe 
as a whole. 

Men's associations, inactive during most of the 
year, re-formed when the bands came together and 
carried out activities that took place only when the 
entire tribe was assembled. One of these men’s soci- 
eties was designated by the tribal chief to serve as 
a police force charged with punishing violations of 
the strict rules that prevailed during the buffalo 
hunt, as well as with preserving order during the 
march and on the occasion of the sun dance. 

That the emergence of these structural features 
was an adaptation to the problems created by a 
supraband aggregation is demonstrated by the fact 
that all the features—the council, the tribal chief, 
the men’s societies, the police force, the sun dance 
organization—lapsed when the tribe broke up into 
its constituent units in the autumn. 

The adaptiveness of a society's internal organi- 
zation, tested repeatedly by the food quest, may be 
tried even more severely by warfare. Survival is 
the ultimate measure of fitness, and however well 
adapted a society may be in other respects, if it is 
unable to stand up to its enemies, it must be found 
wanting in its over-all adaptation. Superiority in 
military technology may be the decisive factor in 
armed conflict, but where the weapons of war are 
alike on both sides, success may well go to the 
society better organized to wage war. 

Continual and successful involvement in war 
tends to leave its stamp on all segments of a soci- 
ety, sometimes to a remarkable degree. The ancient 
Spartans afford a classic example of this. Every- 
thing in the life of a Spartan was subordinated to 
his military obligations. The institutions of war 
were paramount, and other institutions were ad- 
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justed accordingly. Among preliterate peoples, the 
Zulu and Masai of Africa and the Northern Cayapé 
of South America provide examples of the same 
phenomenon. 

The ideational aspect of culture also serves to 
adapt a society to the prevailing conditions of exist- 
ence. For example, it has been noted of nomadic 
peoples like the Eskimo, Tehuelche, and Lengua 
that even when the food supply is unusually plenti- 
ful, after a band has spent more than a few days 
or weeks in one spot, everyone becomes restless 
and anxious to be on the move. Ecological necessity 
has developed in them a psychology of nomadism. 
By being mentally adapted to normal ecological 
conditions, the individuals in a culture are always 
ready, and even eager, to make what is ordinarily 
the most appropriate response. The matter may be 
put this way: a sociocultural system works best 
when it makes people want to do what they have 
to do. 

A question that remains to be answered is how 
cultural adaptation is related to cultural evolution. 
The two are not synonymous: adaptation is the 
adjustment of a society to its external and internal 
conditions of existence, while evolution is change 
by which a society grows larger, more heterogene- 
ous, and more coherent. While adaptive changes 
are usually evolutionary, they are not necessarily 
or always so. Adaptation may sometimes involve 
simplification, as when deteriorating environmental 
conditions force a society to split into smaller 
groups, move its settlements more often, and give 
up some of its ceremonies. Here, although the 
chances of survival under worsened conditions have 
increased, complexity has decreased, so that while 
the changes are adaptive, they are not evolutionary. 
Nevertheless, most adaptive changes undergone by 
a society do tend to render it more complex and 
better integrated—in a word, more evolved. 

An evolutionary advance characteristically be- 
gins with an adaptive solution to an ecological 
problem and is followed by a series of readjust- 
ments whereby cultural elements successively fur- 
ther removed from the source of the change are 
affected by the change and accommodate them- 
selves to it. Wittfogel (1957) has suggested that 
the need to establish or extend irrigation systems 
in certain heavily populated and arid parts of the 
world brought about basic changes in the economic 
and political institutions of several of the early 
civilizations and that these basic changes eventu- 
ally ramified throughout their entire social systems. 

During the course of these kinds of readjust- 
ments within a culture, new traits appear which 
are alternative to, and therefore competitive with, 
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existing ones. In this competition traits are sub- 
jected to the cultural equivalent of natural selec- 
tion: the better adapted traits survive and expand, 
and the less fit decline and disappear. 

Competition, selection, and displacement among 
traits can readily be illustrated from contemporary 
American culture. We see them exemplified by the 
diesel engine replacing the steam locomotive and 
by the corporation replacing other forms of large- 
scale business enterprise. The operation of this 
process was recognized and expressed years ago 
by E. B. Tylor when he wrote: “the institutions 
which can best hold their own in the world gradu- 
ally supersede the less fit ones, and . . . this inces- 
sant conflict determines the general resultant course 
of culture” ([1871] 1958, vol. 1, p. 62). 


ROBERT L. CARNEIRO 


[Directly related are the entries EcoLocy; EVOLUTION, 
articles on CULTURAL EVOLUTION and SOCIAL EVOLU- 
TION; TECHNOLOGY.] 
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v 
CULTURE CHANGE 


Culture change is the conceptual formulation 
that refers to the many ways in which societies 
change their patterns of culture: Internal factors 
such as new inventions may lead to an increased 
food supply and population growth, or external 
factors such as conquest by another society may 
bring about culture change. We know from the 
records of prehistory and history that the patterns 
of culture of every human society are constantly 
changing. The rate and type of change may be 
slow and gradual, as it was during the Paleolithic, 
or fast and drastic, as it has been in contemporary 
societies. The basic problem is a question of how 
and why there are shifts in rate and type of 
change, rather than a question of static versus 
changing cultures. 

Our knowledge about culture change may be 
summarized in four basic questions: (1) What are 
the internal or external factors that generate shifts 
in rates and types of culture change? (2) What 
are the processes by which culture change takes 
place? (3) What models and methods are now 
available for the study of culture change? (4) 
How is the concept of culture change related to 
the closely associated phenomena of diffusion, in- 
novation, evolution, acculturation, and nativism? 

Factors influencing culture change. The data 
on change do not yet permit any easy generaliza- 
tions concerning the relative primacy of various 
factors that may generate significant shifts in rates 
and types of culture change. Some anthropologists 
favor a basically Marxist, or Neo-Marxist, interpre- 
tation and give primacy in their theories to such 
factors as “the amount of energy harnessed per 
capita per year” (White 1949). Some, such as 
Steward (1955), place emphasis upon ecology and 
Stress the “adaptation of a culture to its environ- 
ment” as the primary factor. Other scholars em- 
phasize the importance of “religious ideology 
(Weber 1922), cultural “themes” (Opler 1945), 
and “cultural focus” (Herskovits 1955) or, like 
Geertz (1957), point to inherent incongruities and 


tensions in social and cultural systems that gen- 
erate constant pressure for change. Still others 
have developed more specialized theories, such as 
Murdock (1949, p. 199), who concludes that “. . . 
social organization is a semi-independent system 
comparable in many respects to language, and 
similarly characterized by an internal dynamics of 
its own,” or Hallowell (1955) and Wallace (1961), 
who stress the psychological aspects of culture 
change, 

Whatever special emphasis may characterize a 
theory, it is useful to isolate three general factors 
that can influence change in a given culture: 

(1) Any change in the ecological niche occupied 
by a society influences culture change. Such a 
change may occur as a result of either (a) natural 
environmental changes, e.g., the gradual desicca- 
tion of the great basin of Utah and Nevada follow- 
ing the retreat of the last glaciation; or (b) the 
migration of a society from one ecological niche to 
another, e.g., the movement of southern Atha- 
bascans from the subarctic territory of interior 
Canada and Alaska to the arid southwestern United 
States. New cultural adaptations are always re- 
quired for survival in the case of such shifts in 
ecological niche. 

(2) Any contact between two societies with dif- 
ferent cultural patterns influences change in both 
societies, The diffusion, or borrowing, of cultural 
elements that occurs has made these exchanges 
important in culture change throughout history. 
More profound and sustained cultural contact, usu- 
ally called “acculturation,” generates even more sig- 
nificant shifts in the rates and types of change in 
the two cultures. 

(3) Any evolutionary change occurring within 
a society is obviously a factor of critical impor- 
tance. Following Murdock (1949, p. 184), “evolu- 
tion” is used here simply to designate “processes of 
orderly adaptive change.” For example, when a 
society with a food-gathering economy has domes- 
ticated its plants and animals and has thereby in- 
creased its food surplus potential, the requirements 
of the new food-producing technology and the 
ensuing population growth pose critical problems. 
The need for division and specialization of labor, 
social control, and distribution of the surplus leads 
to adaptive changes in the cultural patterns. Or if 
a society (for whatever reasons) shifts from 
matrilocal to patrilocal residence, then again there 
are adaptive changes in a whole range of cultural 
patterns, as Murdock (1949) has demonstrated. 

Processes of culture change. The study of the 
Processes of culture change refers to the actual 
social mechanisms by which the change takes 
place. Some scholars take the position that the 
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basis of all culture change is located in changes 
in the attitudes and behavior of individual mem- 
bers of a society. This point of view is found in 
Barnett (1953), who focuses upon the individual 
innovator in a society and analyzes the cultural 
conditions that stimulate him to innovate, the in- 
centives that motivate him, and the mental proc- 
esses he experiences in innovation. In this study 
Barnett also treats the basic innovative processes 
in terms of recombinations, identifications, sub- 
stitutions, etc., in mental configurations. He finds 
those who accept innovations are likely to be in- 
dividuals who are “dissident,” “disaffected,” or “re- 
sentful.” 

Other scholars take the point of view that al- 
though individuals are the carriers of a culture, 
there are processes of change in social and cultural 
systems which have dynamic properties of their 
own that can be isolated and studied. Thus, we 
can study the mechanisms of “cultural evolution” 
(White 1949; Sahlins & Service 1960), the “con- 
figurations of culture growth” (Kroeber 1944) 
over long time spans, or the “evolution of social 
organization” (Murdock 1949) over at least several 
generations. 

Systematic theories interrelating processual phe- 
nomena observed in changes in both individual 
behavior and social and cultural systems are cur- 
rently one of the outstanding problems for future 
research and analysis. Firth (1951, p. 40) dif- 
ferentiates between “social structure,” which he 
views as providing the principle of continuity in 
society, and “social organization,” in which lies 
the principle of variation or change “. . . by allow- 
ing evaluation of situations and entry of individual 
choice.” Herskovits (1955, pp. 497-514) empha- 
sizes “individual variation,” which is found even 
in the most isolated and primitive societies, and 
the importance of this variation in the mechanisms 
of culture change. Wallace (1961, pp. 143-152) 
attempts to combine individual personality and 
cognition with changes in cultural systems in his 
study of “revitalization” processes, in which he has 
described “. . . a common process structure which 
can be conceptualized as a pattern of temporally 
overlapping, but functionally distinct, stages.” 

Other recent contributions to the conceptual 
mapping of processes of culture change include 
Herskovits’ formulation (1955, p. 492) of “rein- 
terpretation” as the process by which old mean- 
ings are ascribed to new elements or by which 
new values change the cultural significance of old 
forms; Firth’s description (1951, p. 86) of the 
process of “social convection,” by which individ- 
uals not directly involved in a change in the first 
instance tend to modify their behavior to adjust 
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to the change, and the process of “social conduc- 
tion,” by which an innovation brings unforeseen 
results that must be adapted to; and Vogťs dis- 
tinction (1960, pp. 21-22) between “recurrent 
processes,” found in micro time-scales, which 
characterize the daily, seasonal, annual, and gen- 
erational life of a society, and “directional proc- 
esses,” found in macro time-scales, which involve 
nonrepetitive, cumulative shifts in the structures 
of social and cultural systems. 


Models for the study of culture change 


Culture change can be studied as it occurs over 
long time spans (e.g., the analysis of cultural evo- 
lution in human societies from the Lower Paleo- 
lithic to the twentieth century) and in the micro- 
scopic sense, as it occurs in short periods of time 
(e.g., the analysis of how individual American 
Indians altered their behavior patterns during one 
generation of education in Indian Service schools). 
The more specific methods for the study of culture 
change always require a comparative framework 
of some type. Sequences of change in individual 
behavior, in social structure, or in stages of cul- 
tural development are compared in order to yield 
systematic statements reaching beyond the con- 
crete case. 

Stable equilibrium versus constant change. 
From the writings of earlier social theorists such 
as Durkheim and Radcliffe-Brown, students of 
culture inherited a conceptual image of human 
society based upon an organismic analogy that 
emphasized the ways in which a custom, institu- 
tion, or social activity preserved or maintained the 
social order. This conceptual image has led us to 
assume that social and cultural systems tend to 
maintain equilibrium unless they come into con- 
tact with some force from the outside, or develop 
some inner strain that disturbs the equilibrium. 
The problem then is to discover how the equilib- 
rium is restored. 

From the writings of other theorists, notably 
Max Weber (1922), students of change inherited 
a more dynamic image of society. The adequacy 
of the stable equilibrium model as a conceptual 
framework for the analysis of culture change has 
been questioned by such writers as Firth (1951), 
Leach (1954), Herskovits (1955), and Vogt 
(1960), who all maintain that the basic tenden- 
cies in social and cultural systems are toward 
change, rather than toward states of equilibrium. 
Furthermore, it can now be demonstrated from 
our accumulated archeological and historical data 
that a culture is never static, but rather that one 
of its most fundamental properties is change. 


The problem for students of culture change has 
therefore become one of describing, conceptualiz- 
ing, and explaining a set of ongoing processes 
of change that occur at varying rates, move in 
varying directions, and that are triggered and main- 
tained by complex interactions of ecological, tech- 
nological, social, cultural, and psychological vari- 
ables. 

Macroscopic models. The most inclusive mod- 
els, with respect to both cultural coverage and 
time span, are those utilized by White (1949) and 
Childe (1951) for the study of cultural evolution. 
Both deal with trends and stages in the culture of 
mankind as a whole. White's most significant prop- 
osition is that technological development expressed 
in terms of man’s control over energy underlies 
social changes and cultural achievements (1949, 
pp. 363-396). Childe is especially noted for his 
formulation of the important shift from food- 
gathering to food-producing societies in early 
human history. These models, dealing with what 
Steward (1955) has called “universal evolution,” 
advance propositions about world culture as a 
whole but cannot explain particular features of 
particular cultures. An attempt to solve this con- 
ceptual and methodological difficulty has more 
recently been proposed by Sahlins and Service 
(1960), who differentiate between “general” and 
“specific” cultural evolution. They write: 


General cultural evolution . . . is passage from less to 
greater energy transformation, lower to higher levels 
of integration, and less to greater all-around adapt- 
ability. Specific evolution is the phylogenetic, rami- 
fying, historic passage of culture along its many 
lines, the adaptive modification of particular cultures. 
(p. 38) 


In his model, Steward proposes a different meth- 
odological solution described as “multilinear evo- 
lution,” in which “regularities in culture change” 
are studied by comparing sequences of change in 
a series of particular cultures. His classic paper 
(1955, pp. 178-209) utilizing this model is his 
comparison of the development of “irrigation civi- 
lizations” in five arid or semiarid areas of the 
world; Peru, Mesoamerica, Mesopotamia, Egypt, 
and China. Although more recent empirical evi- 
dence has not supported his generalizations coni 
cerning irrigation works as the crucial factor in 
all these areas, the comparative framework he 
proposes is a fruitful one that can be utilized on 
other ranges of culture change data. 

Another model that involves social and cultural 
changes over relatively long time spans is that 
of Redfield, whose classic study (1941) of the 
shift from “folk” to “urban” society has stimulated 


many further studies of culture change, including 
the present interest in the emergence of and 
changes in “peasant society.” Drawing upon the 
earlier ideas of Durkheim, Maine, Morgan, and 
Tonnies, Redfield designed a field study that com- 
pared four communities in Yucatan: a tribal vil- 
lage, a peasant village, a town, and a city. Using 
degree of isolation and degree of homogeneity in 
culture patterns as the independent variables, he 
shows how three types of culture change accom- 
pany lessening isolation and homogeneity: disor- 
ganization of the culture, secularization, and indi- 
vidualization. 

Redfield’s model generates a number of inter- 
esting propositions. Perhaps the major methodo- 
logical difficulty in his Yucatan study was that 
two quite different types of phenomena were inex- 
tricably intertwined. Although his cases certainly 
range from “folk” to “urban” types of society, they 
are also cases that have involved a massive con- 
frontation between the Mayan Indian and the 
Spanish cultural traditions. It is difficult to dis- 
tinguish between changes that have derived from 
a shift in type of society from folk to urban and 
changes that have derived from the contact be- 
tween Mayan Indians and Spaniards. 

A more recent model dealing with sweeping 
changes in societies over long time spans is that 
of Rostow (1960), whose formulation of the stages 
of socioeconomic growth ranging from “a tradi- 
tional society” to a stage of “durable consumers’ 
goods and services” is currently influencing anthro- 
pological thought about the processes of culture 
change in the modern nations of the world. Other 
aspects of this long-range process of growth and 
development (at least in our own Western society ) 
are treated in such studies as Riesman’s charac- 
terization (1950) of the sequence of “tradition- 
directed” to “inner-directed” to “other-directed” 
systems. 

Microscopic models. Among the many models 
that are less extensive in scope, but still focused 
upon changing social and cultural systems, are 
those of Murdock (1949) and Leach (1954). 
Murdock (1949, pp. 201-221) is mainly concerned 
with changes in social organization, and he singles 
out the “rule of residence” as the aspect of social 
structure that acts as a filter, capable of respond- 
ing to a variety of quite diverse external stimuli 
but in only a limited number of ways. He proposes 
that change in a social system regularly begins 
with a modification of the rule of residence, and 
other changes in kinship structure and terminology 
follow in predictable order. Leach, on the other 
hand, utilizes what might be called an “oscillation 
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model” in his description of Kachin society, where 
there is a regular oscillation between two polar 
types of political value systems to which the people 
can appeal and still be considered members of the 
society. 

On a still more modest scale of comparative 
analysis, there are models for the study of changes 
in individual and group behavior within one cul- 
ture. An example of such a model is that of Spin- 
dler and Goldschmidt (1952), who present an ex- 
perimental design for the study of sociological and 
psychological variables in the changes that are 
occurring in individual and group behavior among 
the Menominee Indians of Wisconsin. 

Finally, mention should be made of the “genetic 
model” proposed by Romney (1957), which pro- 
vides for the analysis of culture change a com- 
parative framework that offers the possibility of 
controlling geographical and historical factors to 
the maximum extent and of utilizing the full range 
of linguistic, archeological, and ethnographic data 
as these become available. The model, building 
upon earlier suggestions by Sapir (1916) and 
Eggan (1954), assumes that genetically related 
tribes, as determined by related languages, com- 
mon physical type, and shared systemic patterns, 
form a “genetic unit” that derived from a small 
protogroup with a protoculture at some time in the 
past. The term “genetic” implies nothing beyond a 
concern with origins and mode of development of 
a unit of culture history. It is not assumed that all 
the people in the genetic unit necessarily descend 
from the ancestral group in a strict biological sense; 
biological mixture is expected wherever people of 
the genetic unit are in contact with other groups. 
What is required is a distinguishable physical type 
that converges, rather than diverges, as one goes 
back in time. 

The model resembles that of the zoologist who 
views a certain species of animal as evolving and 
making an adaptive adjustment to a given ecologi- 
cal niche and then radiating from this point as the 
population expands into neighboring ecological 
niches. As the population moves into different 
ecological settings, further adaptive variations 
occur in the species. But these variations are 
traceable to the ancestral animal, or, in other 
words, back to the prototype. In applying the model 
to human populations, we may assume that a small 
protogroup succeeds in adapting itself efficiently 
to a certain ecological niche and in developing 
certain basic systemic patterns, which constitute 
the basic aspects of the protoculture. Once the 
adaptation proves to be efficient, the population 
expands, and the group begins to radiate from this 
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point of dispersal. As members split off from the 
protocommunity, move into neighboring ecological 
niches, and come into contact with alien cultures, 
they make appropriate adaptations to these new 
situations and begin to differentiate—that is, there 
are adaptive variations from the prototype as the 
members of the genetic unit spread from the dis- 
persal area. The use of this model not only makes 
possible the controlled comparative study of pre- 
historic and historic sequences of culture change 
in a geographically and historically meaningful 
unit but also sets the stage for more precise 
structural-functional comparisons of the various 
branches of the genetic unit as these continue to 
undergo change in the contemporary world. An 
example of a recent application of the model may 
be found in Vogt (1964). 

Evon Z. Vocr 
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CULTURE AND PERSONALITY 


Culturally constituted social groups are as neces- 
sary for human existence as are any of man’s vital 
organs. Nevertheless, as even cursory observation 
of children reveals, the acquisition of culture is 
often accompanied by conflict and struggle; and 
conformity with cultural rules and norms is fre- 
quently associated with frustration and tension. 
These two facets of culture—its capacity for both 
gratifying and frustrating human needs—consti- 
tute the basis for the two major axes of social 
science inquiry, stability and change; together, they 
constitute the generic problems of culture-and- 
personality research: What are the psychological 
conditions that promote persistence and innovation 
in human social and cultural systems? 


Historical background 

Despite the evident importance of its problems, 
culture-and-personality, as a formal discipline, is 
the youngest and smallest branch of anthropology: 
With some few exceptions, it hardly exists outside 
of the United States. Culture-and-personality, like 
biochemistry, falls between two academic stools, in 
this case anthropology on the one hand and the 


psychological sciences on the other. Thirty-five 
years ago there was no way of bridging the gap 
between them, Anthropology, to the extent that it 
was theoretically oriented, was concerned with such 
macroproblems as historical developments and evo- 
lutionary trends, for which the concepts of aca- 
demic psychology were, or were deemed to be, 
irrelevant. The psychological sciences, on the other 
hand, were tied to the ahistorical, acultural labora- 
tory; their molecular concepts were obviously 
unrelated to the sociocultural concepts of anthro- 
pology. Only psychoanalytic psychology—not part 
of the academic establishment—seemed to see a 
bridge between the data obtained in the clinic and 
the social sciences. Earlier, Freud had made peri- 
odic forays into anthropology, but attempts to link 
the disciplines met with strong opposition on the 
part of social scientists. 

The raw data of anthropological field work ex- 
posed living human beings in all their complexity, 
acting within a specified historical—cultural matrix. 
It is no accident that it was the field anthropolo- 
gists, confronted with the necessity of relating 
traditional, descriptive cultural constructs to the 
ongoing social life about them, who first attempted 
to build this bridge in their search for the psycho- 
logically “genotypic” bases of their “phenotypic” 
cultural constructs. It is no accident, either, that 
psychoanalytic theory, however modified, was seized 
upon first as a possible theoretical tool. For psycho- 
analysis, unlike academic psychology, is concerned 
with molar rather than molecular psychological 
variables. Despite its ahistoricism, psychoanalysis 
anchors its patients in their own ontogenetic his- 
tories, thereby providing a link with the anthropo- 
logical notion of culture as a configuration of 
learned customs. Hence, Edward Sapir, the gen- 
erally acknowledged founder of the field, was influ- 
enced by his personal contacts with the psychiatrist 
H. S. Sullivan; Margaret Mead, by her contacts with 
E. H. Erikson; Ralph Linton and Cora DuBois, by 
collaboration with Abram Kardiner. 

Personality studies 35 years ago were unsystem- 
atic; and cultural studies, with their heavy de- 
scriptive and empirical emphases, were self-con- 
sciously nontheoretical. Hence, a large share of 
culture-and-personality theory was, and continues 
to be, devoted to the exploration of empirical and 
analytic relationships. At that time, anthropologists 
had few techniques for the study of personality. 
The time-honored techniques of gross observation 
and informant interviews were inadequate for per- 
sonality investigation. Again anthropology looked 
to the psychological sciences for assistance: depth 
interviews, Rorschach tests, doll-play, dreams, life 
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histories, systematic observation of family interac- 
tion, and other techniques were borrowed in whole 
or in part from clinical psychology and psychiatry. 
Where disciplinary boundaries are rigidly drawn, 
this influx of exotic tools—combined with the 
exotic concepts of psychoanalytic theory and learn- 
ing theory—could hardly have been expected to 
attract discipline-bound researchers. 

Some anthropologists avoided culture-and-per- 
sonality because of the imprecise and inconclusive 
nature of its findings. This is not to say, as is 
sometimes charged, that its studies are “soft,” or 
lacking in rigor—the best of its studies are cer- 
tainly as rigorous, conceptually and methodologi- 
cally, as their counterparts in other branches of 
social and cultural anthropology. It is, rather, that 
the very nature of its problems, given the research 
tools available, results in findings that sometimes 
lack the unqualified conviction characteristic of 
research in more structured fields. 

As the field has matured, its initial period of 
heavy borrowing has come to an end. Methods and 
theories borrowed from other fields have become 
transformed by the new discipline for its own prob- 
lems and its specific theoretical aims—and, al- 
though it continues to be catholic in its sources of 
stimulation, it has been developing its own. In- 
creasingly, data cited in favor of a theory, and 
theories used for hypothesis derivation, originate 
in culture-and-personality research, rather than in 
the psychiatric clinic or the experimental labora- 
tory. Moreover, as data accumulate, it is becoming 
increasingly possible to test hypotheses statistically 
by means of large-scale, cross-cultural samples. 
Although these studies, pioneered by Whiting and 
Child (1953), have been subjected to serious criti- 
cism, their findings have been sufficiently sugges- 
tive to hazard the prediction that such studies will 
increase both in number and in importance. 

Despite its small number of practitioners, culture- 
and-personality has been extraordinarily diversified 
in its research interests. Although culture-and- 
personality has explored many topics that have 
been neglected by other anthropologists, it has also 
explored many of the topics that are of greatest 
interest to them. In both cases, psychological vari- 
ables have played a prominent role in its research. 
The rationale for this emphasis must be sought 
within the explanatory framework of anthropologi- 


cal thought. 


Explanation in anthropology 

Anthropological theory has, in general, been cast 
in four explanatory modes—historical, structural, 
causal, and functional—that, when analyzed, can 
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be reduced to the causal or the functional mode. 
Thus, historical explanations, to the extent that 
they are scientific explanations rather than uncon- 
trolled speculations, are really causal explanations. 
The mere listing of a series of events that are 
chronologically prior to the appearance of the cus- 
tom to be explained does not constitute explana- 
tion, unless it can be shown that one or more of 
these events was a condition—either necessary 
and/or sufficient—for the appearance of the cus- 
tom. If this can be demonstrated, the explanation 
of the custom’s origin is causal, the fact that it 
originated in the past being incidental to the theo- 
retical aim, which is to provide an explanation for 
a certain type of social or cultural innovation. 

Structural explanations can also be shown to be 
either causal or functional. Those which show the 
configuration (or interrelationships) in a set of 
customs are essentially descriptive. The data are 
ordered according to a coherent plan. If a theory 
is offered to explain the configuration (structure), 
then this theory is necessarily either causal or 
functional. Structural explanations that purport to 
explain a custom or set of customs in terms of 
some “principle” that it embodies are either verbal 
labels, serving to classify a set of data according 
to a heuristic scheme (such as the “principle” of 
the unity of the sibling group), or phenomenolog- 
ical principles of the actors (cognitive maps), in 
which case they are members of a cognitive subset 
in a set of causal variables, Similarly, explanations 
that stipulate either the structural requirements of 
a system or its structural “implications” are either 
causal or functional. Causal explanations are con- 
cerned with the antecedents of a system; functional 
explanations, with its consequences. 

If anthropological explanations can be classified 
as either causal or functional, what role has culture- 
and-personality played in either or both types of 
explanation? 


The human social order as a normative order 


Social systems are characterized by a configura- 
tion of reciprocal roles that are shared by the mem- 
bers of a social group and are acquired from a 
previous generation. These roles serve to satisfy the 
three functional requirements of any society— 
adaptation, adjustment, and integration. Within 
zoological perspective, the unique feature of human 
social systems does not reside in the fact that their 
constituent roles are acquired through Jearning— 
for this is probably characteristic, although not to 
the same degree, of all mammalian social systems 
—but that many of the customs comprising these 
roles are based on rules and norms. 


A custom, as this term is used by anthropologists, 
refers to any socially acquired behavior pattern that 
is widely, if not uniformly, performed by the mem- 
bers of a society or by one of its constituent social 
groups. In general, it is possible to distinguish two 
types of customs. First, there are those customs 
whose occurrence is based on prescriptive or pro- 
scriptive norms. Their performance may thus be 
characterized as being isomorphic with a norm or 
set of norms, These may be termed Type-1 customs, 
Second, there are customs whose occurrence, al- 
though not based on norms, nevertheless reflects 
them. If a norm, or set of norms, is proscriptive 
with respect to one subclass of behavior patterns 
but permissive with respect to all other members 
of the class, the latter, if they occur, reflect the 
norms but are not isomorphic with them. For exam- 
ple, marriage with any female not included in 
incest proscriptions reflects, but is not isomorphic 
with, the proscriptive norms. These customs may 
be termed Type-2 customs. 

Although the occurrence of Type-1 customs is 
mandatory and the occurrence of Type-2 customs 
is voluntary, the performance of the latter is also 
normative in the sense that behavior falls within 
a normative, i.e., permissible, range of variability. 
Type-2 customs are also governed by socially 
shared rules that define proper performance. In the 
absence of such rules, behavior patterns are habits 
that describe individual action; they are not cus- 
toms, which describe social interaction. Knowledge 
of habits and customs enables us to predict 
behavior. 


The human social order as a cognitive order 


Culture, as a normative system, is a functional 
requirement of a human social order. Because of 
the enormous degrees of biological plasticity and 
cognitive ingenuity that, together, produce the 
broad variability characteristic of human behavior, 
the absence of this normative dimension would 
render human social systems impossible. The range 
of behavior patterns potential in any individual is 
much broader than the limited range required for 
the performance of any custom or set of customs. 
Beyond a certain critical point, whose limits are 
still unknown, variability in behavior precludes the 
very possibility of custom. In other words, human 
societies have had to set limits (by means of pre- 
scriptive and proscriptive norms, and of rules) to 
the range of permitted variability in customary be- 
havior. The cultural dimension of human social 
systems is made possible by the very capacity for 
symbolization. It is to human societies what lim- 
ited plasticity and biological determination are to 


insect and mammalian societies. By selecting the 
optimal range required by the operation of a par- 
ticular social system, from the potential range of 
variability characteristic of the species, it ensures 
an important degree of uniformity and predicta- 
bility, thereby rendering social order possible. In 
short, the invention of culture allowed man to com- 
bine the short-run adaptive value of social order 
with the long-run advantage of flexibility. 

A human social order, then, is a cognitive order. 
The set of customs that comprise the constituent 
roles of a human social system consist, in the first 
instance, of a set of norms and rules—cognitive 
variables—that either prescribe or regulate be- 
havior, These norms and rules must be cognized 
by the members of the society if the social system 
is to be maintained. The acquisition of cognitions, 
then, as well as of complete cognitive maps, is a 
necessary psychological condition (cause) for the 
performance of single customs and for the main- 
tenance of an entire social system, Cognitions can 
regulate behavior, but they are not identical with 
it. The conventional statement “Social behavior is 
learned” implies two kinds of learning, cognitive 
and behavioral. That is, the performance of a cus- 
tom requires that the actors learn about the cus- 
tom (acquisition of cognitions) and that they learn 
to perform what they have learned (acquisition of 
behavior). In general, culture-and-personality has 
devoted less attention to the former type of learn- 
ing than to the latter. Nevertheless, the recent in- 
terest in cognition and in ethnoscience, and the 
development of such techniques as componential 
analysis, will probably redress the balance. 

Since customary behavior is governed by rules, 
proper performance of customs requires that the 
actors be capable of evaluating and regulating their 
own behavior in terms of these rules. Evaluation, 
then, is yet another cognitive basis for the main- 
tenance of social systems, although it has received 
little attention from culture-and-personality. 

Despite their importance, cognition, learning, 
and evaluation do not constitute necessary and suf- 
ficient psychological prerequisites for the perform- 
ance of customs. For, although anthropological in- 
formants are entitled to explain their own behavior 
by the ubiquitous “It is our custom,” anthropolo- 
gists cannot abdicate their scientific responsibility 
by this obviously redundant explanation. Customs 
cannot compel the occurrence of behavior; they 
can only channel behavior once it occurs. Unless 
the actors are more highly motivated to perform a 
custom than to perform competing, alternative be- 
havior patterns that compose their behavioral reper- 
toire, the learning of a custom, however normative, 
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will not ensure its performance. In short, the per- 
formance of customs, like other forms of behavior, 
must be instigated by the intention or expectation 
of satisfying a need or set of needs. A great deal of 
culture-and-personality research has been devoted 
to the study of motivation and its ontogenesis in 
the socialization process. 

In sum, cognition, learning, motivation, and 
evaluation—all psychological variables—constitute 
necessary and sufficient conditions (causes) for the 
proper performance of customs and, hence, for the 
maintenance of social systems. This analysis be- 
comes much more complicated when it is observed 
that the relationship between the performance of 
customs and the satisfaction of needs is not always 
a simple one. For Type-2 customs—in which “per- 
formance” means the practice of personally pre- 
ferred behavior patterns—this analysis, with proper 
qualifications, will serve as a simplified model. For 
Type-1 customs, in which “performance” means 
compliance with prescriptive or proscriptive norms, 
further elaboration is required. 


The human social order as a moral order 


A normative order, it has been indicated, consists 
of two related, but discrete, variables: norms and 
rules. “Norms” prescribe the occurrence of behav- 
ior; proscriptive norms prescribe avoidance behav- 
ior. “Rules” regulate or govern behavior (whether 
it is prescribed or permissive) once it occurs. 
Norms and rules ensure the uniformity, and hence 
the predictability, of behavior. Rules are ethically 
neutral; norms implicitly incorporate a set of moral 
values and, therefore, also function to preclude the 
overt expression of motives that are, or are deemed 
to be, socially harmful or disruptive. Norms also 
ensure the performance of activities that are, or are 
deemed to be, socially desirable, if not necessary. 
In terms of the moral order, deviation from norms 
is viewed as an offense to the social order. The 
limits that norms impose on behavioral variability 
are intended to ensure the occurrence of culturally 
normative behavior, as well as to regulate the per- 
formance of socially normative behavior. 

Because of man’s plasticity and cognitive in- 
genuity, what any actor must do, as defined by cul- 
tural norms, in order to participate in his social 
system is not the same as what he can do; thus, 
what he must do may conflict with what he would 
like to do. Hence, tension between personal needs 
and cultural norms, as well as the internal pres- 
sures to eliminate this tension, are omnipresent in 
any society. In general, tension is resolved in favor 
of the norms, resolution being mediated by anxiety. 
This will be moral anxiety, if the norms have been 
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internalized (superego); or social anxiety, if pun- 
ishment is known to be the consequence of devi- 
ation. Although anxiety can serve to motivate 
compliance with Type-2 customs, the source of 
deviation (the original need) is inhibited but not 
extinguished. For the existence of these norms im- 
plies that the motivational systems of the actors 
are either indifferent to, or opposed to, the prescrip- 
tive or proscriptive behavior. If the latter is the 
case, the frustration of the proscribed motive has 
important consequences for cultural stability and 
change. 


Frustration, sociocultural persistence, change 


If the expectation of gratifying drives, including 
anxiety, constitutes a necessary (although not a 
sufficient) condition for the persistence of social 
systems, the frustration of drives constitutes a 
necessary (although not a sufficient) condition for 
the disruption of and change in these systems. Cul- 
tural norms, which proscribe the satisfaction of 
certain drives, constitute one importance source of 
frustration. The social structure is another impor- 
tant source. For example, if a structural arrange- 
ment such as stratification prevents certain groups 
in society from satisfying culturally approved needs 
by denying them access to those roles whose per- 
formance can satisfy these needs, the attendant 
frustration may be as great as frustration induced 
by proscriptive norms. Both types of frustration 
constitute potential sources of social and cultural 
deviation, innovation, and change. 

Paradoxically, they also constitute sources of 
cultural and social persistence. If motives cannot 
be satisfied directly, because of cultural or social 
impediments, need-frustration may be averted by 
distortions in any of the motive’s four dimensions— 
drive, goal, act, and agent. These changes may suf- 
ficiently alter the original meaning of the motive 
so that, in its disguised form, the need may seek 
gratification in indirect, but culturally approved, 
ways. Frequently, the means for such distortion of 
motives, and consequent gratification of needs, are 
found in elements of the social and cultural sys- 
tems. Although customs—and even roles—making 
up the social system are often used in this fashion, 
it is the cultural system that, par excellence, serves 
the function of gratifying these forbidden motives, 
For example, forbidden dependency or hostility mo- 
tives are often disguised and, consequently, grati- 
fied in subservient or dominant political roles, re- 
spectively. Religion, art, folklore, ritual, and the 
like may function as culturally constituted defense 
mechanisms by which frustrated motives may be 
disguised and, hence, gratified in a culturally ap- 


proved manner. This, of course, is not their only 
function. They have other latent, not to mention 
manifest, functions of a nondefensive character, 
When elements of the social and cultural systems 
serve to gratify otherwise frustrated and potentially 
disruptive needs, the needs, in turn, constitute 
powerful motivational causes for the persistence of 
these systems. Culture-and-personality research has 
only begun a systematic exploration of this 
important field of research, 

Sometimes, however, socially and culturally in- 
duced need-frustrations are not resolved in this 
manner; and this frustration results in deviation, 
either cultural or psychological, In defending the 
self against internal conflict, some defense mech- 
anisms are adopted in which the cognitive distor- 
tion of motives is of sufficient magnitude to cause 
behavior that, psychologically bizarre and/or so- 
cially disruptive, may be diagnosed as mental ill- 
ness. In other cases, defense mechanisms may be 
avoided in favor of direct expression of forbidden 
motives, resulting in crime and delinquency. This 
type of deviation has received less attention from 
culture-and-personality research than has psycho- 
logical deviation, probably because its incidence is 
much less frequent in the primitive societies whose 
study has engaged traditional anthropological at- 
tention. 

Need-frustration is a necessary (but not a suf- 
ficient) condition for social and cultural change, as 
well as for psychological and cultural deviation. 
Indeed, the border line between change and devia- 
tion is often a tenuous one. If certain cultural 
norms or structural arrangements systematically 
frustrate important needs, indigenous or borrowed 
innovations in norms or in social structure, which 
might otherwise have been ignored or labeled as 
deviation, become the basis for sociocultural cumu- 
lation and change. In general, culture-and-person- 
ality research has paid more attention to dramatic 
change than it has to gradual change. (This is 
found, for example, in studies of nativistic and 
revivalistic movements. ) 

If motivation is a necessary stimulus to change, 
cognition is a necessary basis for change. Any 
change entails cognitive restructuring; the maS 
dramatic the change, the greater the restructuring 
required. Although cognitive maps, or perceptual 
sets, seems to be highly resistant to change, it 18 
frequently observed that the acceptance of innova- 
tions is impeded by cognitive blocks. If innovations 
are accepted, they are sufficiently assimilated to 
traditional cognitive orientations, often rendering 
dramatic changes that are more apparent than real. 
Culture-and-personality has devoted relatively little 


attention to cognitive variables in studies of 
sociocultural change. 

Psychological variables, such as learning, cogni- 
tion, perception, and motivation, are as important 
in causal explanations of change in sociocultural 
systems as they are in explanations of structural 
persistence. 


Functionalism in culture-and- personality 

Culture-and-personality theory, by the very na- 
ture of its explanatory concepts, is both causal and 
functional in character. If customs are performed 
because of the expectations of satisfying needs, at 
least one of their functions (manifest and/or 
latent) is the satisfaction of the biological and 
psychological requirements of the actors. However, 
customs are units within a sociocultural system; 
functionalism, as a theory that purports to explain 
the maintenance of social and cultural systems, is 
concerned with the functional requirements of 
social groups taken collectively. Since the func- 
tional requirements of group existence are satisfied 
not by the existence of customs but by their per- 
formance, and since performance is caused by the 
expectation of satisfying needs (personal func- 
tions), social functions are served in the process 
of serving personal functions. Obviously, an ex- 
planation of the latter is contingent upon an 
explanation of the former. Thus, when personal 
functional requirements are not satisfied by the 
performance of customs, these customs may not 
occur, When this happens, their functions are not 
served, and changes and/or disruptive influences 
may take place in both the social and the cultural 
systems. 

Personality variables, although always causal, 
may not always be important for understanding 
social functions, If all of the functions of a custom 
are recognized (manifest functions), motivation 
may be taken for granted; an understanding of a 
custom’s social functions does not entail explicit 
knowledge of the needs that instigate its perform- 
ance. This is not true of unrecognized (latent) 
functions, especially of those which are served by 
the performance of customs whose motivation is 
at least partially unconscious. Since unconscious 
motives are frequently expressed in the perform- 
ance of customs whose manifest functions are un- 
related to these motives, and since the motives are 
disguised because of the disruptive sociocultural 
impact of their undisguised expression, the latent 
social functions served by the performance of these 
customs are of the utmost importance for an under- 
standing of sociocultural persistence. 
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[Other relevant material may be found in COGNITIVE 
THEORY; CULTURE; LIFE CYCLE; SOCIALIZATION; and 
in the biographies of BENEDICT and SAPR.] 
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CULTURE AREA 


Culture areas are geographical territories in 
which characteristic culture patterns are recogniz- 
able through repeated associations of specific traits 
and, usually, through one or more modes of sub- 
sistence that are related to the particular environ- 
ment. As one formulation within the general school 
of historical particularism that has developed in 
anthropology in the United States, the concept of 
culture area reflects the theoretical position that 
each culture, on whatever level it may be analyzed, 
must be examined with regard to its own history 
and also with regard to the general principles of 
independent invention, culture borrowing, and cul- 
tural integration, Although many factors at the 
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base of any recognizable culture area are ecological 
in nature, the culture-area concept is one that con- 
forms to the doctrine of limited possibilities rather 
than to a simple geographic determinism. 

Viewed in this light and assessed according to 
the size and character of the geographic units and 
the degree of complexity of cultural similarities 
within, and differences beween, the units, the cul- 
ture-area concept takes shape as a classificatory 
device of marked utility in describing the cultural 
regions of the world. Since “culture” and “area” are 
both generalized terms, their use in combination 
gives no real clue as to precise meaning, which 
must be specified. When contrasting one culture 
area with another, the level of abstraction must be 
the same. 

In its original formulation the culture-area con- 
cept applied primarily to the ethnographic present 
and occupied an important niche in the natural- 
history phase of anthropology that was concerned 
with the orderly description of the cultures of the 
world. The geographic distribution of culture traits 
within such areas served as indirect evidence for 
the reconstruction of cultural histories. The formu- 
lations for each of the major continents were used 
for convenience in the ordering of ethnographic 
descriptions but were otherwise ignored or dis- 
carded as being too limited in time, too static in 
concept, and too generally conceived to be of much 
use to the developing trends of concern with inter- 
personal and social dynamics. The steady expan- 
sion of archeological research, which furnishes 
direct evidence for the construction of the historical 
chronicle in local terms, reduced the role of indirect 
evidence furnished by contemporary data in the 
reconstruction of culture history. Although the 
culture-area concept went into temporary eclipse 
as a tool for theoretical research, it was still re- 
tained for the arrangement of museum collections, 
for which it was originally devised, and for the 
presentation of descriptive data on the classroom 
level (e.g., Herskovits 1955; Keesing 1958). It 
should, however, be noted that efforts to sketch a 
culture-area map of Asia persisted into the 1950s, 
as a move to complete the world picture. The or- 
ganization of data in culture-area terms persists in 
standard anthropological works of the present day 
(e.g., Gibbs 1965; Murdock 1959). The utility of 
the concept with regard to cultural dynamics and 
other current interests appears in Service’s discus- 
sion of differences in acculturation in colonial Latin 
America that were conditioned by the aboriginal 
culture-area patterns (1955) and in such studies 
as those of Hallowell (1946) and Devereux (1951), 
which deal with personality types characteristic of 


specific culture areas and their survival through 
time and acculturation. 

The culture-area concept can add insight to the 
processes of culture history by filling in the archeo- 
logical record (see, for example, Steward 1955, 
chapter 11); in the mapping of culture areas or of 
trait or trait-complex distributions for successive 
periods, the same general areas or boundaries show 
tendencies to survive (Bennett 1948; Kroeber 1944; 
Smith 1952) or recur (Ehrich 1956; 1961). Cul- 
ture-area mapping must initially be done with re- 
gard to single periods, but it is the repeated geo- 
graphical and distributional patterns that give some 
intimation of physiographic and ecological influ- 
ences, and the dynamic processes of cultural for- 
mation and adjustment must in each case be 
separately analyzed and evaluated. 

Wissler is generally considered to have formu- 
lated the culture-area approach during the course 
of arranging the ethnological exhibits of the North 
American Indians for the American Museum of 
Natural History; his first major work on the sub- 
ject appeared in 1917. Kroeber (1939, pp. 4-8), 
although describing Wissler’s approach as of grad- 
ual, empirical, almost unconscious growth, gives 
him full credit for the codification and develop- 
ment of then current usages, the recognition of the 
stabilizing effects of environment on cultural pat- 
terns, and the foundation of the idea of temporal 
culture climax by his enunciation of spatial culture 
centers. 

Driver (1962), however, points out that as early 
as 1904, Kroeber himself dealt with areal subdivi- 
sions of California, and that Wissler first men- 
tioned the culture area in 1906, Also in 1904 
Livingston Farrand suggested a seven-part classi- 
fication of North American Indians, including 
considerations of both geography and culture, and 
then discussed them at some length (1904, pp. 
101-194). Holmes (1903), writing on museum 
exhibits, mapped the North American Indians ac- 
cording to 19 geo-ethnic groups, which correspond 
well to the groupings in the later work of both 
Wissler and Kroeber. Furthermore, Kroeber (1939, 
P. 7, note 6) cites an article by O. T. Mason, pub- 
lished in 1896, that recognizes 18 culture areas Or 
environments in the Western Hemisphere. Museum 
exhibits of ethnographic materials had been geo- 
graphically organized for some years (Wallace 
1887), and this approach to ethnographical data 
was clearly derived from zoogeography. 

It is significant that the initial growth and for- 
mulation of the culture-area concept took place 
with regard to the North American Indians, for 
whom the documented ethnographic evidence was 


reasonably full and for whom the environmental 
settings were contrasting and limiting. 

Methodological considerations. The initial ob- 
jective of the culture-area concept as a classifica- 
tory device is the organization of the vast number 
of individual cultures into a coherent system of 
units that can be analyzed and compared. Such an 
ordering of data is a preliminary step in the study 
of cultural dynamics and culture history, and it is 
static only insofar as one wishes to treat its descrip- 
tive categories as ends in themselves. 

Although there is a general tone or pattern to a 
culture area, the distributions of its elements are 
not necessarily uniform, and Kroeber’s concept of 
climax (1939, pp. 4, 5, 222 ff.) refers to peaks of 
intensity. The boundaries between areas are not 
necessarily distinct, for recognizable cultures within 
a given area may contrast with those of neighbor- 
ing ones, and if the boundaries are not sharply 
delineated, zones of composite culture or blended 
traits may make the transition from one to another 
a matter of gradation. Within a single area, how- 
ever, as in the southwestern United States and in 
the Congo region of Africa, quite different ways of 
life may coexist as characteristic patterns. 

In a hierarchical classification the criteria se- 
lected as determinants become more numerous and 
more detailed as the levels of categories become 
more specific. In this sense, Kroeber’s 84 divisions 
in “Cultural and Natural Areas of Native North 
America” (1939) are a more detailed elaboration 
of Wissler’s original major culture areas; and Mur- 
dock’s article on South American culture areas 
(1951), using nine major types of positive infor- 
mation, does not merely attempt to revise Steward’s 
formulation (1946-1959) but also actually in- 
creases the number of recognizable subareas. It 
seems at least in part to be a matter of categorical 
level that leads to divergence in the assessment of 
the theoretical significance of the concept. Thus, 
Naroll, in discussing the major culture areas of 
Asia, writes, in support of Wissler’s theoretical in- 
terpretations, that “while the environment does not 
of itself either produce or determine cultural pat- 
terns, it does have a powerful influence on them; 
it not merely states the economic problems which 
people must solve, but in each ecological region 
it tends to standardize some particular pattern 
which the people have chosen as a solution” (1950, 
P. 186). On the other hand, Murdock, while min- 
imizing its theoretical importance, describes it as 
“nearly as useful in ordering the immense range 
of ethnographic variation as is the Linnaean sys- 
tem in the ordering of biological forms” (1951, 
p. 415). 
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Operationally, it makes little difference whether 
one’s original approach is through the somewhat 
intuitive recognition of similarities and differences 
in integrated patterns viewed as cultural wholes, 
whether it is based on detailed distribution studies 
of traits and trait complexes, or whether it stems 
from a delineation of geographic and ecological 
factors. All three procedures must be brought into 
play, and distributional studies, such as cross- 
cultural surveys and the Human Relations Area 
Files documentation, can serve as controls. 

Kroeber’s recognition of areal distinctions pre- 
ceded the extensive work on trait-element distribu- 
tions in California (Driver 1962). On the other 
hand, the Midwestern Taxonomic System of archeo- 
logical classification (McKern 1939), although not 
originally concerned with spatial considerations, 
did show the areal distributions of “aspects” in a 
methodology that seems clearly derived from the 
California studies. Although distributions of specific 
North American Indian traits, as mapped in Driver 
and Massey (1957), did not result in an automatic 
delineation of culture-area boundaries, correlations 
of traits did fall consistently into areal clusters 
(Driver 1962, p. 23). On the other hand, Naroll 
(1950, p. 186) paid as much attention to ecological 
frontiers as to cultural. An additional factor seems 
to be that geographical entities such as river sys- 
tems or plains areas may focus human contacts 
inward, resulting, on the one hand, in a form of 
isolating mechanism that establishes consistency in 
culture patterning, while at the same time tending 
to delimit independent trait and trait-complex dif- 
fusion (Ehrich 1956). Also to be noted are contin- 
ued attempts to produce temporally flat geographic 
maps of culture patterns at given points in time. 

Time depth. Although much of the work on 
culture-area analysis and delineation has been car- 
ried out with relation to the ethnographic present 
or to particular periods, some attempts have been 
made to consider the concept with relation to time 
depth. These efforts have two major orientations. 

(1) The first of these is the persistence of the 
same culture patterns or configurations in given 
areas over long periods. One aspect of continuity 
analysis is implicit in Kroeber's Configurations of 
Culture Growth (1944), in which he uses con- 
figurations to apply to long-continuing traditions in 
the civilizations of the Old World, particularly with 
regard to their geographical locations. The chang- 
ing interests and bursts of energy that occur at vari- 
ous times, he isolates as temporal climaxes within 
the main stream of the localized configuration. 

Another aspect is the persistence of established 
traditions or substrata that may have conditioning 
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effects upon the direction of acculturation. Writing 
on the northwest coast of North America and on 
Latin America, respectively, Smith (1952) and 
Service (1955) bring into focus the influences of 
traditions typical of particular areas upon the local 
patterns of culture change. 

Into a third category of continuities falls Ben- 
nett’s development of the “co-tradition” as formu- 
lated for Peru, as applied experimentally to the 
archeology of the southwestern United States, and 
as suggested for Mesoamerica (1948). The Bronze 
Age culture of mainland Greece, Crete, and the 
Cyclades would also seem to fall within the “co- 
tradition” pattern. The concept is one of somewhat 
similar and related, although recognizably distinct, 
cultures that persist for reasonably long periods 
within an area. The term “tradition” connotes per- 
sistence, and the significance is that of cultural 
linkage, either of enduring parallels descending 
from common or related origins and remaining in 
contact or of strong acculturation or convergence. 

(2) A second orientation of the culture-area 
concept with regard to time bears no relation to the 
continuity of cultural tradition. It is becoming evi- 
dent, especially in archeological contexts, that the 
mapping of culture areas at different periods re- 
veals regions and boundaries that persist or reap- 
pear, even when peoples with distinctly different 
culture patterns overrun the territory. Sooner or 
later the same geographic lines tend to re-establish 
themselves. A striking example of this thesis is the 
close correspondence between the regionalization 
of contemporary Anglo-American civilization in the 
United States and the culture areas of the North 
American Indians. Here we have the replacement 
of a population by new peoples with a new tech- 
nology, but the geographical and ecological factors 
have reasserted themselves. This would seem to 
indicate a form of geographic uniformitarianism 
that, despite cultural discontinuities, brings about 
the persistence or the re-emergence of areas and 
boundaries at various times, from the period of 
earliest settlement onward. Similar processes have 
marked the culture history of the Mediterranean 
and the Middle East and of Europe (Ehrich 1956; 
1961). 


The culture-area concept is a means of organiz- 
ing a vast amount of variegated ethnographic data 
into comprehensible units within a classificatory 
system. Like all such systems, it depends upon an 
increasing number of criteria or determinants in 
the isolation of units in a descending order of mag- 
nitude. Major considerations in recognizing these 
areas and subareas are ecological zones, patterns of 


cultural integration, and correlations of independ- 
ently diffused traits. Although initial recognition 
may depend in part upon familiarity and intuition, 
distribution studies serve as effective controls. Im- 
portant determining processes seem to be cultural 
adjustments to the environment and the inward 
focusing of contacts within an area, caused by 
regional topographic patterns which produce cul- 
tural isolates. These factors persist through time 
and find expression either in the continuities of 
cultural traditions or in the reappearance of the 
same areas and boundaries, even when the local 
culture history is discontinuous. 

The concept is far from static and orders cul- 


_ tural information in a form that makes it useful 


for comparative analyses and an understanding of 
cultural dynamics, processes, and cultural history. 
Roser? W. EHRICH AND 
GERALD M. HENDERSON 


[See also AFRICAN SOCIETY; CARIBBEAN SOCIETY; ECOL- 
OGY; ETHNOLOGY; INDIANS, NORTH AMERICAN; MID- 
DLE AMERICAN SOCIETY; OCEANIAN SOCIETY.] 
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CUNNINGHAM, WILLIAM 


William Cunningham (1849-1919), British eco- 
nomic historian, was born in Edinburgh, Scotland, 
the third son of James Cunningham, a prominent 
Edinburgh solicitor. After attending Edinburgh 
Academy, Cunningham became a student at Edin- 
burgh University and studied for a short time in 
Tübingen. In 1869 he entered Caius College of 
Cambridge University to read moral science, and 
upon his graduation in 1872 he was bracketed top 
with F. W. Maitland, an honor that earned Cun- 
ningham a scholarship to Trinity College. He was 
a man of “overflowing vitality” (Scott 1920); this 
trait enabled him, after his ordination in 1873, to 
combine, with outstanding success, clerical and 
academic work. Between 1874 and 1878 he served 
in the industrial cities of Yorkshire and Lancashire 
as one of the first lecturers in the Cambridge Uni- 
versity extension scheme. Before this time his aca- 
demic training and interests had been mainly 
Philosophical, but he now turned increasingly to- 
ward economics and history. 

He returned to Cambridge in 1878. The follow- 
ing year he was appointed an examiner in the 
Cambridge history tripos and found that the exam- 
ination included a paper on political economy and 
€conomic history, although the latter subject was 
not being taught. Cunningham decided to teach 
economic history, and the first edition of The 
Growth of English Industry and Commerce (1882) 


was prepared as a textbook for his courses. From 
1884 to 1888 he was a university lecturer in his- 
tory and from 1888 to 1891 a lecturer at Trinity 
College. His appointment as Tooke professor of 
economics at King’s College, London, from 1891 
to 1897, was a tribute to a rapidly established rep- 
utation. He would have resided in London had he 
not been elected in 1891 to a fellowship of Trinity 
College. An honorary fellowship of Caius College 
followed in 1895. In 1899 he was lecturer in eco- 
nomic history at Harvard; he returned to Harvard 
as Lowell lecturer in 1914. He was active in the 
economic section of the British Association (presi- 
dent in 1891 and 1905) and was a foundation 
member of the British Academy, president of the 
Royal Historical Society from 1910 to 1913, and 
proctor for the clergy in convocation (helping to 
write, in 1906, an important report on clerical 
poverty). 

It was at Cambridge that Cunningham’s great 
and enduring work in economic history was done. 
He was, more than anyone else, responsible for the 
recognition and establishment of economic history 
as an independent discipline in the universities of 
Britain, where it is the only branch of history with, 
usually, its own department and, often, its own 
degree, Cunningham’s achievements were, first, to 
demonstrate the importance of economic history 
for both history and economics and, second, to 
stimulate research and teaching in economic his- 
tory by a stream of publications of high quality. 
Cunningham gave economic history in Britain both 
status and content: before him, only J. Thorold 
Rogers could claim to have been an economic his- 
torian in the professional sense; after him, eco- 
nomic history attracted some of the most talented 
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of historians and inspired some of the best his- 
torical writing. 

Cunningham, with his contemporary W. J. Ash- 
ley, outlined the subject matter of economic his- 
tory, established its methods of inquiry, and defined 
its problems for at least the next half century. 
Cunningham’s Growth was the first major histori- 
cal work to sketch systematically the periods and 
features of English economic history. Cunningham 
was responsible for many revisions of accepted 
historical opinion, the most enduring being his 
favorable revaluation of mercantilism. His eco- 
nomic history suffered, however, from being too 
simply conceived as the story of national develop- 
ment within a framework largely established by the 
political historians. Economic legislation, and the 
policy behind it, played the major role in Cunning- 
ham’s economic history. This led him into bias; for 
example, he believed mistakenly that Edward u1 
had pursued a consistent economic policy and, for 
a later period, he exaggerated the extent of the 
transition in England from state control to laissez- 
faire. 

All of Cunningham’s views were carefully estab- 
lished and strongly held, and he was constantly in- 
volved in controversy about history, economics, 
theology, and academic and clerical administra- 
tion. Two controversies, one academic and one 
political, were particularly important. The aca- 
demic controversy concerned method. Cunningham 
believed that history was essential to economics 
in order to prevent economics from becoming an 
abstract branch of social philosophy, and his ad- 
vocacy of the historical method precipitated a 
minor Methodenstreit in England, He argued that 
economists were unable to give guidance on con- 
temporary affairs because of their ignorance of 
facts and of “the empirical study of the phenomena 
of the past.” He believed in “the relativity of eco- 
nomic doctrine” and that economic history was 
“the best propaedeutic to political economy.” In a 
famous article Cunningham attacked the econo- 
mists for what he called “the perversion of eco- 
nomic history” by the neglect of “the serious study 
of facts” (1892); this provoked a pained reply from 
Alfred Marshall and caused him to revise the 
economic history in Book 1 of his Principles of 
Economics. It also established more firmly the 
claims of economic history to a place in the train- 
ing of economists. 

The second controversy concerned protection. 
Cunningham was an early critic of laissez-faire 
and had much sympathy for the Sozialpolitik of 
the German nationalist economists, He was, in- 
deed, a patriot, with a strong sense of Britain’s 


greatness and imperial responsibilities. This great- 
ness, he believed, was based mainly on industrial 
strength, which in the long run could only be 
preserved by trade protection within an imperialist 
framework. He was thus for many years one of the 
few academics who vigorously supported the pro- 
tectionist movement. He did not live long enough, 
however, to see either the resurgence of empirical 
economics or the end of free trade. 

R. M. HARTWELL 


[Directly related is the entry History, article on Eco- 
NOMIC HISTORY.) 
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CUVILLIER, ARMAND 


Armand Joseph Cuvillier was born in Paris in 
1887. He entered the Ecole Normale Supérieure in 
1908 and became agrégé de philosophie in 1919. 
After holding positions at collèges in Montluçon, 
Bourges, Strasbourg, Laon, and Paris, he taught 
sociology at the Sorbonne from 1945 to 1953. 

Cuvillier is primarily a historian of ideas. His 
interest in French socialism at the time of the 
revolution of 1848 goes back to 1919, when he 
published his thesis on L’Atelier, a workers’ periodi- 
cal edited between 1840 and 1850 by followers of 
Philippe Buchez. In 1937 he wrote an essay on the 
differences between Proudhon and Marx (1937a), 
and in his article “Les antagonismes de classes 
dans la littérature sociale française de Saint-Simon 
à 1848” (1956b) he showed to what extent the 
notion that class antagonism is inherent in indus- 
trial society had developed prior to the earliest 
Marxist writings. 

Another major concern of Cuvillier’s has been 
to introduce the work of Durkheim and his fol- 
lowers to a larger public. Anxious as he has been 
to preserve Durkheim’s heritage, Cuvillier has 
found it necessary to separate the properly scien- 
tific part of Durkheim's work from what he has 
called his “sociologism”: thus, he has denounced 
Durkheim's attempt to arrive at moral conclusions 
on the basis of the study of social facts, and he 
has accused Durkheim of transforming society it- 
self into the fundamental value. In spite of such 
criticism, Cuvillier has defended Durkheim repeat- 
edly against more extreme critics. In an article, 
“Durkheim et Marx” (1948a), he concluded that 
Durkheimism and Marxism cannot be simply op- 
posed as conservative and revolutionary sociology, 
respectively, or as idealist and materialist con- 
ceptions: they have common elements, and both 
are forerunners of the sociology of knowledge. 

Cuvillier has done useful work both as an editor 
and as a writer of teaching documents. He has 
edited the works of Proudhon, Spinoza, Roustan, 
Condillac, Malebranche, and Durkheim; particu- 
larly valuable is his reconstruction of Durkheim's 
course on pragmatism and sociology from the notes 
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of two students. His textbook, Introduction à la 
sociologie, was first published in 1936 and has 
since been revised six times and translated into 
several languages. A handbook of sociology in two 
volumes, Manuel de sociologie (1950), has become 
a useful tool for sociologists because of its great 
wealth of references. 

In 1953 Cuvillier warned that the traditional 
French conception of sociology as an objective 
science based on empirical investigation was in 
danger (Où va la sociologie francaise? . . .). Assert- 
ing the importance of the formula attributed to 
François Simiand, “no facts without ideas, no 
ideas without facts,” he focused his attack on the 
work of Georges Gurvitch, whom he berated for 
having created an arbitrary, formalistic sociological 
typology, divorced from facts. Simultaneously, he 
deplored a tendency in French sociology to conduct 
empirical research without theoretical foundations. 


JOSEF GUGLER 


[For the historical context of Cuvillier’s work, see 
Marxist SOCIOLOGY; and the biographies of DURK- 
HEIM; MARX; PROUDHON; SIMIAND.] 
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CYBERNETICS 
The term “cybernetics,” designating a distinct 


field of activity, appeared on the scientific scene 
at the close of World War 11, with the publication 
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of Norbert Wiener’s book Cybernetics: Or Control 
and Communication in the Animal and the Ma- 
chine, Wiener defined the term “cybernetics” as 
“the entire field of control and communication 
theory, whether in the machine or the animal” 
(Wiener 1948, p. 19); he was unaware that the 
term had been used, in a more limited sense, a 
century earlier by André Ampére (1834). 

Since 1948, research and publications related to 
cybernetics have proliferated, unfolding the con- 
tent of cybernetic concepts and their impact on 
fields ranging from psychology and neurophysiol- 
ogy to sociology and philosophy of science, This 
continuing clarification of the meaning and impli- 
cations of cybernetics has influenced attitudes 
toward and usage of the term, as well as our under- 
standing of it, thereby blurring Wiener’s initial 
definition. A brief look at some of the forces that 
shaped its development will help in understanding 
what “cybernetics” means today. 

In his personal review of the subject, Wiener 
recounts that while working on the theory of an 
automatic system for aiming antiaircraft guns he 
and his colleagues were impressed with the critical 
role of feedback in the proper functioning of a con- 
trol system. This led them to conjecture that in 
order for a person to perform motor activities, his 
cerebellum must embody types of feedback and 
associated information processes comparable to 
those used in an artificial control system. If this 
were so, then the brain could be viewed as a com- 
plex communication, computer, and control sys- 
tem; and the concepts of feedback and control 
theory could account for internal homeostatic con- 
trol (for temperature, blood-sugar concentration, 
heart action, etc.), as well as for control of those 
motor actions required for purposeful manipulation 
of external objects. Implicit in these notions was 
the further thesis that those cognitive activities in- 
volved in higher-level problem-solving behavior 
also could be interpreted mechanistically in terms 
of the flow and processing of information. 

The concepts of cybernetics, emphasizing an in- 
formation-processing analysis of the mechanisms 
that generate purposeful behavior, excited the in- 
terest of some psychologists, physiologists, and 
even psychiatrists. Psychologists saw a way of re- 
lating behavior to the underlying information proc- 
esses that control behavior. Neurophysiologists 
found that the brain and nervous system could be 
analyzed as a special-purpose computing machine 
“designed” to generate adaptive, intelligent behav- 
ior. And for psychiatrists, Wiener argued that func- 
tional mental disorders in the human are primarily 
diseases of memory caused by errors introduced in 


the processing of information and are not neces- 
sarily indicative of a physiological or anatomical 
breakdown of the brain mechanism. Thus, Wiener’s 
writings suggested that problems in the psychology 
of behavior, the physiology of the nervous system, 
and the psychopathology of mental disorders could 
all be described in the neutral language of informa- 
tion processing and control. 

Because of the central importance of the concept 
of information, a second major force behind the 
development of cybernetics was the publication in 
1948 of Shannon's paper “The Mathematical The- 
ory of Communication.” Here was a theory that 
explicated quantitatively one measure for the 
amount of information conveyed by messages. The 
theory showed how to determine the capacity of a 
communication channel. One could now compute 
how much more information one channel could 
transmit than another. Shannon's theory clarified 
the important concept of a code and showed how 
to determine the efficiency of a given coding sys- 
tem. The theory also demonstrated how to combat 
the destructive effects of noise by introducing re- 
dundancy into coding schemes. Shannon's mathe- 
matical theory of communication not only expli- 
cated all of these key concepts but also proved some 
surprising mathematical relationships between 
noise, redundancy, channel capacity, and error- 
free transmission of messages [see INFORMATION 
THEORY], 

Clearly, the digital computer was a third force 
pushing and molding the development of cyber- 
netics. The first electronic digital computer was 
completed in 1946, and the following years brought 
swift advances in computer theory, technology, and 
applications. Switching speeds and memory capac- 
ities increased by several orders of magnitude. 
Input-output devices and information conversion 
equipment of great diversity were developed. Theo- 
retical foundations emerged in the form of a theory 
of automata and information machines. More reli- 
able equipment, more flexible programming Jan- 
guages, and a steady decline in costs all contributed 
to the ever widening use of computers. The appli- 
cation of computing machines spread from scien: 
tific calculations to automatic control and business- 
data processing—in fact, into almost every facet 
of government, industrial, and military information 
processing. One of the most interesting applications 
is simulation, where the computer is used as 4 
general-purpose research vehicle to generate the 
logical consequences of arbitrary assumptions about 
a complex process. Thus, one can get new insights 
about a complex process by having a computer 
simulate its behavior, whether the model be for 


some aspects of the economy or for some neuro- 
physiological structure. In this way psychologists 
searching for theories of problem-solving behavior 
(for example, the cognitive behavior associated 
with proving theorems of logic) have attempted to 
simulate aspects of such behavior by using a com- 
puter. Similarly, neurophysiologists seeking an un- 
derstanding of the neural organizational principles 
that give rise to pattern recognition, learning, and 
similar processes have simulated with computers 
the behavior of networks of idealized neurons [see 
CoMPUTATION; SIMULATION]. 

Wiener’s notions about the brain and the com- 
puter, Shannon’s theory of information, and the 
new computing technology created optimism about 
new ways to attack the formidable problems of 
thinking and knowing. This atmosphere of excite- 
ment and ferment, accentuated by hopes of inter- 
disciplinary unification, generated much competent 
work. Unfortunately, it also produced serious in- 
tellectual and semantic misunderstandings about 
computers, brains, and people; and confusions be- 
tween information and meaning, between amount 
of information and entropy. These difficulties caused 
some nonsensical claims to be offered under the 
banner of cybernetics. The more responsible work- 
ers criticized this pseudoscientific fringe, thereby 
contributing to a reversal in attitudes. The new 
tendency was to regard cybernetics with suspicion 
and disdain, 

Thus, there developed—and still exist—conflict- 
ing attitudes toward cybernetics and what its sub- 
ject matter really is. Vagueness about the meaning 
of “cybernetics” has been compounded by the fact 
that since around 1955 the subject of cybernetics 
has enjoyed a wide acceptance and publicity in the 
Soviet Union, where it is now interpreted most 
broadly and used to describe all studies and tech- 
niques that relate even in the most remote way to 
computers, information processing, communica- 
tions, or control systems. 

Today, almost two decades after Wiener’s book, 
“cybernetics” still means different things to differ- 
ent people. For some, cybernetics is not a “new 
science” but merely a collection of techniques, 
studies, and devices clustering around information 
processing. For those who accept this interpreta- 
tion, “cybernetics” is but a fancy name for the 
application of certain techniques to related fields— 
for example, the application of information theory 
to analysis of coding and redundancy in the visual 
Sensory system. 

Others equate cybernetics with automation and 
its accelerating thrust into all facets of human 
activity. They recognize that cybernetics not only 
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changes favorably the face of our society but also 
initiates sociological problems of great magnitude 
—such as technological unemployment and social 
conflict resulting from the increasing replacement 
of men by machines. 

Finally, many interpret cybernetics as a new, all- 
inclusive, and powerful way of analyzing complex 
systems, from machines to society itself, in terms 
of the flow and processing of information. Some of 
these see a deeper significance to the underlying 
logical structure of cybernetics. The concepts of 
cybernetics do, in fact, offer hope for a new unity 
in our understanding of those processes that under- 
lie the activities of knowing. 

For some, the real intellectual wealth of cyber- 
netics lies not in its analogies between the com- 
puter and the brain—though these analogies are 
fruitful—but rather in the realization that both 
systems, natural and artificial, can be analyzed in 
terms of the same cybernetic language, the lan- 
guage of information and control (see MacKay 
1957 for a more detailed discussion). The concepts 
of this language are potentially rewarding because 
they span the traditional gap between the psychol- 
ogy of behavior and the physiology of those mech- 
anisms that generate behavior, including cognitive 
behavior. Thus, cybernetics offers an effective new 
language for analyzing those information mech- 
anisms and processes associated with behavioral 
aspects of thinking and knowing. The language of 
cybernetics may prove rich and versatile enough to 
permit a theory of knowing—a science of knowl- 
edge—to be expressed in terms of cybernetic con- 
cepts. 

Those concepts suggest even more than how to 
grasp and formulate the relationship between the 
information-flow organization of the brain and in- 
telligent behavior. There is no reason to believe 
that the human brain and nervous system is opti- 
mally organized. One might find principles, framed 
in cybernetic language, that show how to design an 
artifact able to learn more quickly, remember and 
associate better, act faster and more reliably, or 
solve problems more ingeniously than humans, One 
might find design principles radically different from 
those embodied in the human, and build (or grow) 
highly intelligent artifacts. All of this presupposes, 
of course, a theory of thinking and knowing (as 
exists for engineering communications) that can 
be used to judge which cognitive system is optimal 
relative to some aspect of information processing. 
Be clear about what this means. Cybernetics today 
offers no laws describing what kinds of information- 
flow structures are necessary to produce various 
dimensions of intelligent (mindlike) behavior. 
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There exist only the faintest outlines of such or- 
anizational principles. Nothing, however, contra- 
dicts the thesis that such design principles exist, 
can be described, and can be implemented. Cyber- 
netics offers both a language and a set of concepts 
to use in molding these principles into a theory 
relating information processing to the activities of 
learning, thinking, knowing, and understanding 
(see Maron 1965). 

M. E. Maron 


[Directly related is the biography of WrenER. Other 
relevant material may be found in CoMPUTATION; 
INFORMATION THEORY; SIMULATION. ] 
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DARWIN, CHARLES 


Charles Robert Darwin was born in 1809 at 
Shrewsbury and died in 1882 in Down House in 
Kent, He was the next to youngest child in a family 
of four sisters and an older brother. His father, 
Robert Darwin, was a physician of imposing pres- 
ence and severe character, His paternal grand- 
father, Erasmus, had also been a doctor by pro- 
fession. By avocation, however, Erasmus Darwin 
was a naturalist, whose Zoénomia, a disquisition in 
verse on the course of nature, has usually been 
considered, quite incorrectly, to have been an antic- 
ipation of Charles Darwin's theory of evolution. 
Darwin’s maternal grandfather was Josiah Wedg- 
wood, founder of the pottery firm and friend of 
Erasmus Darwin in the Birmingham scientific 
circle known as the Lunar Society. Darwin be- 
longed, in short, to a provincial family of means 
and intelligence which associated itself with the 
professions, intellectual pursuits, and commerce, 
Although his father expected him to choose a pro- 
fession, he never needed to earn a living. In 1839 
Darwin, too, married a Wedgwood, his cousin 
Emma, The seven (of ten) children who survived 
infancy and their descendants have sustained the 
notable literary and scientific tradition which dis- 
tinguishes the family. 

Originally intended for medicine in his turn, 
Darwin detested the study of it at the University 
of Edinburgh. His father allowed him to move to 
Cambridge, with a view to his becoming a clergy- 
man. At both universities the hobby of natural his- 
tory absorbed him. He read; he collected beetles 
and prepared specimens; he went on geological and 
entomological forays, He might have become a 


clerical naturalist like Gilbert White, whose Natu- 
ral History and Antiquities of Selborne had charmed 
him in boyhood. Instead, a decisive opportunity 
intervened. In 1831 he was offered the (unpaid) 
post of naturalist aboard H. M. S. Beagle, then out- 
fitting for a voyage to collect geographical informa- 
tion. This was to be no mere trip around the world, 
The expedition lasted five years. It was the making 
of Darwin as a scientist, and it cost him his health. 
Sir Gavin de Beer’s biography has established the 
strong probability that the chronic physical pros- 
tration which debilitated Darwin throughout the 
rest of his life was the manifestation of Chagas’ 
disease, a malady identified only in 1909, which 
involves infestation of the heart muscle and intes- 
tines by Trypanosoma cruzi. These bacteria are 
carried by the great black bug of the pampas, 
benchuca (Triatoma infestans), and while in Peru 
Darwin recorded having watched with careful in- 
terest one of these insects biting his finger and 
engorging itself on his blood. 

Darwin returned to England in 1836, On the 
Origin of Species appeared in 1859. In the interval 
of over twenty years Darwin published descriptive 
accounts of the zoological and geological findings 
of the voyage. His Journal of Researches in 1839 
was followed by a series of geological treatises: one 
on coral reefs in 1842, a second on volcanic islands 
in 1844, and a third on South America in 1846. 
These writings won him the reputation of an accu- 
rate, thorough, and reliable naturalist. In the early 
1850s he published in four volumes a complete 
study of fossil and living barnacles. Here his work 
was that of a descriptive biologist of great finesse, 
capable also in experimental inquiry. Meanwhile, 
he was quietly maturing his views on evolution, 
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these too deriving from observations and reflections 
started on the voyage of the Beagle. 

Theory in biology. The theoretical originality 
of Darwin's central work is more difficult to appre- 
ciate than its content is difficult to understand. 
On the Origin of Species does not on the face 
of it read like a book animated by theory, either 
abstruse or abstract, Paragraphs follow one out of 
another setting forth evidence for the mutability 
of animals. That notion was not in itself a new one. 
Intimations and even evidence of the transforma- 
tion of species had often before been adduced—by 
Lamarck, by Darwin's grandfather, and by many 
others, Certainly Darwin marshaled his materials 
with a fidelity to fact and a range of relevance that 
proved telling. Nevertheless, what was original was 
not the assertion but the explanation of evolution 
in the theory of natural selection. For that theory 
constituted a new notion of biological order. Al- 
though Darwin's mundane style tended to obscure 
its novelty, the Origin is in fact a work of theory; 
it changed the way of seeing the phenomena of life. 

Prior to the assimilation of Darwin's work, no 
consensus existed about the mode of biological ex- 
planation. Only in the early nineteenth century did 
biology as a professional discipline emerge from 
the combination of the tradition of natural history 
with that of the medical or mechanistic study of 
anatomy and physiology. In Darwin’s youth two 
distinct traditions governed the modeling of theory 
—the one organismic and the other teleological. 
The former arose in eighteenth-century romantic 
feeling about nature and harked back to ancient 
Stoicism. According to that philosophy, nature is 
the source of vitality, morality, and destiny, Goethe’s 
vision of universal metamorphosis exemplified this 
cast of mind, and its foremost scientific exponent 
was Lamarck. In post-Darwinian polemics Lamarck 
has often been represented by those hostile to the 
theory of natural selection as the predecessor who 
had formulated whatever was original and valid in 
the doctrine of evolution, That argument is untrue 
to the facts and unfair to the thought of both 
Lamarck and Darwin. In Lamarck’s view, cyclic 
equilibrium obtains in the world between the con- 
tinuously constitutive forms of activity ( electricity, 
fire, life itself) and the perpetually disintegrating 
tendency of the physical environment. The innate 
self-perfecting drive of living beings never alto- 
gether prevails over the dead hand of inorganic 
matter, and the tableau of animals each in its kind 
represents the highest point to which every species 
has attained in the continuous process of becoming. 
Assertions of the inheritance of acquired characters 
and the law of use and disuse (these are later 


terms) may, indeed, be found in Lamarck’s writ- 
ings, although only as incidental to his view of flux 
and process in the progression of life, not as ex- 
plaining actual evidence of variations. 

Over against this conception of life as plastic 
power, novel in emphasis and appeal if not in prin- 
ciple, persisted the strong tradition of teleology. 
Engrained since Aristotle, that habit of mind had 
survived the scientific revolution of the seventeenth 
century with its scope reduced to the degree that 
it had become the mode of explanation appropriate 
to the description of organisms, Within this re- 
stricted domain teleology had if anything been 
fortified by the popularity in eighteenth-century 
natural theology of the argument from design, 
which held that the existence of God is demon- 
strable in the ease and skill with which He had 
specially created each species to fit its own environ- 
ment and manner of life. In scientific practice, 
moreover, teleology answered in detail to the prob- 
lems of functional analysis of anatomical structure, 
exhibiting how in plants and animals the part 
serves in exact respects the purposes of the whole. 
Such was Cuvier’s method in comparative anatomy. 

Background of “Origin of Species.” The prob- 
lem of adaptation, in short, was the crux of bio- 
logical explanation. Aboard the Beagle Darwin al- 
ready saw it to be so, Leaving home a believer in 
the fixity of species, like Cuvier or Paley, he had 
the facts of geographical variation of species con- 
tinuously borne in upon him throughout the exten- 
sive experience of the voyage. He told in his Auto- 
biography what lines of evidence led him first to 
doubt and then to change his mind: 


. .. I had been deeply impressed by discovering in the 
Pampean formation great fossil animals covered with 
armour like that on the existing armadillos; secondly, 
by the manner in which closely allied animals replace 
one another in proceeding southwards over the [South 
American] Continent; and thirdly, by the South Ameri- 
can character of most of the productions of the Gala- 
pagos archipelago, and more especially by the manner 
in which they differ slightly on each island of the 
group. ([1876a] 1958, p. 118) 


Upon these manifold indications of variation of 
species in geographical space the new science of 
geology superimposed evidence of their modifica- 
tion in time. Stratigraphy had become a definitive 
science in the decades between 1810 and 1830. The 
first truly historical science, stratigraphy system- 
atically ordered the geological succession according 
to the fossil population characteristic of the several 
formations and systems of the rocks. While an 
undergraduate, Darwin had worked one summer as 
assistant to one of the founders of stratigraphy, the 


Reverend Adam Sedgwick, a professor at Cam- 
bridge. Sedgwick, like many among the early geol- 
ogists, held that the evidence for geological revolu- 
tions confirmed Biblical accounts of the Creation 
and the Flood. That view proved venial scientifi- 
cally, however, for their intensive study of the fossil 
record made clear the progression of forms of life 
in the course of the ages. 

Catastrophists of Sedgwick’s persuasion took 
those ages to be brief and punctuated by cata- 
clysms. Increasingly, however, skeptical critics came 
to think of the span of the past as very long—long 
enough for all changes to come about gradually. 
Indeed, the mentor to whom Darwin professed the 
deepest of his scientific obligations was not a biol- 
ogist at all, He was Sir Charles Lyell, whose Prin- 
ciples of Geology, published in three volumes 
between 1830 and 1833, was the earliest fully com- 
prehensive work in that science. It combated the 
doctrine of geological catastrophes in the name of 
the uniformity and virtual eternity of nature and 
taught Darwin to think of the cumulative efficacy 
of minute changes operating progressively across 
vast reaches of time, 

Further insights came to Darwin as he puzzled 
over the problem of species in the months follow- 
ing his return to England in the autumn of 1836. 
Soon thereafter he started a notebook on the sub- 
ject, the first in what was to become a voluminous 
series. He worked the while, so he later said, on 
Baconian principles, collecting facts wholesale. One 
of his concerns was inheritable modifications. Such 
modifications are not to be confused with character- 
istics acquired, in the Neo-Lamarckian sense, as a 
consequence of a creature’s individual necessities 
—the biceps in a blacksmith’s arm, for example. 
Any theory of evolution must, of course, suppose 
that the variations which are its subject matter are 
cumulative over generations. From the outset Dar- 
win had the wit to see that if a theory of evolution 
were not to beg the question, variations must be 
supposed random in origin, cumulative in effect, 
and indefinite in extent. In thinking about how 
they may be accumulated, Darwin meditated often 
upon the way in which kennel owners, cattle 
farmers, stablemen, nurserymen, and pigeon fan- 
ciers select stock for breeding—how, operating 
upon successive generations of domesticated ani- 
mals and plants, the breeder picks for reproduction 
those individuals that exhibit the characters desired 
and thus appears to mold the species. 

As early as 1837, then, Darwin had all this: geo- 
graphical evidence of variation; geological proof of 
succession; a uniformitarian sense of the history 
of nature; and the notion of random, heritable, and 
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cumulative modifications in form, modifications 
produced by a process of selection, albeit an artifi- 
cial one. Even so, he saw no prospect of oversetting 
by means of such indirect evidence the well-nigh 
universal conviction that species are fixed in nature 
itself, based as that belief was on the universal 
observation that within the experience of everyone 
like reproduces like, as well as on the plausible sup- 
position that the adaptation of species to all sorts 
of specialized lives argues the creation of each to 
lead just that life we see it living. 

Darwin disposed of a set of facts and ideas, in 
other words, and needed a theory; indeed, he was 
constantly looking for one. It came in one of those 
flashes of insight which is better called catalytic 
than coincidental or subconscious. In October 1838 
he chanced to read for distraction Malthus’ essay 
on population. The argument persuaded him at 
once [see POPULATION, article on POPULATION THE- 
oRIES; and the biography of MALTHus], It must, in- 
deed, have been familiar, for it was one of the 
commonplaces of contemporary political economy, 
requiring only to be brought together with Darwin’s 
problem for its relevance to flash into his mind: 


. . . being well prepared to appreciate the struggle for 
existence which everywhere goes on from long-con- 
tinued observation of the habits of animals and plants, 
it at once struck me that under these circumstances 
favourable variations would tend to be preserved, and 
unfavourable ones to be destroyed. The result of this 
would be the formation of new species. Here, then, I 
had at last got a theory by which to work. ([1876a] 
1958, p. 120) 


He worked for twenty years before publishing, 
although he did write out a draft in 1844 for cir- 
culation among friends and colleagues, notably 
Lyell and J, D. Hooker, the botanist. In May 1856 
he at last began writing on an exhaustive scale and 
completed some ten or eleven chapters in the next 
year and a half. He thought to call the work “Natu- 
ral Selection.” Then on June 18, 1858, Darwin had 
the shock of receiving from Alfred Russel Wallace, 
a young naturalist who had been investigating the 
flora and fauna of the Malayan jungle, the manu- 
script of an evolutionary theory identical with his 
own although based on much less extensive mate- 
rial, together with a request to send it along to 
Lyell if it should seem worthy. This coincidence 
precipitated Darwin’s decision to come before the 
public. On the advice of Lyell and Hooker he pre- 
pared a brief summary of his views to be published 
jointly with Wallace’s paper. Thereupon he settled 
down to compose an account that should be full 
enough to do justice to his materials although re- 
duced in scale from what he had initially intended. 
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It took eight months to write the Origin of Species, 
which appeared in November 1859. 

The circumstances of the publication of the Ori- 
gin belong, of course, to Darwin's personal history, 
not to an account of his science. Suffice it to say 
that his behavior throughout was entirely honor- 
able. It offers no grounds for the imputation re- 
peated by detractors that he slighted Wallace. (For 
a general discussion of multiple discoveries in sci- 
ence, see Merton 1961.) 

“Origin of Species.” The full title summarizes 
the argument: On the Origin of Species by Means 
of Natural Selection, or the Preservation of Favoured 
Races in the Struggle for Life. Addressing himself 
to the general public, Darwin began with the most 
familiar matter, the great variations evident among 
domesticated animals, He thereby called attention 
to the variability which exists within what are com- 
monly taken to be fixed species and observed that 
in the breeder's hands they are malleable: “The key 
is man’s power of accumulative selection: nature 
gives successive variations; man adds them up in 
certain directions useful to him” ({1859] 1964, p. 
105). The second chapter extends consideration of 
the ambiguity of species into nature, and the third 
attributes the function of selection to the struggle 
for existence. “Owing to this struggle, variations, 
however slight, and from whatever cause proceed- 
ing, if they be in any degree profitable to the indi- 
viduals of a species, in their infinitely complex 
relations to other organic beings and to their physi- 
cal conditions of life, will tend to the preservation 
of such individuals, and will generally be inherited 
by the offspring” (p. 145), Within the species the 
proportion of favored individuals will increase in 
successive generations, since by definition these 
individuals have characteristics with survival value 
that enable their possessors, merely by living longer 
than less favored competitors, to produce more 
descendants similarly equipped. Thus will species 
alter as time goes on. 

The middle chapters of the Origin discuss the 
laws of variation. Darwin’s masterful control over 
detail is particularly evident here. There are acute 
observations on the effects of use and disuse of 
organs, on the influence of climate, and on the 
greater variability of species than of genera, the 
last offering a consideration peculiarly difficult to 
comprehend on any basis other than his, Darwin 
expressly excluded the problem of cause, Given the 
evidence that variations have in fact occurred and 
are heritable, the theory of natural selection ex- 
plains the course of variation or change. Indeed, 
the success of the theory turned on dropping the 
question of cause, not as uninteresting, but as cur- 


rently unanswerable, although not necessarily for- 
ever so. 

In the later chapters Darwin anticipated diffi- 
culties and objections. He dealt with instinct, which 
hardly seems a bodily matter. He acknowledged the 
problem of extreme specialization and discussed 
the spectacular beauty of the male in certain spe- 
cies of birds as an extravagance that seems to sur- 
pass the humdrum effects of selection, He ascribed 
the absence of transitional forms to the imperfec- 
tion of the geological record. He treated sexual 
preferences and choices as a special case of natural 
selection rather than as an auxiliary or extraneous 
factor, In summary let it be said that contemporary 
evolutionists bear him out on all essential matters. 
The degree of confirmation of his reasoning and its 
fidelity to vast reaches of fact is extraordinary when 
it is also considered that he was necessarily un- 
aware of the twentieth-century discoveries about 
the fine structure and workings of heredity. 

Publication of the Origin was the great event, 
indeed the raison d'être, of Darwin's career. His 
single-mindedness invested the book with a kind of 
power—a power deriving from the scope as well as 
from the depth of his thought. Iconoclastic critics 
have sometimes described the book as dull, Nor 
does it invite stylistic appreciation. It will repay 
structural analysis, however, and this is the way 
to understand Darwin's merit as a theorist. For this 
is a work of theory, even though any given sentence 
expresses only fact or deduction. The art lies in 
the deployment, not in the expression, and it 
creates its effect in the way a mosaic does rather 
than a painting. Each element is a mere bit of 
Stone, every sentence a mere statement; yet 4 
design governs the arrangement and the disposi- 
tion. There are not many books in just this genre. 
Burckhardt's Civilization of the Renaissance in 
Italy, for example, and perhaps Marx’s Capital 
(setting aside the disgust which often overlays 
statements in Marx) are also works in the mid- 
nineteenth-century mode of factual argumentation 
which, in the literature of science, Darwin repre- 
sents at its most powerful. 

Darwin did not abandon further science in order 
to defend his views in the discussion that followed 
the publication of the Origin, although the pattern 
of the remaining half of his career was inevitably 
determined by the response to his book. He fol- 
lowed criticism closely and modified later editions 
of the Origin in numerous respects which either 
met or accepted objections without abandoning the 
position. Not all the revisions proved fortunate. 
Least felicitous was one he introduced into the 
subtitle as well as into the text. At Wallace's urging 


he substituted Herbert Spencer’s formula, “survival 
of the fittest,” for his own phrase, “preservation of 
favoured races” to describe what transpires in the 
struggle for life in consequence of natural selec- 
tion. Both the tautology—i.e., survival in a struggle 
for life—and the ambiguity of the superlative 
“fittest” confused his proper meaning and obscured 
the point he had originally emphasized: that the 
essential feature in the operation of natural selec- 
tion is not mere survival for its own sake. It is the 
differential reproductivity of individuals favored 
adaptively by chance variation, which effect entails 
an increasing proportion of similarly constituted 
offspring in a population. 

Darwin's later work. Thenceforth, Darwin was 
more effective in his other writings, both in sub- 
stantiation of the theory of natural selection and 
in affirmation of his remarkable virtuosity in re- 
search, He chose problems which combined an 
appeal to his taste with relevance to evolution. 
Three major books develop central evolutionary 
themes more fully than does the Origin. The Varia- 
tion of Plants and Animals Under Domestication 
(1868) tells in full zoological and archeological 
detail how man has bent to his uses very variable 
breeds—dog and horse, for example, sheep and 
goat, peacock and guinea fowl—and attributes the 
contrasting stability of the goose to its never having 
been taken in as a pet or servant and of the donkey 
to its having served only the classes of society 
whose members cannot afford experiments [see 
DOMESTICATION, article on ANIMAL DOMESTICA- 
Tion]. By way of excursion, at the end of his book 
Darwin advanced the hypothesis of pangenesis to 
account for the phenomena of heredity. Here he 
addressed himself to the complaint that he had not 
explained how variations are transmitted. The spec- 
ulation with which he met the criticism has only 
increased his vulnerability before later critics, for 
in imagining that the basis of heredity might be 
generalized throughout the organism (“gemmules” 
in each structure and organ governing the forma- 
tion of its counterpart in the offspring ), Darwin 
appeared to have conceded inheritance of acquired 
characters taken in a Lamarckian sense. It is to be 
appreciated, however, that Darwin suggested this 
speculation as a parenthetical aside. 

In The Descent of Man (1871) Darwin directed 
his theory explicitly to the single species Homo 
sapiens. Much of the book is devoted to developing 
the evolutionary significance of sexual selection— 
i.e., the preferential choice of reproductive partner. 
Apart from that, the argument follows two main 
lines. The first is physical. The immediate evolu- 
tionary forebears of mankind are unknown. Darwin 
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never represented man as deriving from apes. He 
did establish that man must in all probability be 
descended from species that are classified among 
primates, and further, that man and the higher 
apes resemble each other anatomically more closely 
than the latter resemble the lower primates. 

The second line of argument is behavioristic. 
Intellectual and social faculties are themselves 
adaptive and in their variations make for the 
greater or lesser survival of the creatures that pos- 
sess them. Other species besides man subsist with 
the aid of rudimentary or developed forms of social 
organization and communication, and Darwin 
could see only differences of degree between these 
and the characteristics of human community and 
moral awareness. 

The last of Darwin’s expressly evolutionary 
treatises carries behavioristic comparisons even 
further. Probably The Expression of the Emotions 
in Man and Animals (1872) is the most dated of 
Darwin's writings. Nevertheless, in comparing phe- 
nomena like the physical manifestation of hostility 
in dog and master—the similarity of the snarling 
jaw to the drawn lip—and many other states of 
temper, Darwin did carry biology into treatment of 
faculties traditionally reserved for moral studies. 
Reciprocally, he introduced the study of animal 
behavior into psychology [see SOCIAL BEHAVIOR, 
ANIMAL]. 

In the last decade of his life Darwin put in hand 
a train of experimental researches mainly botanical 
in character and published a final series of mono- 
graphs exhibiting a great variety of novel observa- 
tions and phenomena. He had already brought out 
a study of fertilization in orchids—in 1862; in 
1877 he issued a second edition, much enlarged in 
detail, In the same year he published a very acute 
work on hermaphroditic reproduction in trees and 
plants. Two years previously, in 1875, he had is- 
sued two quite new studies, one on climbing plants 
(first published in journal form in 1865) and one 
on insectivorous plants. In 1880 he generalized the 
former in a book about the power of movement in 
plants as it resides in root, leaf, stem, and flower. 
In 1881 he concluded his lifework with an account 
of the process by which earthworms contribute to 
the formation of humus: the originality of the sub- 
ject is characteristic of his acumen, while its choice 
reflects the modesty of his temperament. 

To appreciate the full range of Darwin’s accom- 
plishment as a scientist one must read in these 
experimental writings as well as in the Origin, The 
comparison to Newton is often adduced. It is justi- 
fied. Notable scientists normally excel either in 
theoretical insight or in experimental virtuosity. 
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Newton and Darwin were capable as very few have 
ever been, and perhaps none in equal measure, of 
combining those veins and thereby achieving the 
foremost stature in all aspects that distinguish sci- 
ence. It is true that, given the state of the science, 
comprehensiveness had to do the work in theory for 
Darwin that mathematical power did for Newton, 
which is only to say that the one was suited by his 
abilities to accomplish for biology what the other 
had done for physics, and that both of them made 
the most of those abilities, 

Professional response to the “Origin.” The the- 
ory of natural selection polarized opinion among 
biologists more radically than is the common effect 
of scientific theories. Those who adopted it tended 
to do so straightaway in something like a conver- 
sion, “How extremely stupid not to have thought of 
that,” was T. H. Huxley’s often quoted reaction as 
the pieces suddenly fell into place on reading Dar- 
win. Yet even the professional response was influ- 
enced by philosophical taste and cultural tradition. 
Then, as now, the English-speaking scientific world 
was empirically minded after the manner of Bacon, 
Locke, and Mill. In Darwin’s own milieu, what im- 
mediately persuaded was his resolution of the ques- 
tion of biological adaptation, Not that he solved 
the problem. He did better. He dispelled it, by turn- 
ing adaptation from final cause into present effect. 
Instead of a wonder so intricate as to bespeak Prov- 
idence or purpose, adaptation became merely the 
outcome of the whole life history of the species in 
its relation to circumstance and environment, a 
phenomenon or set of phenomena to be analyzed 
and explained in evolutionary terms, In Huxley’s 
hands, therefore, Darwin became the protagonist 
of the drama of scientific fact undoing theological 
prejudice. 

In Germany, where Darwin’s work enjoyed a 
success if anything more influential than in Eng- 
land and the United States, what appealed was less 
its empiricism than its historicism. The sense of 
history as indwelling process, whether in the suc- 
cessive versions of Herder, Hegel, or Marx, had in 
the course of the romantic movement of thought 
become deeply engrained in the German sense of 
the world. Philosophically speaking, such histori- 
ography treats the past in a mode exactly counter 
to Darwin’s, the one being a realist and the other 
a nominalist approach to nature. Oblivious to these 
distinctions, German historicists misread Darwin 
and took him for their own. It is well known that 
Marx spoke of dedicating the second book of Capital 
to Darwin, an intention which makes a curious con- 
trast to the Lamarckian position of some subse- 
quent Marxists. The professional champion and 


self-appointed spokesman, however, was Ernst 
Haeckel, who tried to make over all biology into the 
study of evolution. He governed his embryology, 
for example, by the doctrine that the development 
of every individual in the womb recapitulates the 
evolution of the species, Not only does evolution 
govern all biology, but all truth too: Haeckel’s 
Riddle of the Universe is the cardinal document of 
the movement to make something like a religion 
out of evolutionary thought itself. 

Only in France was Darwin's work initially 
greeted by indifference (except among anthropol- 
ogists who did take it up with very great benefit to 
their science). Not so the biologists. Among them 
the writ of Cuvier still ran, There was also a prob- 
lem of language. In French, the word évolution 
refers to the development of an individual; trans- 
formisme connotes the doctrine as affecting species 
in general, and at that time involved visionary 
connotations. A self-defeating semantic objection 
also supervened—to explain the origin of species 
in a book which questioned their reality seemed an 
inadmissible mode of discourse, The greatest of 
physiologists, Claude Bernard, never even seized 
the differences between the Darwinian theory and 
its speculative predecessors. “We must doubtless 
admire,” he wrote disdainfully, “those great hori- 
zons dimly seen by the genius of a Goethe, an Oken, 
a Carus, a Geoffroy Saint-Hilaire, a Darwin, in 
which a general conception shows us all living 
beings as the expression of types ceaselessly trans- 
formed in the evolution of organisms and species, 
—types in which every living being individually 
disappears like a reflection of the whole to which 
it belongs” ([1865] 1957, pp. 91-92). In general, 
Darwin expressed himself pragmatically in a man- 
ner characteristically English and characteristically 
unconvincing, not to say irritating, to the French. 

The nonprofessional response. Darwin himself 
took no part in the social Darwinist movement of 
ideas and gave it no countenance [see SOCIAL 
DARWINISM]. Something must here be said, never- 
theless, about the pattern of nonprofessional Te- 
sponse, insofar as it has been hostile, for these 
attitudes greatly amplified for a time certain scl 
entific difficulties left unresolved by Darwin, and 
they have also contributed to persistent misunder- 
standings of the status of the theory in science— 
misunderstandings arising from repugnance to- 
ward its political and social derivations. ; 

The pattern has two distinct aspects, theological 
and moralistic, which turn on different implica- 
tions of the theory of evolution. The former, the 
theological issue between the Biblical story of the 
Creation and the Darwinian account of the origin 


of species, is the more famous. It is also the more 
literal and has proved the more ephemeral, con- 
cerned as it is with the wide historical fact of 
evolution rather than with the theory of natural 
selection, by which Darwin explained and thereby 
established the fact. The science of geology had, 
indeed, already faced theology with the necessity 
of regarding the scriptural account of earth history 
in an allegorical fashion, and the confrontation 
between Darwinists and religious believers settled 
rather than initiated that matter among informed 
persons. The Origin of Species did cause additional 
shock and distress, of course, chiefly in regard to 
the question of whether the reality of a spiritual 
component of human existence requires that man 
be biologically unique and specially created. This 
is a dilemma which may be resolved in one of two 
ways: either by considering the whole universe as 
contingent and composed of physical and spiritual 
aspects; or, in a more restricted way, by appreci- 
ating that the theory of evolution treats man as an 
evolved animal, but it does not and in principle 
cannot assert that he is nothing but an animal, At 
all events, within a relatively short space of time 
theologians learned to contemplate evolution with- 
out dismay, as clearly a matter of fact; significant 
religious minds have not felt impelled to gird 
against the facts, although fundamentalist rejec- 
tion of them persists in diminishing backwaters. 

What has persisted in recurrent and often seri- 
ous manifestations is a second aspect of resistance 
to Darwin’s work. This kind of resistance is serious 
because it obtains among educated persons and 
reveals the occasional inability of highly intelligent 
people to respect the force and limitations of sci- 
entific demonstration, In this tradition the tenor 
has been to accept and even celebrate the phenom- 
enon of evolution while denigrating the theory of 
natural selection, Sometimes this denigration has 
taken the form of an attack on Darwin’s character 
and probity, as in the instances of Barzun (1941) 
and Himmelfarb (1959). Sometimes it has taken 
the form of a revival, more or less garbled, of the 
views of Lamarck, as in the writings of Samuel 
Butler and George Bernard Shaw. Sometimes it has 
taken the form of proposing an alternative formula, 
metaphorical in character, for the purpose of re- 
plenishing evolution with intelligence, will, or 
choice, Such were the notions of “creative evolution,” 
the élan vital of Bergson, or entelechy of Driesch. 
Sometimes it has taken on ideological and political 
dress, as in the refusal of Lysenko's adherents to 
admit the irrelevance to heredity of manipulation 
of the physical and social environment. As, indeed, 
in any complex of opinions, some proportion of 
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these several elements, some mixture of these 
motivations, may be discerned in all types of edu- 
cated refusal to be persuaded by the theory of 
natural selection. 

What unifies these hostile views into a tradition 
of romantic resistance is the common complaint 
that the theory of natural selection deprives the 
history of nature of any sanctions for man as a 
social or moral being. These critics hold a different 
conception of the function of a scientific explana- 
tion. They want more out of nature than Darwin, 
or science generally, has found there. They want 
reasons, and Darwin gave them circumstance. They 
want causes, and he gave them accidental varia- 
tion. They want purpose and direction, and he gave 
them an indefinite sequence of indifferent results. 
They want a science which seizes on unity and 
deep process, and he fragmented nature into a 
congeries of discrete events connected only by prox- 
imity and happenstance. What has proved pro- 
found, therefore, is not the incompatibility between 
evolutionary science and theology, considered in- 
tellectually at least. Much deeper and more per- 
sistent, partly because of common commitment to 
the natural with different understandings of what 
it signifies, has been the incompatibility between 
the Darwinian theory and any form of naturalistic 
social or moral philosophy. For the moralist knows 
what sort of nature he requires of science. And the 
theory of natural selection has succeeded precisely 
in consequence of limiting its purview to the indif- 
ferent event. While the present state of scientific 
understanding is not the touchstone by which to 
judge theories of the past, it is important to em- 
phasize, nevertheless, that the theory of natural 
selection is part of working evolutionary thought. 
To refuse to admit its force is not only to question 
the actual state of scientific knowledge, but it is 
also to impugn the current mode of scientific 
explanation. 

CHARLES C. GILLISPIE 


[Directly related are the entries EVOLUTION; GENETICS; 
Race. Other relevant material may be found in the 
biographies of CooLEY; GALTON; GIDDINGS; Lus- 
BOCK; MORGAN, Lewis HENRY; SPENCER; TYLOR.] 
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DAVENANT, CHARLES 


Charles Davenant (1656-1714) was one of the 
leading economic and political pamphleteers in 
the England of William mi and Queen Anne. His 
writings have a literary distinction far above that 
of the average pamphlet of the time, and they 
show considerable skill in the use of the new tech- 
niques of “political arithmetic.” These qualities, 
combined with the air of judicious detachment that 
the author often succeeded in imparting, have 
misled many commentators into considering Dave- 
nant an independent and objective thinker. His 
pamphlets, however, are essentially occasional 
pieces, concerned with advancing a particular 
partisan cause and indeed frequently concerned 
with advancing the author's own personal interests. 
Nevertheless, they are interesting examples of early 
economic analysis and were influential in promot- 
ing the increased use of quantitative argument on 
political and economic questions. 

From 1678 to 1689 Davenant served as a come 
missioner of excise, but when a new commission 
was named after the accession of William m1, he 
was omitted. This must have been a serious finan- 
cial blow to him, and for the next five years he 
applied, without success, for reinstatement in the 
revenue service. His early writings represent a neW 


approach to the same end. His first pamphlet, pub- 
lished in 1695, An Essay Upon Ways and Means 
of Supplying the War (1771, vol. 1, pp. 1-81), a 
number of unpublished “memorials” of that and 
the following year, and his Discourses on the Pub- 
lic Revenues, and on the Trade of England, Part I 
of 1698 (1771, vol. 1, pp. 125-302) can be re- 
garded as an attempt to bring about an alteration 
in policy likely to redound to his own benefit, These 
works express disapproval of the financial meas- 
ures of the Whig ministry and in particular of its 
use of long-term loans, which Davenant argued 
would prove detrimental to the economy. His 
counterproposal was an increase in excise taxation 
—the branch in which he had had experience. 

But he had also a political objection to the loans, 
for they brought profits to the moneyed men, whom 
Davenant identified with the Whigs, and the great- 
est burden of taxation fell on the landed gentry, 
who were mainly Tories and whose interests Dave- 
nant came to espouse. This political side became 
more prominent, and at the end of An Essay Upon 
the Probable Methods of Making a People Gainers 
in the Balance of Trade (1771, vol. 2, pp. 163- 
382), published in 1699, Davenant gave up pro- 
posing changes in policy and launched an out-and- 
out attack on the ministry and all its works. He 
was presumably convinced that his chances of 
advancement were better with the Tory opposition, 
and he went on to write several purely political 
pamphlets, the most celebrated being The True 
Picture of a Modern Whig (1771, vol. 4, pp. 125- 
180), published in 1701. These established him as 
the leading Tory pamphleteer. He was also identi- 
fied with the extreme Tories in Parliament, where 
he sat from 1698 to 1701. 

When his friends came to power on the acces- 
sion of Anne in 1702, he was rewarded with the 
customs post of inspector-general of exports and 
imports, which he held until his death in 1714. 
This was not, however, enough to keep him out 
of financial difficulties, and he lived his last years 
largely on the charity of friends. His next work, 
in 1704, Essays Upon Peace at Home and War 
Abroad (1771, vol. 4, pp. 267-439; vol. 5, pp. 1- 
69), supported the new ministerial policy of “mod- 
eration” and infuriated his former extremist asso- 
ciates. When they returned to power in 1710, 
Davenant had to conciliate them with two partisan 
dialogues in his old style and with two Reports to 
the . . . Commissioners for . . . Public Accounts 
(1771, vol. 5, pp. 345-463) on the trade with 
France and Holland, which, although in the guise 
of impartial statistical studies, constituted in fact 
an economic argument in justification of the minis- 
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terial policy of deserting the Dutch and other allies 
and making a separate peace with France. 

Davenant's earlier writings on foreign trade are 
less political, but they reveal the same fears of the 
threat of Dutch rivalry to England’s trade. In An 
Essay on the East India Trade (1771, vol. 1, 
pp. 83-123) published in 1696, he argued that 
English textiles need not be protected against 
Indian goods. Some passages in this work have 
given rise to the myth that Davenant was an early 
free trader. His whole argument, however, is con- 
ceived in terms of the balance of trade, and he 
takes the line, as did earlier writers, such as 
Thomas Mun, that even if the “particular” balance 
with India is adverse, this is more than compen- 
sated for through the interdependence of different 
branches of international trade and that it is the 
country’s “general” balance of trade alone that 
matters. Two years later, in 1698, he elaborated 
this argument and applied it also to the “losing” 
trade with France in Discourses on the Public 
Revenues, and on the Trade of England, Part II 
(1771, vol. 1, pp. 343-459; vol. 2, pp. 1-162). He 
obtained some reward for these services from the 
East India Company and may also have been re- 
munerated for his strong advocacy of the case of the 
Royal African Company in Reflections Upon the 
Constitution and Management of the Trade to Africa 
(1771, vol. 5, pp. 71-343), published in 1709. 

Although Davenant’s personal career was not 
very successful, although he was on the losing 
side in almost every controversy he joined, and 
although his enemies had some grounds for re- 
garding him as a self-seeking, mercenary time- 
server, his writings have a place in the history of 
economic thought and analysis; they set out ten- 
able points of view held by significant bodies of 
minority opinion and often suggest a very genuine 
concern for the welfare of his country. 


D. A. G. WADDELL 
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and Balliére 1913 is a bald summary of his economic 
writings. 
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DAVENPORT, HERBERT J. 


Herbert Joseph Davenport, American economist, 
was born in Wilmington, Vermont, in 1861 and 
died in New York City in 1931. His father, a lead- 
ing lawyer, had been a Democrat in Republican 
Vermont, and the father and son shared a good 
measure of rugged individualism. Davenport's fore- 
bears on both sides were members of old New 
England families that included the founders of 
New Haven, Connecticut, and Salem, Massachu- 
setts. 

While in his twenties Davenport went to South 
Dakota, invested his parental legacy in real estate, 
and made and lost a fortune. He studied succes- 
sively at the University of South Dakota, Harvard 
Law School, the University of Leipzig, the Ecole 
des Sciences Politiques at Paris, and eventually at 
the University of Chicago, where he obtained his 
doctorate in economics in 1898. After serving as 
a high school principal in the Midwest, he taught 
at the University of Chicago from 1902 to 1908, 
then at the University of Missouri, where he was 
head of the department of economics and from 
1914 to 1916 dean of the School of Commerce, 
and finally, from 1916 to 1929, at Cornell 
University. 

Davenport was a man of strong personal loyal- 
ties that correspond to the complex nature of his 
position in American economic thought. When he 
published Value and Distribution in 1908, he dedi- 
cated it to J. Laurence Laughlin, soundest of econ- 
omists, who had served as the first chairman of 
the department of economics at the newly founded 
University of Chicago. When, a dozen or so years 
later, Davenport was under consideration for the 
presidency of the American Economic Association, 
his loyalty to Laughlin was strong enough to cause 
him misgivings about accepting the post, because 


Laughlin had never held this high office, No 
less durable was the attachment that connected 
Davenport with the unorthodox and scintillating 
Thorstein Veblen, who was first his teacher and 
later his colleague at Chicago. Veblen considered 
Davenport one of his outstanding pupils, second 
only to Wesley Clair Mitchell. Davenport, in turn, 
stood by Veblen on many occasions when Veblen 
was in difficulties, securing employment for him 
at the University of Missouri in 1910, taking him 
into his home, and in later years more than once 
trying to find a place for him at Cornell. 

Davenport claimed, half in jest, that he was 
drawn to the study of economics because he was 
sure that something was wrong with socialism and 
he wanted to find out what it was. As a young 
man at Chicago, he had sought out Veblen after 
class, eager to hear his frank opinion on this sub- 
ject, but the great skeptic never gave him a direct 
answer. Again, in 1920, in his presidential address 
(1921), Davenport touched on the subject of so- 
cialism, saying that he was not a socialist because 
he was unable to predict what the socialist ideal 
would concretely turn out to be, He did not doubt, 
however, that political democracy was as readily 
possible in a collective as in a competitive eco- 
nomic order. 

In his own way Davenport was as bitter a critic 
of business civilization as his master Veblen. He 
was convinced of the relative character of most 
economic doctrines, but his bent was not an insti- 
tutional one. Rather than tracing changes in insti- 
tutions, he set out to develop a theory that was 
to shed light on the economic environment in 
which he found himself, This theory was to be 
purged of the moralizing and psychologizing tend- 
encies characteristic of the thought of Alfred 
Marshall and of the Austrians. It was to have no 
place for such concepts as utility, real costs, and 
the “social organism.” Instead, it was to be, more 
narrowly, “the science which treats phenomena 
from the standpoint of price” (1913, p. 25). On 
this basis, Davenport developed his economic 
theory in a “sound,” although not entirely conven- 
tional, manner, with such occasional shockers as 
the inclusion of abortionists and prostitutes among 
the ranks of producers and of burglars’ jimmies 
among wealth. At the end of this work, he stated 
the need for a new economics that is not “a system 
of apologetics, the creed of the reactionary, a de- 
fense of privilege, a social soothing sirup, a smug 
pronouncement of the righteousness of whatever 
is” (1913, p. 528). Economists, he claimed, have 
wrongly identified private gain with social welfare. 
There are echoes of John A. Hobson here as well 
as of Thorstein Veblen. It was no wonder that 


Davenport advised his students to take Veblen’s 
course, to which he considered his own work a 
prelude. Although Davenport's views did not pre- 
vent his election as president of the American 
Economic Association in 1920, they scandalized 
not a few of his colleagues. 

Davenport is remembered best as an early expo- 
nent of the concept of opportunity cost, which 
defines cost in terms of opportunities foregone. 
Underlying his system is the notion of a general 
interdependence of all prices, but this idea was not 
connected with the general-equilibrium approach 
of the mathematical school of Lausanne. Daven- 
port's critical view of utility, which, of course, he 
shared with others, prepared the ground for the 
acceptance of the indifference-curve approach, a 
theory developed by mathematical economists. In 
still other ways he was ahead of his time—for ex- 
ample, when he considered the problem of unem- 
ployment as “the most important practically and 
perhaps the most difficult theoretically” (1896, 
p. 355), when he developed what was then one 
of the best explanations of the multiple expansion 
of credit through the banking system (1913, chap- 
ter 17, pp. 260-266), or when he related the inter- 
est rate to the operations of the commercial banks 
rather than to psychological factors (1913, chapter 
8, pp. 349 ff.). 

Henry W. SPIEGEL 


[For the historical context of Davenport's work, see 
the biographies of Hosson and VEBLEN.] 
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DAVIDSON, DAVID 


The Swedish economist David Davidson (1854— 
1942) established his reputation with the publica- 
tion of his dissertation, Bidrag till läran om de 
ekonomiska lagarna för kapitalbildningen (“Con- 
tributions to the Doctrine of the Laws of Capital 
Formation” 1878). Written at the completion of 
his law studies at the University of Uppsala (1871— 
1877), the book made him, with Gustav Cassel and 
Knut Wicksell, one of the founders of neoclassical 
economics in Sweden. 

Although the main part of the book is devoted 
to an analysis of the concept of capital along 
classical lines, the second part is an attempt to 
base an economic theory upon the structure of 
wants. Thus the main part of the book continues 
the tradition of Turgot, Adam Smith, F. B. W. Her- 
mann, and others, whereas the second part shows 
the strong influence of Carl Menger. Davidson 
accepted Menger’s theory of subjective value, a doc- 
trine implying that wants are measurable, that the 
wants of individual consumers are comparable, and 
that economic activity represents a striving to max- 
imize utility (“the value of things”). There is also 
a trace of marginalistic reasoning in the book. 

Davidson’s discussion of capital formation em- 
phasizes the factors that determine saving, whereas 
investment is given little attention. Among the 
factors that influence saving, he mentioned provi- 
sion for one’s family, provision for old age, the size 
of one’s income, and the efficiency of credit institu- 
tions. He did not treat the rate of interest. His 
views on capital have certain similarities with 
those of Eugen von Böhm-Bawerk, who was also a 
student of Menger’s. 

Davidson's next book, Bidrag till jordräntete- 
oriens historia (“Contributions to the Theory of 
Rent” 1880), attempts to show, among other things, 
that the classical form of the doctrine of rent was 
developed before Ricardo (for example, by West). 
This book was responsible for stimulating David- 
son’s strong interest in Ricardo, and his theoretical 
work after this point was, to a large extent, in- 
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volved with analyses of Ricardo. Davidson also 
published works on monetary and fiscal theory 
(1886; 1889). 

In 1899 he made his major contribution to the 
development of economics with the founding of 
the Ekonomisk tidskrift, Sweden's first scientific 
economic journal. He served as editor of the jour- 
nal until 1939, and from 1900 on published almost 
all his own writings in it. Davidson was often in- 
volved in controversies with Cassel and Wicksell; 
Wicksell acknowledged the extent to which his own 
ideas were developed as a result of sharpening his 
wits against Davidson's. 

Davidson's writing was characterized by its ana- 
lytical rigor. An example of the kind of analytical 
work identified with him is his concept of “stable 
money value.” He saw stable money as something 
that could be achieved by decreasing the price 
level during times of increasing productivity. Later, 
he also developed the idea of an “absolute money 
value,” but this does not seem to have been a 
fruitful one. 

Davidson was appointed professor of economics 
and public finance at the University of Uppsala in 
1890. One of his students was Per Jacobsson, later 
president of the International Monetary Fund. 


KARL Gustav LANDGREN 


[For the historical context of Davidson's work, see the 
biographies of CAssEL; MENGER; RICARDO; SMITH, 
ADAM; TURGOT; WEST; WICKSELL.] 
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Georges Ambroise Davy, French sociologist, was 
born in Bernay (Eure) in 1883. He entered the 
Ecole Normale Supérieure in 1905 and became 


agrégé de philosophie in 1908 and docteur és lettres 
in 1922. After teaching at colléges in Nice and 
Lyons, Davy joined the University of Dijon in a 
teaching capacity in 1919 and in 1931 entered 
university administration as rector of the university 
division of Rennes. In 1938 he became general 
inspector of public instruction in France. He re- 
turned to teaching in 1944, occupying the chair 
of sociology at the Sorbonne until 1955 

Davy may be considered a member of the Durk- 
heim school. His early book reviews in L'année 
sociologique (from 1910 on), particularly the re- 
views of books in the fields of the sociology of 
morals and of law, discussed the judicial theories 
of the time in the light of Durkheim's ideas. In 
1922 he published Le droit, l'idéalisme et l'expéri- 
ence (1922a), a collection of essays on Raymond 
Saleilles, Maurice Hauriou, Léon Duguit, Frangois 
Gény, and Emmanuel Lévy, asserting that the 
postulate of the reality of a collective mind permits 
the sociological explanation of the idea-content of 
the law. 

In the debate between sociologists and psychol- 
ogists that took place in France after Durkheim's 
death, Davy's position was a dogmatically “socio- 
logical” one: His article “La sociologie” (1924) 
presented Durkheim's approach without even the 
occasional restrictions imposed by Durkheim him- 
self. Yet Davy later modified his position. In an 
inaugural address at the First World Congress of the 
International Sociological Association, he stressed 
that psychology must also play a part in the ex- 
planation of social facts, and he asserted that the 
way the conscious individual determines his be- 
havior must be related to the way social conditions 
determine his behavior (1950). : 

Davys most significant work, La foi jurée 
(1922b), was an attempt to reconstruct the forma- 
tion of contractual law. Davy argued that there is 
a continual change from statute to contract and 
that even before a contractual relationship comes 
into existence, its function is performed by the 
adaptation of relationships established by statute. 
Specifically contractual relationships appear only 
when certain social transformations have taken 
place: a transformation of totemism that turns 
names and blazons into objects of exchange intro- 
duces a contractual element into social status; 
a change in descent patterns conduces to male 
heredity and supremacy and the emergence of a 
kind of male individualism characteristic of feudal- 
ism. Davy’s analysis, based largely on the accounts 
of the potlatch that became known in France 
through the work of Marcel Mauss, interprets 
Kwakiutl society as being in a state of transition 


and revealing the transformations requisite to the 
development of contractual law. 


JOSEF GUGLER 


[Other relevant material may be found in the biogra- 
phies of DURKHEIM; MAINE; Mauss.] 
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DEATH 


1. DEATH AND BEREAVEMENT John W. Riley, Jr. 


1. THE SOCIAL ORGANIZATION 


oF DEATH Robert W. Habenstein 


I 
DEATH AND BEREAVEMENT 


Death is a personal event that man cannot de- 
scribe for himself. As far back as we can tell, man 
has been both intrigued by death and fearful of it; 
he has been motivated to seek answers to the mys- 
tery and to seek solutions to his anxiety. Every 
known culture has provided some answer to the 
meaning of death; for death, like birth or marriage, 
is universally regarded as a socially significant 
event, set off by ritual and supported by institu- 
tions. It is the final rite de passage. 

The social and psychological aspects of death 
have been studied by anthropologists, sociologists, 
psychologists, and psychiatrists; and the main out- 
lines of their understandings can be summarized 
on three levels—cultural, social, and individual. 
The meanings which have been attached to death 
in most cultures include beliefs in some kind of 
existence after death; most peoples—save the non- 
literate—have entertained theories of personal sal- 
vation; and religion, philosophy, and political ideol- 
ogy have provided some answers to man’s quest for 
the meaning of death. The relationship between 
death and the social structure has received little 
systematic attention from social scientists, al- 
though there is much research on the social pre- 
scriptions for bereavement, especially as these 
relate to ritualistic mourning and individual grief. 
Scattered empirical studies suggest that, for the 
individual in the contemporary Western world, 
matters of death are less salient than those of 
living, although there are clear traces of a latent 
and underlying ambivalence. 

Although between fifty and sixty million people 
die each year, growing proportions of people in the 
world live into the later years. Thus, many people 
have the opportunity to contemplate their death, 
and unknown but even greater numbers of per- 
sons are affected by bereavement. For a phenome- 
non of such wide and pervasive significance, it is 
curious that the most recent systematic bibli- 
ography on the subject of Western social science 
literature on death and bereavement (Kalish 1965) 
does not exceed four hundred entries—many of 
them recent. In our time death has been largely a 
taboo topic (e.g., see Feifel 1959; Fulton 1965). 
But attention is now being directed to various 
social problems involving man’s relationship to 
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death: the problem of death and bereavement for 
the aged, dilemmas faced by the practitioners who 
deal with death, risk taking by both nations and 
individuals, and the social and moral implications 
of scientific advance in the control of death. 


Death and culture 


Death raises two kinds of problems that require 
cultural definitions and norms: those pertaining 
to one’s own death, and those pertaining to the ob- 
ligations imposed upon others by the fact of a 
death. In no known culture is the individual left to 
face death completely uninitiated. He is provided 
with beliefs about “the dead” and about his own 
probable fate after death. Similarly, all these cul- 
tures include norms governing the imperatives 
imposed by death: a corpse must be looked after; 
the deceased must be placed in a new status; his 
vacated roles must be filled and his property dis- 
posed of; the solidarity of his group must be re- 
affirmed; and his bereaved must be re-established 
and comforted (Blauner 1966). 

Death in nonliterate society. Systematic analy- 
sis of the records on nonliterate peoples shows vari- 
ous recurrent components in their belief systems 
(Simmons 1945). Belief in a spirit world inhabited 
by the dead is practically universal among them. 
There is no clear theory of “natural” death; they 
believe that death results from the intervention of 
an outside agent. The culture typically includes a 
conception, implicit or explicit, of a relationship 
between the living and the dead. Death is viewed 
as a crisis through which the deceased enters upon 
a new status, Symbols of power, either malevolent 
or benevolent, are attached to the dead. Among the 
Navajo, for example, actions of the spirits of the 
dead are generally perceived as being hostile to- 
ward the living; while among the Tikopia, where 
cohesion and continuity between the two worlds is 
a central theme, the relationship between quick 
and dead is believed to be benign. Belief in per- 
sonal salvation appears rare; and, in contrast to the 
pervasive concerns of civilized man, primitive man 
seems to have developed no eschatology of rewards 
and punishments in the worlds populated by the 
dead (Bellah 1964). 

In respect to bereavement practices, anthropolo- 
gists report great diversity. The actual bereavement 
period may extend, as it does for the Cocopa, over 
a period of years; or, as among the Pueblo, it may 
be but brief and perfunctory. In some cultures, 
bereavement begins with illness (which may be 
tantamount to death); in others, it begins only 
after the disposal of the corpse. In some cultures, 
the bereaved are required to idolize and placate the 


deceased, who is certain “to return”; in others, the 
deceased is held in such great fear that elaborate 
rituals are required to prevent his taking up his 
former role in the community (Krupp & Kligfeld 
1962). There is, however, no satisfactory general 
theory to account for these cultural variations. 

Historical perspectives. Within recorded history, 
answers to problems raised by death are found in 
religion, philosophy, and, to some extent, in politi- 
cal ideology. The major world religions include 
varying beliefs in a relationship between man’s life 
on earth and his ultimate fate after death. Rabbinic 
Judaism developed a detailed theory of a day of 
judgment. According to the teachings of Zoroaster, 
the soul is directed at death to balance its good and 
evil deeds. In the Islamic scheme, Allah is ready 
to prepare a happy place for the true believer. 
Buddhism postulates that nirvana (a final beati- 
tude, oblivion) is attainable through a long suc- 
cession of reincarnations, each mystically related 
to the karma (deeds) of preceding lives. Similarly, 
Hinduism rests upon a complex and philosophical 
relationship between dharma (civic and religious 
piety) and moksa (the attainment of salvation). 
The teachings of both Confucius and Lao-tzu carry 
an implication of salvation in that man must adapt 
both to the expectations of his ancestors and to 
cosmic moral law. The religions of East and West 
differ fundamentally with regard to death and life 
after death on only two main points. First, for the 
East, the route to salvation tends to be either con- 
templative or mystical; for the West, it tends to be 
ascetic and active. Second, the East views the 
ultimate outcome as an undifferentiated and im- 
personal “oneness” with the universe, while the 
West sees it as the continuation of the integrity of 
the personal self. 

In the history of Western religion, the emergent 
Christian conception of salvation gave emphasis to 
an afterlife in which the individual’s identity con- 
tinues essentially intact. The Roman Catholic 
church institutionalized the problem of salvation 
in the relationship of the individual to the priest; 
anxiety about death is reduced as the individual 
experiences sin, repentance, atonement, and re- 
lease. In contrast, the Calvinist concept of predes- 
tination intensified anxiety about death and the 
afterlife, since it regarded man as powerless to 
control his fate. Thus new forms of conduct and 
social organization evolved, as Max Weber (1904— 
1905, pp. 99-128, 155-183 in 1958 edition ) points 
out, to help the early Protestant deal with this 
increased anxiety; and, to ensure his salvation, the 
individual turned to a life that emphasized method- 
ical, rational conduct in work and a disciplined 


family life. In present-day religious thinking, how- 
ever, the clear connection between death and sal- 
vation has become blurred. Schneider and Dorn- 
busch’s (1958) study of popular religion in America 
(an analysis of the inspirational writings by Nor- 
man Vincent Peale, Joshua Loth Liebman, and 
others) shows a predominant stress on salvation in 
this life rather than in the next and small preoccu- 
pation—since man is assumed to be essentially 
good—with spiritual preparation for death. 

The most active periods of philosophical concern 
with death tend to coincide with periods of relative 
inactivity in formal religious institutions (Choron 
1963). In Greece during the fifth and fourth cen- 
turies B.c., for example, death was a central theme 
of philosophical speculation. Plato developed his 
theories of the interlocking relationship of knowl- 
edge and the immortality of the soul—a juxta- 
position of ideas destined to play a dramatic role 
in the history of Christendom. During the second 
and first centuries B.C. in Rome, the problem of 
death again became a major focus for philosophy; 
to the Stoics, for example, preparation for death 
was considered the only proper end of philosophy. 

Although the history of Christian belief in eternal 
life seems to have largely inhibited widespread 
philosophical treatments of the problem of death 
in the West, a notable development took shape 
toward the end of the nineteenth century in the 
form of existentialism. Contemporary existentialist 
theories, often obscure and contradictory, are of 
special interest because of their emphasis upon 
death. Jean-Paul Sartre, in one view, echoes several 
earlier philosophical traditions in his argument that 
the self is finite, that nonbeing follows death, and 
that the immortality of the soul is a fiction. Sartre 
thus ignores and despises “the stranger.” which is 
death. Martin Heidegger, in another view, wants 
to “disarm” death by taking it into the conscious- 
ness, Hence, the individual’s search for the mean- 
ing of existence (Dasein) points to death as the 
ultimate phenomenon of life (Choron 1963). In 
still another existentialist view, the question posed 
by death has been reinterpreted to ask: Can the 
individual cope with the threat of nothingness by 
replacing his belief in personal immortality with a 
belief in social immortality? 

Various ideologies throughout history have in- 
volved such higher principles as patriotism or work 
in seeking answers to the meaning of death. The 
Homeric singers extolled death for warriors, prom- 
ising that they would not be forgotten. The ideology 
of the Greek polis offered the individual a kind of 
immortality if his life was sacrificed for the com- 
mon good—an element in political ideology that 
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has changed little over the centuries. Man has 
always been willing to die for the state; in the ex- 
treme case, even by his own hand ( for example, 
Durkheim’s conception of altruistic suicide). The 
stress on death for the state typically gains cur- 
rency during wartime. War consecrates the mean- 
ing of death (Warner 1959), and the similarity be- 
tween the soldier and the man who perishes “in 
his calling” foreshadows an important element in 
the ideology of the monolithic state. According to 
communist doctrine, the individual can reduce his 
anxiety about death through work and identifica- 
tion with the party. Thus the Russians, like the 
Puritans, have incorporated work into their ide- 
ology as one answer to the threat of death. 


Death and society 


Death and the changing patterns of mortality 
are reflected in the structure of society. With the 
exception of a number of highly significant and 
institutionalized practices—war, infanticide, canni- 
balism, ceremonial human sacrifice, capital punish- 
ment—social institutions have evolved to facilitate 
life and to prevent death. The demographic history 
of man bears out the generalization that he has 
been more interested in death control than in birth 
control. Thus, mortality rates have tended to fall 
faster than fertility rates. Yet, despite an impres- 
sive literature on the means for controlling demo- 
graphic changes, relatively little attention has 
been paid to the larger problems inherent in the 
relationship between death and social structure. 
Two examples will illustrate the range of develop- 
ing theoretical concern with such problems, al- 
though no general theory is yet at hand. 

First, the recent work of Blauner (1966) points 
to the fact that mortality operates on society as a 
variable, not as a constant. According to this the- 
ory, the higher the mortality, the greater the threat 
to the social system, a threat which is reflected both 
in ritualistic mourning practices and in the social 
prescriptions that are activated when deaths occur, 
In high mortality societies, social relationships 
tend to be diffuse and widely dispersed throughout 
the group (everyone knows everyone else), and, 
when someone dies, the entire community mourns 
along with the next of kin and close associates. 
Similarly, in such societies there tend to be prescrip- 
tions that “solve” the social problems created by 
individual deaths. Thus, elaborate kinship rules 
provide new families for orphaned children, just 
as such customs as the levirate and sororate pro- 
vide new spouses for widows and widowers. 

In societies with low mortality rates, however, 
death poses a greater threat to the personality sys- 


22 DEATH: Death and Bereavement 


tem. In the West today, for example, the small 
family tends to socialize its members for interper- 
sonal competence, giving bereavement an especially 
personal significance. Furthermore, since a person’s 
significant others are concentrated among his close 
relatives and friends, bereavement reactions tend 
to be highly varied and individually therapeutic. 
The most striking exceptions to this tendency are 
the highly ritualistic occasions produced by the 
deaths of heads of state and other prominent fig- 
ures, such as the funeral of President Kennedy. 

Findings of recent studies in the United States 
and Great Britain support such an individualistic 
emphasis in bereavement practices. In the United 
States, the appropriate expressions of grief and the 
length of the bereavement period, rather than fol- 
lowing a widely accepted pattern, are found to vary 
greatly with the circumstances of the death, the 
status of the deceased, the status of the bereaved, 
the nature of their former relationship, and the age 
and sex of both the bereaved and the deceased 
(although women are permitted a greater display 
of sorrow than men, the general prescription is “to 
be brave”). Bereavement, with few social limita- 
tions, is susceptible to individual definition to fit 
individual needs (Bowman 1959). Similarly, a 
study of bereavement in Britain concludes that “the 
majority of the population lack common patterns 
or ritual to deal with bereavement” (Gorer 1965). 

A second theoretical approach, developed by Par- 
sons (1963), calls attention to the changing con- 
text of death in American society; it notes that in- 
creasing proportions of any birth cohort live to the 
approximate completion of the life cycle and that 
death has been largely separated from its long and 
complex relationship to suffering. Thus, the twin 
threats of suffering and prematurity have been 
greatly reduced by medical advances. Death is now 
more often inevitable than adventitious; as early as 
the beginning of the twentieth century, Sir William 
Osler was able to report that few of his dying pa- 
tients died in agony. Within this context, Parsons 
argues, new orientations toward death are devel- 
oping that are less influenced by these traditional 
anxieties, 

Parsons classifies the developing orientations 
into two types: a “normal” or active orientation 
(consistent with the high evaluation placed by con- 
temporary society on science and activity) that 
Stresses the moral significance of death as the 
termination of a completed life cycle of effort and 
achievement; and a deviant orientation that is es- 
sentially regressive and fatalistic. To the extent 
that this “normal” orientation prevails in American 
society, the individual is expected to “face up” to 


death in realistic terms, and his bereaved are ex- 
pected to do their “grief work” quickly and pri- 
vately—within the intimate circle of family and 
close associates. At the same time, the deviant 
orientation to death is also clearly in evidence, and 
to this Parsons relegates the denial of the reality 
of death, which some scholars have regarded as the 
modal American view (as indicated by such phe- 
nomena as the impermeability of caskets, the prac- 
tice of cosmetic embalming, and the lifelike pres- 
entation of the corpse). How widespread each type 
of orientation actually is becomes an empirical 
question to which studies have only recently begun 
to be directed. 

Apart from such special theories, the over-all re- 
lationships between human death and human so- 
ciety have recently been probed by a few writers 
(for example, Choron 1963; Hoffman 1964; Sulz- 
berger 1961; Brown 1959) but have not yet re- 
ceived systematic theoretical attention from social 
scientists. Yet the fact of death raises problems on 
several levels of social structure (Blauner 1966). 
Mortality challenges social continuity—and socie- 
ties are universally characterized by institutions 
for transmitting the heritage from one generation 
to the next. Mortality threatens the orderly func- 
tioning of society—and social structures are uni- 
versally characterized by mechanisms for replacing 
deceased performers of social roles, Mortality 
weakens the group—and groups have traditionally 
established means, in the face of death, for re- 
assembly and restoration. 

Mannheim (1923-1929) pursues one theoretical 
approach that begins to deal with such broader 
issues by asking the disarmingly hypothetical ques- 
tion of what society would be like if there were no 
death. He points to connections between death and 
other basic processes: as participants in society die, 
there are roles to be filled by new participants; 
moreover, since the accumulated heritage can be 
only imperfectly transmitted, there is a continual 
process of transition from generation to generation. 
Consequently, as new participants are able to take 
a fresh look at society, social change is facilitated. 
New approaches and solutions are constantly being 
developed, and old solutions are discarded and 
forgotten when they are no longer necessary Or 
effective. i 

While Mannheim’s provocative essay probes 4 
wide range of social phenomena, a more complete 
theoretical formulation of the adaptive and seles 
tive mechanisms implicit in the relationship of 
death to society might well be possible. Such widely 
used social science concepts as those pertaining t° 
political succession, property inheritance, kinship 


structure, socialization—to list but a few—might 
be transferable to a more general sociological 
theory of death. 


Death and the individual 


The historical shift in bereavement practices 
from a social to an individual emphasis holds im- 
portant implications for the individual, who must 
face not only his own death but also the possible 
loss of close relatives and associates. Despite the 
importance of the topic, empirical studies of the 
individual's relationship to death have been com- 
paratively few and recent. Great obstacles to re- 
search are posed by people’s reluctance to discuss 
so private a matter, as well as by their underlying 
ambivalence toward death itself. Nevertheless, at- 
tempts are now being made to examine different 
aspects of the individual's feelings and attitudes, 
using a variety of research techniques, from pro- 
jective tests and physiological response measures 
to interviews of cross-section samples. Reactions 
have been obtained from several special segments 
of the population—children, the aged, the dying, 
the mentally and the physically ill. Certain char- 
acteristics of the individual (sex, age, religiosity, 
education, health, etc.) have been studied as pos- 
sible factors affecting attitudes toward death. And, 
although some of the first findings appear incon- 
clusive or confusing, efforts are underway to ex- 
plain individual attitudes through their interrela- 
tionships with the norms of the culture (Volkart & 
Michael 1957) and to design new research within 
a broader conceptual framework. 

The image of death. While the empirical 
studies cannot yet support any over-all formulation 
of individual attitudes toward death, a few ex- 
amples will illustrate the many clues and sugges- 
tive findings now beginning to emerge. One set of 
studies focuses on children, indicating, for exam- 
ple, that the child’s conception of death develops 
in stages. Thus, among very young children, prior 
to the development of the sense of causality, death 
is seen as reversible, not final (Nagy 1948). Emo- 
tional involvement with death tends to vary with 
stages in the development of the ego structure and 
with changing cultural pressures and expectations, 
so that involvement is greater during early child- 
hood and adolescence than during the preadoles- 
cent period (Alexander & Adlerstein 1958). Fear 
of death in children (as well as in adults) has been 
related in various studies to such disparate phe- 
nomena as separation anxiety, sex guilt, physical 
restraint, fear of the dark, sibling rivalry, and the 
castration complex. 

Another set of studies emphasizes the impor- 
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tance, for the dying individual, of a secure environ- 
ment and a return to primordial kinship ties. Most 
subjects who know they are to die say they prefer 
to die at home and to be surrounded by families 
and friends (Fulton 1965; Feifel 1959). Elderly 
subjects are less apprehensive about death if they 
live in familiar surroundings and with relatives 
(or even in homes for the aged) rather than alone. 
Such indications point to a need for social support 
that may be out of keeping with present tendencies 
toward hospitalizing and isolating the dying indi- 
vidual (Glaser & Strauss 1965). And the increas- 
ing majority of people do, in fact, die in hospitals 
(Fulton 1965). 

A series of small studies attempting to connect 
a person’s religion with his attitudes toward death 
has thus far produced inconclusive findings—in 
part because of conceptual differences in the atti- 
tudes studied and the specialized populations ex- 
amined. Thus, fear of death is variously reported to 
increase with religious orientation, or to decline 
with religious activity. Some studies report that 
more thought is given to death by the religiously 
inclined. Other studies show no association what- 
soever between religious conviction and attitudes 
toward death. While there are no satisfactory em- 
pirical data at hand to link these apparently con- 
flicting findings, greater consistency will undoubt- 
edly be found as research takes into account the 
differing definitions of death emphasized by the 
several religions and the differing needs met by 
religion in the various sectors of society. 

Two other types of research offer preliminary 
support for Parsons’ argument regarding the de- 
velopment of an active orientation toward death 
as contrasted with the denial of its reality. In one 
strand, a few small but cogent studies suggest that 
many persons fear their own death largely because 
death eliminates the opportunity to achieve goals 
important to self-esteem and that death may appear 
appropriate to the dying under conditions of dig- 
nity and personal fulfillment (e.g., Diggory & 
Rothman 1961). The second type of research deals 
with people’s concerns with death in comparison 
with their concerns about the problems of life and 
studies the modes of their adaptation to death. A 
cross-section study of the adult population of the 
United States by Rosalie Goldwater and John W. 
Riley, Jr. (the results of this study were being 
analyzed in 1966, but had not yet been published ) 
shows that large majorities report frequent con- 
cern with such problems as health (76 per cent) 
and money matters (74 per cent), in contrast to a 
minority who say they think often about the uncer- 
tainty of their own lives or about the possible death 
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of someone else (32 per cent). That this lack of 
concern does not reflect a general “denial” of death 
is indicated by the finding that 85 per cent, in re- 
sponse to a question concerning different ways of 
adjusting to the uncertainty of life, concur that 
people should “try to make some plans about 
death.” Although relatively few adult Americans 
have executed wills (24 per cent) or made funeral 
or cemetery arrangements (28 per cent), eight out 
of ten have purchased life insurance, and half have 
made a point of talking about death with those 
closest to them (for a preliminary account of some 
of these findings, see Riley 1964). 

Further analysis shows connections in this study 
between these views of death and the respondents’ 
educational attainment and age (analyzed jointly). 
The higher the education, the less negative the re- 
spondent’s image of death, the less his expressed 
anxiety about death, and the more active his adap- 
tation to death. This suggests that, as the general 
level of education in the Western world rises, a 
new orientation toward death may be in the mak- 
ing, however many defense mechanisms may be 
operative. Furthermore, older people are more likely 
than their younger counterparts (at any given edu- 
cational level) to reveal an active orientation to 
death and to disavow the idea that one should ig- 
nore death or avoid making plans. Similarly, other 
studies note that, among the aged, approaching 
death seems to provoke less anxiety (Cumming & 
Henry 1961); whereas among the young (adoles- 
cents), there is little structuring of the future and 
low tolerance for the idea of death (Kastenbaum 
1964). Thus, an active adaptation to death seems 
to become greater as individuals come nearer to 
completing the life cycle. 

Bereayement. Death means to the individual 
not only his own demise but also the loss of other 
people who are significant to him. From a psycho- 
logical standpoint, bereavement—generally held 
to signify the emotional state and behavior of the 
survivor following the death of a person who ful- 
filled dependency needs—is a temporary condition 
from which the individual is expected to recover. 
Studies of grief reactions to death have identified 
such syndromes of associated psychological and 
physiological symptoms as somatic distress, pre- 
occupation with the image of the deceased, guilt, 
hostile reactions, and loss of established patterns 
of conduct (Lindemann 1944). Freud (1915), 
whose classic work has afforded the theoretical 
foundation for the psychiatric literature on melan- 
cholia, paranoid reactions, and other emotional 
concomitants of bereavement, argued that recov- 


ery from the grief syndrome requires a process of 
reality testing to demonstrate that the loved object 
no longer exists; only when this process is com- 
plete is the ego free again. Mourning, then, is a 
psychological task to be performed (Krupp & Klig- 
feld 1962), 

From a sociological standpoint, the bereaved in- 
dividual may be aided through rituals and the 
support of family and friends to resume his usual 
social obligations after the mourning period (Eliot 
1932). In this perspective, the task is to re-estab- 
lish the systems of relationships interrupted by 
death or to develop new ones. Durkheim originally 
specified the function of ritual in enabling be- 
reaved persons to cope with death (1912, pp. 445, 
448 in 1961 edition): “When someone dies, the 
family group to which he belongs feels itself less- 
ened and, to react against this loss, it assembles... . 
The group feels its strength gradually returning to 
it; it begins to hope and to live again.” Various 
studies suggest, however, that such social supports 
often work imperfectly. A large-scale British survey, 
for example, shows that the help afforded by family 
gatherings and religious ceremonies is limited to 
the period of initial shock; for the subsequent 
period of intense mourning and physiological 
stress, the bereaved is typically left alone, bereft of 
attention or affect from the external world (Gorer 
1965). Thus societal supports may be ill designed 
to meet the needs of those who must live through 
bereavement and come to terms with grief. 

The psychological response of the survivor and 
his need for social support depend upon many fac- 
tors, and especially upon who has been lost—a 
child, a parent, a distant friend; in particular, 
many studies have focused upon the loss of a 
spouse. The majority of the widowed are older 
people, for whom the death of a spouse can leave 
a void that may never be filled, and research has 
called attention to the associated problems of finan- 
cial support, changes in housing and daily routine, 
and social isolation. To be sure, the most extreme 
sense of desolation occurs with recency of bereave- 
ment and tends to decrease over the subsequent 
years (Kutner et al. 1956). Yet, numerous studies 
comparing widowed with married persons have 
consistently shown that the widowed have reduced 
contacts with their children, intensified feelings of 
loneliness, higher suicide rates, and higher death 
rates. 


Some current issues 


Two specific problems related to the meaning 5 
death are engaging the research efforts of soci 


scientists: the problems of an aging population 
who are approaching death, and the role conflicts 
experienced by those who must deal with death 
(doctors, nurses, ministers, life insurance agents, 
undertakers). There is increasing concern with the 
morale and living conditions of the aged. For in- 
stance, with death imminent, is disengagement 
from social relations to be preferred over continued 
activity (Cumming & Henry 1961)? Should age- 
homogeneous retirement facilities for the elderly 
be gradually developed? What are the relative re- 
sponsibilities of public and private pension plans? 
Of the family? The solutions to such problems (of 
which there are many) are being sought by a wide 
variety of social science researchers. 

Role conflicts among those who deal profession- 
ally with death are also being increasingly identi- 
fied and studied. The clergy ponder the distinction 
between faith and therapy; doctors debate the 
Hippocratic mandate that life must be preserved at 
all costs; and nurses are caught between the de- 
mands of recuperating and dying patients. Life 
insurance agents attempt both euphemistic and 
realistic approaches in their efforts to bring into 
salience the uncertainty of life; while undertakers, 
constant reminders of the certainty of death, are 
berated as “grief therapists” and commercializers 
of ritual (Mitford 1963; Fulton 1965). 

The ambiguity of death is also to be seen in vari- 
ous other fields of scientific advance. Although the 
law generally holds that death occurs when auscul- 
tation can no longer detect a heartbeat, such a 
definition is frequently made obsolete in routine 
medical practice. Distinctions are drawn between 
clinical death (of the organism) and biological 
death (of the organs), so that the time of death is 
increasingly a matter for decision, and moral ques- 
tions arise as to the individual's “right to die with 
dignity.” An important issue in the ethics of birth 
control is also involved—does the intrauterine de- 
vice cause an abortion and hence a death? Further- 
more, while science can neither prove nor disprove 
the hypothesis of some form of communication be- 
tween living and dead, recent research in the field 
of parapsychology, reactivating an old tradition of 
psychic research (Myers 1903), is demanding 
attention from reputable scientists. To be sure, the 
“findings” of such research have not yet earned a 
place in the framework of modern science, but such 
efforts cannot be completely ignored. Finally, with 
the discovery that cells can be kept alive (appar- 
ently indefinitely) in a nutrient medium and that 
such cells can perhaps be reconstituted through the 
process of genetic transformation, biological im- 
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mortality itself can no longer be entirely ruled out. 
Thus science in various ways challenges the social 
definition of death. 

Joun W. RILEY, JR. 


[Directly related is the entry AGING. Other relevant ma- 
terial may be found in Kinsurp, article on DESCENT 
GROUPS; LIFE CYCLE; LIFE TABLES; MORTALITY; 
Rirvat.] 


BIBLIOGRAPHY 

ALEXANDER, IRVING E.; and ADLERSTEIN, ARTHUR M. 1958 
Affective Responses to the Concept of Death in a 
Population of Children and Early Adolescents. Journal 
of Genetic Psychology 93:167-177. 

BELLAH, Rogert N. 1964 Religious Evolution. American 
Sociological Review 29:358-374. 

BLAUNER, Rogert 1966 Death and Social Structure. 
Psychiatry 29:378-394. 

Bowman, LeRoy 1959 The American Funeral: A Study 
in Guilt, Extravagance, and Sublimity. Washington: 
Public Affairs Press. 

Brown, Norman O. 1959 Life Against Death: The Psy- 
choanalytic Meaning of History. Middletown, Conn.: 
Wesleyan Univ. Pi.°s. 

Cuoron, Jacques 1963 Death and Western Thought. 
New York: Collier. 

CHoron, Jacques 1964 Modern Man and Mortality. 
New York: Macmillan. 

CUMMING, ELAINE; and HENRY, WILLIAM E. 1961 Grow- 
ing Old: The Process of Disengagement. New York: 
Basic Books. 

Diccory, James C.; and ROTHMAN, DOREEN Z. 1961 
Values Destroyed by Death. Journal of Abnormal and 
Social Psychology 63:205-210. 

DurKHEm, ÉMILE (1912) 1954 The Elementary Forms 
of the Religious Life. London: Allen & Unwin; New 
York: Macmillan. > First published as Les formes 
élémentaires de la vie religieuse, le système totémique 
en Australie. A paperback edition was published in 
1961 by Collier. 

Eror, THomas D. 1932 The Bereaved Family. Ameri- 
can Academy of Political and Social Science, Annals 
160: 184-190. 

Ferret, HERMAN (editor) 1959 The Meaning of Death. 
New York: McGraw-Hill. 

Freup, Sigmund (1915) 1959 Thoughts for the Times 
on War and Death. Volume 4, pages 288-317 in Sig- 
mund Freud, Collected Papers. International Psycho- 
analytic Library, No. 10. New York: Basic Books; Lon- 
don: Hogarth. > First published as “Zeitgemasses über 
Krieg und Tod.” 

FuLTonN, Rosert L. (editor) 
New York: Wiley. 

GLASER, BARNEY; and STRAUSS, ANSELM 1965 Aware- 
ness of Dying: A Study of Social Interaction. Chicago: 
Aldine. 

Goopy, J. R. 1962 Death, Property and the Ancestors: 
A Study of the Mortuary Customs of the Lodagaa of 
West Africa. Stanford Univ. Press. 

Gorer, GEorFREY 1965 Death, Grief, and Mourning. 
New York: Doubleday. 

Hertz, Rogert (1907-1909) 1960 Death and The Right 
Hand. Glencoe, Ill.: Free Press. > First published as 
“La représentation collective de la mort” in Volume 
10 of L'année sociologique and as “La prééminence 


1965 Death and Identity. 


26 DEATH: The Social Organization of Death 


de la main droite” in Volume 34 of Revue philoso- 
phique. 

Hockinc, WILLIAM E. 1957 The Meaning of Immortal- 
ity in Human Experience. New York: Harper. 

HOFFMAN, FREDERICK J. 1964 The Mortal No: Death 
and the Modern Imagination. Princeton Univ. Press. 

Katisu, RicHarp A. 1965 Death and Bereavement: A 
Bibliography, Journal of Human Relations 13:118-141. 

KasTENBAUM, Ropert (editor) 1964 New Thoughts on 
Old Age. New York: Springer. 

Krupp, Grorce R.; and KLIGFELD, BERNARD 1962 The 
Bereavement Reaction: A Cross-cultural Evaluation. 
Journal of Religion and Health 1:222-246. 

KUTNER, BERNARD et al. 1956 Five Hundred Over Sixty: 
A Community Survey on Aging. New York: Russell 
Sage Foundation. 

LINDEMANN, ERICH 1944 Symptomatology and Manage- 
ment of Acute Grief. American Journal of Psychiatry 
101: 141-148, 

MANNHEIM, Kart (1923-1929) 1952 Essays on the So- 
ciology of Knowledge. Edited by Paul Kecskemeti. 
New York: Oxford Univ. Press. 

MitForp, Jessica 1963 The American Way of Death. 
New York: Simon & Schuster. 

Myers, Freverick W. H. (1903) 1954 Human Person- 
ality and Its Survival of Bodily Death. 2 vols. New 
York; Longmans. 

Nacy, Marta 1948 The Child's Theories Concerning 
Death. Journal of Genetic Psychology 73:3-27. 

Parsons, Tatcorr 1963 Death in American Society; A 
Brief Working Paper. American Behavioral Scientist 
6:61-65. 

RiLry, Jonn W. JR. 1964 Contemporary Society and the 
Institution of Life Insurance. Journal of the American 
Society of Chartered Life Underwriters 18, no. 2:93- 
103. 

SCHNEIDER, Lours; and DornpuscH, SANFoRD M. 1958 
Popular Religion: Inspirational Books in America. 
Univ. of Chicago Press. 

Simmons, Leo W. 1945 The Role of the Aged in Primi- 
tive Society. New Haven: Yale Univ. Press. 

SULZBERGER, Cyrus 1961 My Brother Death. New York: 
Harper. 

Vorxart, EpMunp H.; and MICHAEL, STANLEY T. 1957 
Bereavement and Mental Health. Pages 281-304 in 
Alexander H. Leighton et al. (editors), Explorations in 
Social Psychiatry. New York: Basic Books. 

WARNER, W. LLoyp 1959 The Living and the Dead: A 
Study of the Symbolic Life of Americans. New Haven: 
Yale Univ. Press. 

WEBER, Max (1904-1905) 1930 The Protestant Ethic 
and the Spirit of Capitalism. Translated by Talcott 
Parsons, with a foreword by R. H. Tawney. London: 
Allen & Unwin; New York: Scribner. + First pub- 
lished in German. The 1930 edition has been reprinted 
frequently. 


LLS 
THE SOCIAL ORGANIZATION OF DEATH 


The physical extinction of its members, not all 
at the same time but all eventually, is a contin- 
gency that every human group must face. Each 
death initiates significant responses from those sur- 
vivors who in some way have personally or vicari- 
ously related to the deceased. Inevitably, the 
collectivities in which the dead person held mem- 


bership also react. Despite the social (symbolic) 
ambiguity presented by the dead body, the survivors 
continue to relate to it for some time with predeath 
imagery. At the same time, they must attempt to 
cope with emotions no longer secured within the 
pre-existing balance of interpersonal relations. 

The reciprocal problem for the social group or 
collectivity remains the reassigning and reassem- 
bling of social roles and statuses, optimally in such 
fashion that not only is the social order in some 
measure re-established but the survivors affected 
by the death are re-equipped with images and sym- 
bols appropriate for building and sustaining an 
altered yet viable self-conception. This group prob- 
lem forms the basis for the treatment of mortuary 
behavior that follows. No categorical separation is 
attempted between the personal and organizational 
dimensions of the subject. The emphasis remains, 
however, on the latter. 

Death as passage. No social group socializes 
and controls members with a cosmology that cate- 
gorically holds out death as nothing more than the 
total eclipse of the person. To the primitive and 
preliterate, the opposite orientation is more likely. 
The belief that life is not the end underlies some 
of the ritual behavior of all peoples. It is perhaps a 
necessary premise to the development of human 
culture. 

A corollary premise suggests that the death of a 
society is inconceivable by its members, inasmuch 
as their belief and symbol systems link man and 
society reciprocally. Total obliteration of the person 
would so challenge the grounds for society's exist- 
ence that the very idea constitutes, in effect, a 
sacrilege. 

Possibly the most elementary and universal 
response is found in the conception of death as 
a transition or journey, as a series of happenings 
rather than an event complete in itself. The notion 
of transition implies qualitative changes in time 
and place. Consequent to death, secular time and 
location are replaced by sacred time and existence 
in another world, in which the spirit is either ab- 
sorbed or exists with some measure of individual- 
ity. Recognition cannot help but be given to physi- 
cal dissolution of the dead body, but the force or 
entity that gave the body life is held to be only 
transformed but never extinguished. All great re- 
ligious systems seemingly build on this principles 
its universality directs attention, then, to death as, 
a passage or as stages in the career of some life 
force that for a time inhabits the body but neither 
begins nor ends with it. F 

Channeling of death responses. For the survi- 
vors the death of an intimate has its most imme 
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diate consequence in vaguely or distinctly felt 
ambiguity and confusion. The intensity of the in- 
dividual response will be roughly proportionate to 
the intensity of the interpersonal interaction, vicari- 
ous as well as face-to-face, that the survivor en- 
joyed with the deceased. For these individuals, as 
well as for the group, the response will in great 
measure vary with the difficulty of replacing the 
departed member in an ongoing system of role and 
status relationships. 

The channeling of basic human sentiments is 
never an automatic process, nor is it possible to 
guarantee that the collectivity suffering such rup- 
ture in its affairs will not react so violently as to 
threaten all operating institutions. Whatever the 
rationale or general belief about the nature of 
death, elaboration into a set of operative prescrip- 
tions for behavior proceeds expeditiously in the 
context of symbolically ritualized ceremonials. The 
social prescriptions surrounding death do not un- 
equivocally control the responses of individuals and 
groups to the phenomenon of death, Personal re- 
actions where these prescriptions are embracing, 
as in preliterate, tribal societies, may seem re- 
flexive. But it would be incorrect to assume that 
the emotional responses of the survivors must and 
do coincide exactly with the demands of a socially 
prescribed mortuary etiquette. Death of an intimate 
always results in some loss of the bereaved ego— 
an impoverishment of self—and when the associa- 
tion has been close, whether characterized by posi- 
tive or negative sentiments, the loss will trigger off 
emotional responses that can overflow the chan- 
nels for appropriate mortuary behavior provided by 
the culture. 

The effectiveness of death rituals stems from the 
fact that, through the medium of a sacred-symbol 
system, they assist man in defining his relations 
to himself, his fellow man, and the cosmos. Rites 
are for the most part performed or engaged in col- 
lectively; the representations thus evoked and ex- 
pressed in ritual carry the authority and sanction 
of society itself. Mortuary rites characteristically 
operate to give meaning and sanction to the sep- 
aration of the dead person from the living, to help 
effect the transition of the spirit, soul, or life force 
into an otherworldly realm, and to assist in the 
incorporation of the spirit of the dead into its new 
existence. 

Ritualization and the drama of funerals. Ritu- 
alization of mortuary behavior evokes new or 
changed self-conceptions, insofar as it serves to 
move people from moments of personal confusion 
and ego impoverishment toward a restructuring of 
identity. Through such ritualization the “work of 


grief,” as postulated in dynamic psychology, is ex- 
pedited by the meaningful social interaction of the 
bereaved survivors. Since this interaction involves 
role playing, such rearrangement as occurs through 
the emergence of new or different roles resolves 
the anomaly of the incumbentless role created by 
death. 

The actual disposal of the dead body is generally 
handled in a number of ritual-bearing scenes or 
episodes. Once properly prepared for the funeral, 
the corpse will receive some form of attention from 
the survivors, Family and close kin, friends and 
neighbors, usually have the greatest emotional in- 
volvement, although where kin, sib, and clan bonds 
are strong, more extensive prescriptions for mortu- 
ary behavior channel and sanction the emotional 
and physical behavior of the most closely, as well 
as the most distantly, related. 

Funerals for the dead are matters of dramatic 
and sacred moment. The manner of disposal of 
the body, the role of the corpse in the ritual, and 
the utilitarian care of the dead is highly variable 
from group to group. Despite preliminary magico- 
religious prophylactic and propitiatory acts of the 
survivors, bodies may still be considered so repre- 
sentative of virulence and danger that, as in the 
case of the Kaingang in South America, they may 
be abandoned in terror. In like manner, the Navajo 
and other Indian tribes in the southwestern United 
States quickly bury the body along with many, if 
not all, of its earthly effects; the deceased's dwell- 
ing, if he died there, is abandoned and never re- 
used. At another extreme, common among the 
Malayo-Polynesians, the corpse may for a long 
period of time be kept on display close at hand, 
seemingly benign or positive in its influence, or be 
temporarily sequestered until the remaining burial 
rites are performed. 

Disposal of the dead emphasizes the separation 
of the physical dead from the society of the living. 
The role of the specialist, such as the priest, med- 
icine man, shaman, or spiritual intercessor, is 
crucial at this juncture, since it is through ritual- 
ized actions, organized into episodes or scenes, that 
both the dead and the living are moved on to new 
points of orientation and to new status positions. 
The point to be emphasized is that mortuary cere- 
monials affect the individual's sense of identity, or 
self, and provide entry into and departure from 
the system of roles and status relationships in the 
society. It is for this reason that funerals have the 
basic potential for the highest order of social sig- 
nificance. Within the framework of mortuary cere- 
monies, society-specific patterns of belief and action 
centering on death and burial arise to express or 
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achieve other purposes, among which are the 
descent of property, authority, and sexual privilege 
and the enhancement of a popular aesthetic of 
beauty in death; or the projection of cults of per- 
sonality, rationality, or pragmatism. Dramatization 
of all such purposes—even that of expressing in- 
difference—may achieve a measure of functional 
autonomy, 

ROBERT W. HABENSTEIN 
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DEBT, PUBLIC 


Public debt is an obligation on the part of a gov- 
ernmental unit to pay specific monetary sums to 
holders of legally designated claims at particular 
points in time. The sums owed to creditors may be 
defined in standard monetary units of the debtor 
government or in units of an external currency. 
The United States national debt represents, pre- 
dominantly, an obligation of the federal govern- 
ment to pay specific sums of United States dollars 
to creditors. On the other hand, national debts may, 
and local debts must, be defined in units of external 
currency. The monetary obligation may be that of 
paying either interest or a return of principal, or 
both. Specific issues of debt may or may not be 
characterized by definite maturity schedules. Con- 
sols, which represent obligations to pay interest in 
perpetuity, involve no obligation for a return of 
principal. 

Public debt must be distinguished from currency 
outstanding. The obligation on the part of a money- 
issuing governmental authority to the holder of cur- 
rency is not public debt, since the claim can also 
be met in currency units. 

Public debt is created by the act of public bor- 
rowing, or, in other words, the act of floating 
public loans, the act of selling government securi- 
ties. This is a process through which governmental 
units, in exchange for money (currency or demand 
deposits) give promises to pay, this exchange being 
normally voluntary on the part of the lender. For 
governments the purpose of the exchange is that 
of securing current purchasing power with which 
they may purchase resource services, or final prod- 
ucts. fhe issue of public debt is one means of 
financing government expenditures, alternative tO 
taxation and to direct currency creation. 

Public debt is amortized, or retired, by a reverse 
transfer in which government gives up money for 
debt instruments, either through purchase on the 
open market or through scheduled maturity pay- 
ments, 


Measurement. Public debt is normally measured 
in nominal maturity values. This does not repre- 
sent the “size” of the public debt in such a manner 
as to make comparisons over time and among sepa- 
rate governmental units fully accurate. Varying 
composition of debt can affect its degree of liquid- 
ity for holders, as well as other characteristics. To 
the extent that debt instruments are valued for these 
nondebt features, the effective size of public debt, 
as such, is reduced. A more accurate measure of 
debt is produced by capitalizing the annual inter- 
est charges at some appropriate rate of discount, 
normally that rate which approximates the return 
on risk-free investment in the economy. An exam- 
ple will clarify this point. In terms of nominal ma- 
turity value, national debt may be of equal size, say 
$300,000 million, at two points in time. In the one 
case, however, if the debt is composed primarily of 
short-term issues possessing a high degree of 
“moneyness,” the annual interest charges may 
amount to, say, only $6,000 million. In the other 
case, if the debt is largely funded, the interest 
charges may be as high as, say, $12,000 million. 
Clearly, the “size” of the public debt is not identical 
in the two cases; other dimensions than nominal 
maturity values must be included in any appropri- 
ate measure. For purposes of making intertem- 
poral and international comparisons, the most ap- 
propriate measure is perhaps the ratio of annual 
interest charges to gross national or gross domestic 
product, 


Table 1 — National debt in selected countries, 1960 
RATIO OF DEBT 


NATIONAL DEBT, 1960° TO GNP 

(In millions of dollars) (Per cent) 
Brazil 49 19 
Conada 17,679 49 
France 24,925 40 
Germany 611 1 
India 13,159 35 
Japan 2,551 7 
Norway 1,293 29 
United Kingdom 77,652 124 
United States 286,471 57 


a. Includes intragovernmental debt. Converted to U.S. dollars at official rates. 
b. Rounded to nearest percentage point. 


Source: International Financial Statistics. 


Public debt may characterize both national and 
local fiscal systems. Data are much more readily 
available for national debts, and that of some se- 
lected countries may be introduced for illustrative 
purposes. Table 1 shows the national debts of sev- 
eral selected countries in 1960, defined in nominal 
maturity values, local currency units being con- 
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verted into United States dollar equivalents at of- 
ficial exchange rates. The right-hand column of 
Table 1 indicates the ratio of national debt to gross 
national product of the issuing country. 

The most significant fact that emerges from any 
comparative survey of national debts in the various 
countries, and one that is indicated from the data 
summarized in Table 1, is that no single country 
was, in 1960, dangerously “overburdened” with na- 
tional debt. Among the major countries of the 
world, only for the United Kingdom was the size of 
the national debt larger than GNP. And for rela- 
tively few countries, developed or underdeveloped, 
is the ratio of national debt to GNP more than 50 
per cent. There are several reasons for these results, 
some of which make their significance less than it 
might initially seem to be. 

Countries that are characterized by large na- 
tional debts, measured proportionately to GNP, 
tend to be those that have enjoyed reasonably stable 
government, that have been victorious in wars, and 
that have experienced reasonable monetary sta- 
bility over a relatively long period. Great Britain 
and the United States, both victorious in two world 
wars and both characterized by political stability 
and relatively moderate inflation, carry relatively 
heavier national debts than most other countries. 
Causality seems to work in only one direction 
here, however. Political and monetary stability do 
not result from large national debts; instead, large 
national debts emerge only in conditions of reason- 
able political and monetary stability. In the absence 
of these conditions, public borrowing will rarely be 
important as a means of financing public spending, 
and, even when it is-successfully carried out, the 
creditors of the state may be subject to confiscation 
through inflation or through default. 

Historically, the large increases in national debts 
have been associated with the financing of war 
emergencies. This is illustrated by the growth of 
the United States national debt, shown for selected 
years in Table 2. 


Table 2 — Growth of United States national debt, se- 


lected years 
NOMINAL MATURITY VALUE 
YEAR (In millions of dollars) 
1917 1,023 
1919 26,349 
1930 15,774 
1940 42,376 
1946 277,912 
1949 249,509 
1960 286,471 
1965 317,270 


Source: U.S. Treasury Department data. 
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Composition. Public debt assumes many forms, 
ranging from consols at the one extreme to treasury 
bills of very short maturity at the other, with other 
characteristics not relating to maturity schedules. 
The form in which debt is marketed affects the 
interest rate that must be paid, since various fea- 
tures, desired in themselves, may be offered com- 
plementary to what may be called “pure debt.” 
Consols, which have no maturity and which obli- 
gate the issuing government only to pay a specific 
interest charge periodically and in perpetuity, pro- 
vide a useful bench mark for comparative purposes. 
These come close to representing “pure debt,” and 
as other features are added the cost of carrying 
debt tends to decrease. The fixing of a definite ma- 
turity, which obligates the government to return 
the principal to the holder of claims, increases the 
liquidity of the claim, especially as maturity is 
approached. Hence, as maturities are shortened, 
liquidity is increased to the point at which short- 
term treasury bills take on a high degree of “money- 
ness,” For this reason bills can normally be sold at 
considerably lower rates of interest than long-term 
bonds. 

Certain countries have been successful in float- 
ing issues of debt that obligate the government to 
return monetary sums of fixed purchasing power 
to holders of claims. These constant-value bonds, 
which provide holders with apparent protection 
against inflationary erosion in value, have been 
marketed at very low rates of interest. Great Britain, 
in the years since World War u1, has successfully 
introduced “premium bonds,” which incorporate 
certain features of a lottery. 

The terms “funded debt” and “floating debt” have 
been used, historically, to refer to long-term and to 
short-term debt, respectively. Any definitions are, 
to an extent, arbitrary, but all issues of more than 
five-years maturity are generally called “long-term,” 
and all issues with shorter maturities are called 
“short-term.” The rubric “floating debt” is most ex- 
plicitly used with reference to issues of less than 
one year maturity. 

The names given to specific debt instruments 
reflect the maturity category. Bonds are securities 
issued for long term. Notes and certificates are in- 
termediate-term, and treasury bills are short-term. 
The term “government securities” is used to refer 
to the whole range of public debt instruments. 

Ownership. The pattern of ownership of pub- 
lic debt is an important factor in determining its 
impact on the national economy. In almost all 
countries national debt is held in significant 
amounts by various governmental or quasi-govern- 
mental agencies. For some purposes it is desirable 


to include only debt held outside the governmental 
sector in measuring total debt. For other purposes 
intragovernmental debt should be included. Insofar 
as the agencies holding debt are, in fact, treated as 
being independently accountable units, both in 
their investing and in their paying functions, the 
public debt that these agencies hold is not different 
in effect from that held by the nongovernmental 
public. However, insofar as these agencies holding 
debt are not independent of direct treasury control, 
their holding does not constitute debt at all and 
could as well have been canceled at the outset. For 
the most part, intragovernmental debt represents 
some combination of these two institutional situa- 
tions. 

It is also important to distinguish between the 
debt held by the nonbanking public and that held 
by the banking system and by certain categories 
within the banking system. To the extent that gov- 
ernment securities are held by the central banks, 
they provide reserves for the commercial banking 
system, and changes in central bank holdings are 
indicative of important monetary movements in 
the economy. To the extent that commercial banks 
hold securities, potential reserves are available to 
them. 

Traditionally, debt held internally and debt held 
externally have been sharply distinguished. In a 
national accounting sense, the servicing of inter- 
nally held debt does not require the transfer of 
income resources out of the national economy, 
whereas, of course, such a transfer is required for 
the servicing of an externally held debt. This point 
is relevant, even though it must be kept in mind 
that, other things being equal, the income base 
from which the transfers are to be made must be 
larger in the case of externally held debt, precisely 
by the amount of the necessary interest transfer. 

Management. Public debt management may be 
defined as that set of operations required to main- 
tain an existing nominal debt. Again, consols pro- 
vide a useful reference point; if all public debt 
should consist of consols, management reduces to 
the payment of interest charges. Significant prob- 
lems of management arise because issues must be 
refinanced, must be “rolled over,” at periodic inter- 
vals. These problems become more difficult as the 
proportion of floating debt in the total becomes 
larger and also as economic conditions over time 
become less stable. f 

Recognition of management difficulties explains 
the traditional policy objective of funding the na- 
tional debt to the maximum extent that is possible. 
This objective may conflict with other objectives 
that have been introduced into public policy. AS 


national debts grow in importance, the annual in- 
terest payments assume relatively large shares of 
the governmental expenditure budget. This fact 
prompts recognition of the minimization of interest 
cost as an objective of debt management, one that 
is sharply in conflict with funding. Modern devel- 
opments in the theory of macroeconomic policy 
have also forced the recognition that debt manage- 
ment exerts significant over-all effects on the econ- 
omy. This produces a third possible management 
objective, that of supplementing macroeconomic 
policy in promoting stabilization—growth aims. 

In a period of threatened inflation both the fund- 
ing and the macroeconomic objectives would dictate 
a shift out of short-term into long-term issues, but 
this would increase interest costs. By contrast, dur- 
ing a period of recession both the interest-cost and 
the macroeconomic objectives dictate a shift into 
short-term issues from long-term, but this runs 
afoul of traditional notions about funding. The ex- 
perience of the United States since World War 1 
suggests that the best explanatory hypothesis is 
that management has been directed toward main- 
taining substantially the same maturity structure 
over time. 

Monetization. Interest costs on national debt 
could, of course, be reduced to zero through direct 
monetization of public debt. This is a process 
through which interest-bearing issues are refi- 
nanced through the issue of money, currency, or 
bank deposits. Because of the interest saving 
alone, monetization is always to be recommended 
insofar as it can be accomplished without inflation. 
Only the threat of inflation provides a barrier to 
monetization, 

As the national economy grows, an increasing 
stock of money is required in order to maintain a 
constant level of product and service prices. One 
means of injecting increments to the money stock 
into the system is through debt monetization. Given 
the improbability that modern governments will re- 
tire public debts through the deliberate creation of 
budgetary surpluses, monetization provides the 
primary means through which national debts will 
be reduced, Monetization of debt is possible only 
for governmental units that possess independent 
money-creating powers. 

National debt and fiscal policy. There is no 
simple connection between fiscal policy and the 
issue or retirement of national debt. Budgetary 
deficits need not be financed by the issue of debt, 
defined in a meaningful way. And, if such deficits 
are generated purposefully for the supplementing 
of aggregate demand, national debt should not be 
used as the means of financing. Instead, money 
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should be directly created, which is always an 
alternative means of financing. In such a setting, 
the issue of debt exerts a deflationary, and unde- 
sirable, impact on total demand. 

The elementary confusion about all this arises 
because, institutionally, national governments tend 
to disguise money creation through so-called “bor- 
rowing” from the central banks, and “public debt” 
is, nominally, created in this money-creating proc- 
ess. It is essential, however, that these two methods 
of financing deficits, which are conceptually quite 
different, be distinguished in the analysis. 

The same analysis applies, in reverse, when 
budgetary surpluses are created. If the purpose of 
generating the surplus is that of reducing total 
demand, debt held outside the central banks should 
not be retired. Instead, the surplus should be dis- 
posed of by drawing down treasury balances or 
retiring debt held by central banks, which is the 
institutional equivalent of destroying money. 

Principles of public debt. The preceding sec- 
tions of this article summarize the institutional ele- 
ments of “public debt,” on which there exists broad 
agreement among professional experts. It is the 
“theory,” or “principles,” of public debt that has 
traditionally generated controversy, and the debate 
shows little sign of being resolved, although essen- 
tially the same arguments have been advanced for 
two centuries. For those who remain skeptical 
about the progress of economic science, the debt- 
theory controversy provides ample corroboration. 

Nominally, the central question has been: Who 
bears the burden of public debt, and when? Anal- 
ysis is clarified if this question is more carefully 
stated as follows: Who bears the real cost, or bur- 
den, of the public-expenditure projects that are 
financed by the issue of public loans, and when is 
this real cost incurred? What is sought is the inci- 
dence of debt, and the problem is comparable to 
that of locating the incidence of taxation, should 
this alternative financing device be employed. 

This question is important because only if it is 
answered properly can a rational choice be made 
between debt issue and alternative financing de- 
vices. When should government borrow? This ulti- 
mate policy question cannot be answered until and 
unless the consequences of public borrowing can 
be predicted. Public spending may be financed in 
any of three ways: (1) taxation, (2) public bor- 
rowing, and (3) money creation. The appropriate 
situations for the use of the second alternative can- 
not be identified until the differences between this 
and the remaining alternatives are analyzed. 

For whom is the central question relevant? Fail- 
ure to clarify this point has also led to ambiguity. 


32 DEBT, PUBLIC 


In any broadly democratic political structure, “gov- 
ernment” is the institutional process through which 
people make “public” decisions. Therefore, “When 
should government borrow?” becomes merely a 
shorthand manner of asking, “When should indi- 
vidual citizens, as participants in the governmental 
decision process, as prospective taxpayers, bor- 
rowers, or beneficiaries, borrow?” The question of 
the appropriateness of public debt issue becomes 
analogous to the question concerning private debt 
issue for the individual in his private capacity. 

The answer to the central question posed above 
seems obvious. Public borrowing should take place 
when it is desirable to put off or to postpone the 
incurring of real cost until some later period, in re- 
turn for which, as in any act of borrowing, an inter- 
est charge must normally be paid. The desire to 
postpone the incurring of real cost or incidence of 
the public expenditure project may or may not be 
related to the characteristics of the project itself. 
Public or collective consumption may, in some 
cases, be legitimately financed from public loans, 
just as private consumption may be legitimately 
financed from private loans. A more normal or 
standard reason for public borrowing is found, 
however, in the temporal pattern of benefits that 
are expected to be produced by the public spend- 
ing project. If the project involves a large and con- 
centrated outlay that is anticipated to yield benefits 
over a whole sequence of time periods in the future, 
considerations of both efficiency and equity suggest 
resort to public loans, provided only that direct 
money creation is predicted to cause inflation. Tra- 
ditionally, public debt has been discussed with ref- 
erence to the financing of extraordinary expendi- 
ture needs, and specifically with reference to public 
capital formation. Historically, national debts have 
been created largely during periods of war emer- 
gency, when spending needs have indeed been 
extraordinary, even if the result has not been public 
capital formation of the orthodox variety. 

The controversy. Little more would need to be 
said with respect to the “principles” of public debt 
were it not for the fact that the straightforward 
analysis has been recurrently challenged, and by 
economists of great eminence, Ricardo’s logic led 
him to enunciate the proposition that the public 
loan and an extraordinary tax of equivalent amount 
exert identical effects on the rational individual. 
Under conditions of perfect certainty, perfect capi- 
tal markets, and under the assumption that indi- 
viduals act “as if” they will live forever, the future 
tax liabilities that are inherent in a public debt 
obligation will be fully capitalized at the time of 
debt issue, and the effects on individual behavior 


patterns will be identical to those that would be 
produced by a tax of the same capital sum (Ricardo 
1817). Within his own restricted model Ricardo’s 
basic proposition cannot be challenged, but its 
validity does not contradict the elementary notions 
on public debt outlined above. The individual, as 
prospective taxpayer—borrower—beneficiary, con- 
fronts two financing alternatives—current taxation 
and public debt issue. And to the extent that he 
interprets these two alternatives correctly, he knows 
that each of the two will impose upon him a real 
cost, a burden that in a present-value sense is sub- 
stantially the same. This Ricardian equivalence 
does not suggest, however, that the objective pat- 
tern of cost payments remains the same over the 
two alternatives. Taxation and debt issue remain 
different, not similar, financing institutions, be- 
tween which the individual may choose, for the 
simple reason that taxes require a transfer of re- 
source services from the individual to the fisc dur- 
ing the initial period, whereas debt issue postpones 
such transfer until later periods. Whether or not 
the individual, under debt financing, correctly or 
incorrectly anticipates or “capitalizes” future tax 
liabilities as he “should” in the normative Ricardian 
model, is not relevant to the determination of the 
objective pattern of real cost payments over time. 

A common fallacy. The most pervasive and re- 
curring fallacy in the discussion of public debt has 
been that which summarizes the theory of inter- 
nally held debt by the statement “We owe it to our- 
selves.” The fallacy here is not a new one; it was 
widely held by both scholars and publicists in the 
eighteenth century. It almost faded out of the 
literature during the nineteenth and early twenti- 
eth centuries, only to reappear and to gain pr 
dominance after the so-called Keynesian revolution 
of the 1930s. In the immediate post-Keynesian 
decades the fallacy came to be almost universally 
accepted by economists. Therefore, if for no other 
reason, the importance that this fallacy has held 
in the literature warrants some consideration of it 
here. For purposes of discussion it may be labeled 
the “national accounting fallacy.” 

If the government borrows funds internally, 80 
the theory goes, the public expenditure project 1s 
financed from internal resources. These cannot be 
brought from future time periods into the presents 
hence, the opportunity costs of the public project, 
regardless of the method of finance, must be borne 
during the period in which the resources are actu- 
ally used. Public debt involves no postponement of 
cost or burden in time, and the view that it does 50 
is based on a crudely drawn analogy with private 
debt. In periods after public debt issue, the require 


interest payments represent nothing more than 
transfers from taxpayers to bondholders, and, so 
long as both groups are members of the com- 
munity, no real cost is incurred. External public 
debt, by contrast, because of the necessity to trans- 
fer interest payments to outsiders, is seen as wholly 
different from internal debt and analogous to pri- 
vate debt. 

This conception of national debt contains a 
fundamental flaw in its failure to translate oppor- 
tunity cost or burden from aggregative components 
into something that is meaningful to individual 
members of the community. What is the behavioral 
relevance of the fact that the resources actually 
used up in the public project are taken from cur- 
rent consumption or investment within the com- 
munity until and unless those members of the 
community who must undergo the sacrifice of 
alternatives can be identified? Once this question 
is raised, however, and such identification is at- 
tempted, the fallacy is clearly revealed. 

The members of the group who actually sur- 
render purchasing power, command over current 
resource services, which is used by the government 
to carry out the public purpose, are those who pur- 
chase the public debt instruments, the government 
securities. No other members of the group sacrifice 
or “give up” anything directly during the initial 
period. But do the bond purchasers bear the real 
costs or burden of the project? That they do not 
becomes clear when it is recognized that their sur- 
render of current funds is a wholly voluntary and 
private transaction in which they exchange current 
purchasing power for promises, on the part of the 
fisc, of income in future periods. These bond pur- 
chasers, or persons acting in this capacity, do not 
in any way consider themselves to be exchanging 
purchasing power for the benefits of the public 
spending, which would be the case if they should 
be really bearing the real cost. If, however, it is 
acknowledged that bond purchasers do not bear 
the real cost of the public spending, and if no other 
members of the group bear such cost during the 
initial period, who does pay for the project that is 
debt-financed? If, as the national accounting fal- 
lacy suggests, none of this cost is shifted to future 
periods, public debt might seem to provide for 
“fiscal perpetual motion,” since a means would have 
been located for financing beneficial public projects 
without cost. 

The core of the fallacy lies in the equating of the 
community as a unit, in some aggregated national 
accounting sense, with the individuals-in-the-com- 
munity, in some political sense as participants, 
direct or indirect, in collective decision making. In 
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their capacities as prospective taxpayers—borrowers— 
beneficiaries, individual members of a political com- 
munity can postpone the objective real costs of 
public spending through resort to debt finance, 
even though they may sell the debt instruments, 
bonds, to “themselves,” acting in a wholly different 
capacity as bond purchasers. There are two ex- 
changes, not one, involved, and it is the neglect of 
this basic duality that has clouded much of the dis- 
cussion and analysis. Individuals, as taxpayers- 
borrowers-beneficiaries, through the political de- 
cision process, “exchange” the future tax liabilities 
that debt issue embodies for the promise of ex- 
pected benefits of the public spending, current or 
in future. They are enabled to do this because indi- 
viduals, in their capacities as prospective bond 
purchasers, are willing to exchange current pur- 
chasing power for the promise of future payments. 
The second group makes an intertemporal ex- 
change that is the opposite of that made by the 
first group. And it is grossly misleading to think of 
these two exchanges as canceling in effect merely 
because the two groups may be partially coincident 
in membership. 

The alleged differences between internal and 
external debt disappear in a model that corrects for 
this national accounting fallacy. In either situation 
individuals in their roles as participants in collec- 
tive fiscal choice secure command over resource 
services initially without the necessity of giving up 
purchasing power. Whether other parties to the 
exchange be foreigners or some subset of indi- 
viduals acting in their private capacity as bond 
purchasers makes no essential difference, second- 
ary transfer considerations aside. 

For individuals as they participate in fiscal 
choice, public debt is the institutional analogue to 
private debt as these same individuals act privately. 
Basically, the same principles apply in each case, 
despite all the disclaimers made by economists. 
This is not, of course, to equate public finance with 
private finance in all respects. The important dif- 
ference lies, however, not in the effects of borrow- 
ing but in the fact that national governments pos- 
sess powers of money creation whereas private 
persons, and local governments, do not. The power 
to create money allows national governments to 
generate budgetary deficits without at the same 
time issuing debt. The principles of public borrow- 
ing, which are at base simple, have been obscured. 
by the failure of economists to make this elemental 
point clear. In part, as suggested before, this has 
been the result of institutional complexities present 
in modern fiscal—financial structures. Modern gov- 
ernments create money through issuing what they 
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commonly call “public debt”; they do so by “borrow- 
ing” from central banks. This disguised money 
creation does not, of course, have the same effects 
as genuine debt issue. 

The analysis of public debt which was dominant 
in the 1940s and 1950s, as a result of the Keynesian 
and post-Keynesian emphasis on deficit financing 
combined with the confusion between debt issue 
and money creation, was subjected to critical attack 
in the late 1950s and early 1960s. Predictions as to 
the development of analysis or the acceptance of 
ideas are risky at best, but it seems reasonable to 
suggest that the principles of public debt are on the 
verge of synthesis. Undue optimism is, however, 
surely to be avoided, especially if the history of debt 
theory is to be used as a guide. 

JAMES M. BUCHANAN 


[See also Fiscat poticy; and the biography of Ri- 
CARDO.) 
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PSYCHOLOGICAL ASPECTS 


Men must choose what to do, Often, choices 
must be made in the absence of certain knowledge 
of their consequences. However, an abundance of 
fallible, peripheral, and perhaps irrelevant informa- 
tion is usually available at the time of an important 
choice; the effectiveness with which this informa- 
tion is processed may control the appropriateness 
of the resulting decision. This article is concerned 
with laboratory studies of human choices and of 
certain kinds of human information processing 
leading up to these choices. It is organized around 
two concepts and two principles. The two concepts 
are utility, or the subjective value of an outcome, 
and probability, or how likely it seems to the de- 
cision maker that a particular outcome will occur 
if he makes a particular decision. Both of the prin- 
ciples are normative or prescriptive; they specify 
what an ideal decision maker would do and thus 
invite comparison between performances of ideal 
and of real decision makers. One, the principle of 
maximizing expected utility, in essence asserts that 
you should choose the action that on the average 
will leave you best off. The other, a principle of 
probability theory called Bayes’ theorem, is a for- 
mally optimal rule for transforming opinions in the 
light of new information, and so specifies how you 
should process information, The basic conclusions 
reached as a result of comparison of actual human 
performance with these two principles is that men 
do remarkably well at conforming intuitively to 
ideal rules, except for a consistent inefficiency in 
information processing. 

Utility 

Utility measurement and expected utility. The 
concepts of utility and probability have been with 
us since at least the eighteenth century. But seri- 
ous psychological interest in any version of them 
did not begin until the 1930s, when Kurt Lewin 
wrote about valence (utility) and several proba- 
bility-like concepts. Lewin had apparently been in- 
fluenced by some lectures on decision theory that 
the mathematician John von Neumann had elven 
in Berlin in 1928. But the Lewinian formulations 
were not very quantitative, and the resulting Te 
search did not lead to explicit psychological E 
cern with decision processes. However, in 194 
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yon Neumann and Morgenstern published their 
epochal book Theory of Games and Economic Be- 
havior. The theory of games as such has been re- 
markably unfruitful in psychological research, 
mostly because of its dependence on the absurdly 
conservative minimax principle that in effect in- 
structs you to deal with your opponent as though 
he were going to play optimally, no matter how 
inept you may know him to be. But von Neumann 
and Morgenstern rather incidentally proposed an 
idea that made utility measurable; that proposal is 
the historical origin of most psychological research 
on decision processes since then, Their proposal 
amounts to assuming that men are rational, in a 
rather specific sense, and to designing a set of 
procedures exploiting that assumption to measure 
the basic subjective quantities that enter into a 
decision. 

Since the origin of probability theory, the idea 
has been obvious that bets (and risky acts more 
generally) can be compared in attractiveness. For- 
mally, every bet has an expectation, or expected 
value (EV), which is simply the average gain or 
loss or money per bet that you might expect to 
accrue if you played the bet many times. To calcu- 
late the EV, you multiply each possible dollar out- 
come of the bet by the probability of that outcome, 
and sum the products. In symbols, the EV of the 
ith bet is calculated as follows, where Vj) is the 
payoff for the jth outcome of the ith bet and p; is 
the probability of obtaining that payoff: 


(1) EV; = pi Vu. 


Bets can be ordered in terms of their EV, and it 
seems plausible to suppose that men should prefer 
a bet with a higher EV to a bet with a lower one. 
But a little thought shows that men buy insurance 
in spite of the fact that the insurance companies 
pay their employees and build buildings, and thus 
must take in more money in premiums than they 
pay out in benefits. Thus insurance companies are 
in the business of selling bets that are favorable 
to themselves and unfavorable to their customers. 
Nevertheless, it is doubtful that anyone would call 
buying insurance irrational. This and other con- 
siderations led to a reformulation of the notion 
that men should order bets in terms of EV. The 
seventeenth-century British utilitarian philosophers 
had distinguished between objective value, or price, 
and subjective value, or utility. If the utility of 
some object to you is different from its price, then 
surely your behavior should attempt to maximize 
not expected value in dollars but expected utility. 
That is, you should substitute u(V:;), the utility of 


the payoff, for the jth outcome of bet i, for the pay- 
off itself, V;; in equation (1). Since it is utility, not 
payoff, that you attempt to maximize in this model, 
it is ‘called the expected utility maximization 
model, or EU model. In symbols, 


(2) EU, = Uipju(Vij). 


Von Neumann and Morgenstern proposed simply 
that one should use equation (2) to measure 
utility, by assuming that men make choices ration- 
ally. Several specific implementations of this idea 
will be examined below. 

Freud and psychiatry have taught us, perhaps 
too stridently, to look for irrational motivations be- 
hind human acts, and introspection confirms this 
lesson of thousands of years of human folly. Why 
bother, then, with measurement based on the as- 
sumption that men are rational? Three kinds of 
answers seem clear. First, rationality, as decision 
theorists think of it, has nothing to do with what 
you want, but only with how you go about imple- 
menting your wants. If you would rather commit 
rape than get married and rather get drunk than 
commit rape, the decision theorist tells you only 
that, to be rational, you should get drunk rather 
than get married. The compatibility of your tastes 
with your, or society's, survival or welfare is not 
his concern. So it is easy to be irrational in Freud's 
sense, and yet rational from a standpoint of de- 
cision theory. Second, men often want to imple- 
ment their tastes in a consistent (which means 
rational) way, and when large issues are at stake, 
they often manage to do so. In fact, knowledge of 
the rules of rational behavior can help one make 
rational decisions, that is, knowledge of the theory 
helps make the theory true. Third, the most im- 
portant practical justification of these or any other 
scientific procedures is that they work. Methods 
based on von Neumann and Morgenstern’s ideas 
do produce measurements, and those measure- 
ments provide predictors of behavior, The follow- 
ing review of experiments supports this statement. 

The Mosteller and Nogee experiment. The 
von Neumann—Morgenstern proposal was elabo- 
rated by Friedman and Savage (1948), a mathe- 
matical economist and a statistician writing for 
economists, and then was implemented experi- 
mentally by Mosteller and Nogee (1951), a statis- 
tician and a graduate student in social psychology. 
(No discipline within the social or mathematical 
sciences has failed to contribute to, or use, decision 
theory.) Mosteller and Nogee asked subjects to 
accept or reject bets of the form “If you beat the 
following poker-dice hand, you will win $X; other- 
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wise, you will lose $0.05.” A value of X was found 
such that the subject was indifferent between ac- 
cepting and rejecting the bet. Arbitrary assignment 
of 0 utiles (the name for the unit of utility, as gram 
is the name for a unit of weight) to no transaction 
(rejection of the bet) and of —1 utile to losing a 
nickel fixed the origin and unit of measurement of 
the utility scale, and the calculated probability p 
of beating the poker-dice hand was substituted into 
the following equation: 


pu($X) + (1 — p)u(—$0.05) = u( $0). 


Since u($X), the utility of $X, is the only unknown 
in this equation, it is directly solvable. Mosteller and 
Nogee used two groups of subjects in this ex- 
periment: Harvard undergraduates and National 
Guardsmen. For the Harvard undergraduates, they 
found the expected decreasing marginal utility; 
that is, the utility function rose less and less 
rapidly as the amount of money increased. For the 
National Guardsmen, they found the opposite; the 
larger the amount of money, the steeper the slope 
of the utility function. 

Perhaps the most important of the many criti- 
cisms of the Mosteller-Nogee experiment is about 
the role that probabilities display in it. The prob- 
ability of beating the specified poker-dice hand is 
not at all easy to calculate; still, it is substituted 
into equation (2), which is supposed to represent 
the decision-making processes of the subject. Actu- 
ally, Mosteller and Nogee did display these proba- 
bilities as numbers to their subjects. But they also 
displayed as numbers the values of the amounts 
of money for which the subjects were gambling. 
Why should we assume that the subjects make 
some subjective transformation that changes those 
numbers called dollars into subjective quantities 
called utilities, while the numbers called probabili- 
ties remain unchanged? Mosteller and Nogee made 
the point themselves, and reanalyzed their data 
using equation (2), but treating p rather than 
u($X) as the unknown quantity. But this is no 
more satisfactory. The fundamental fact is that 
equation (2) has at least two unknowns, p and 
u($X). This fact has been recognized by renaming 
the EU model; it is now called the SEU (subjec- 
tively expected utility) model. The addition of the 
S means only that the probabilities which enter 
into equation (2) must be inferred from behavior 
of the person making the decision, rather than cal- 
culated from some formal mathematical model, Of 
course, the person making the decision may not 
have had a very orderly set of probabilities, In par- 
ticular, his probabilities for an exhaustive set of 


mutually exclusive events may not add up to 1. 
Thus there are really two different SEU models, 
depending on whether or not the probabilities are 
assumed to add up to 1. (In the latter case, the 
utilities must be measured on a ratio, not an inter- 
val, scale.) This article will examine the topic of 
subjective probabilities at length later. 

In the SEU model every equation like equation 
(2) has at least two unknowns. A single equation 
with two unknowns is ordinarily insoluble. All 
subsequent work on utility and probability measure- 
ment has been addressed in one way or another to 
solution of this problem of too many unknowns. 
Edwards (1953; 1954a; 1954b) gave impetus to 
further analysis of the problem by exhibiting sets 
of preferences among bets that could not be easily 
accounted for by any plausible utility function for 
money but that seemed to require instead the no- 
tion that subjects simply prefer to gamble at some 
probabilities rather than others, and indeed will 
accept less favorable bets embodying preferred 
probabilities over more favorable bets embodying 
less preferred probabilities. 

The Davidson, Suppes, and Siegel experiment. 
The next major experiment in the utility measure- 
ment literature was performed by Davidson, Sup- 
pes, and Siegel (see Davidson & Suppes 1957), 
two philosophers and a graduate student in psy- 
chology. The key idea in their experiment—the 
pair of subjectively equally likely events—was 
taken from a neglected paper by the philosopher 
Ramsey (1926). Suppose that you find yourself 
committed to a bet in which you stand to win some 
large sum if event A occurs and to lose some other 
large sum if it does not occur. Now suppose it iS 
found that by paying you a penny I could induce 
you to substitute for the original bet another one 
exactly like it except that now you are betting on 
A not occurring. Now, after the substitution, sup- 
pose that I could induce you to switch back to the 
original bet by offering you yet another penny. 
Clearly, for you, the probability that A will occur 
is equal to the probability that it will not, that is, 
the two events are subjectively equally likely. 

If we assume that either A or not-A must hap 
pen, and if we assume that for you (as for any 
probability theorist) the probabilities of any set of 
events no more than one of which can happen an 
some one of which must happen—that is, an aS 
haustive set of mutually exclusive events—add up 
to 1, then the sum of the probabilities of A and 
not-A must be 1. Now, if those two numbers are 
equal, then each must be 0.5. b 

Davidson, Suppes, and Siegel hunted for SU 
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jectively equally likely events, and finally used a 
die with one nonsense syllable (e.g., ZEJ) on three 
of its faces and another (ZOJ) on the other three. 
(They were very lucky that the same event turned 
out equally likely for all subjects.) They fixed two 
amounts of money, —4 cents and 6 cents. They 
found an amount of money, X cents, such that the 
subject was indifferent to receiving 6 cents if, say, 
ZO] occurred and X cents if ZEJ occurred, and 
receiving —4 cents for sure. It follows from the 
SEU model that 


p(ZOJ) u(6¢) + p(ZEJ) u(X¢) = u(—4¢). 
A little algebra shows that 
u(6¢) — u(—4¢) = u(—4¢) — u(X¢). 


That is, the distance on the utility-of-money scale 
from 6 cents to —4 cents is equal to the distance 
from —4 cents to X cents (of course X cents is a 
larger loss than —4 cents). Once two equal inter- 
vals on the utility scale have been determined, it is 
no longer necessary to use a sure thing as one of 
the options. If A, B, C, and D are decreasing 
amounts of money, if the distance from B to Cis 
equal to the distance from C to D, and if the sub- 
ject is indifferent between a subjectively equally 
likely bet in which he wins A for ZOJ and D other- 
wise, and another in which he wins B for ZOJ and 
C otherwise, then the distance from A to B is equal 
to the other two distances. Davidson, Suppes, and 
Siegel used this procedure to construct a set of 
equally spaced points on their subjects’ utility-for- 
money functions. Thereafter, they used the result- 
ing utility functions as a basis for measuring the 
probability that one face of a symmetrical four- 
faced die would come up. (If the four faces were 
considered equally likely and if their probabilities 
added to 1, then that probability would be 0.25.) 

The most important finding of the Davidson, 
Suppes, and Siegel experiment was that a good 
many internal consistency checks on their utility 
functions worked out well, Once a number of equal 
intervals on the utility function have been deter- 
mined, many predictions can be made about pref- 
erences among new bets; some of these were tested, 
and in general they were successful. The utility 
functions were typically more complicated than 
those of the Mosteller-Nogee experiment and dif- 
fered from subject to subject; they were seldom 
linearly related to money. The subjective probability 
of the face of the four-faced die was typically 
found to be in the region of 0.20. 

The Davidson—Suppes—Siegel procedure remains 
intellectually valid, though criticisms of details are 


easy to make. However, it is unlikely that future 
utility measurement experiments will use it. The 
prior determination of the subjectively equally 
likely event is less attractive than procedures now 
available for determination of both utilities and 
probabilities from the same set of choices. 

Among a substantial set of utility-measurement 
experiments, only two others are reviewed here. 
They both embody sophisticated ideas taken from 
recent developments in measurement theory, and 
they both use what amounts to a simultaneous- 
equations approach to the solution of a system of 
equations like equation (2) for utilities and proba- 
bilities, treating both as unknowns. 

The Lindman experiment. Harold Lindman 
(1965) began by giving a subject a two-outcome 
bet of the form that if a spinner stopped in a speci- 
fied region, the subject would win $X, while if it 
did not, he would win $Y. Then he invited the sub- 
ject to state the minimum price for which he would 
sell the bet back to the experimenter. After the sub- 
ject had stated that amount, $Z, Lindman operated 
a random device that specified an experimenter’s 
price. If the experimenter’s price was at least as 
favorable to the subject as the subject's price, then 
the sale took place, the experimenter paid his price 
to the subject, and the bet was not played. Other- 
wise, the sale did not take place, and the subject 
played the bet. Since the sale, if it took place, 
always took place at the experimenter’s price, it 
was to the subject’s advantage to name the actual 
minimum amount of money that he considered 
just as valuable as the bet. Thus Lindman could 
write 

pu($X) + (1— p)u($Y) = u($Z). 


This equation has at least four unknowns, three 
utilities and a probability. If we question whether 
subjects unsophisticated about probability theory 
make their probabilities add up to 1, it may have 
five unknowns, since then both the probability of 
the event and the probability of its complement 
can be treated as unknowns. Thus, in a system of 
such equations there will always be many more 
unknowns than equations, even though the same 
probabilities and amounts of money are used in 
many different bets. However, the system can be 
rendered soluble, and even overdetermined, by tak- 
ing advantage of the fact that if $Z is between $X 
and $Y (as it will be in the example), then u($Z) 
will be between u($X) and u($Y). In more formal 
language, the relation between u($X) and $X is 
monotonic, Lindman exploited this fact by fitting 
a series of line segments to his utility functions; 
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by controlling the number of line segments, he 
controlled both the number of unknowns and the 
amount of curvilinearity (more precisely, changes 
in slope) he could introduce into the utility func- 
tion. 

Lindman’s results are complex, orderly, and 
pretty. He obtained a variety of shapes of utility 
functions for money from his different subjects. 
When he analyzed the data without assuming addi- 
tivity of probabilities, he found that the actual 
sums were very close to 1; the data strongly sup- 
port the idea that his subjects do in fact make 
probability judgments that add to 1. The proba- 
bility and utility functions that were found pre- 
dicted choices among bets very well indeed. There 
were some interactions between probabilities and 
utilities of a kind not appropriate to the EU model, 
but they were not major ones. 

The Tversky experiment. Deviating from the 
nearly universal use of college student subjects, 
Amos Tversky used prisoners at Jackson Prison, 
the largest state prison in Michigan, as subjects 
(1964). He used cigarettes, candy bars, and 
money, rather than money alone, as the valu- 
able objects whose utilities were to be measured. 
Tversky’s research was based on an application of 
simultaneous conjoint measurement, a new ap- 
proach to fundamental measurement which em- 
phasizes the idea of additive structures. Consider a 
bet in which with probability p you win $X and 
with probability 1 —p no money changes hands. 
Suppose such a bet is worth just $Z to you. A 
matrix with different values of p on one axis and 
different values of $X on the other defines a family 
of such bets. If the utility of no money changing 
hands is taken to be 0, then the EU model can be 
written for any such bet in logarithmic form as 
follows: 


logp; + logu($X;;) = logu($Z,;). 


That is, in logarithmic form this is an additive 
model. If all the values of $Z;; for such a matrix 
of bets are known, then the rules for additive rep- 
resentations of two-dimensional matrices permit 
solution (by complex computer methods) of a sys- 
tem of inequalities that give close values of p; and 
u($X;;). Tversky did just this, both for gambles 
and for commodity bundles consisting of so many 
packs of cigarettes and so many candy bars. 

The main finding of Tversky’s study was a con- 
sistent discrepancy between behavior under risky 
and riskless conditions. If probabilities are not 
forced to add to 1 in the data analysis, then the 
form that this discrepancy takes is that the proba- 
bilities of winning are consistently overestimated 


relative to the probabilities of losing. If probabili- 
ties are forced to add to 1, then the utilities meas- 
ured under risky conditions are consistently higher 
than those measured under riskless conditions, The 
latter finding would normally be interpreted as re- 
flecting the attractiveness or utility of gambling as 
an activity, independently of the attractiveness of 
the stakes and prizes. The discrepancy between 
Tversky’s finding and Lindman’s remains unex- 
plained. 

Summary. In all of these studies, the choices 
among bets made under well-defined experimental 
conditions turn out to be linked via some form or 
other of the SEU model to choices among other 
bets made by the same subjects under more or less 
the same conditions. That is, the SEU model per- 
mits observation of coherence among aspects of 
subjects’ gambling behavior. Of course it would be 
attractive to find that such coherence would hold 
over a larger range of risk-taking activity, A sub- 
stantial disappointment of these studies is that the 
individual utility and probability functions vary so 
much from one person to another. It would be 
scientifically convenient if different people had the 
same tastes—but experience offers no reason for 
hoping that they will, and the data clearly say that 
they do not. 

At any rate, these studies offer no support for 
those who reject a priori the idea that men make 
“rational” decisions. An a priori model of such 
decision making turns out to predict very well the 
behavior of a variety of subjects in a variety of 
experiments. Nor is this finding surprising. A very 
general and intuitively appealing model of almost 
any kind of human behavior is contained in the 
following dialogue: 


Question: What is he doing? 
Answer: He's doing the best he can. 


Probability 


Identification rules for probability. The previ- 
ous discussion has probably been somewhat con- 
fusing to many readers not familiar with what is 
now going on in statistical theory. It has consist- 
ently treated probability as a quantity to be inferred 
from the behavior of subjects, rather than calcu- 
lated from such observations as the ratio of heads 
to total flips of a coin. It is intuitively reasonable 
to think that men make probability judgments just 
as they make value judgments and that these 
judgments can be discovered from men’s behavior. 
But how might such subjective probabilities relate 
to the more familiar quantities that we estimate by 
means of relative frequencies? 
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This question is a controversial one in contem- 
porary statistics. Considered as a mathematical 
quantity, a probability is a value that obeys three 
quite simple rules: it remains between 0 and 1; 
0 means impossibility and 1 means certainty; and 
the probability of an event's taking place plus the 
probability of its not taking place add up to 1. 
These three properties are basic to all of the elabo- 
rate formal structure of probability theory con- 
sidered as a topic in mathematics, Nor are they, or 
their consequences, at all controversial. What are 
controversial are the identification rules linking 
these abstract numbers with observations made or 
makable in the real world. The usual relative- 
frequency rules suffer from a number of intellec- 
tual difficulties. They require an act of faith that a 
sequence of relative frequencies will in fact ap- 
proach a limit as the number of observations in- 
creases without limit. They are very vague and 
subjective while pretending to be otherwise; this 
fact is most conspicuous in the specification that 
relative frequencies are supposed to be observed 
under “substantially similar conditions,” which 
means that the conditions should be similar enough 
but not too similar. (A coin always tossed in exactly 
the same way would presumably fall with the same 
face up every time.) Perhaps most important, the 
frequentistic set of rules is just not applicable to 
many, perhaps most, of the questions about which 
men might be uncertain. What is the probability 
that your son will be a straight-A student in his 
senior year in high school? While an estimate 
might be made by counting the fraction of senior 
boys in the high school he is likely to attend who 
have straight-A records, a much better estimate 
would be based primarily on his own personal char- 
acteristics, past grade record, family background, 
and the like. 

The Bayesian approach. Dissatisfaction with 
the frequentistic set of identification rules, for 
these and other more technical reasons, has caused 
a set of statisticians, probability theorists, and 
philosophers led by Leonard J. Savage, author of 
The Foundations of Statistics (1954), to adopt a 
different, personalistic, set of such rules. According 
to the personalistic set of identification rules, a 
probability is an opinion about how likely an event 
is. If it is an opinion, it must be someone’s opinion; 
I will remind you of this from time to time by ref- 
erence to your probability for something, and by 
calling such probabilities personal, Not any old 
opinion can be a probability; probabilities are 
orderly, which mostly means that they add up to 1. 
This requirement of orderliness is extremely con- 
straining, so much so that no real man is likely 


to be able to conform to it in his spontaneous 
opinions. Thus the “you” whose opinions I shall re- 
fer to is a slightly idealized you, the person you 
would presumably like to be rather than the person 
you are. 

Those who use the personalistic identification 
rules for probabilities are usually called Bayesians, 
for the rather unsatisfactory reason that they make 
heavier use of a mathematical triviality named 
Bayes’ theorem than do nonpersonalists. Bayes’ 
theorem, an elementary consequence of the fact 
that probabilities add up to 1, is important to 
Bayesians because it is a formally optimal rule for 
revising opinions on the basis of evidence. Con- 
sider some hypothesis H. Your opinion that it is 
true at some given time is expressed by a number 
p(H), called the prior probability of H. Now you 
observe some datum D, with unconditional proba- 
bility p(D) and with conditional probability p( D|H) 
of occurring if H is true. After that, your former 
opinion about H, p(H), is revised into a new 
opinion about H, p(H|D), called the posterior 
probability of H on the basis of D. Bayes’ theorem 
says these quantities are related by the equation 


_ P(D|H)p(H), 
(3) p(H|D) =D 


An especially useful form of Bayes’ theorem is ob- 
tained by writing it for two different hypotheses, 
H, and Hs, on the basis of the same datum D, and 
then dividing one equation by the other: 


— P(DIH4) 


p(H,|D) = pD) p(H4), 
P(Hy|D) = POT pHs), 


p(H,\D) _ p(D\Ha) p(H4), 
PCH»|D) ~~ p(D\Hn) PCAn) 


or, in simpler notation, 
(4) QELA. 


In equation (4), 9,, the ratio of p(H4) to P(Hn), 
is called the prior odds, ©, is called the posterior 
odds, and L is the likelihood ratio, Equation (4) is 
perhaps the most widely useful form of Bayes’ 
theorem. 

Conservatism in information processing. Bayes’ 
theorem is of importance to psychologists as well 
as to statisticians. Psychologists are very much 
interested in the revision of opinion in the light of 
information. If equation (3) or equation (4) is an 
optimal rule for how such revisions should be 
made, it is appropriate to compare its prescriptions 
with actual human behavior. 
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Consider a very simple experiment by Phillips 
and Edwards (1966), Subjects were presented 
with a bookbag full of poker chips. They were told 
that it had been chosen at random, with 0.5 proba- 
bility, from two bookbags, one containing 700 red 
and 300 blue chips, while the other contained 700 
blue and 300 red. The question of interest is which 
bag this one is. Subjects were to answer the ques- 
tion by estimating the probability that this was the 
predominantly red bookbag. On the basis of the 
information so far available, that probability is 
0.5, as all subjects agreed. 

Now, the experimenter samples randomly with 
replacement from the bookbag, At this point let me 
invite you to be a subject in this experiment. Sup- 
pose that in 12 samples, with replacement, the ex- 
perimenter gets red, red, red, blue, red, blue, red, 
blue, red, red, blue, red—that is, 8 reds and 4 
blues. Now, on the basis of all the evidence you 
have, what is the probability that this is the pre- 
dominantly red bookbag? Write down an intuitive 
guess before starting to read the next paragraph. 

Let us apply equation (4) to the problem, The 
prior odds are 1:1, so all we need is the likelihood 
ratio. To derive that is straightforward; for the gen- 
eral binomial case of r reds in n samples, where 
H, says that the probability of a red is p, (and of 
a blue is q,) and Hy says the probability of a red is 
Px (and of a blue is gn), 


ian 
Pia's 
In this particular case, made much simpler by the 
fact that pa = qn and vice versa, it is simply 


=)" 


Of course 2r — n =r — (n —r) and is the differ- 
ence between the number of reds and the number 
of blues in the sample—in this case, 4. So the 
likelihood ratio is (7/3)' = 29.64. Since the prior 
odds are 1:1, the posterior odds are then 29.64: 1, 
And so the posterior probability that this is the 
predominantly red bookbag is 0.97. 

If you are like Phillips and Edwards’ subjects, 
the number you wrote down wasn’t nearly so high 
as 0.97. It was probably about 0.70 or 0.80, Phillips 
and Edwards’ subjects, and indeed all subjects who 
have been studied in experiments of this general 
variety, are conservative information processors, 
unable to extract from data anything like as much 
certainty as the data justify, A variety of experi- 
ments has been devoted to this conservatism phe- 
nomenon. It is a function of response mode; Phil- 


lips and Edwards have shown that people are a bit 
less conservative when estimating odds than when 
estimating probability. It is a function of the diag- 
nosticity of the information; Peterson, Schneider, 
and Miller (1965) have shown that the larger the 
number of poker chips presented at one time, the 
greater is the conservatism, and a number of 
studies by various investigators have shown that 
the more diagnostic each individual poker chip, the 
greater the conservatism. 

Conservatism could be attributed to either or 
both of two possible failures in the course of 
human information processing. First, the subjects 
might be unable to perceive the data-generating 
process accurately; they might attribute to data 
less diagnostic value than the data in fact have. 
Or the subjects might be unable to combine in- 
formation properly, as Bayes’ theorem prescribes, 
even though they may perceive the diagnostic value 
of any individual datum correctly, Data collected 
by Beach (1966) favor the former hypothesis; data 
collected by Edwards (1966) and by Phillips 
(1965) favor the latter. It seems clear by now that 
this formulation of the possible causes of con- 
servatism is too simple to account for the known 
facts, but no one has yet proposed a better one. 

Probabilistic information processing. If men 
are conservative information processors, then it 
seems reasonable to expect that this fact has prac- 
tical consequences. One practical consequence is 
familiar to anyone who has ever grumbled over a 
hospital bill: human conservatism in processing 
available diagnostic information may lead to col- 
lection of too much information, where the collec- 
tion process is costly. An even more serious con- 
sequence may arise in situations in which speed 
of a response is crucial: a conservative information 
processor may wait too long to respond because he 
is too uncertain what response is appropriate. This 
consequence of conservatism is especially impor 
tant in the design of large military information- 
processing systems. In the North American Air 
Defense System, for example, the speed with which 
the system declares we are under attack, if we are, 
may make a difference of millions of lives. 

Edwards (1962; 1965b) has proposed a design 
for diagnostic systems that overcomes the defi- 
ciency of human conservatism. A probabilistic m 
formation processing system (PIP) is designed in 
terms of Bayes’ theorem, For vague, verbal data 
and vague, verbal hypotheses, experts must esti- 
mate p(D|H) or L as appropriate (usually L rather 
than p(D|H) will be appropriate). They make 
these estimates separately for each datum and ea 
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hypothesis or pair of hypotheses of interest to the 
system. Then a computer uses equation (3) or 
equation (4) to synthesize these separate judg- 
ments into a posterior distribution that reflects how 
all the hypotheses stand in the light of all the data. 
This distribution is of course revised each time a 
new datum becomes available. 

A number of studies of PIP have been performed 
(see, e.g., Schum, Goldstein, & Southard 1966; 
Kaplan & Newman 1966; Edwards 1966). The 
studies have generally found it more efficient than 
competitive systems faced with the same informa- 
tion. In a large unpublished simulation study, 
Edwards and his associates found that data that 
would lead PIP to give 99:1 odds in favor of some 
hypothesis would lead its next-best competitor to 
give less than 5:1 odds in favor of that hypothesis. 
Even larger discrepancies in favor of PIP appear in 
Phillips’ study. Thus a combination of human 
judgment and Bayes’ theorem seems to be capable 
of doing a better job of information processing 
than either alone. 


Are men rational? 


All in all, the evidence favors rationality. Men 
seem to be able to maximize expected utility rather 
well, in a too-restricted range of laboratory tasks. 
There are, of course, a number of well-known 
counterexamples to the idea that men consistently 
do what is best for them. More detailed analysis of 
such experiments (e.g., probability learning ex- 
periments) indicates that substantial deviations 
from rationality seldom occur unless they cost 
little; when a lot is at stake and the task isn’t too 
complex for comprehension, men behave in such a 
way as to maximize expected utility. 

The comparison of men with Bayes’ theorem is 
less favorable to men, The conservatism phenom- 
enon is a large, consistent deviation from optimal 
information-processing performance. Nevertheless, 
it is surprising to those newly looking at this area 
that men can do as well as they do at probability 
or odds estimation. 

The topic of the articulation between diagnosis 
and action selection will receive much more study 
in the next few years than it has so far. What little 
evidence is available suggests that men do remark- 
ably well at doing what they should, given the 
information on hand. But the surface of this topic 
has scarcely been scratched. 

Total rejection of the notion that men behave 
rationally is as inappropriate as total acceptance 
would be, On the whole, men do well; exactly how 
well they do, depends in detail on the situation they 


are in, what’s at stake, how much information 
they have, and so on. The main thrust of psycho- 
logical theory in this area is likely to be a detailed 
spelling out of just how nearly rational men can be 
expected to be under given circumstances. 


WARD EDWARDS 
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n 
ECONOMIC ASPECTS 


The distinction between prescriptive and descrip- 
tive theories of decision is similar to that between 
logic—a system of formally consistent rules of 
thought—and the psychology of thinking. A logical 
tule prescribes, for example, that if you believe all 
X to be Y, you should also believe that all non-Y 
are non-X, but not that all Y are X; and if you be- 
lieve, in addition, that all Y are Z, you should 
believe all X to be Z (a logical rule known as 
transitivity of inclusion), Here, X, Y, and Z are 
objects or propositions. Such prescriptive rules do 
Not state that all people of a given culture, social 
Position, age, and so forth, always comply with 


thern. If, for example, the links in a chain of rea- 
soning are numerous, the rule of transitivity will 
probably be broken, at least by children or by un- 
schooled or impatient people. 

Descriptions, and consequently, predictions, of 
“illogical” behavior, as indeed of all human be- 
havior, are presumably a task for psychologists or 
anthropologists. Such predictions are obviously im- 
portant to practicing lawyers, politicians, salesmen, 
organizers, teachers, and others who work with 
people, just as predictions of the behavior of metals 
and animals are important to engineers and dairy 
farmers. These practitioners are well advised to 
know the frequencies of the various types of il- 
logical behavior of their clients, adversaries, or 
students. But they are also well advised to avoid 
logical errors in their own behavior, as when they 
apply their knowledge of men (and of all nature, 
for that matter) to win lawsuits or elections or to 
be successful in organizing or teaching. For ex- 
ample, their propositions X, Y, and Z may be about 
another's madness, yet should obey the rule of 
transitivity of inclusion. Moreover, if you want to 
train future lawyers, statesmen, or businessmen, 
you are well advised to learn, from your own or 
other people’s experiences, the techniques needed 
to make your pupil not only knowledgeable about 
other men’s logical frailties but also strong in his 
own thinking and able to solve the brain twisters 
his future will offer. These pedagogical techniques 
are, of course, objects of descriptive study. 

To illustrate the prescriptive—descriptive distinc- 
tion in the domain of decisions, let us consider one 
of the rules proposed by prescriptive decision 
theory. We choose the rule called transitivity of 
preferences (other proposed rules will be discussed 
below) because of its special similarity with transi- 
tivity of inclusion. If you prefer a to b and b to c, 
you should prefer a to c. Here, a, b, and c are, in 
general, your actions with uncertain outcomes, al- 
though in the special case of “sure actions,” the out- 
come, e.g., gaining or losing an object, is unique, 
in which case preferring action a is, in effect, the 
same as preferring the outcome of action a. If an 
individual disobeys the transitivity rule and pre- 
fers a to b, b to c, and c to a, we would say that 
he does not know what he wants. Surely we would 
advise or train a practitioner to develop the ability 
for concentrated deliberation, weighing advantages 
and disadvantages in some appropriate way. This 
should result in a consistent ranking, from the 
most to the least preferable, with ties (i.e., indif- 
ference between some alternatives) not excluded, 
of alternative actions from which the decision 
maker may be forced to choose. 
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The statement that “given a set of feasible alter- 
natives, the ‘reasonable’ (rational) man should 
choose the best” and the parallel statement that 
“an actual man does choose the best (and if the 
best is not feasible, the second best, etc. )” are both 
empty if the ranking order is not supposed to be 
stable over a period long enough to make the state- 
ments of practical relevance. If the ranking order 
is stable, economics as the study of the best alloca- 
tion of available resources becomes possible. 

The boundary between descriptive and pre- 
scriptive economics did not worry early writers. 
Gresham’s law, Ricardo’s explanations of rent and 
of the high postwar price of bullion, and Böhm- 
Bawerk’s theory of interest are deductions from the 
assumption of consistent ranking of alternatives 
by reasonable men. But those authors also prac- 
ticed induction and looked for historical facts that 
confirmed their deductions. The two approaches 
are not inconsistent if one assumes that in the 
cultures considered, housewives and adult, gain- 
fully occupied men have by and large been “rea- 
sonable,” at least when deciding not on wars and 
divorces but on matters of major interest to the 
economists of the time—for example, quantities 
such as prices, demands, and outputs, as well as 
less quantifiable choices such as location and in- 
ternal organization (division of labor) of a plant. 
These matters were studied under the assumption 
of a tolerably well-known future. 

The descriptive and prescriptive approaches be- 
came more clearly distinguished as more attention 
was directed to economic decisions under uncer- 
tainty (Fisher 1906; Hicks 1939; Hart 1940) and, 
stimulated by the Theory of Games and Economic 
Behavior (von Neumann & Morgenstern 1944), to 
nonquantifiable decisions traditionally assigned to 
political and military science or to sociology. In 
search of tools for their descriptive work, econo- 
mists were also influenced by modern statistical 
decision theory. [See DECISION THEORY.] It then 
appeared that the reasonable man was not suffi- 
ciently approached even by the ideal type of an in- 
dustrial entrepreneur, who evidently must be aided 
by a sophisticated hypothesis tester, a statistician, 
or, more recently and generally, an expert in opera- 
tions research. [See OPERATIONS RESEARCH.] AS 
such experts, armed with computers, do indeed 
progressively penetrate the economic world, pre- 
dictive and descriptive economics (e.g., the predic- 
tion of aggregate inventories) may again become 
closer to what can be deductively derived from ra- 
tional decision rules. 

To return to the analogy between theories of 
thinking and of decision, prescriptive theories of 
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thinking and decision are concerned with formal 
consistency between sentences or decisions, not 
with their content. If one believes that Los Angeles 
is on the Moon and that the Moon is in Africa, it 
is logical for him to believe that Los Angeles is in 
Africa, although none of the three statements is 
true. Similarly, if a man prefers killing to stealing 
and stealing to drinking strong liquor, he should 
prefer killing to drinking, although none of these 
actions is laudable or hygienic. Hicks (1939) 
used Milton's “reason is also choice” as a motto to 
the theory of consumption, Ramsey (1923-1928) 
called his theory of decision an extension of logic, 
an attitude adopted more recently by other logi- 
cians, for example, Carnap (1962), von Wright 
(1963), and Jeffrey (1965). The mutual penetra- 
tion of logic and decision theory will appear even 
more founded and urgent when we come to discuss 
the cost of thinking and deciding (section 8). 

In sections 1, 3, 4, and 5 we shall discuss some 
prescriptive decision rules and outline some simple 
experiments whose outcomes determine whether 
the subject did or did not comply with these rules. 
Whenever such experiments or similar ones have 
actually been performed, we shall refer to available 
descriptive evidence, namely, the frequency of sub- 
jects’ obeying these rules or obeying some other 
stated behavior patterns, such as the probabilistic 
decision models of section 2. In section 6 the ex- 
pected utility theorem implied by the prescriptive 
rules is discussed. Sections 7 and 8 point to un- 
solved problems on the frontier of decision theory. 


1. Complete ordering of actions 


Before we consider the notion of a complete 
ordering of alternative actions or decisions (we 
need not distinguish between the two here), it is 
convenient to restate the transitivity of preferences 
by defining the relation “not preferred to” or “not 
better than,” written symbolically as <. If a<b 
and b < c, then a < c. It is a reflexive relation, i.e., 
aSa. A relation that is transitive and reflexive is 
called an ordering relation. When a < b and b < a, 
we write a ~b and say that the decision maker is 
indifferent between a and b, even though a and b 
are not identical. For example, an investor may be 
indifferent between two different portfolios of se- 
curities. Thus “<” (unlike the relation “not larger 
than” applied to numbers) is a weak ordering re- 
lation. Further, when a < b and not b < a, we write 
a<b and say that b is preferred to, or is better 
than, a. 

Shall we say that a reasonable man always 
either prefers b to a or a to b or is indifferent? Or 
can he, in addition, refuse to compare them? If 


he can, the ordering of the actions by the relation 
“not better than” is only a partial one. This posi- 
tion has been taken, for example, by Aumann 
(1962), Chipman (in Interdisciplinary Research 
Conference 1960), and Wolfowitz (1962 ). But can 
one escape choosing? As in Pascal's (1670) fa- 
mous immortality bet, “Il faut parier!” The avoiding 
of choice itself is a decision. Like other decisions, 
it is made more or less desirable by the prizes and 
penalties it entails. If this is granted, the transitive 
relation “<” is also connective and induces a com- 
plete ordering. Then any subset of actions can be 
arranged according to their desirabilities (with ties 
not excluded). Rank numbers, also called ordinal 
utilities (priorities), can then be assigned to the 
alternative actions provided that certain weak con- 
ditions are satisfied (Debreu 1959, section 4, note 
2, and section 4,6), 

Tests of transitivity of preferences. As a matter 
of descriptive theory we ask whether people (of a 
given culture, etc.) do act as if there exist com- 
plete orderings of their actions by preferences. In 
actual experiments the penalty for refusing to 
choose was assumed to be strong enough to force 
subjects to make genuine choices in the form of 
verbal statements of preferences or, in some cases, 
actually to stake money on one wager rather than 
another. With connectivity of the preference rela- 
tion thus assured and its reflexivity assumed by 
definition, the transitivity of the relation remained 
to be tested. The alternative actions were repre- 
sented by multidimensional objects—a bundle of 
tickets to various shows (Papandreou 1957), mar- 
riage partners classified by three three-valued char- 
acteristics (May 1954; Davis 1958), a monetary 
wager (Edwards 1953; Davidson & Marschak 
1959), a price policy affecting both the firm’s rate 
of return and its share of the market, an air trip 
varying in both cost and duration (MacCrimmon 
1965), and so forth. The subject responded by 4 
triad of binary choices—a or b? b or c? c or a?— 
but in most experiments these choices were sepa- 
rated in time by choices from other triads, that is, 
“a or b?” was followed by “a’ or b’?” rather than by 
“b or c?” Moreover, neither of the two actions 
“dominated” the other (see section 3), and no sub- 
ject could use paper and pencil. 

In all experiments, the transitivity rule was 
violated with varying frequencies. The proportion 
of the number of violations by all subjects to the 
maximum theoretical number of intransitive triads 
that were possible in the given experimental de- 
signs ranged from .04 in MacCrimmon’s tests 0 
business executives to .27 in May's tests of students. 
Since the experiments record actual choices an 


not preferences, intransitivity might be exhibited 
by a person who, although consistent, is strictly in- 
different among three alternatives. (To approach, 
more generally, the case of “near indifference,” the 
hypothesis tested must be rephrased in probabilistic 
terms. This will be discussed in section 2.) Thus 
the outcome of an experiment depends on the 
nature of the alternatives offered, making the ex- 
periments mentioned not quite comparable. 

It would also be of great interest to study the 
effects of the passage of time and the effects of 
learning, especially if different methods of training 
are applied, such as “post-mortem” discussion (a 
domain opened by MacCrimmon), sequential modi- 
fications of the list of choices, etc. In fact, what 
would be the effect of supplying the subjects with 
paper and pencil and training them to use these 
prerequisites of our culture to tabulate the decision 
problem in an “outcome matrix” as in Table 1 
below? 

It has been suggested (Quandt 1956; Simon 
1957) that intransitivities of decisions occur when 
the subject is unable to pay simultaneous attention 
to the several dimensions of each object of choice. 
This is somewhat analogous to the psychophysical 
finding that discrimination is greater between 
sounds differing in pitch but of equal loudness than 
between sounds differing in both (Pollack 1955). 
This suggests that the probabilistic approach com- 
mon in psychophysics might be tried in a descrip- 
tion of decision behavior. As to prescriptive theory, 
one may recall Benjamin Franklin's suggestion to 
score the pros and cons of each decision or the 
practice of adding scores when grading students 
or judging young ladies in beauty contests. Again, 
these or any other ways of overcoming the multi- 
dimensionality of decisions through deliberate and 
simultaneous marshaling of all the dimensions will 
in general require paper and pencil. 


2. Probabilistic ordering of actions 


Since the prescriptive rule of transitivity is often 
violated, it is necessary to recast descriptive de- 
cision theory in different terms. The hypothesis 
that transitivity of decisions may be achieved by 
learning or training has already been mentioned. 
It is also traditional in experimental psychology, 
especially since Fechner's (1860) studies of per- 
ception, to describe behavior by probability distribu- 
tions, Economists, by contrast, have succeeded in 
the past in making useful predictions of the ag- 
gregate behavior of large numbers of individuals 
based on the assumption that individuals are, by 
and large, “reasonable.” The probabilistic approach 
is rather new in economics, perhaps as young as 
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the econometricians’ attempts to test aggregate 
economic models statistically and to relate these 
models to observable behavior of sampled individual 
households and firms. 

Luce and Suppes (1965) have comprehensively 
surveyed the literature and the experimental evi- 
dence relevant to probabilistic orderings. One model 
that implies all the others suggested so far is that 
presented by Luce (1959). Fixed positive numbers 
Va, Up, Vey +*+, Called strict utilities, are attached to 
the alternative actions a, b,c, +++. The set of ac- 
tions includes all alternatives that may ever be 
offered to a given subject. The strict utilities are 
assumed to have the following property: if the sub- 
ject must choose from a particular subset of alter- 
natives, he will choose each with a probability pro- 
portional to its strict utility. For example, suppose 
ViVi VU = 1:2:3:4. Then, if asked to choose 
from the subset (a,b,c), the subject will choose 
a, b, or c with respective probabilities 4%, %, %; if 
he must choose from the pair (a, b), he will choose 
a or b with respective probabilities 14, 34. The hy- 
pothesis can be tested at least “in principle” (i.e., 
assuming that representative samples from the 
universe of all choice situations can be obtained). 
One would observe the relative frequencies of 
choices from various subsets and infer, for exam- 
ple, that the ratio of the probability of choosing a 
to the probability of choosing b from the pair (a, b) 
does or does not change if a third alternative is 
also offered. 

In another model, constants Ua, Uy, Uc, ***> 
called strong utilities, are associated with the alter- 
natives a, b, c, -+ . The strong utilities are assumed 
to have the following property: in the binary choice 
between a and b, the probability p(a, b) of choos- 
ing a is an increasing function of the difference 
Ua — u, (except when p(a, b) is equal to 1 or 0). 
It follows that p(a,b) 24 according as taZ t. 
Note that the case p(a, b) = } provides an opera- 
tional definition of “indifference” based on actual 
choices rather than on the verbal statement “I am 
indifferent.” This model was used, in effect, in 
Thurstone’s discussion of public opinion and of 
soldiers’ food preferences (1927-1955). It is analo- 
gous to Fechner’s perception model, which associ- 
ates a physical stimulus a with a subjective “sensa- 
tion” u,; the subject perceives a to be heavier (or 
louder or brighter) than b with a probability 
p(a, b) that increases with the difference Ua — Uy 
(provided p(a, b) is neither 0 nor 1). 

It can be shown that if strict utilities exist, then 
strong utilities also exist (putting uw. = log va» etc. ), 
but not the converse. The strong utility model im- 
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plies, in turn, the still less restrictive weak utility 
model, which assigns the ordinal numbers Wwa, ws, 
We, +++ to the alternatives a,b,c, :--, such that 


p(a, b)ž4 according as w, > wy. Clearly, if the 


Strong utilities exist, so do the weak utilities (put- 
ting w, = u,, etc.), but not the converse since dif- 
ferences between ordinal numbers are undefined. 
Both the weak utility model and the strong utility 
model can be tested in principle for possible rejec- 
tion, for they have implications that involve choice 
probabilities only, avoiding the intervening utility 
concept. In particular, the models imply, respec- 
tively, the weak stochastic transitivity, 


if p(a, b) > 4 and p(b,c) > 4, 
then p(a, c) > 4, 


and the strong stochastic transitivity, 


if p(a, b) > 4 and p(b, c) > 4, 
then p(a, c) > max [p(a, b), p(b, c)). 


Of these two forms of stochastic transitivity, the 
strong implies the weak one; it is, in turn, implied 
by the exact transitivity rule treated in section 1 if 
the choice probabilities are assumed to take on only 
three values: 0, 4, and 1, corresponding to the pref- 
erence relations a < b, a ~ b, and b <a. 

Another probabilistic model also implied by, and 
less restrictive than, Luce’s strict utility model is the 
random utility model. It does not imply, nor is it 
implied by, the strong or the weak utility model. 
It asserts, for a given subject, the existence of a 
fixed probability distribution on the set of all pref- 
erence orderings of all alternatives, so that for a 
set of N alternatives, a probability is attached to 
each of their N! permutations. In the nonprobabilis- 
tic economics of consumer’s choice the (ordinal) 
utility function that attaches a constant rank to 
each commodity bundle is a fixed one. In the ran- 
dom utility model this function is, instead, visual- 
ized as drawn at random from a set of such 
functions, according to a probability distribution 
that is fixed for the given subject. Thus the word 
“utility” designates here a random number, not 
a constant as in the other probabilistic models 
described. 

What relevance do these (and related) models 
have for a prescriptive theory of decision? Just as 
not all men are always consistent, not all men are 
always in good health. Doctors and nurses are busy 
measuring the temperatures and blood pressures 
of the sick ones, The probability distributions sup- 
posed to characterize a “stochastic decision maker” 
may vary in the degree of their closeness to the 
ideal limit—consistency. For example, consistency 


is approached in the strong utility model as the 
values of p(a, b) concentrate near 1 or 0 or 4. We 
can thus trace the path of progress of learning or 
training for consistency. 


3. Inadmissibility of dominated actions 


An action’s result (outcome) depends, in gen- 
eral, not only on the action but also on the “state 
of the world (nature or environment),” which is 
not in the decision maker's control. For our pur- 
poses an action (called an act by Savage [1954]) 
can be defined by the results it yields at various 
states of the world. Thus an action is a function 
from the set of states to the set of results. (This 
function has constant value if the action is a “sure 
action.”) In a decision situation, only some such 
functions are available; their “feasible set” (the 
“offered set of alternatives” of section 2) depends, 
for example, on the decision maker's resources, 
technology, and market as he views them, 

An event is a subset of the states of nature. In 
particular, the subset consisting of all states that, 
for any given feasible action, result in the same 
outcome is called an outcome-relevant event. In a 
decision situation the set of states is partitioned 
into such events, with all irrelevant details omitted. 

Even if the well-disciplined decision maker can 
consistently rank multidimensional results accord- 
ing to his preferences, action under uncertainty 
remains multidimensional because of the multiplic- 
ity of events. Actions are bets, or wagers. Yet it 
seems reasonable to prescribe (as in section 1) that 
bets be ordered; indeed, some authors prescribe 
that bets be completely ordered (Ramsey 1923- 
1928; Savage 1954; de Finetti 1937). It will be 
shown in section 6 that a complete ordering oe 
responds to that of numerical “expected utilities 
of actions provided that the decision maker is con- 
sistent in the sense of obeying both the traditional 
tules of logic and certain postulates that are plausi- 
ble enough to qualify as extensions of logic. These 
Postulates will be discussed in this and subsequent 
sections, essentially following Savage. 

Action a is said to dominate action b if the results 
of a are sometimes (i.e., when some events take 
Place) better than the results of b and are never 
Worse. Is it not reasonable that you should prefer 


Table 1 — An outcome matrix (in dollars) 


Event 

Action Zi Za 
a 1,000 1,500 
a 999 1,500 
b 1,000 1,000 


the dominating action? Any action that is dom- 
inated by some feasible action is thus inadmissible. 
Consider, for example, the actions listed in Table 1. 
If action a is taken and the state of the world is 
Z,, a gain of $1,000 will be realized; if action a is 
taken and the state of the world is Z,, a gain of 
$1,500 will be realized; and so on. Such a table is 
called an outcome matrix. 

The inadmissibility postulate would order these 
actions as follows: a’<a,b<a. It does not, by 
itself, induce complete ordering of the actions. Thus, 
it is silent about the preference relation between 
actions a’ and b in Table 1. The “expected utility” 
rule of section 6 will determine this relation roughly 
in the following manner; depending on the “prob- 
abilities” of events and on the “utilities” of results, 
the $500 advantage of a’ over b if Z, happens may 
or may not outweigh its $1 disadvantage if Z, hap- 
pens, This appeals to common sense but remains 
vague until, with help of other postulates, we define 
the concepts of “probability” and “utility.” 


r 


v XXXX 
L hemel- 
v a, x 


Figure 1 — Result r as a function of bid x when the 
asking price is equal to v (solid line) and 
to a, > v (crosses) 


Does the inadmissibility rule describe people’s 
actual behavior? It is difficult to make an individual 
violate the rule when the decision situation is pre- 
sented with clarity, for example, by an outcome 
matrix whose entries have an obvious ordering 
(sums of money, as in Table 1, or good health, 
light illness, and death). But consider the following 
experiment. A subject is given an envelope con- 
taining v dollars; he is permitted to convince him- 
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self that the envelope does contain v dollars. He 
writes, but does not tell the experimenter, his ask- 
ing price, a, for the envelope and its contents. He 
will then receive a bid of x dollars. If x exceeds a, 
he will receive x dollars; otherwise, he will keep 
the v dollars. There will be no further negotiations. 
Subjects often ask more than the true value, setting 
a=a,>v, In Figure 1, the stipulated result r of 
such action is plotted against possible levels of the 
bid x (state of nature). Comparison with the cor- 
responding plot for the “honest” asking price, 
a=a,=v, shows that a, is dominated by a. In 
fact, an asking price a. < v (not shown in Figure 1) 
is also dominated by a,. Hence the honest asking 
price is the only admissible one. It seems that some 
subjects expect further negotiations despite the ex- 
plicit warning. We surmise that they would waive 
uncalled-for associations and habits if trained to 
plot or tabulate payoffs. 


4, Irrelevance of nonaffected outcomes 


Table 2 presents a matrix of outcomes measured 
in per cent of return on a firm’s investment. We 
may interpret a, b, a’, and b’ as the firm’s investing 
in the development of alternative products. The out- 
comes of these actions depend on the mutually 
exclusive events (say, business conditions) Z, , Zi, 
and Z,. Suppose the firm prefers a to b. This pref- 
erence cannot be due to any difference in the out- 
comes if the event Z, happens, for these outcomes 
are identically 5. Therefore, the firm’s preference 
of a to b must be due to its preferring the wager 
“_900 if Z., 100 if Zy’ to the return of 5 with cer- 
tainty. But then the firm should also prefer a’ to DS 
for, again, if Z, happens, the outcome (—200) is 
not affected by the firm’s choice. Its preference as 
between a’ and b’ must depend on its preference as 
between the wager “—200 if Z., 100 if Z,” and the 
certainty of 5, just as in the previous case. By the 
same reasoning, if the firm is indifferent between 
a and b, it should be indifferent between a’ and b’. 
The rule enunciated here can be regarded as a gen- 
eralization of the admissibility rule of the previous 
section, with wagers admitted as a form of out- 
comes so that, for example, action a is described 


Table 2 — Outcome matrix for the return on a firm's 
investment (per cent) 


Zı Za Zs 
5 —200 100 

5 5 5 
—200 —200 100 
—200 5 5 
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as follows: if Z, happens, you get 5; otherwise you 
get a lottery ticket, losing 200 if Z. happens and 
gaining 100 if Z, happens. 

Tests of the irrelevance rule. Most business 
executives violated the irrelevance rule when Mac- 
Crimmon (1965) verbally gave them the choices 
in Table 2—first a versus b, then a’ versus b’. How- 
ever, a large majority of my students, who also 
never heard of the irrelevance principle but knew 
how to draw up a matrix of outcomes, complied 
with the rule. Allais (1953) performed a similar 
experiment, but instead of describing the three 
events as alternative business conditions, he gave 
them numerical probabilities (an extraneous notion 
in the present context), the probability of one of 
the events being only .01. Most of his respondents, 
including some decision theorists, violated the prin- 
ciple (but see Savage's pungent introspective dis- 
cussion of his own second thought [1954, sec. 5.6g)). 

The following type of experiment has been dis- 
cussed widely. Funnel 1 contains an equal number 
of red and black balls, and the subject is invited to 
convince himself of this; but he is not permitted to 
look into funnel 2, which he only knows to contain 
one or more balls whose colors are either black or 
red. When a handle is pulled, each funnel will 
release only the bottom ball. In the following bets, 
he will win either $100 or nothing: 


Bet 1R: bottom ball in funnel 1 is red. 
Bet 1B: bottom ball in funnel 1 is black. 
Bet 2R: bottom ball in funnel 2 is red. 
Bet 2B: bottom ball in funnel 2 is black. 


The outcomes are as indicated in Table 3, where, for 
example, event RB is “bottom ball in funnel 1 is 
red, bottom ball in funnel 2 is black.” Suppose the 
subject always prefers to use funnel 1 rather than 
funnel 2, that is, he prefers bet 1R to 2R and also 
bet 1B to 2B. He will thus treat the results un- 
affected by his choice—namely, those earned when 
events RR and BB occur—as relevant to his choice! 
Yet similar experiments performed by Chipman 
(see Interdisciplinary Research Conference 1960) 
and Ellsberg (1961) Suggest that many people do 
indeed prefer funnel 1 to funnel 2. Some of my 


Table 3 — Outcome matrix for the funnel experiment 
(in dollars) 


RB BR RR BB 
100 0 100 0 
0 100 100 ° 
100 o 0 100 
0 100 0 100 


student subjects who did so motivated their choice 
by stating the sharp distinction between “risk” 
(funnel 1) and “uncertainty” (funnel 2), a dis- 
tinction taught to economists since Knight (1921). 
Others stated that they could base their bets on 
more information when using funnel 1 than when 
using funnel 2. But information, although never 
harmful, can be useless, In experiments of Raiffa 
(1961) and Fellner (1965), subjects were, in effect, 
willing to pay for such useless information. 


5. Definitions of probabilities 


Having defined events Z, , Z, , Z,, -> as subsets 
of the set X of all states of nature, it is consistent 
with current mathematical language and current 
English to require that any numbers, P(Z,), P(Zs), 
P(Z,), «+», claimed to be the probabilities of these 
events, satisfy the following conditions: (1) they 
should be nonnegative; (2) for any two mutually 
exclusive events, say Z, and Z,, the number as- 
signed to the event “Z, or Z,” (meaning the occur- 
rence of either Z, or Z,) should equal the sum of 
P(Z,) and P(Z,); (3) P(X), i.e., P(Z, or Z, or Zs 
or ++- ), should be equal to one. 

Personal probabilities. The numbers P(Z,), 
P(Z,), P(Z;), «+» are an individual's personal prob- 
abilities of these events if, in addition to having 
the mathematical properties just stated, they de- 
scribe his behavior in the following sense: when- 
ever P(Z,) >P(Z,), for example, he will prefer 
betting on Z, to betting on Z,, assuming he wants 
to win the bet. This, too, is consistent with ordinary 
English—he will prefer betting on the victory of 
the Democrats in the next election to betting on the 
truth of the proposition that California is longer 
than Norway if and only if he considers winning 
the former bet “more probable” than winning the 
latter one. Here, “betting on Z,” means taking an 
action that yields a result s (for success) if Z, 
occurs, a result that is more desirable than the 
result f (for failure) if Z, does not occur. The 
probabilities of events must depend on the events 
only, i.e., the individual’s preferences between bets 
must be the same for all pairs of results s,f 
($100, $0; status quo, loss of prestige; etc.) pro- 
vided only that s is better than f. This postulate of 
independence of beliefs on rewards must be added 
to those of sections 1, 3, and 4. Will the subject 
reverse his judgment about the comparative chances 
of any two events—as revealed by his choices be- 
tween two bets—if the prizes are changed? If not, 
he satisfies the postulate. It seems to be satisfied 
by practically all subjects asked to rank several bets 
according to their preferences, the rewards being 
first a pair s, f, then a different pair s’, f’. 


Suppose a subject is indifferent to bets on any 
of the eight horses running a race. His preferences 
would thus imply that P(Z,) = P(Z.)=--- = P(Z,), 
where Z; is the event that the ith horse wins (ties 
are excluded for simplicity). The nonnegativity 
property is satisfied if we put P(Z;) = for all i, 
What about the other two mathematical properties 
required? Suppose the subject considers double 
bets with the same prizes s, f as for single bets. He 
will prefer every double bet, e.g., the bet on the 
event “Z, or Z, ,” to any single bet (here he has, in 
effect, applied the inadmissibility postulate), and 
he will be indifferent between any two double bets 
(here the postulates of irrelevance and of intransi- 
tivity apply). Similarly, he will prefer triple bets to 
double bets and will be indifferent between any two 
triple bets, and so on. Consistent with these prefer- 
ences, we can put 


P(Z, or Zj) = % = P(Zi) + P(Z)) 
for any two horses i and j, 
P(Z; or Z; or Z) = % = P(Z; or Z;) + P(Zx) 


for any three horses i, j, and k, and so on. In gen- 
eral, we can assign probability P(Z) = m/8 to the 
event Z that one of the m specified horses will win 
(where 1 <m <8). Then the numbers P(Z) are 
the subject's personal probabilities, for they agree 
with his preferences between bets and also satisfy 
the three mathematical requirements stated at the 
beginning of this section. 

Instead of a horse race, a subject is asked to 
imagine a dial divided into n equal sectors. Sup- 
pose the hand of the dial is spun and comes to rest 
in the ith sector, We define this occurrence as event 
Z, . If the subject is convinced that the dial’s mech- 
anism is “fair,” i.e., that the events Z,, Zs, °°: , Zn 
are symmetrical (exchangeable), he will be indif- 
ferent among bets on any one of these n events. 
[See Propapiiity, article on INTERPRETATIONS. | 
Following Borel (1939, chapter 5), he can then 
assess his personal probability of any event Jii 
say “rain tomorrow,” by finding a number m 
(1<m <n) such that betting on T is not more 
desirable than betting on “Z, or Z, or --+ or Zn” and 
is not less desirable than betting on “Z, or Z, or 

< or Zm.” Then his personal probability of T, 
P(T), satisfies the inequality 


mt 


m 
<P(T) <=: 


By making n arbitrarily large, one can assess P(T) 
arbitrarily closely, and by using dial arcs which 
represent any fraction, rational or irrational, of the 
dial’s circumference, one can define personal prob- 
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abilities ranging continuously from 0 to 1. Note 
that such assessments of personal probabilities, 
when determined in an experiment, are based not 
on the subject’s verbal statement of numbers he 
calls probabilities but on his actual choices. They 
may therefore be useful in predicting actions pro- 
vided that the subject is consistent. If he is not 
consistent, his violations may or may not be sim- 
ilar in principle to those incurred in any instru- 
ment readings—a theme of probabilistic psychology 
touched upon in section 2. 

Objective probabilities. As a special case, per- 
sonal probabilities of some real-world events may 
be “objective,” i.e., the same for different people. 
This is particularly the case when there is agree- 
ment that the events come sufficiently close, for all 
practical purposes of those involved, to fulfilling 
certain symmetry requirements, Approximate sym- 
metry is assumed for the positions of a roulette dial 
like Borel’s and for the occurrences of death among 
many similar males of age 20. Such requirements 
are strictly satisfied only by idealized, mathemati- 
cally defined events—events that are never observed 
empirically. A “fair” coin, a “fair” roulette dial, a 
“homogeneous” population of males aged 20 (or a 
“random” sample from such a population) are all 
mathematical constructs. The mathematical theory 
of probability applies rules of logic to situations in 
which strict symmetry and the three properties 
stated at the beginning of this section hold (refin- 
ing property 2 in order to accommodate the case of 
an infinite X). If decision makers agree that certain 
events are approximately symmetric, and if they 
apply logical rules, then their choices between 
betting on (predicting) any two events will agree; 
their personal (in this case also objective) prob- 
abilities will coincide with those given by mathe- 
matical theory. Clarity requires us, however, to 
distinguish between mathematical probabilities and 
objective probabilities assigned by decision makers 
to empirical events, just as we distinguish between 
a geometric rectangle and the shape of an actual 
sheet of paper. 


6. Expected utility 

The four postulates discussed thus far—com- 
plete ordering of actions, inadmissibility of domi- 
nated actions, irrelevance of nonaffected outcomes, 
and independence of beliefs on rewards—appear 
about as convincing as the rules of logic (and about 
as subject to transgression by people not trained 
in untwisting brain twisters ). Together with a “con- 
tinuity” postulate (to be introduced presently ), 
they imply the following rule, which is more com- 
plicated and less immediately convincing: The con- 
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sistent man behaves as if he (1) assigned personal 
probabilities P(Z) to events Z, (2) assigned numer- 
ical utilities u(r) to the results r of his actions, 
and (3) chose the action with the highest “expected 
utility.” The expected utility #(a) of action a is the 
weighted average 


(a) = x u(r): P(Z«), 


where the event Z, is the set of all states for which 
action a yields result r. 

The rule is trivially true when the choice is among 
sure actions; if action a always yields result r, then 
P(Z,) =1, so that o(a) = u(r). 

Consider now actions with two possible results 
—Success s and failure f. This is the case, for ex- 
ample, when actions are two-prize bets (as in sec- 
tion 5) or when the decision maker is a “satisficer” 
(Simon 1957) for whom all outcomes below his 
“aspiration level” are equally bad and all others 
equally good. In section 5 we saw that of two 
two-prize bets, a consistent decision maker prefers 
the bet that has the higher probability of success. 
Since s is better than f, we can assign numerical 
utilities 


u(s)=1>0=u(f), 


and we see that the expected utility w(b) of a two- 
prize bet b coincides with its probability of success 
P(Z,,), since 


o(b) =1- P(Z,) +0 - P(Zp) = P(Zy). 


Thus the satisficer maximizes the probability of 
reaching his aspiration level. 

As the next step, we compute the probability of 
success and hence the expected utility w(c) of a 
bet c compounded of n simple two-prize bets or 
lottery tickets b,, «+> »b, on n different (but not 
necessarily mutually exclusive) events T, , -- - > Tes 
Lottery ticket b; is a bet on the event T;, and the 
subject will receive ticket b; if Zi happens. The 
events Z,,+--,Z, are mutually exclusive events 
one of which must happen, and the events “Z; and 
T;” (the occurrence of Z; and T;) are pairwise in- 
dependent in the sense that 


P(Z, and T;) = P(Z;) - P(T,). 


We can thus regard ticket b; as the result yielded 
by action c when Z; happens. Hence P(Z;) = P(Z, e) 
in the present notation. Moreover, we have just 
shown that the expected utility of a simple two-prize 
bet can be measured by its probability of success, so 
that P(T;) = w(b; ). Clearly, the probability of suc- 
cess of the compound bet c is the probability of the 
event “(Z, and T,) or (Z, and T-) or +++ or (Z, and 


T,)"; by mathematical property 2 of probabilities 
(section 5), this is equal to 5}; P(T) - P(Z,), 
Hence, 


olc) = La(b,) - P(Z,,-), 


i.e., the expected utility rule is valid for the special 
case where each result of an action is a two-prize bet, 

To extend this in a final step to the general case, 
let s be the best and f the worst of all results of an 
action. In the preference notation of section iG 

Sr <s for any result r. Consider the continuous 
range of all bets b whose two prizes are s and f 
and whose success probabilities take all the values 
between (and including) 1 and 0. Then, for any b, 
f <b<s, and fora given r, r <b or b <r depend- 
ing on the bets success probability. A plausible 
continuity postulate asserts, for each r, the exist- 
ence of a bet, say b,, such that r~b,. We can 
therefore assign to r a utility u(r) = w(b,). A de- 
cision maker should then be indifferent between 
an action a that yields various results r with respec- 
tive probabilities P(Z,,) and a bet c compounded 
of the corresponding two-prize bets b, just described 
entering with the same probabilities P(Z,,). That 
is to say, P(Z,.) = P(Z,,). The expected utility rule 
follows, since 


(a) =a(c) = Dw(b,) + P(Zi,<) 
= Lu(r) « P(Zn)- 


Some insight into this derivation of the expected 
utility rule is provided to the trainee in decision 
making by letting him rank his preferences among 
the tickets to four lotteries. Each ticket is described 
by prizes contingent on two alternative events, one 
of which must occur. An example of such a deci- 
sion problem is presented in Table 4, where p is 
written for P(Z) for brevity. If the event Z is “a coin 
is tossed and comes up heads” (we refer below to 
this event simply as “heads”) and the subject re- 
gards the coin as “fair,” then p = 4. But Z may also 


Table 4 — Decision problem involving lottery tickets 


Prize 
Prize if event Z 
lottery if event Z does not seein: 
ticket happens happen Probability of gaining $1! 
a $100 $0 pP 
b Lottery 
ticket a $100 p?+1—p>p 
e Lottery 
ticket a $0 P<p 
d Lottery Lottery be: 
ticket b tickete plp? + 1 — p) + (1 —p)p* =P 


be, for example, “the next sentence spoken in this 
room will contain the pronoun T?” In any case, 
when the inadmissibility postulate is applied, it is 
evident from the last column of Table 4 that ticket a 
is better than c and worse than b. Furthermore, the 
decision maker should be indifferent between tickets 
aand d. 

Cash equivalents and numerical utilities. Let 
us define the cash equivalent of ticket a, denoted 
k(a), as the highest price the decision maker would 
offer for ticket a; k(b), k(c), and k(d) are defined 
similarly. If asked to name his cash equivalent for 
each lottery ticket in Table 4, the decision maker 
should name amounts such that k(b) > k(a) = 
k(d) >k(c). If he fails to do this, he is incon- 
sistent, and no scale of numerical utilities can 
describe his behavior. If he is consistent, and if the 
event Z is “heads,” the following utilities for some 
money gains can be ascribed to him: 


($100) = 1; 
o[k(b)] = 4; 
e[k(a)] = of[k(d)] = 45 
ofk(c)] = 45 
($0) = 0. 


Some but not all subjects conform with the re- 
quired ranking of the lottery tickets. Therefore, in 
any empirical estimation of a subject's utilities and 
personal probabilities, one must check whether the 
subject is consistent, at least in some approximate 
sense. As pointed out in the simpler context of 
section 2, probabilistic models of decision and of 
learning to decide are needed for any descriptive 
theory, and they too may fail. 

Behavior toward risk. Again suppose that the 
event Z in Table 4 is “heads.” If the subject has 
named cash equivalents k(a) = k(d) = $50, k(¢) = 
$25, and k(b) = $75 (and similarly for further, 
easily conceived compound lotteries with utilities 
4, dices sae 7+)» we would infer that over the 
observed range he is indifferent between a “fair 
bet” and the certainty of getting its expected gain. 
We would say that he is “indifferent to risk.” His 
utility function of money gain is a straight line. On 
the other hand, if he has named cash equivalents 
k(a) =k(d) < $50, k(c) <3k(a), and k(b) < 
3[100 + k(a)], his utility function of money gain 
will be a concave curve (any of its chords will lie 
below the corresponding arc), and we would say 
that he is “averse to risk.” If the inequality signs 
in the preceding sentence are reversed, the utility 
function is convex (chords are above arcs), and 
we would say that over the observed range the sub- 
ject “loves risk.” When the utility function is either 
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concave or convex, the decision maker maximizes 
the expected value, not of money gain, but of some 
nonlinear function of money gain. 

Daniel Bernoulli (1738) pointed out that the 
utility function is concave in the case of the “Peters- 
burg paradox.” Marshall (1890) also assumed risk 
aversion as an economic fact (or perhaps as a 
prescription from Victorian morals) equivalent to 
that of “decreasing marginal utility of money.” 
Some economic implications of risk aversion were 
given by Pratt (1964) and Arrow (1965). It should 
be noted that this assumption is inconsistent with 
the behavior of a satisficer for whom utility is a 
step-function of money gain and the behavior of 
a merchant for whom the disutility of bankruptcy 
is the same regardless of the amount owed. 

The maxmin rule. In competition with the ex- 
pected utility rule (and the postulates underlying 
it) is the “conservative” or “maxmin” rule, which 
states that the decision maker should maximize 
the minimum payoff; that is to say, the maxmin 
rule proposes that preferences among actions be 
based on each action’s worst result only. Thus in 
the case of Table 1 the maxmin rule would pre- 
scribe that a' < b even if Z, has the probability of, 
say, an earthquake, and presumably b ~ a, contra- 
dicting the inadmissibility postulate. However, be- 
cause of the strong appeal of the inadmissibility 
postulate, it is usually proposed that the maxmin 
tule be applied only when domination is absent; 
this leads to the ordering a’ <b <a rather than 
a <b~a. This proposal is not too satisfactory 
since it violates a plausible continuity principle 
(Milnor 1954). A small modification in outcomes 
—by $1 in Table 1, or by 1 cent for that matter— 
can make or break domination and thus reverse the 
preference ordering. Indeed, should we advise peo- 
ple not to live in San Francisco or Tokyo because 
of possible but not very probable earthquakes? 
Should we advise pedestrians never to cross streets 
because of the possibility of being struck by an 
auto? Such considerations compel us to balance 
the advantages against the disadvantages of com- 
peting decisions, weighing them by appropriately 
defined probabilities—the expected utility principle. 

A historical note, Early theories prescribing 
maximization of expected utility were confined to 
special cases. Two types of restrictions were im- 
posed. First, the decision maker was advised to 
maximize the expected value of his money wealth 
or gain, i.e., utility was in effect identified with 
a money amount or some linear function of a 
money amount. Some other quantifiable good, such 
as the number of prisoners taken or the number of 
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patients cured, might play the same role as money, 
but nonquantifiable rewards and penalties of ac- 
tions were unnecessarily excluded from the set of 
results. Second, probabilities were restricted to 
objective ones—a special case of personal proba- 
bilities. 

It is difficult to imagine that an experienced 
Bronze Age player who used dice that had “tooled 
edges and threw absolutely true” (David 1962) 
would bet much more than 1:5 on the coming of 
the ace. Cardano’s efforts in computing the gam- 
bler’s odds (Ore 1953) suggest at any rate that by 
the sixteenth century the rule of maximizing aver- 
age money gains computed on the basis of objec- 
tive probabilities was taken for granted. 

We have already cited Bernoulli and Marshall 
as having proclaimed utility a nonlinear function 
of money, thus lifting the first of the above restric- 
tions. Marshall also applied the utility concept to 
commodity bundles. Von Neumann and Morgen- 
stern (1944) extended it further to all possible 
results of actions and derived the expected utility 
tule from simple consistency postulates. This was 
simplified by others, especially Herstein and Milnor 
(1953), All these writers dealt only with objective 
probabilities, leaving out the important cases in 
which symmetries between relevant events are not 
agreed upon. 

Bayes (1764) can be credited with the idea of 
personal probabilities. He thought of them as be- 
ing revealed by an individual's choices among 
wagers—not just on cards and coins, but on horses 
and fighting cocks as well! Thus Bayes removed 
the second restriction, but he retained the first in 
assuming that utility was in effect identified with 
money gains, so that betting 9 guineas against 1 
implies a corresponding ratio of probabilities. In 
1937, de Finetti provided mathematical rigor for 
this approach, 

In 1926, Ramsey (1923-1928) stated, perhaps 
for the first time, simple consistency postulates 
that imply the existence of both personal proba- 
bilities (of any events, regardless of symmetries ) 
and utilities (of any results of actions, quantifiable 
or not). Savage (1954) restated the consistency 
postulates and, partly following de Finetti and 
von Neumann and Morgenstern, proved that they 
imply the expected utility rule. For an original ex- 
position, see Pratt, Raiffa, and Schlaifer (1964). 

Reviews of the works of other contemporary 
authors are found in Arrow (1951), Luce and 
Suppes (1965), Ożga (1965), and Fishburn 
(1964). [See also Economic EXPECTATIONS.] Sur- 
veys and bibliographies as well as much original 
material by 21 leading authors on both the prescrip- 


tive and descriptive aspects of decision theory are 
found in Shelly and Bryan (1964). 


7. Strategies 


The existence of numerical utilities describing 
a decision maker's “tastes” and of probabilities de- 
scribing his “beliefs” has been shown to follow 
from rules of consistency, To formulate and solve 
the problem of choosing a good decision, both 
tastes and beliefs must be assumed fixed over some 
given period of time (but see Koopmans [1964] on 
“flexibility”). In general, the probabilities the de- 
cision maker assigns at any time to the various 
states of nature and thus to the results of his ac- 
tions depend on his information at that time, New 
information may also uncover new feasible actions. 
We shall generalize the decision concept accord- 
ingly in several steps. 

A strategy (also called a decision function or 
response rule) is the assignment, in advance of 
information, of specific actions to respond to the 
different messages that the decision maker may re- 
ceive from an information source. If, more gen- 
erally, messages, actions, and results form time 
sequences (the case of “earning while learning”), 
we have a sequential strategy (also called an ex- 
tensive or dynamic strategy). To each possible 
sequence of future messages, a sequential strategy 
assigns a sequence of actions (see, for example, 
Theil 1964). An optimal sequential strategy maxi- 
mizes the weighted average of utilities assigned to 
all possible sequences of results, possibly taking 
into account “impatience” by means of a discount 
rate (Fisher 1930; Koopmans 1960). The weights 
are personal joint probabilities of sequences of 
events and messages. This amounts to the same 
thing as saying that the probabilities of events are 
revised each time a message is received. 

It is still more general to redefine action in 
order to include in it the choice of information 
sources to be used and thus of “questions to be 
asked” at a given time. The resulting problem of 
finding an optimal informational strategy is a task 
of economic theories of information and organiza- 
tion and also of statistical decision theory (¢-8+ 
Raiffa & Schlaifer 1961) where events and in- 
formation sources correspond to hypotheses and 
experiments, respectively. 

Finally, if we allow the decision maker to re- 
ceive messages about the feasibility and outcomes 
of actions he has not formerly considered, we ob- 
tain the still more general concept of exploratory 
strategy. True, it has been almost proverbially tenu- 
ous to assign probabilities to the results of indus- 
trial or scientific research; yet these undertakings 


are not different in principle from many other ven- 
tures and bets, such as those discussed in section 5. 

The complex strategies noted here are hardly 
maximized by the actual entrepreneur, although 
the penetration of industry by professionals may 
again bring descriptive and prescriptive economics 
to their pristine closeness. Today many descriptive 
hypotheses in this field use the concept of aspira- 
tion level—the boundary between success and fail- 
ure, The actual decision maker is said to revise his 
aspiration level upward or downward depending on 
whether he has or has not reached it by previous 
action; exploration for actions not previously con- 
sidered is triggered by failure (Simon 1957). The 
sequence of actions generated by the dynamic 
aspiration-level model will, in general, differ from 
that prescribed by dynamic programming. [See 
PROGRAMMING.] Yet, with utilities assigned to re- 
sults of actions in a particular way, it is possible 
that the aspiration-level mechanism is indeed op- 
timal. It has been inspired, in fact, by adaptive 
feedbacks observed in live organisms; such feed- 
backs presumably have maximized the probability 
of the survival of species. 


8. Cost of decision making 


One action or strategy may appear better than 
another as long as we disregard the toil and trouble 
of decision making itself, i.e., the efforts of gather- 
ing information and of processing it into an optimal 
decision. The ranking of actions may be reversed 
when we take these efforts into account and deal, 
in this sense, with “net” rather than “gross” ex- 
pected utilities of actions. A small increase in 
expected profit may not be worth a good night's 
sleep. In statistics, we stop sequential sampling 
earlier the higher the cost is of obtaining each ob- 
servation. As an approximation to some logically 
required, but very complicated, decision rule, we 
may use a linear decision rule (e.g., prescribe in- 
ventories to be proportional to turnover) in order 
to lessen computational costs. The “incremental- 
ism” observed and recommended by Lindblom 
(1965) in the field of political decisions corre- 
sponds to the common mathematical practice of 
searching for a global optimum in the neighbor- 
hood of a local optimum or possibly in the neigh- 
borhood of the status quo. How many local search 
steps one should undertake and how often he 
should jump (hopefully) toward a global optimum 
will presumably depend on the costs of searching 
(see, for example, Gel’fand & Tsetlin 1962). In- 
deed, some strategies may be too complex to be 
computable in a finite amount of time, even though 
they respond to each message by a feasible action. 
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There is, after all, a limit on the capacity of com- 
puters and on the brain capacity of decision makers. 
Hence, some strategies may have infinite costs. 

The net utility of an action is often represented 
as the difference betwen “gross utility” and “de- 
cision cost.” A prescriptive theory would presum- 
ably require that personal probabilities be assigned 
not only to the outcomes of the actions but also 
to the efforts of estimating the outcomes and of 
searching for the action with the highest expected 
utility. 

On a more general level, it is not strictly per- 
missible to represent net utility as a difference be- 
tween gross utility and decision cost. Even if the 
two were measurable in dollars, say, utility may be 
a nonlinear function of money gains. Yet the as- 
sumption that net utility is separable into these 
components simplifies the theory—it reduces the 
cost of thinking! Almost all prescriptive theory to 
date deals with gross utility only. Little attention 
has been given to decision costs that might be sub- 
tractable and thus definable in arriving at a net 
utility concept. Still less attention has been given 
to a net utility concept that cannot be decomposed 
into gross-utility and decision-cost components. 

The elements for a theory of the expected cost 
of using inanimate computers, given the (statis- 
tical) population of future problems, are probably 
available. However, the current classification of 
computation problems as “scientific” and “business” 
(differing in their comparative needs for speed and 
memory) is certainly much too rough. A theory of 
mechanical computation costs would resemble the 
theory of the cost of manufacturing with complex 
equipment and known technology when capacities 
and operations are scheduled optimally. On the 
other hand, so little is known about the “technology” 
of human brains! If economists would join forces 
with students of the psychology of problem solving, 
insights would undoubtedly be gained into both 
the descriptive and prescriptive aspects of decision 
making. 

Jacos MARSCHAK 


[Directly related are the entries DECISION THEORY; 
GAME THEORY, article on THEORETICAL ASPECTS; 
Uriniry.] 
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nt 
POLITICAL ASPECTS 


Decision making is a social process that selects 
a problem for decision (i.e., choice) and produces 
a limited number of alternatives, from among 
which a particular alternative is selected for im- 
plementation and execution (Snyder et al. 1962, 
p. 90). Some writers use the term synonymously 
with policy making, although others distinguish 
the two, reserving decision making for choices that 
involve conscious action and are subject to sanc- 
tions and policy making for a collectivity of inter- 
secting decisions that has no choice-making unit 
in a position to decide for all parties involved 
(Braybrooke & Lindblom 1963, p. 249). For ex- 
ample, one may refer to decision making by the 
presidency or by Congress, but together these insti- 
tutions constitute part of the total policy-making 
process of the United States. Decision making is 
also distinguished from problem solving, which 
may refer either (1) to tasks in which both the 
problem for solution and the alternative solutions 
are given (Kelley & Thibaut 1954) or (2) to more 
abstract higher mental processes of thinking and 
information processing (Newell et al. 1958). Po- 
litical decision making, however, is conceived of as 
involving the search for both problems and alterna- 
tives. Voting, however, is an exception, because 
electors have little, if any, power over the timing of 
elections and for all practical purposes only in- 
direct, if any, participation in the selection of 
alternatives (candidaves). 

History. The intellectual origins of decision- 
making analysis are twofold. One line of its devel- 
opment, mathematical economics, has its begin- 
nings in the eighteenth-century work of Bernoulli 
(1738) and in the modern formulations of theo- 
ries of games by von Neumann and Morgenstern 
(1944). An important political successor was 
Downs’ formulation of governmental decision mak- 
ing in terms of economic theories (1957). The 
origins of mathematical, economic, and game- 
theoretic decision making have been codified by 
Luce and Raiffa (1957) and give promise of in- 
cluding many further hypotheses for empirical in- 
vestigation, as is illustrated by Riker’s work on 
coalitions and coalition formation (1962). This 
root branched off into experimental studies, re- 
viewed in two papers by Edwards (1954; 1961), 
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to test hypotheses deduced from the mathematical 
models [see GAME THEORY]. 

The other historical root of decision-making an- 
alysis is in public administration, which so far has 
been the more influential strain in political de- 
cision making. Alexander Hamilton in America and 
Charles-Jean Bonnin in France identified the field 
at the turn of the nineteenth century (White [1926] 
1955, pp. xiii, 10), and within a hundred years 
Woodrow Wilson in the United States and Max 
Weber in Germany had inaugurated academic 
studies of organizational decision making. The next 
major original works on organizations were those 
of Chester I, Barnard (1938) and Herbert A. Simon 
(1947), who were pioneers in calling for and intro- 
ducing social scientific techniques to the study of 
the subject and also in indicating the relevance of 
psychological and sociological knowledge to the 
understanding of organizations and administration. 
Richard C, Snyder and his colleagues (1954; 1962) 
followed with a conceptual scheme that was de- 
signed for the study of foreign-policy making or- 
ganizations but is applicable to organizational 
decision making in general [see PUBLIC ADMINIS- 
TRATION]. 

In political science, decision making—or, more 
broadly, policy making—has been studied in elec- 
toral voting, legislative roll calls, judicial opinions, 
public opinion, and virtually every other kind of 
political situation or setting. 


Approaches 


Among the most influential modern work on 
decision making is that of Simon. Beginning with 
his logical critique of “proverbs of administration,” 
Simon challenged eighteenth-century assumptions 
about decision behavior. Classical economic theory 
assumed that decision makers know all alternatives, 
that they know the utilities (values) of all alterna- 
tives, and that they have an ordered preference 
among all alternatives. For such a demand for 
“rationality” Simon proposed to substitute the con- 
cept of “bounded rationality,” which would more 
nearly comport with what is known about the psy- 
chological and physiological limits of decision 
makers. For the model of optimizing decisions, he 
substituted satisficing, that is, the adoption of a de- 
cision when an alternative seems to meet minimal 
standards or is good enough and is not dependent 
on the availability of all alternatives from which 
the best is chosen. These concepts and propositions 
have been researched primarily in industrial rather 
than in governmental organizations (see the sys- 
tematic codification of theory in March & Simon 


1958), although there is no theoretical reason to 
expect them to be any less applicable in the latter, 
[See ADMINISTRATION, article on ADMINISTRATIVE 
BEHAVIOR. ] 

Another important stimulus to studies of polit- 
ical decision making is the work of Snyder et al. 
(1954; 1962). The first presentation of Snyder's 
conceptual scheme was an outline of categories on 
which data for studying foreign policy decisions 
should be gathered. Although distributed initially 
only in a privately published paperback format, 
Snyder's study was soon cited in publications rang- 
ing from work on disturbed communication to 
studies in judicial behavior. Before Snyder's work 
was published in book form eight years after its 
first appearance, excerpts of it had been widely re- 
printed. Much of its impact, quite apart from its 
attention to decision-making analysis, undoubtedly 
stemmed from its explicit concern with a number 
of issues in methodology and the philosophy of sci- 
ence that were current in American political science 
during the 1950s. 

Because Snyder's approach consisted largely of a 
conceptual scheme that identified clusters of varia- 
bles for study without containing theory about their 
interrelations, propositions for empirical study could 
not logically be derived from the formulation as 
they had from more formal models, such as those 
of von Neumann and Morgenstern. However, Sny- 
der and his associates formulated a number of hy- 
potheses for empirical work based on the concep- 
tual scheme that were capable of being studied in 
a number of contexts by different researchers 
(Snyder & Paige 1958; Snyder & Robinson 1961). 

One of the great merits of the conceptual scheme 
originated by Snyder et al. was that it joined Bye 
chological and sociological levels of explanation; 
that is, it proposed to combine data and theory 
about both individual decision makers and the 
group or organizational context in which they oper- 
ate. It offered a means of explaining group behavior 
in terms other than those strictly of personality. The 
aim was to combine the social and the psychological 
levels of analysis in order to increase the ability to 
predict variance. However, some critics felt that 
neither the state of psychology nor that of soci- 
ology permitted such hypothetical combinations. 
This point undoubtedly had merit, owing to the fact 
that few political scientists pursued Lasswell’s in- 
itiatives in studying political personality (1930; 
1935; 1948) and to the continuing separation of 
experimental social psychologists from field-oriented 
political scientists. One may expect that among the 
most active lines of future research on decision 


making will be studies of the interrelation of indi- 
vidual and organizational factors in producing 
decision outcomes. 

A more descriptive and intellectualized model of 
the decision process is that of Lasswell (1956), 
who has identified seven stages or functions in the 
making of any decision. The first function is that 
of intelligence, which brings to the attention of the 
decisional unit problems for decision and informa- 
tion about these problems. There follow the rec- 
ommendation function, in which alternatives are 
proposed; the prescription stage, in which one 
alternative is authoritatively selected; the invoca- 
tion of the prescribed alternative; its application in 
particular cases by administrative or enforcement 
officers; the appraisal of its efficacy; and, finally, 
the termination of the original decision. This con- 
ception of decision making was designed on the 
basis of numerous investigations of judicial proc- 
esses and has since been employed in a variety of 
legal contexts, including sanctioning systems in 
civil and criminal law, law of outer space, law of 
the seas, and public order [see POLICY SCIENCES]. 
Its usefulness has been demonstrated also in studies 
of legislative and foreign policy decisions. Like 
Snyder’s conceptual scheme, Lasswell’s descriptive 
model does not immediately generate hypotheses 
for empirical investigation. Many such hypotheses 
have been formulated and researched, however, 
among them propositions relating power advantages 
to decision makers who dominate the intelligence 
and recommendation functions. 

Voting studies have only recently been cast in the 
language of decision making. Downs considered 
electoral decisions in terms of the postulates of 
economic behavior and deduced 25 propositions 
about party and governmental decision making 
(1957, pp. 296-300). For many of these data were 
independently available to confirm or disconfirm 
the predictions by their consistency or inconsis- 
tency with his model. In contrast to such quasi- 
mathematical models, the more typical voting study 
has emphasized the social-psychological variables 
acting to produce the voter's (and voters’) de- 
cisions. An inventory of voting surveys ( exclusive 
of ecological and gross data analyses) found 209 
hypotheses on which some empirical evidence was 
available in one or more studies (Berelson et al. 
1954). Contemporary voting theory considers the 
decision to vote and the direction of the vote to be 
products of party affiliation, orientation toward 
candidates, and orientation toward issues. Party 
affiliation is a stable factor, usually inherited from 
one’s family, and generally subject to change only 
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by dramatic social events (e.g., an economic de- 
pression). In the absence of unusual salience of 
either candidates or issues, party affiliation will 
determine the individual and aggregate vote. [See 
VoTING.] 


Generic characteristics of decision making 


Whether these different uses of decision termi- 
nology have anything in common remains to be seen. 
The answer depends on two kinds of effort. One is 
to search for congruence among specific, narrow- 
gauged propositions from contrasting kinds of de- 
cisions. For example, if organizations have rela- 
tively little information and have a deadline for 
decision, their decision makers tend to rely more 
heavily than otherwise on fundamental value orien- 
tations (Snyder & Paige 1958). Similarly, if voters 
have relatively little information about issues and 
candidates, they tend to rely more heavily on rather 
enduring and stable party affiliations (Michigan 
. . . 1960). These separate hypotheses suggest a 
transcendent one; that if information is low, evalu- 
ative criteria are likely to be more important than 
empirical or factual criteria. Another example of 
possible complementarity is the similarity between 
the dimensionality of attitudes in legislative voting 
and leadership and power in legislative bodies and 
the dimensionality of attitudes and roles among 
community decision makers. Legislative attitudes, 
at least among U.S. congressmen, seem to “scale” 
around several dimensions rather than along a 
single liberal—conservative continuum (MacRae 
1958; Miller & Stokes 1963); that is, if an observer 
knows a legislator’s vote on one bill, he can predict 
his vote on other bills in the scale, e.g., social wel- 
fare, but not on bills outside the scale, e.g., foreign 
affairs. Similarly, leadership on decisions is typi- 
cally multidimensional; one set of leaders prevails 
on issue A, e.g., civil rights, and another set domi- 
nates issue B, e.g., education (Matthews 1960; 
Robinson 1962b). This finding is supported in 
studies of influence and decision making in com- 
munities of various size (Katz & Lazarsfeld 1955; 
Dahl 1961). [See LEGISLATION, article on LEGISLA- 
TIVE BEHAVIOR. ] Still another apparently transcend- 
ent proposition is one that incorporates the “cross- 
pressures hypothesis” in voting and the frequent 
characterization of committee decisions, Voters re- 
ceiving conflicting appeals from family, church, 
work group, and other sources tend to compromise 
by not voting or by split-ticket voting (Michigan 
. . . 1960). Committee or group decisions are simi- 
larly said to be compromised and ambiguous ver- 
sions of originally clear and consistent alternatives 
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(Kelley & Thibaut 1954). Whether such apparently 
similar propositions have anything more than super- 
ficial commonality and whether many such con- 
gruities exist are questions for research that would 
help determine whether decision studies of various 
units and levels have much in common. [See Cross 
PRESSURE. | 

The second kind of effort that would clarify the 
generality of decision phenomena would be the 
construction of models of decision that would both 
generate new hypotheses for empirical investiga- 
tion and accommodate existing, more or less con- 
firmed propositions. Such attempts are apparent, 
for example, in the recent books by Downs, Riker, 
and Campbell et al. Downs, as already noted, has 
parsimoniously derived from his model a large 
number of propositions that had previously been 
investigated as parts of less general theories. Riker, 
too, has found that electoral and legislative de- 
cision studies fit his theory of coalition formation. 
And Campbell et al. have incorporated previous 
ecological and sociological voting hypotheses into 
their model of the American voter. However per- 
suasive postdictive studies may be, that is, however 
consistent ad hoc propositions about historical 
events may be with the post hoc models, the 
stronger and more compelling evidence comes from 
predictive models from which derived hypotheses 
may be researched and verified. Decision models of 
this kind are less apparent in political science than 
in economics and psychology (Edwards 1954; 
1961; Simon 1959). 

The concept of occasion for decision. A con- 
ception of decision making that includes the iden- 
tification of a problem or situation and is not 
confined to choice making treats the occasion for de- 
cision as a variable and not as a constant. Different 
Kinds of decision situations involve participants and 
activate organizational structure in different ways. 
A number of dimensions of occasion for decision 
have been identified: uncertainty, risk, routine, un- 
programmed (Snyder et al. 1962, p. 81; Simon 
1960). Among typologies of decision situations is a 
three-dimensional one that identifies a range of 
situations varying between “crisis” and noncrisis. 
Crisis is a situation that (1) is regarded by decision 
makers as threatening to their organizational goals, 
(2) requires a relatively short response time, and 
(3) is unanticipated (Robinson 1962a; C. F. Her- 
mann 1963, pp. 63-65). This conceptualization re- 
sembles social-psychological concepts of stress and 
threat but is more particular to the domain of 
decision making. It also is a more narrow definition 
than usually appears in historical and political sci- 
ence literature, which often defines a crisis as only 


an important event. Further empirical work will be 
required to validate a workable concept; more than 
logic or a priori definitions are required [see Crisis], 


Personality and decision making 


The relation between “personality” and decision 
making, historically of interest to political analysts 
but long confined to anecdotes and speculation, still 
awaits sustained systematic study. Lasswell’s ad- 
vocacy of the application of psychoanalytic and 
psychological concepts in the study of politicians in 
the 1930s remains to be heeded, except by a few 
political scientists. Research falls into two cate- 
gories: (1) that dealing with political socialization 
and recruitment into decision-making roles, and 
(2) that relating personal characteristics of de- 
cision makers to the content of their decisions. 

Political socialization studies are founded on 
Lasswell’s dictum that everyone is born a politician, 
but some outgrow it. Three psychologically oriented 
biographies relate adolescent and preadolescent 
events to the eventual political careers of President 
Woodrow Wilson (George & George 1956), Mayor 
Anton Cermak of Chicago (Gottfried 1962), and 
James Forrestal, the first U.S. secretary of defense 
(Rogow 1963). Wilson illustrates the hypothesis 
that childhood deprivations of affection lead to low 
self-esteem, for which in adolescence and adult- 
hood one compensates by seeking and exercising 
power over others. Wilson’s compensation came 
first as speaker of college debating societies and 
later as chief executive of Princeton University, the 
state of New Jersey, and the United States. Efforts 
to find psychological characteristics to distinguish 
political types from apolitical types have been 
small in number and discouraging in results. For 
example, McConaughy’s study of South Carolina 
legislators (1950), Hennessy’s survey of Arizona 
party activities and nonactivists (1959), and 
Browning and Jacob's tests of politicians’ motiva- 
tions (1964) confirmed few of a number of hy- 
potheses that psychological instruments would dis- 
tinguish politicians from nonpoliticians. However, 
acknowledged methodological shortcomings in some 
of these studies make their results indecisive and 
inconclusive. Wahlke et al. (1962), in a compart 
son of four state legislatures, found that political 
interests of the elites they studied were activated 
at almost any stage of the life cycle, but that the 
most frequently crucial phase of political socializa- 
tion occurs at a relatively early age—for many, 2S 
early as childhood. Similar conclusions are docu- 
mented in Hyman (1959), Greenstein (1960), 
and Easton and Hess (1961; 1962). [See Social 
IZATION, ] 


The other line of research on personality and 
decision making relates personal characteristics to 
decision content. One class of such studies searches 
for correlations between social backgrounds and 
prior experiences on the one hand and decisions 
or policies on the other. Matthews (1960) found 
that legislators who adhere more closely to the in- 
ternal norms or folkways of the Senate are most 
successful in getting their bills adopted. And, in 
turn, senators who adhere most closely to the folk- 
ways tend to come from noncompetitive, homo- 
geneous states. Nagel (1962) investigated the rela- 
tions of more than fifteen variables, such as party 
affiliation, education, occupation, ethnicity, and 
group affiliations of judges, to a large number of 
different kinds of judicial decisions, including those 
involving administrative regulation, civil liberties, 
taxation, family relations, business, personal in- 
jury, and criminal cases. Statistically significant 
results of correlations of varying sizes have been 
obtained for many variables. 

Another class of such studies relates unconscious 
motivations of decision makers to the outcomes of 
their decision process. A. George and J. George's 
biography of Wilson (1956) is a case in point, for 
the president's ambitions for power obscured his 
perception of the actual power situations and led 
him into self-defeating strategies. Almond and 
Lasswell (1948) reported that the interaction be- 
tween the dominance or submissiveness of clients 
and welfare administrators was predictive of the 
decisions of the administrators; dominant clients 
were more likely to obtain favorable decisions from 
submissive agents than vice versa, and submissive 
clients were more likely to receive favorable de- 
cisions from dominant agents than vice versa. 
Margaret Hermann (1963) revived content analy- 
sis of public speeches as an indirect measure of 
legislators’ motivations toward power, personal se- 
curity, other people, tolerance of ambiguity, and 
ethnocentrism, and from knowledge of some of 
these unconscious motivations of twenty legislators, 
she successfully predicted their votes on a scale of 
foreign policy issues around the dimensions of 
nationalism-internationalism. Milbrath found that 
highly sociable citizens tended to make more con- 
tributions to political campaigns than nonsociable 
citizens, but he found that sociability was not asso- 
ciated with the behavior of Washington lobbyists 
(1963). 

However elemental these personality studies are, 
they are more advanced than efforts to relate per- 
sonal attitudes and values to political decisions. 
This line of research is only beginning to be opened 
up in American studies. However, the cross-cultural 
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work of McClelland (1961) suggests strongly that 
motivations toward and values of achievement are 
related to a society's economic development; this 
finding is supported by an investigation of more 
than forty societies that is based on content analy- 
sis of educational materials and projective tests. 

However discouraging the outcomes, to date, of 
research on personality and decision making may 
be, the disappointment owes more to the neglect of 
extensive effort than to any evidence that the task 
is fruitless or unpromising. The long-standing aca- 
demic separation of political science from psy- 
chology and the diverse kinds of training given to 
political scientists have hindered the research. Re- 
cently completed and published studies once again 
reveal evidence that the interaction of personality 
and political decision is a fruitful subject of study. 
[See PERSONALITY, POLITICAL.] 


Organizational context of decision making 

With the possible exception of voting, all political 
decision making occurs in an organizational con- 
text. Organization is not easily defined [see ORGAN- 
IZATIONS], but it embraces a number of agreed- 
upon characteristics, including relatively formal 
rules of procedure, certain impersonal norms, in- 
direct and mediated communication, relatively 
stable role expectations, and durability beyond the 
life of its members. Decision makers in organiza- 
tions act under some influence from the rules, 
norms, and expectations of the large group to 
which they belong and in which they participate. 
Thus, organizational decision making is not merely 
individual or small-group decision making writ 
large. 

The distinguishing characteristics of organiza- 
tion constitute dimensions that may vary from one 
organization to another. That is, rules of procedure 
in some organizations are different from those in 
others (for example, compare the voting procedures 
in Congress with decision rules in more hierarchi- 
cally arranged organizations ), and these procedures 
contribute to variations in the substantive content 
of policy or decision outcomes (e.g., Shuman 1957; 
Riker 1958). Communication patterns also vary 
from organization to organization. Mulder ([1958] 
1963, pp. 65-110) has shown experimentally how 
centralized and decentralized groups differ in their 
approach to problem solving. Robinson (1962b, 
pp. 168-190) has related variations of frequency, 
source, mode, and kind of communication, together 
with satisfaction with communication processes, to 
the policy attitudes and votes of legislators. 

Political scientists (and also industrial manage- 
ment analysts) have emphasized extent of central- 
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ization (and decentralization) as an important 
dimension of organizations. Some element of hier- 
archy accompanies all organizations (Simon 1962, 
pp. 468-469), although there may be variations in 
the organizational levels at which hierarchy is con- 
centrated and these may be variations in the ac- 
curacy, frequency, and receptivity of upper eche- 
lons to feedback from lower ranks and vice versa. 

Centralization is but one specific characteristic 
or dimension of organization. Others, such as di- 
vision of labor and the criteria of recruitment, are 
increasingly being made operational and are re- 
searched in governmental organizations (Hall 
1963). Former efforts to construct a priori gross 
typologies of organizations are yielding to empirical 
search for individual dimensions, which may or may 
not correlate highly enough to be thought of as 
constituting distinguishable types of organizations. 

Political scientists studying decision making are 
concerned with whether differences in organization 
make any differences for decision outcomes. Prob- 
ably the most studied problem of this type is the 
comparison of bureaucratic (i.e., executive or ad- 
ministrative) decision making with legislative de- 
cision making. Bureaucracies tend to be more 
impersonal and to have more “programmed” de- 
cision rules and more complex and hierarchical 
structures than legislatures. In general, executive 
agencies tend to be more innovative than legisla- 
tures, to be more capable of obtaining and process- 
ing larger amounts of technical information rele- 
vant to the problem for decision, and to exercise 
greater influence over policy (Robinson 1962b; 
Banks & Textor 1963, especially characteristics 174 
and 179). 


Methodology 


In political science, most decisional studies have 
been cases of particular decisions or individual de- 
cision makers. Bailey (1950) inaugurated legisla- 
tive case studies with the detailed history of the 
passage of a single bill in Congress. The Inter- 
university Case Program pioneered in developing 
more than fifty cases of administrative, legislative, 
election, and party decisions. However, most case 
studies have been atheoretical and have not been 
directed at building a body of confirmed proposi- 
tions across cases, Indeed, the possibilities of the 
case method usually have been regarded as limited 
to illustrating known principles or generating new 
ones, but as of little or no value for testing hypothe- 
ses. Efforts to use cases for testing hypotheses by 
generalizing across instances have been begun by 
Munger (1961) and Robinson (1962b, pp. 23-69). 

Case studies of individual decision makers 


through standard biographical techniques continue, 
but only a few have followed Lasswell’s early ex- 
amples of psychological inquiry. 

Many national and some international political 
bodies produce large numbers of decisions and 
record many of them in a form readily susceptible 
to quantitative analysis. Roll-call data are espe- 
cially useful for identifying voting blocs among 
members of these bodies and for identifying pat- 
terns of voting for individual members, whether 
persons or nations. In decision making analysis, 
such roll-call data may be regarded as dependent 
variables or as outcomes of decision-making proc- 
esses. Hypotheses can readily be formulated that 
link these dependent variables to such independent 
variables as personality, social backgrounds, eco- 
nomic characteristics, and organizational variables. 
However, because legislatures, courts, and inter- 
national organizations are essentially noncrisis or- 
ganizations (as the term is used to designate one 
kind of decision occasion), these data are relevant 
only to a limited number of decision situations. 
Specific techniques used in organizing and inter- 
preting roll-call data include cluster bloc, Guttman 
scaling, index of cohesion, factor analysis, party 
unity scores, and similar techniques familiar 
throughout the social sciences. 

Survey techniques (i.e., a sampling of a popula- 
tion of respondents with whom personal interviews 
are conducted) have been used largely to study 
nonelite decision making (e.g., voting). The most 
ambitious of these studies include “panels” of re- 
spondents who are reinterviewed periodically dur- 
ing and/or after an election, and perhaps through- 
out a series of several elections. Although surveys 
have been confined mostly to nonelite studies, they 
are also applicable to investigations of elite decision 
making by interviews with samples of decision 
makers on one or more aspects of their processes 
Robinson 1962b, pp. 168-190, 220-234). [See 
SURVEY ANALYSIS.] r 

Simulation of political decision making has in- 
cluded both man-simulation (e.g., Guetzkow et al. 
1963) and computer-simulation (e.g., Pool & Abel- 
son 1961). Both forms constitute operating noe 
of large-scale social processes. The functions © 
simulation are both heuristic and hypothesis-testing: 
To formalize models of decision processes requires 
logical and rigorous statements of the relationships 
among relevant variables and also of the relato 
ships among propositions containing the variables. 
And, like any laboratory technique, simulation 
makes it possible to test hypotheses by controlling 
some factors while varying others. The use of sme 
lation to study decisions developed later in politic 


science than in the other social sciences, but it 
promises to be one of the increasingly used methods 
of the future. [See SIMULATION. ] 

Mathematics as a method in political science and 
decision-making analysis is also likely to increase 
in use, although, to date, its applications to this 
field have been less numerous than those in other 
parts of social science, The next generation of po- 
litical scientists will have more opportunities to 
obtain training in mathematical analysis, and their 
training will include new developments in mathe- 
matics which make the tools of that subject more 
helpful for the social sciences. The functions of 
mathematics for studying political decision making 
are the same as its functions for studying other 
aspects of social or political phenomena. First, 
mathematics constitutes a formal language for 
making explicit statements of the relationships be- 
tween variables and hypotheses. Second, it provides 
for the logical deduction of hypotheses from the 
rigorous statements that may be empirically re- 
searched by other methods, either experimental or 
field-observational, In short, mathematics serves 
both to integrate theory and to generate new hy- 
potheses. After the deduction of new hypotheses, 
however, other techniques must be employed for 
conducting empirical tests of the hypotheses. 


JaMEs A. ROBINSON 


[See also ADMINISTRATION; POLITICAL BEHAVIOR; Po- 
LITICAL PROCESS.] 
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DECISION THEORY 


Statistical decision theory was introduced by 
Abraham Wald (1939) as a generalization of the 
classic statistical theories of hypothesis testing and 
estimation. Ostensibly designed to put these theo- 
ries in a common mathematical framework, it ac- 
complished much more. It eliminated fundamental 
gaps in the previous theories and widened the scope 
of statistics to embrace the science of decision 
making under uncertainty. Although statistical 
decision theory represented a novel synthesis, many 
of its individual elements of novelty had been 
anticipated in spirit by Gauss and by the Neyman- 
Pearson theory of testing hypotheses. Furthermore, 
in formal structure it has much in common with 
game theory [see GAME THEORY]. 

In a decision problem the statistician must select 
one action from a set of available actions. A strat- 
egy (also called a decision function) is a plan that 
tells how to use the data to select an action and is 
evaluated by reference to the cost of its expected 
consequences. The difficulty in selecting the best 
Strategy derives from the fact that the conse 
quences of an action depend on the unknown state 
of nature. k 

The proper role of decision theory is the subject 
of considerable controversy. Because it makes es- 
sential use of costs, many statisticians feel that 
decision theory may be suitable for problems of the 
marketplace but not those of pure science. Another 
issue is the difficulty in assigning costs or values 
to the consequences. 5 

Crosscutting this controversy is another, which 


concerns criteria for selecting a strategy; such 
criteria may be regarded as principles of statistical 
inference. 

An example. To illustrate the decision-theory 
formulation, consider a simplified and artificial 
example. An archeologist has recently uncovered 
a skull that may belong to one of two populations, 
A or B. A skull positively identified as A would set 
off an extensive and costly research effort; a skull 
positively identified as B would not justify any 
research effort. The archeologist proposes to take 
one of three actions, the extensive research effort 
(action a,), a lesser research effort (action a»), 
or dropping the inquiry (action a,). The possibili- 
ties that the skull belongs to either population A or 
to population B are called states of nature and 
labeled 6, and @, respectively. Assume that after 
considerable introspection the archeologist has 
attached values to the consequences. By tradition, 
statisticians think in terms of losses, so the best 
situation (action a, and state 0,) will be treated as 
having zero loss. The losses are represented in 
Table 1. 


Table 1 — Losses 


ACTION 
a) a as 
STATE OF #1 O 2 5 
NATURE fa 4 3 2 


One may ask why the loss corresponding to 6: 
and a, is not zero since a, is the best action under 
state 0.. If the situation (@:, @,) is compared with 
the conceivable alternative (6: , a), the latter seems 
preferable and it is reasonable to assign a positive 
loss to (0,, a+). To compare (62, ds) with the alter- 
natives under 0., one uses regret. The regret for 
(6, a+) would be zero, while the regrets for (0: , a) 
and (0:,a,) are 2 and 1 respectively. 

Thus far three of the basic elements of decision 
theory have been presented: the available actions, 
the possible states of nature, and the consequences, 
or loss table. Because the state of nature is not 
known, the selection of a best action is not trivial. 
Ordinarily a scientist will perform some experi- 
ment to diminish the costly effect of ignorance. 
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Suppose that the archeologist decides to measure 
the cranial capacity of the skull and classify it as 
either large (zı), medium (z+), or small (z). 

There would be no difficulty in using the data 
if all members of population A had large skulls 
and all members of population B had small ones. 
Unfortunately the experiment is not so informa- 
tive: while skulls from A tend to be large and 
skulls from B tend to be small, there is appreciable 
overlap. Table 2 gives the fraction of skulls of 
population A and of population B that are large, 
medium, and small; this information, supposedly 
based on the results of extensive prior work, is not 
always readily available. 


Table 2 — Probability of observing z given @ 


z za Zs 
br 6 3 J 
ba 2 3 5 


There are many ways in which the data can be 
used. Any rule that prescribes an action for each 
possible result of the experiment is called a strat- 
egy; Table 3 lists the 27 such rules for this prob- 
lem. A discussion of the concept of strategy is given 
by Girshick (1954). 

A typical reasonable strategy is s,, which may 
be abbreviated (a, a», a). This strategy calls for 
action a, if z, is observed, a if z: is observed, and 
a, if z, is observed. The strategies s,, Su, and Sz; 
ignore the data; S», seems to be unreasonable. 

To determine how good a strategy is, it must be 
evaluated. How to evaluate the expected costs of 
S, = (@:, A», dy) will be illustrated. If 0, is the true 
state of nature, z, is observed and a, taken with 
loss 0 and probability .6, z. is observed and a, 
taken with loss 2 and probability .3, and zs is ob- 
served and a, taken with loss 5 and probability .1. 
The expected loss is 


(.6)0 + (.3)2 + (.1)5 = 1.1. 
Similarly, if @, is the true state, the expected loss is 
(.2)4 + (.3)3 + (.5)2 = 2.7. 


The expected losses for the 27 strategies are 
given in Table 4. Justification for the relevance of 


Table 3 — Actions for the 27 strategies 


STRATEGIES 
e a AS HspBiawtis $1017, 1518, 59) io, fa, Tax. faa Pk, Fab ae ay 
ep art E A enon CE aena E E a e i 23 (Cal Ge. A te a3 a3 
BATA HED [kGice A PE ito masa on O a a edad os 8; Calica) ay Sy E Fe a as o, as 


% a a3 a G3 0, G2 a, a3 


a G2 G3 Œ% G2 G3 Gy G2 Oa a a3 a a3 
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Table 4 — Expected losses 
STRATEGIES 


S Sa S3 Sa Ss S% S57 Sa So Sio S Sia Sia Sis Sis Sie Siy She Sie $20 $2) Saa San 424 Sas S26 Say 


STATE 


h 0.0 0.2 05 06 08 1.1 1.5 1.7 20 1.2 14 17 1.8 20 23 27 29 3.2 30 32 35 36 38 41 45 47 50 


Uai Oa 40 35 30 37 32 27 34 29 24 38 33 28 3.5 3.0 2.5 3.2 27 22 36 31 26 33 28 23 30 25 20 


expected loss derives from the theory of utility 
(Chernoff & Moses 1959, chapter 4; Luce & Raiffa 
1957, chapter 2), [See UTILITY.] 

Table 4 may be regarded as an extension of 
Table 1 with s: , s,,, and s», corresponding to a; , a:, 
and a,. The increased size gives the appearance of 
greater complexity but implies that the archeologist 
has greater choice after performing the experiment 
than before. Since s, is obviously an improvement 
on s,,, having smaller expected losses under both 
9; and @,, the experiment may be regarded as hav- 
ing succeeded in reducing the costly effects of 
ignorance, 

The strategies and their expected losses are rep- 
resented in Figure 1. Here each strategy is identi- 
fied by a point whose coordinates are the expected 
losses under 0, and 6, respectively. 


L, — EXPECTED LOSS UNDER 0, 


2 3 4 5 
L, — EXPECTED LOSS UNDER 4, 


wL,+%Ll,=0 %L,+%L,=0 


Figure 1 — Strategies and expected losses 


The archeologist is not restricted to the 27 strat- 
egies mentioned above. He may decide to toss a 
fair coin and select s, if it falls heads and Ss if it 
falls tails. Such a strategy is called a randomized, 
or mixed, strategy in contrast to the original non- 
randomized, or pure, strategies. For the above mixed 
strategy the expected loss under 6, is 0 with prob- 
ability 4 and .5 with probability 4 or .25, which 
is halfway between 0 and .5. In like manner, the 
expected loss under 0, is 4(4.0) + 4(3.0) =3.5, 


which is halfway between 4.0 and 3.0. It follows 
that the point representing this randomized strat- 
egy is halfway between the points representing s, 
and s, in Figure 1. By replacing the coin with a 
more general randomizing device, it is possible to 
find a strategy corresponding to any point on the 
line segment connecting any two strategies, Thus 
the set of available strategies (randomized and 
nonrandomized ) has the property that every point 
of the line segment connecting two points of the 
set is in the set. Such a set is called convex. The 
fact that the set of strategies is the smallest convex 
set containing the nonrandomized strategies plays 
a fundamental role in the mathematics of decision 
theory. 

To minimize loss, the statistician seeks a strategy 
whose representative point in Figure 1 is as low 
and as far to the left as possible. A strategy is said 
to dominate another strategy when the first is never 
more costly than the second and sometimes less 
costly. In Figure 1, s dominates s* if s is below 
and to the left of s*, directly below s*, or directly 
to the left of s*. A strategy that is not dominated 
is called admissible. The admissible strategies lie 
on the heavy line s,5,5,59S»8;,S2;. The intuitively un- 
reasonable strategy s., is dominated by ss; the 
Strategy s,,, which ignores the result of the experi- 
ment, is also dominated by s,. i 

Choice of good strategy. Clearly, the statisti- 
cian can profitably confine his attention to the 
relatively few admissible strategies. How one 
should select from among these strategies is the 
subject of considerable controversy (Luce & Raiffa 
1957, pp. 278-306). One criterion tentatively sug- 
gested by Wald was the minimax criterion, which 
may be described as follows: If nature is malevo- 
lent and the statistician picks s, nature will select 
the state @ that gives the greatest loss for s. Thus 
each strategy is evaluated by its maximum 1oss, 
and the statistician selects that strategy s for which 
the maximum loss is as small as possible. For the 
example, the minimax is found as the intersection 
of the line L, = L, with the segment s,5,,, and con- 
sists in selecting s, with probability 5; and si with 
probability 2%, yielding expected losses 2.34 under 
both @, and 0+. For illustration of minimax strate- 
gies that do not lie on the line L, = L., see Luce 
and Raiffa (1957, p. 404). 


The minimax criterion is generally regarded as 
too pessimistic. In the theory of two-person, zero- 
sum games, where a player has a malevolent oppo- 
nent who is anxious to ruin him, minimax has 
justification; it seems unreasonable, however, to 
regard nature as a malevolent opponent. 

An alternative view is that if the archeologist is 
in complete ignorance he ought to act as though 
each state of nature were equally likely and select 
the strategy that minimizes the average of the two 
losses (Luce & Raiffa 1957, p. 284), This strategy 
is s, and may be derived graphically by moving 
the line 3L,+4L,=0 parallel to itself until it 
touches the convex set of strategies. This objective 
approach has been criticized on the grounds that 
rarely, if ever, does a scientist approach a problem 
in complete ignorance. 

A third approach (Savage 1954) consists in 
proving, under relatively weak assumptions de- 
scribing restrictions of consistent and rational be- 
havior, that such behavior leads one to act as 
though he had certain prior subjective or psycho- 
logical probabilities for the states of nature. Thus 
if the archeologist had prior probabilities 4 and 
ł for @, and 0., he would minimize 3L, + §L:. 
Graphically, this may be achieved by moving 
the line 4L, +łL,=0 parallel to itself until it 
touches the set of strategies. In fact this line first 
touches the set of strategies along the line segment 
connecting s, and sọ, which means that Ss, So, and 
all their mixtures are equally good for the archeolo- 
gist. 

Other statisticians refuse to be bound by such 
restrictions in matters as sensitive and complex as 
inductive inference. 

The decision-theory controversy. The main ob- 
jection to decision theory involves the losses. Some 
feel that in scientific problems where the only ob- 
ject is the satisfaction of scientific curiosity, it 
makes no sense to attach values or costs to the 
actions. Others, who are willing to accept a prag- 
matic approach, are overwhelmed at the difficulty 
in finding precise numbers to insert in the loss 
table. Still others feel that a statistician should not 
consider losses or actions but should confine him- 
self to analyzing the inferential content of the data. 

I feel that some conception of the value of con- 
sequences, be it based on personal gratification or 
on selfless consideration of the welfare of human- 
ity, is necessary when faced with the problem of 
designing efficient experiments or when consider- 
ing what data to gather and when to stop accumu- 
lating data. Since small deviations in the loss table 
have little effect on the relative merit of strategies 
when there is a substantial amount of data, precise 
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values of losses are not vital; on the other hand, 
some rough conception of costs is required to un- 
derstand the problem. By closing the gaps of previ- 
ous formulations, decision theory has the advantage 
of focusing attention on all the major issues of a 
problem. 

Another objection to decision theory is based on 
the requirement of a detailed model representing 
the preconception of the totality of possible states 
of nature. Common sense is necessary to avoid 
foolish results in the face of data indicating that 
the preconception was wrong. This issue is not 
peculiar to decision theory and lies at the heart of 
some of the current controversies over the com- 
parative method in the social sciences. 

Decision theory and classical statistics. Testing 
hypotheses, estimation, and confidence intervals 
are the three main classical theories of statistical 
inference. A hypothesis-testing problem corre- 
sponds to a two-action problem. The possible ac- 
tions are to accept a hypothesis or to reject it. The 
considerable difficulty that troubled novices over 
the choice between one-tailed and two-tailed tests 
was largely resolved by Neyman-Pearson theory. 
Decision theory added clarification by indicating 
that some problems traditionally formulated as 
testing problems really involve three actions. 

In estimation problems one attaches a loss to 
estimating @ by t. A popular special case is the loss 
(t — 6)*. Then a good procedure tends to minimize 
the mean squared error, E(t —6)*, which is the 
traditional measure of the goodness of an estimate. 
(See Chernoff & Moses 1959, pp. 207-233.) 

The theory of confidence intervals is mathe- 
matically elegant, appealing, and popular but does 
not fit naturally into the decision-theory frame- 
work. The introduction of costs in a reasonable 
fashion would undoubtedly lead to substantial 
modification of the theory of confidence intervals. 

Decision theory is wider than these classical 
theories. For example, in the archeological ex- 
ample, decision theory permits the consideration 
of the compromise action a», which is not optimal 
for any state of nature. This possibility does not 
exist in hypothesis testing. Decision theory also 
accommodates formulations involving sequential 
design of experiments. 

Complete classes and Bayes solutions. A com- 
plete class of strategies is one in which every 
strategy not in the class is dominated by some 
strategy in the class. A Bayes solution for certain 
prior probabilities is a strategy that minimizes the 
weighted average of the expected losses, with those 
prior probabilities as weights. In Figure 1, chang- 
ing the prior probabilities changes the slope of the 
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line that produces the Bayes solution; and by 
changing this slope, all admissible strategies are 
obtained. Thus, in the example, the class of all 
Bayes solutions is complete. 

In many decision problems, in particular in all 
decision problems with a finite number of states of 
nature and a finite number of actions, the class 
of all Bayes solutions is complete. A consequence of 
this result is that one may restrict oneself to Bayes 
solutions whether one believes in psychological 
prior probabilities or not. The importance of this 
result is related to the fact that Bayes solutions are 
easily characterized using Bayes’ theorem for pos- 
terior probabilities. If this theorem is used, the 
tedious task of listing all possible strategies before 
observations are obtained is no longer required to 
compute Bayes solutions [see PROBABILITY]. 

Compound decision problems. The Robbins 
theory of compound decision problems (Robbins 
1951) can be partly illustrated by the following 
example: A subject is given a true-false question 
on an examination. An investigator wishes to clas- 
sify the student as knowing the answer or not 
knowing it. The cost of misclassifying the subject 
is 1, It is assumed that a subject who knows the 
answer will answer correctly and that one who 
does not know the answer will guess at random. 
The traditional strategy of treating the subject as 
knowing the answer if he answers correctly seems 
reasonable. 

Suppose now that several hundred students take 
the examination and about 50 per cent answer the 
question correctly. If a substantial Proportion of 
the group knew the answer, one would expect con- 
siderably more than 50 per cent correct answers. 
Thus one may conclude that almost everyone 
should be treated as not knowing it. 

This exotic approach illustrates that it is pos- 
sible to use the data from many unrelated prob- 
lems of the same type to improve over the results 
attained by analyzing each problem separately. 

A variation of this approach yields Robbins’ 
theory of empirical Bayes procedures (Robbins 
1956), where past data on different examples of a 
problem are used to estimate empirical prior 
probabilities, 

Further problems. Three other branches of de- 
cision theory in which interesting work has been 
done are comparison of experiments (Blackwell & 
Girshick 1954, chapter 12); invariance (Blackwell 
& Girshick 1954, pp. 223-236); the study of mono- 
tone procedures (Blackwell & Girshick 1954, pp. 
179-195) and the extension of monotone pro- 
cedures to problems involving Pólya type distribu- 
tions (Karlin 1957-1958), 
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[Directly related are the entries ESTIMATION; HYPOTHE- 
SIS TESTING; and the biography of Waro.) 


BIBLIOGRAPHY 

BrackweLL, Davip; and Gimsnick, M. A. 1954 Theory 
of Games and Statistical Decisions. New York: Wiley. 

Bross, Irwin D. J. 1953 Design for Decision, New 
York: Macmillan. + A paperback edition was pub- 
lished in 1965 by Free Press. 

Cwernorr, HERMAN; and Moses, L. E. 1959 Elementary 
Decision Theory. New York: Wiley. 

Gmsuicx, M. A. 1954 An Elementary Survey of Statisti- 
cal Decision Theory. Review of Educational Research 
24: 448-466. 

KARLIN, S. 1957-1958 Pólya Type Distributions. Parts 
2-4. Annals of Mathematical Statistics 28:281-308, 
839-860; 29: 1-21. 

Luce, R. Duncan; and Rarrra, Howarp 1957 Games 
and Decisions: Introduction and Critical Survey. New 
York: Wiley. 

Ronuins, Hersert E. 1951 Asymptotically Subminimax 
Solutions of Compound Statistical Decision Problems. 
Pages 131-148 in Berkeley Symposium on Mathemati- 
cal Statistics and Probability, Second, University of 
California, 1950, Proceedings. Berkeley and Los Ange- 
les: Univ. of California Press. 

Rossms, Hersert E. 1956 An Empirical Bayes Ap- 
Proach to Statistics. Volume 1, pages 157-163 in 
Berkeley Symposium on Mathematical Statistics and 
Probability, Third, University of California, 1954- 
1955, Proceedings. Berkeley and Los Angeles: Univ. 
of California Press. 

Savace, Leonard J. 1954 The Foundations of Statistics, 
New York: Wiley. 

Von NEUMANN, JoHN; and Morcenstern, Oskar (1944) 
1964 Theory of Games and Economic Behavior. 3d 
ed. New York: Wiley. 

Wat, ABRAHAM 1939 Contributions to the Theory of 
Statistical Estimation and Testing Hypotheses. Annals 
of Mathematical Statistics 10:299-326. 

WALD, ABRAHAM 1950 Statistical Decision Functions. 
New York: Wiley. > Reprinted in 1964. 


DEFENSE MECHANISMS 


The concept of defense mechanisms was origi- 
nally proposed by Freud to explain the origins of 
Socially handicapping symptoms, such as unrea- 
sonable fears or unjustified anger. Freud pictured 
symptoms as “derivatives,” or indirect expressions, 
of impulses that cannot be expressed directly be- 
cause they are incompatible either with other im- 
pulses or with one’s moral standards. The blocking 
of expression elicits the unpleasant experience of 
anxiety, which varies with the strength of the 
impulses and tends to increase progressively until 
the conflict is resolved. Defenses represent predi- 
lections for particular distortions of unacceptable 
impulses. Torn between his impulse to hurt his 
father and his guilt, a son may project his anger 
to someone else. The boy is consequently relieved 
of both guilt and anxiety. If he displaces his anger 
to a safer target, he develops irrational angers at 


the scapegoat and is again saved from the discom- 
fort of guilt and anxiety. If he represses the anger, 
he forgets about it; if he isolates it, he is aware of 
the idea that he wants to hurt someone but does 
not experience the feeling. He can also cancel the 
results of previous aggression by employing de- 
fenses like undoing, reparation, and reversal, the 
meanings of which are conveyed by the terms 
themselves. 

What was first a clinical construct proved to be 
so fruitful in clarifying a large variety of social 
phenomena, both normal and abnormal, that it can 
be regarded as one of the most revolutionary influ- 
ences on twentieth-century thought. Freud himself 
employed the construct in discussing such different 
topics as the content of art, the meaning of dreams, 
slips of the tongue, sources of prejudice, psycho- 
therapists’ errors, universal myths, the appeal of 
humor, religious ritual, and the child’s view of his 
world. His papers on such topics have provided 
the foundations for numerous professional special- 
ties in the arts and humanities as well as in medi- 
cine and the social sciences. It is difficult to picture 
what the novel, the play, or the painting of the 
twentieth century would be like if creative artists 
had not been exposed to the concepts of conflict, 
unconscious motives, and defensive distortions. 

The ubiquitous interest in defenses is a tribute 
to their usefulness in explaining otherwise incom- 
prehensible behavior in terms of the resolution of 
conflict, particularly internal conflict from which 
the person cannot escape. The clinician typically 
observes patients torn between moral standards 
and either sexual or aggressive impulses. The 
antagonistic forces are intense and have approxi- 
mately the same strength. The conflict often de- 
velops a stable equilibrium characterized by pro- 
tracted indecision and mounting anxiety. The 
inclination to forego either impulse tends to 
strengthen that impulse, by emphasizing the im- 
pending frustration, and to weaken the antagonist, 
which seems less important once it can be gratified. 
Anxiety may become so great that it interferes 
with the objective appraisal of events. At such 
times one can resort to the defenses, which permit 
resolutions without awareness of the nature of the 
choice or even of the conflict. 


Definition and identification 


In the more than half a century since the con- 
cept of defense was proposed, it has been subjected 
to considerable scrutiny both by partisan clinician 
and rigorous experimenter. People who study de- 
fenses empirically are immediately faced with a 
paradox. To conduct research on a mechanism, 
they have to be clear about ways of identifying it. 
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The attempt to attain clarity often makes the aver- 
age investigator skeptical about the widespread 
enthusiasm for the concept, an enthusiasm but- 
tressed more by faith in clinical observation than 
by a substantial body of experimental data or even 
by a theoretical system. What is available is a 
group of loosely defined descriptive labels. There is 
little agreement either about the definition of the 
mechanism or about the ways in which it affects 
perception. The label has been applied to such dif- 
ferent phenomena as scotoma to visual details, the 
inability to report material whose recall is mani- 
fested indirectly, and the reinterpretation of events. 

Critics often object to the “explaining” of for- 
getting by the invoking of the mechanism of re- 
pression, a practice they correctly label as a reifica- 
tion of the term, Postman (1953) makes another 
cogent criticism when he objects to the presumed 
assumption of a “Judas eye, scanning incoming 
concepts in order to decide what shall be permitted 
in consciousness” and apparently acting as a trig- 
ger that releases mechanisms employed to distort 
the unacceptable concepts. [See PERCEPTION, arti- 
cle on UNCONSCIOUS PERCEPTION.] 

In the face of such phenomena as multiple per- 
sonality, “forgotten” material that can be recovered 
under special conditions like hypnosis, and moti- 
vated errors in judgment, most skeptics concede 
that Freud was asking legitimate questions and 
that the perceptual system must often be split into 
subsystems, some of which are not accessible to 
awareness. But there is considerable disagreement 
about ways of describing such phenomena, and 
sympathetic critics often plead for hypotheses that 
are phrased in the language of the social scientist. 

The vagueness of the concept of defense is sug- 
gested by Freud’s definition: “. . . all the techniques 
of which the ego makes use in the conflicts which 
potentially lead to neurosis” ([1926] 1936, p. 144). 
This definition provides little help in discriminating 
between defense and other reactions that occur 
during conflict and ignores much of what Freud 
himself observed about the connections between 
defenses and normal phenomena, such as crea- 
tivity, thinking, and humor. 

Enumeration and criteria. The mechanisms 
labeled by Freud include repression, regression, re- 
action formation, reversal, isolation, undoing, intro- 
jection, identification, sublimation, denial, projec- 
tion, and turning against the self. Many other 
psychoanalysts have proposed lists of defenses, few 
of which agree with Freud’s in either content or 
length. The lack of consensus reflects some basic 
theoretical disagreements. Melanie Klein (1959), 
who founded her own psychoanalytic group, was 
very concerned with splitting and reparation, which 
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are ignored by members of other groups. A more 
common reason for the lack of consensus is a 
casualness about definition of which Freud him- 
self was often guilty, Later writers have been even 
less sensitive to the necessity of clear definition, 
extending the meaning of defense from a disposi- 
tion to remotely related phenomena, such as neu- 
rotic symptoms; to stylistic trends, like resistance 
to change; and even to cultural products, like 
myths, which are neither defenses nor derivatives 
of impulses. 

Anna Freud (1936) provided some useful cri- 
teria for definition when she noted that the “ego's 
defensive operations” are unconscious and that 
they repudiate “the claims of instinct” by keeping 
“ideational representatives of repressed instincts” 
from becoming conscious and by “creating a trans- 
formation” in the associated emotions. She sug- 
gested the additional functions of regulating needs 
by appropriate substitutions, altering the inappro- 
priate strengths of needs, and preventing the recog- 
nition of painful events over which there is no 
control, 

Miller and Swanson (1960) propose a group of 
specific criteria for recognizing defenses. They 
think that the observer must first identify an event, 
usually an impulse, that the person in conflict can- 
not acknowledge because the awareness of it 
would create objective difficulties or anxiety. The 
observer identifies the event on the basis of such 
indirect evidence as the content of dreams, sub- 
liminal perception, and unpleasant emotional 
states, such as excessive anxiety or guilt. Second, 
he must locate the substitute for the original im- 
pulse. The type of defense is inferred from the 
substitute, particularly the nature of the discrep- 
ancy between wish and act. If a son is constantly 
aggressive and his father responds by being very 
helpful, he is viewed as reversing his aggressive 
impulses if he has dreams that symbolize a murder 
of the son. If the father replaces aggression by 
fears that the son will be killed in an accident, 
these are interpreted as projections of the father’s 
hostile wishes to auto drivers. 


The nature of defense mechanisms 


Of course the definition of terms is only a prel- 
ude to the raising of questions about the nature of 
defense mechanisms. How do the different de- 
fenses operate? Specifically, how does one project? 
Are there different kinds of projection? How does 
one repress? Do certain defenses have common 
characteristics? To what extent are different de- 
fenses learned? The literature on defense contains 
few deliberate attempts to answer such questions 
but many suggestive contributions, 


Attempts at classification. Freud suggested a 
possible system of classification when he postulated 
that defenses are specific to particular pathologies; 
projection to paranoia, repression to conversion 
hysteria, turning against the self to depression, 
Such associations occur frequently but they are 
still not very strong. The same defense may be 
found in different pathologies; projection con- 
tributes to the irrational fear of the phobic, the 
obsessive’s overconcern with his potential con- 
demnation by others, and the depressive’s feelings 
that he has lost all status in the eyes of others. 

Another basis of classification was proposed by 
Anna Freud, who classified defenses with reference 
to eliciting conditions, such as anxiety, impulse, 
actual danger, forbidden emotion, and the pressure 
of conscience. Even though a single condition may 
sometimes elicit a large number of different de- 
fenses, there is some validity to the presumed as- 
sociation between the two. Turning against the 
self seems most appropriate to the distortion of 
aggressive impulses. Reversal, undoing, and repa- 
ration also seem appropriate to conflicts about 
aggression. Turning against the self seems inap- 
plicable to anxiety, external threat, and sex. It 
seems more appropriate to think of denying anxi- 
ety and external threat or of repressing sexual 
impulses. 

Employing a variety of criteria, Miller and 
Swanson (1960) propose two families of defense, 
differentiating them on the bases of complexity, 
distortion in derivatives, specificity to particular 
problems, and social effectiveness. The use of de- 
fenses in the first family, which is presumed to 
have developed early, leads to considerable dis- 
tortion in the interpretation of events; they tend 
to be applicable to most problems; they are often 
socially handicapping; they are simple in that they 
require little previous learning. A defense like 
denial, for example, only requires that the person 
close his eyes and conjure up a wishful picture in 
his imagination. Common to such mechanisms is 
an obliteration of awareness and many kinds of 
information associated with it. 

The use of defenses in the second family creates 
only minor distortions in the interpretation of 
events; they tend to be suited to particular prob- 
lems; they are conducive to socially acceptable, 
even rewarded, behavior; they are relatively com- 
plex. A defense like projection requires that the 
person be able to recognize his impulses, that he 
have an awareness of self, and that he be able to 
apply a code of values. 

Defenses in the second family have in common 
a displacement or shift on one or more dimensions 
of the forbidden impulse, such as the objects, 


affects, actions, agents, and goal states. When a 
boy who is angry at his father displaces his ag- 
gression to another object, he chooses something 
similar in certain respects to the original object. 
Sometimes the choice may be very remote as in the 
classical case of Freud’s little Hans, who shifted 
his problem from his father to white horses, both 
being large, light-skinned, and strong and having 
dark structures around the eyes (glasses and 
blinders) and a dark area around the mouth 
(beard and snout). 


Projection—an example 

Research on the defense mechanisms has pro- 
liferated to such an extent as to require that a 
survey of the literature be restricted to one mech- 
anism. Projection has been chosen because it has 
been the subject of the most fruitful investigations. 

One problem that has plagued investigators is 
the variety of phenomena that have been labeled 
as projection. Freud recognized that, in addition 
to the unconscious attribution of traits, there is 
also a more “normal” kind of projection in which 
we “refer the cause of certain sensations to the 
external world instead of looking for them inside 
ourselves. . .” (1912-1913). As a result of his 
observations of psychotics, Federn (1953) identi- 
fied still another type in which confusion about 
the body image leads a person to be uncertain 
about the location of either acceptable or unaccept- 
able impulses, which might be attributed to self, 
another person, or some object in the intervening 
space. The literature contains references to still 
other phenomena that were called projection. [See 
BODY IMAGE.] 

In designing research, most investigators begin 
with a conception of ways in which the mechanism 
affects perception. The commonest conceptions 
attribute projection to the breakdown in discrimi- 
nation caused by the loss of ego boundaries, to 
displacement of unacceptable impulses from self 
to outside world, to a combination of increased 
sensitivity to the forbidden impulses in self and 
other and a denial of the impulses in self, to a 
diminution of the seriousness of one’s problems 
by exaggerating them in others, and to displace- 
ments on dimensions of the agent of action. Some 
of these conceptions are compatible with one an- 
other; others are not. The version that attributes 
projection to poor discrimination is consistent with 
the version that explains it as a displacement of 
the experienced event from inside to outside the 
self, But one cannot postulate both an intact body 
image and a breakdown in its boundaries or ex- 
plain the distortion as pathological and basic to all 
cognition. 
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Depending on their assumptions about the 
nature of projection, various investigators have be- 
come concerned with different problems, Assum- 
ing that the mechanism is learned during the 
child’s struggles with toilet training, a number of 
investigators have devoted their efforts to study- 
ing the associations between anality, projection, 
and aggression. Others have studied the social im- 
plications of the defense by observing its use in 
different societies. Among the Dobuans (Fortune 
1932), whose members are very competitive, poi- 
soning, witchcraft, and other types of treachery 
are frequent enough, even in relations between 
close kin, to elicit universal suspicion of one’s 
fellow man, and the person who does not project 
his aggression may not survive very long. Projec- 
tion is also helpful to the Hopi (Goldfrank 1945), 
who have a highly organized cooperative economic 
system which could not function even in the pres- 
ence of mild aggression, They project their anger 
to evil spirits. Although citizens of both communi- 
ties favor the same defense, the difference in tar- 
gets results in one group being homicidal and the 
other being exceptionally peaceful. 

A concern with the social consequences of pro- 
jection in American society has led a number of 
social scientists to study the vicious cycle that 
reinforces the paranoid’s projection of aggression. 
He initially has little evidence to justify his sus- 
picions, but if his fear leads him to withdraw or 
to be hostile, others soon begin to shy away. On 
the second encounter they are more wary, which 
confirms his suspicions of their intentions and en- 
hances his fear and hostility. Each encounter 
reinforces the elements in the cycle, which consists 
of increased alarm and self-protective hostility on 
his part followed by puzzled retreat and, finally, 
anger on the part of others. 

Epidemiological research on the incidence of 
different psychoses provides indirect evidence 
about the learning of defenses. Paranoid psychoses 
occur more often in the poorer than in the more 
affluent areas of large cities (Faris & Dunham 
1939). Some investigators reason that the hard- 
ships faced by the poor reinforce the tendency to 
project; others that people with paranoid tenden- 
cies are economically handicapped and their down- 
ward mobility finally forces them to live in poorer 
sections. The disagreement has not yet been re- 
solved, but evidence that psychotics in poor areas 
have not generally drifted there from other sections 
casts doubt on the latter hypothesis. [See PARANOID 
REACTIONS. ] 

A relatively new field of investigation, which 
represents a merging of Freudian theory, the so- 
ciology of communication, and certain epistemo- 
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logical principles, starts with the assumption that 
projection is an inherent part of all perception. 
On observing an object, a man organizes his visual 
sensations into meaningful categories and adds 
additional meanings and values in terms of the 
implications of the incident for his condition and 
his goals. The more concerned he is with what an 
object is like or how it ought to be, the greater is 
the likelihood that he will view it within a particu- 
lar rubric of meanings and that he will impose 
qualities on it that others may not see. 

Projection has a prominent role in many studies 
of the development of moral standards. It is as- 
sumed that the child first resists parental demands, 
such as those prohibiting attacks on his siblings. 
He then conforms because of fear, But then, each 
time he is tempted to misbehave he anticipates 
how the mother will judge him by projecting to 
her the feeling he would have if he were in her 
place. He then identifies with the disapproving 
mother and gradually comes to inhibit his impulses 
or to punish himself just as she might have done. 

Underlying this type of speculation is the prem- 
ise that we can never know what is in another 
person’s mind on the basis of direct experience. 
We must project what we would feel if we were 
in his place in order to guess what he is experi- 
encing, After we project, we usually test our result- 
ant impression by comparing it with subsequent 
events. If these show the impression is wrong, we 
modify it and test the new one in turn. It is as- 
sumed that the paranoid projects no more than 
anyone else; he differs from other men in the in- 
efficiency with which he tests the veridicality of 
his projections, 

Experimental research on projection has helped 
to develop the outlines of a potential theory. Studies 
reveal, for example, that the incorrect attribution 
to others of unpleasant traits is a function of the 
situation as much as the predilections of the per- 
son. Bramel (1962) threatened his subjects by in- 
dicating that their homosexual tendencies were 
more marked than they had reported in their self- 
ratings. On obtaining his reports the subjects in- 
creased their ratings of the homosexual tendencies 
of people with whom they had been paired but 
whom they did not know very well. The more 
favorable were the original estimates, the greater 
was the tendency to see others more unfavorably. 
Posner (1940) asked a group of children to rate 
the attractiveness of pairs of toys, to give one of the 
pair to a friend, and then to Suess which of the toys 
the friend would have given the subjects. They 
usually chose the preferred toy for themselves and 
Projected the selfish choice to the friend. This pro- 


jection was much less evident in a control group 
whose members did not have to give away one of 
the toys before estimating the friends’ choices, 

In a study of the contents and targets of projec- 
tion Zimmer (1955), like a number of investi- 
gators, found that subjects project their acceptable 
personality traits to liked people and unacceptable 
traits to disliked people. Moreover, the degree of 
projection is related to conflict about the trait. Ac- 
cording to Harvey (1958), people feel positively 
about raters who agree with their self-rating but 
deprecate the raters when they disagree. The sub- 
jects also show poor recall of the ratings, the dis- 
tortions increasing with the discrepancies from 
self-rating up to a critical point. 

The results of some studies help to explain the 
conditions under which people project similar and 
complementary traits to others. At a party arranged 
by Murray (1933) preadolescent girls rated a group 
of pictures before and after a frightening game 
called “murder.” The maliciousness attributed to 
the people in the pictures increased markedly fol- 
lowing the game. Similar rather than complemen- 
tary projection was elicited by Bramel, whose study 
has been described above. A comparison of the two 
investigations suggests that subjects project com- 
plementary traits when they find their own quali- 
ties acceptable and similar traits when they dislike 
their qualities and need to de-emphasize them. 

The projection of complementary attributes 
seems to require a particular cognitive style, Leven- 
thal (1957) divided his subjects into two groups, 
depending on the number of different concepts 
they used in describing their friends. The complex 
perceivers used many concepts and typically re- 
acted to others in terms of differences between 
them; simple perceivers used few concepts and 
were more prone to assume likenesses in others. 

Still another determinant of the contents of pro- 
jection is its appropriateness to the target. Menzies 
(1960) reports that nurses project irresponsible 
impulses to juniors and severe disciplinary atti- 
tudes to seniors. The resultant impressions tend to 
be accurate because they are consistent with the 
nurses’ roles. The “social defense system” of the 
nursing service enables the women to “project and 
reify” relevant elements of their psychic defense 
systems. Some features of the social system that 
help to defend nurses against their many anxieties 
include the splitting of tasks in a manner that pre- 
vents intensive contact with any one patient, the 
denial of the patients’ individuality by such prac 
tices as referring to them by number or by diag- 
nostic category rather than by name, the encour- 
agement of ritualistic performance of tasks, and 


the discouragement of discretion and initiative. 
Unless a new nurse can restructure her defenses 
in ways that are congruent with this social defense 
system, she has considerable difficulties in her 
professional relationships. 


A number of recent papers have been devoted to 
the hypothesis that people with incompatible de- 
fenses have more difficulty getting work done than 
do people with less incompatible combinations. 
Cohen (1956) asked his subjects to work together 
on a task and to judge each other's interpersonal 
ability. As anticipated, people inclined to favor pro- 
jection were more negative and hostile in inter- 
acting with one another than when paired with 
other subjects. 

Recent research on the defenses has been domi- 
nated by the assumption that the mechanisms rep- 
resent predilections for particular styles of interpret- 
ing one’s environment. This cognitive viewpoint 
has led investigators to think of defenses as com- 
ponents of a general style of solving emotional 
problems. It has also concentrated people’s interest 
on the self as a focus for the kinds of problems that 
elicit various defenses, Finally, it has prompted in- 
vestigators to think of the possibility that all de- 
fenses, and possibly much of the process of think- 
ing, can be analyzed into certain basic components. 
Some of these that are common to the defenses 
listed in the psychoanalytic literature include the 
tendency to do or think something that is opposite 
in meaning to the unacceptable event, to obliterate 
one’s total awareness of a problem, and to displace 
awareness from one location on a dimension to 
another. Displacements from inside to outside the 
body may represent a special instance of opposite- 
ness, other examples of which are reversal and 
denial of beliefs and undoing and reparation of 
action. It seems probable that within the next dec- 
ade, research on defense mechanisms will permit 
close integration of the concept with theories of 
cognition and motivation. 

DANIEL R. MILLER 


[For further general discussion of defense mecha- 
nisms, see ANXIETY; PREJUDICE; PSYCHOANALYSIS; 
and the biographies of Freup and KLEIN. More 
specific treatment may be found in AESTHETICS; 
Dreams; FANTASY; HUMOR; LITERATURE, article on 
THE PSYCHOLOGY OF LITERATURE; MYTH AND SYM- 
BOL; RELIGION. ] 
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DELEGATION OF POWERS 


Delegation of powers is the act whereby a politi- 
cal authority invested with certain powers turns 
over the exercise of those powers, in full or in part, 
to another authority. Accordingly, the powers of 
the delegate are precisely those that belonged to 
the delegant, and the actions performed in virtue 
of the delegation have the same juridical nature 
as if they had been performed by the delegant him- 
self. Delegation should not, therefore, be regarded 
as permission or authorization; rather, it is a trans- 
fer of power. The fundamental problem then is to 
find out whether, and to what extent, that transfer 
is legitimate in the realm of public law. 

When delegation is legally provided for there is 
no difficulty. This is often the case on the adminis- 
trative level; the organizational regulations of a 
bureau authorize its head to turn the exercise of 
his powers over to another official. It should be 
noted, however, that even when delegation is au- 
thorized by existing law, it is subject to very pre- 
cise conditions, In the first place, the right to 
exercise delegation cannot be presumed. Further- 
more, those actions for which the right of delega- 
tion is granted must be clearly indicated. Finally, 
delegation must necessarily be limited in time. 

In constitutional matters, the problem is more 
delicate. Since it arises in relations between legis- 
lative and executive branches of government, it 
has political implications that can lead to devia- 
tions from the strict application of legal princi- 
ples. In practice the question is whether, in the ab- 
sence of constitutional provisions authorizing the 
legislature (parliament or congress) to strip itself 
of its competence, it can entrust the executive with 
the right to take regulatory measures that will have 
the force of law. In cases where a constitution 
reserves certain areas to the competence of a legis- 
lative body, delegation would have the effect of 
bringing about a transfer of functions from the 
legislative branch to the executive; and in every 
case, delegation would achieve a transfer of powers. 

Critique. Theoretically it is impossible to dele- 
gate the legislative power (or any other preroga- 
tive) given by a constitution to a legislature. This 
position is based both on a legal argument and 
on a consideration of common sense. Legally, one 
can only delegate a power that one possesses. But 
the power to legislate is not a right of the legisla- 
tive houses; it is a function entrusted to them by 
a constitution, to exercise and not to dispose of at 
will. Common sense reinforces the legal principle, 
Locke was the first to show that when the People, 
by means of a constitution, grant the power of 


making laws to a given agency, it is because they 
have confidence in that agency. They feel that the 
way the agency is constituted and the procedures 
it has to follow will guarantee that the rules made 
will merit obedience. “The people,” Locke wrote, 
“can[not] be bound by any Laws but such as are 
Enacted by those, whom they have Chosen, and 
Authorised to make Laws for them” (Two Treatises 
of Government [1690] 1960, 11, sec. 141), 

In opposition to delegation, one could also invoke 
the principle of the separation of powers, saying 
that it would be violated if, under cover of an 
invitation by parliament, the executive could adopt 
measures that, by their nature and object, were 
veritable laws. Finally, for those who, especially 
in France, identified democracy with the omnipo- 
tence of the houses of the legislature, delegation 
would endanger the very idea of democracy, since 
on the one hand it appeared to be a means of 
imposing obligations on individuals that their rep- 
resentatives would not have consented to; and on 
the other hand, by reinforcing the scope of the 
government's actions, it could justly be suspected 
of favoring the views approved of by the govern- 
ment. 

Historical development. The theoretical force 
of these arguments opposing the concept of the 
delegation of power could not prevail against the 
actual necessities that have compelled governments 
to resort to it in almost every country. These neces- 
sities arose in two areas, war and economic catas- 
trophe, and in two successive waves, World War I 
and the depression. The war of 1914-1918, when 
the bitterness of the struggle entailed mobilization 
of all the forces of the nation, made it necessary 
to concentrate all powers in the hands of an agency 
able to use them promptly. Since this agency could 
only be the executive, the latter was granted power 
by the parliaments to regulate affairs that in nor- 
mal times would have called for a legislative vote. 
In France, the first laws extending the regulatory 
powers of the government were adopted on Au- 
gust 3 and 5, 1914; they regarded only matters of 
limited scope. Subsequently, although special laws 
had enlarged the government's power to act, it 
promulgated, on its own accord, measures called 
décrets-lois, which properly belonged exclusively 
within the competence of the parliament. 

In England it has always been held that, in the 
absence of a written constitution, Parliament 1S 
sovereign and can therefore delegate to an agency 
of its choice any or all of its legislative competence: 
Nevertheless, although such instances of delega- 
tion were known for a long time (for example, by 
the Mutiny Act of 1717, Parliament transferred to 


the crown all regulation of discipline in the army), 
they remained exceptional and, furthermore, did 
not entirely divest Parliament of those powers. In 
point of fact, the historical practice of delegated 
legislation that had become fairly widespread dur- 
ing the nineteenth century came down to this: 
Parliament itself established the broad principles 
of regulation; the subordinate authority was em- 
powered to adapt them to actual situations. In 1914 
there was a clear-cut change; the Defence of the 
Realm (Consolidation) Act gives the government 
the broadest powers and introduces crisis legisla- 
tion into the framework of delegated legislation. 
This category of delegation goes much farther since 
it does not in any way limit the liberty of the 
executive. 

In the United States, the principle that Congress 
cannot delegate its legislative powers can be modi- 
fied by a broad interpretation of the function of 
the president. It is admitted that in a period of 
crisis or war, the president can do anything that 
is needed to preserve the Union. Applying this idea, 
originally held by Lincoln and Theodore Roosevelt, 
President Wilson, during World War 1, took steps 
that normally would come under the competence 
of Congress, Even among the neutrals, the interna- 
tional crisis led to a transfer of legislative powers 
from the legislative branch to the executive branch, 
Thus, in Switzerland, on August 3, 1914, the fed- 
eral assembly granted full powers to the federal 
council, 

Since the motive for the extensive delegation of 
power was war necessity, it might have been 
thought that the delegation would end when the 
war ended. Nothing of the sort happened. After the 
conflict, a new wave of delegation appeared, this 
time provoked by economic difficulties. In France, 
financial emergencies led successive governments 
to ask parliament for the power to legislate by 
decrees; in England, delegated legislation became 
a normal governmental procedure; in Switzerland, 
the economic crisis of 1930 led to a new extension 
of the powers of the federal council. In the United 
States, President Roosevelt had recourse to his 
statutory powers, i.e., those that a president has 
under an express delegation from Congress, to 
regulate by executive order matters that are nor- 
mally reserved to formal law. The international 
tension from 1948 on has led American presidents 
to take similar actions. 

Since that time it has become impossible to re- 
gard delegation of the legislative power as a mere 
expedient, legitimate only to meet a crisis situation. 
The volume of legislative measures taken by the 
executive in many states often exceeds the number 
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of laws adopted by their legislatures. Experience 
has shown that even under normal conditions the 
legislative bodies can no longer claim a monopoly 
of legislation. In the ideal liberal state, the right 
to make laws is reserved solely to the national rep- 
resentatives because laws are few in number and 
very general in their content, being only a last 
resort intended to help overcome the inadequacies 
of the social order. But the modern conception of 
democracy calls more and more frequently for 
intervention by the state. The number of regula- 
tions required and their technical nature make 
legislatures increasingly incapable of issuing them. 
Furthermore, the executive, being obliged to act 
and to act quickly, can no longer wait for the 
legislative branch to decide whether to grant the 
government the laws it needs for its policies. 
Governments need to have the power to work out 
general policy and to issue freely the regulations 
necessary for its implementation. 

These facts were recognized in England in 1932 
by the Committee on Ministers’ Powers (Donough- 
more Committee), which was charged with study- 
ing the legality of delegated legislation. In the 
United States, these facts were recognized in 1949 
by the Commission on Organization of the Execu- 
tive Branch of the Government (Hoover Commis- 
sion). In both countries, it was held that legislation 
by the executive was not unconstitutional so long 
as it left room for control a posteriori, either by 
parliament (in England) or by the courts (in the 
United States). This control can indeed be effec- 
tive, as was shown in 1952 by the Supreme Court 
decision declaring unconstitutional President Tru- 
man’s seizure of the steel mills. In Europe, some 
constitutions drafted between the two wars granted 
the executive the right to legislate by decree under 
exceptional circumstances (Polish constitution of 
April 23, 1935, articles 55 and 57; Austrian con- 
stitution of December 7, 1929, article 18; Spanish 
constitution of December 9, 1931, article 80; etc.). 
After World War n, the possibility of the delegation 
of the legislative power was expressly recognized 
by the Italian constitution (article 77) and the 
fundamental law of the German Federal Republic 
(article 80). In France, on the other hand, the 
obloquy that the use of decrees had cast on parlia- 
ment, which was accused of evading its responsi- 
bilities, led the authors of the 1946 constitution to 
insert, in article 13, the rule that parliament alone 
makes law. In fact, however, from 1948 on, by 
more or less disguised procedures, all the govern- 
ments of the Fourth Republic asked parliament for 
the power to legislate, and obtained it. The 1958 
constitution, taking cognizance of what has become 
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an unavoidable necessity in a modern state plainly 
incorporated legislation by the executive branch. 
Not only can the executive, by virtue of article 37, 
legislate by decree on any matters not reserved to 
parliament by article 34, but article 38 gives it the 
power of asking the houses of parliament for a 
delegation of power to legislate even on those mat- 
ters that are reserved to parliament. 

G. BURDEAU 


[See also CONSTITUTIONAL LAW; CRISIS GOVERNMENT; 
LEGISLATION. ] 
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I 
THE STUDY OF DELINQUENCY 


Juvenile delinquency is that behavior on the part 
of children which may, under the law, subject those 
children to the juvenile court. As such, it is a rela- 
tively new and legal term for a very old phenom- 
enon. 

The term has both precise and diffuse referents. 
When a child is designated a juvenile delinquent 
by the court, this is a precise definition of his legal 
status. He is, by this act, a ward of the court, sub- 
ject to its discretion. By contrast, except in a strictly 
legal sense, the term refers only vaguely to actual 
behavior, since what is delinquent varies greatly 
over time and from one part of the world to an- 
other. 

Delinquency as legal status. Concern with mis- 
behavior by children is at least as old as recorded 
history. The earliest known code of laws (the Code 
of Hammurabi) took specific note of the duties of 
children to parents and prescribed punishments 
for violations. As legal systems were elaborated, 
the age of offenders continued to be important in 
defining responsibility for criminal behavior. An- 
cient Roman law and English common law, for 
example, held that children under the age of seven 
were incapable of criminal intent and, therefore, 
of responsibility for crime; between age seven and 
the time of puberty (approximately), criminal re- 
sponsibility was a matter for determination by 
the courts. When responsibility was assumed or 
“proven,” however, children of any age were sub- 
ject to the same laws and the same courts as were 
adults. It was this fact, and the related concern for 
the welfare of children rather than for specific 
types of behavior engaged in by errant youngsters, 
that motivated the early reformers to urge the pas- 
sage of juvenile court legislation. y 

The establishment of the first juvenile court in 
Cook County, Ilinois, in 1899 climaxed many 
years of legal and humanitarian concerns for the 
welfare of children held to be in violation of the 
law and concerns with the criteria by which they 
might be so adjudged (Van Waters 1932). This 
legislation created a new kind of machinery, out- 
side the criminal law, for handling juvenile of- 
fenders. Every state in the United States and vir- 
tually all “modern,” or “developed,” societies have 
since established special legal procedures for han- 
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dling juvenile offenders. Juvenile delinquency has 
emerged as a social problem only recently in some 
areas of the world, but there is evidence even in 
“new nations” and “underdeveloped countries” of 
increased concern with problems of youth under 
the law. 

The defining features of the juvenile court are 
its informality, as compared to the rigorously for- 
mal procedures of the criminal court with its rules 
of evidence and adversary system, and its primary 
concern with the welfare of the child rather than 
with guilt or innocence. Under the law, and follow- 
ing the ancient doctrine of parens patriae, the 
court, as the agent of the state, has the right and 
the responsibility to intervene in cases of child 
need, including those where violation of the law is 
involved or likely to be involved. However, there 
is no unanimity over the world concerning the 
precise conditions under which courts may inter- 
vene. United Nations reports suggest that there is 
a tendency in many countries to broaden the juris- 
diction of the courts to include children who for 
various reasons are considered to be potentially 
delinquent, as well as those directly in violation 
of the law. This tendency, and the lack of rigor- 
ously defined rules of “due process” by which a 
juvenile may be deprived of his liberty or a family 
of its children, has led to great and widespread 
concern with the rights of children and their 
parents, 

The flexibility urged as essential to the proper 
functioning of the court confers great power on 
juvenile court judges. Concern with the rights of 
children and parents arises in countries with strong 
democratic political philosophies, not because mis- 
carriage of justice is probable but rather because it 
is possible. Hence, a dilemma is built into the struc- 
ture of juvenile courts: how to provide the discre- 
tion and flexibility necessary to the prevention of 
delinquency and the rehabilitation of delinquents 
while at the same time preserving due process 
under the law [see PENOLOGY]. 

Patterns of delinquency. The legal status of 
“juvenile delinquent” is important in defining, but 
does not fully encompass, the social role of “ju- 
venile delinquent.” That is, a youngster who has 
been taken into custody by the police, or committed 
to an institution, or otherwise disposed of by the 
court is likely to be defined as a delinquent by many 
people—his parents, friends, neighbors, teachers, 
or employers. The legal process, whether or not it 
officially defines a youngster as delinquent, may 
be important in establishing and reinforcing both 
the community's and the individual's definition of 
himself as a delinquent (Tannenbaum 1938). 
These, in turn, may be important factors in deter- 


mining the nature of his subsequent experience, 
and hence the likelihood that he will engage in 
behavior that is considered delinquent. 

Many youngsters take part in activities defined 
by the statutes as delinquent but are not detected 
or, if detected, are not brought to the attention of 
legal authorities. And some of these youngsters, 
who have not been legally designated as delin- 
quent, are defined as delinquent by others signifi- 
cant to the community and to themselves. The 
behavioral sciences are concerned with the nature 
of delinquent behavior, whether or not that be- 
havior has resulted in legal action. For purposes of 
scientific inquiry delinquency may be defined as 
behavior that is specified by law as grounds for an 
adjudication of delinquency, and delinquents as 
those young people who engage in such behavior. 

The very broad specifications of behavior defined 
by the statutes as delinquent (some do not define 
delinquency in behavioral terms at all) give rise 
to the question: Who is not delinquent? The ques- 
tion is important, however, only if we assume that 
there are only two classes of children—‘delin- 
quent” and “nondelinquent’—that need to be de- 
scribed and understood. In reality, children vary 
in a great many ways in the types of delinquent 
behavior in which they engage and in the relative 
frequency, regularity, and versatility of such be- 
havior. In other words, there are different patterns 
of delinquent behavior and different degrees of 
involvement in them. Social scientists are inter- 
ested both in processes accounting for the develop- 
ment of the behavior of youngsters and in the 
manner in which behavior comes to be defined as 
delinquent. 

The data of delinquency. Official counts of de- 
linquents, or “offenses known,” are limited in the 
extent to which they can contribute to knowledge 
concerning the extent and nature of delinquency 
or of the processes involved in becoming delin- 
quent. They represent the actions of officials rather 
than of young people, and legal rather than scien- 
tific concepts of behavior. They provide little in- 
formation about the characteristics of offenders or 
their behavior. The very flexibility of legal processes 
concerned with juveniles calls into question the 
comparability of official data from one jurisdiction 
to another. Despite these limitations, for some pur- 
poses official data are useful, and they are in any 
case the only data available for describing certain 
characteristics of the phenomena. They are partic- 
ularly useful as reflections of official concern with 
juvenile delinquency over time, within and between 
societies. Sellin and Wolfgang (1964) have demon- 
strated that among broad groups of individuals 
(police, judges, and college students) there is wide- 
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spread agreement concerning the perceived seri- 
ousness of many crimes; they propose that the 
seriousness of delinquency problems in different 
communities and over time be measured from 
police descriptions of delinquency events that are 
most likely to be officially recorded. Certainly, vari- 
ations in official handling of offenders, by age, sex, 
and other theoretically meaningful categories, re- 
flect varying degrees of social concern with the 
behavior of young people. In addition, they may 
provide crude indexes of the behavior of children, 
which are reliable perhaps only with respect to the 
most serious types of behavior, for these will most 
consistently be officially recorded, 

Because of the limited usefulness of official data 
for treatment and etiological purposes, agencies 
and projects devoted to these ends also generate 
large bodies of data concerning delinquents. These 
take three principal forms: clinical reports, usually 
in the form of intensive case studies, which result 
from detailed interviews in a clinical setting, with 
treatment as the primary object of data collection 
(Hewitt & Jenkins 1947); self-reported behavior, 
obtained by completion of questionnaires by indi- 
viduals or by interviewers, with research the pri- 
mary object of inquiry (Short & Nye 1957-1958; 
Reiss & Rhodes 1961); and reports of participant 
and field observation made both by persons inter- 
ested in treatment or prevention and by those com- 
mitted primarily to scientific inquiry (Whyte 1943; 
Miller 1958). 

The methodology of delinquency studies has 
been subject to extensive criticism (Hirschi & Sel- 
vin 1962). While it is true that most studies in 
any substantive field exhibit methodological weak- 
nesses, the field of juvenile delinquency has per- 
haps been especially vulnerable in this respect. 
The more fundamental problem has not been 
methodological, however. Rather, delinquency stud- 
ies have tended to lack both methodological and 
theoretical rigor because they have not been as- 
sociated with the mainstream of behavioral science 
developments. This was not so much the case in 
the 1920s and early 1930s, when delinquency 
studies were an important part of the “Chicago 
school” of developing urban sociology. Following 
this period of intensive activity, the empirical study 
of delinquency dropped off sharply and was largely 
uninformed by the main body of sociological 
thought. Clinical studies also lacked the discipline 
of systematic inquiry, and their impact on socio- 
logical interpretations was minimal and marginal. 
Sociological theory concerning delinquency was 
greatly stimulated by the publication of Albert 
Cohen’s Delinquent Boys in 1955. Still, empirical 


inquiry lagged until grants from private and fed- 
eral agencies made possible investigations of suf- 
ficient scope to contribute markedly to the develop- 
ment of knowledge in this field. Passage of the 
Juvenile Delinquency and Youth Offenses Control 
Act by the Congress of the United States in 1961 
focused unprecedented public attention on juvenile 
delinquency and provided additional impetus for 
large-scale social action and research programs di- 
rected at the acquisition of new knowledge con- 
cerning juvenile delinquency and its control, 

Age and sex differences. Throughout the world 
juvenile court cases tend to be in the older age 
categories specified by law. Official cases of delin- 
quency thus tend to be a phenomenon of adoles- 
cence and young adulthood rather than of child- 
hood. In countries that have experienced increases 
in delinquency, however, the average age of court 
appearance has tended to decrease. As services to 
delinquents and “potential” delinquents become 
more elaborate, younger children are brought be- 
fore the court in the interest of delinquency pre- 
vention as well as rehabilitation. 

In most countries young people are involved in 
a high proportion of property offenses. In the 
United States, offenses for which persons under 
the age of 18 contribute more than one-half of the 
arrests annually are typically auto theft, burglary, 
and larceny. Concern is great in some countries 
over the involvement of juveniles in crimes of vio- 
lence, in the general disregard for social order, as 
in cases of property destruction and mass rioting, 
and in the use of alcohol and narcotics. 

Boys and girls are not equally involved in delin- 
quency, and sex ratios are not the same for all 
types of delinquency. The ratio of boys to girls 
appearing before juvenile courts is very much re- 
lated to the over-all social structure of a society, a8 
well as to variations within it. As the social status. 
of women approaches that of men and women gain 
greater freedom to participate in the affairs of the 
larger society, socialization patterns in the family 
and other institutions change and more girls come 
to the attention of the courts. Present ratios in the 
United States, for example, are about five boys to 
each girl; this is a considerable decrease since the 
early 1900s, when ratios of 50 or 60 to 1 were 
common. Sex ratios tend to be lower among groups, 
and in areas, with high delinquency rates. Since 
World War 11 increasing proportions of female 
offenders have been noted especially in Japan and 
Turkey, where the emancipation of women has 
proceeded with considerable rapidity. 

Boys tend to be arrested for offenses involving 
stealing and mischief of one sort or another. And 
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while studies indicate that more boys than girls 
engage in illicit sex behavior, far higher propor- 
tions of girls are arrested and brought to court for 
sex offenses and for other offenses, such as running 
away, incorrigibility, and delinquent tendencies, 
which very often involve problems of sex behavior. 

Ecological variations. There appears to be a 
direct relationship between urbanization and ju- 
venile delinquency. Although it is not always true 
that urban areas with the largest populations and 
highest densities have the highest delinquency 
rates, urban areas tend to have higher rates than 
do “semiurban” areas, and these areas, in turn, have 
higher rates than do rural areas. Exceptions within 
the general pattern, at least in the United States, 
modify but confirm it. Thus, rural areas and 
smaller communities that are near urban centers 
have been shown to have higher delinquency rates 
than do those that are further removed from such 
centers. Similarly, delinquency rates in relatively 
less populated areas have been found to be directly 
related to the size of their nearest urban center. 
Other studies have found higher proportions of pre- 
adolescent and female offenders in more urbanized 
areas. 

Within cities, juvenile delinquency tends to be 
concentrated in areas characterized by extreme 
physical deterioration, poverty, and social disor- 
ganization. These areas have been called “delin- 
quency areas” because they have been found to 
be characterized by high delinquency rates and 
high rates of recidivism over time; they are usually 
areas in which major shifts have taken place in 
racial and ethnic residential composition (Shaw & 
McKay 1942). In high-rate as compared with low- 
rate areas, age at first delinquency of youngsters 
appears to be lower, and proportions of female 
offenders are higher. Organized vice and political 
corruption flourish in high-rate areas, but there 
are important variations in these respects between 
delinquency areas (Kobrin 1951). 

Delinquency and race. Racial and ethnic groups 
have been shown by many studies to have widely 
varying delinquency rates. Numerous studies have 
found greater variation within than between these 
groups, however, depending especially on the type 
of community areas in which they are located. 
Newly arrived ethnic groups characteristically lo- 
cate in high delinquency areas, and their delin- 
quency rates reflect this fact. When these ethnic 
groups move to areas with lower delinquency rates, 
their rates become lower. Variations in this pattern 
serve only to confirm the hypothesis that the 
amount of delinquency in a racial group is a func- 
tion of its basic socioeconomic position. Negroes 


in the United States, despite some improvements 
in socioeconomic position (which are reflected in 
variations in delinquency rates within the Negro 
population), for many years have had high delin- 
quency rates. These rates are closely associated 
today, as they have been historically, with high 
rates of unemployment and economic uncertainty 
and with family disorganization. By contrast, the 
Chinese in the United States have had very low 
delinquency rates. These have been associated with 
strong family and community organization and 
with the relative isolation of Chinese Americans 
from the mainstream of life in the United States. 

The association of poverty with high delin- 
quency areas and studies of the socioeconomic 
background of offenders have led to much specula- 
tion concerning the influence of social stratification 
patterns on young people. Studies suggest that 
both delinquency rates and the type of delinquency 
vary in different ecological areas. Delinquency 
rates even of middle-class and upper-class persons 
are higher in high-rate areas than in low-rate 
areas. 

International trends. No source of data is ade- 
quate for the determination of how much delin- 
quency there is in any country, much less for 
comparison of the volume of delinquency today 
with that of the past, or from one country to an- 
other. It is evident that public concern with the 
problem has increased in most countries, especially 
since World War 11. Such published reports as are 
available from countries involved in both world 
wars indicate that delinquency rates increased 
during the conflicts (Middendorff 1960; U.S. Chil- 
dren’s Bureau 1962; U.S. Federal Bureau of In- 
vestigation 1966). 

Although a few countries reported stabilized or 
slightly decreasing delinquency rates during the 
late 1950s, these reports carried the caution of 
uncertainty. By contrast, most European countries, 
Japan, and the United States reported startling 
increases during this period, and reports filtering 
out of Russia, China, and various underdeveloped 
but rapidly changing countries, such as Ghana and 
Kenya, indicated a high level of public concern 
and probably actual increases in the incidence of 
juvenile delinquency. The United Nations has at- 
tempted to promote better standards of data gath- 
ering and reporting and to arrive at some interna- 
tional consensus as to the volume and nature of 
delinquency throughout the world. 

Delinquency rates in the more technologically 
and economically advanced countries appear to be 
the highest in the world. Studies of the relation be- 
tween delinquency rates and cycles of economic 
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depression and prosperity have yielded inconsistent 
results (Bogen 1944; Fleisher 1963; Glaser & Rice 
1959; Gibbs 1966). In large cities of these same 
countries high delinquency rates are associated 
with poverty, as reflected in ecological distributions 
and rates for persons in different social classes, but 
in smaller communities delinquency appears to be 
more evenly distributed throughout the class struc- 
ture, There is growing interest, but little evidence, 
concerning increases in delinquent behavior on the 
part of “middle-class” youngsters in prosperous and 
technologically advanced countries. 


Delinquent subcultures 


Just as age, sex, ethnic, and social class distribu- 
tions of those who come to the attention of au- 
thorities for delinquent behavior vary by ecological 
setting, so also the forms that delinquency takes 
have characteristic ecological distributions. Far 
more than delinquency that occurs in the midst of 
mass rioting, for example, gang delinquency tends 
to be concentrated in high delinquency areas. 

Variations in gang organization and behavior are 
well documented (Thrasher 1927). There has been 
much speculation and some research on the exist- 
ence of regularities amidst such variability. The 
concept of “delinquent subcultures” has been intro- 
duced, in part to describe and in part to account 
for the forms taken by gang delinquency. Here we 
are concerned with classifications of delinquency 
as descriptive categories and with what is known 
about them. 

The term “delinquent subcultures” refers to de- 
linquent behavior, with supporting norms, values, 
and structures, which is traditional among mem- 
bers of a group or several groups of young people 
(Cohen 1955). Although Cloward and Ohlin (1960) 
and others have suggested the following classifica- 
tion of delinquent subcultures, the extent to which 
concrete subcultures can be clearly differentiated 
in these terms is still uncertain, Conflict subcul- 
tures are described as consisting of networks of 
gangs engaged in periodic disputes over local ter- 
ritory (as within a particular neighborhood), real 
or imagined insults, proprietary interests in girls, 
etc. Criminal subcultures are oriented toward eco- 
nomic advantage by illegitimate means, through 
planning and organization, which often involve 
marketing arrangements and protection from police 
interference. The gang and these arrangements with 
adults are characteristic organizational features of 
criminal subcultures, in contrast with the networks 
of gangs making up conflict subcultures. Retreatist 
subcultures seek esoteric experience, or “kicks,” as 
in the use of drugs, or participation in “deviant” 
sexual behavior; here gang organization is sub- 


ordinated to subcultural forms of experience, re- 
gardless of group identity. These three types of 
subcultures are described as involving chiefly 
lower-class males, although females are acknowl- 
edged to participate in each to some extent (Rice 
1963; Short & Strodtbeck 1965). 

Middle-class delinquency of a subcultural nature 
has also been hypothesized; it is said to involve 
deliberate courting of danger and excessive con- 
sumption of alcohol, but with less violence than is 
found in lower-class subcultures. There are also 
collective forms of delinquency that are neither 
gang oriented nor supported by a subcultural sys- 
tem. Of particular note are the mass riots of ado- 
lescents that have accompanied public concerts or 
motion pictures of teen-age “idols” in various 
Western countries [Middendorff 1960; see also 
DELINQUENCY, article on DELINQUENT GANGS]. 
While these are not common, when they have oc- 
curred their “epidemic” character, within countries 
and from one country to another, suggests the 
actions of “expressive crowds” which at times 
have been transformed into “acting crowds” bent 
on destruction [see CoLLECTIVE BEHAVIOR]. Higher 
proportions of leaders than of followers in these 
crowds have been found to have previous records 
of delinquency. Participation has ranged broadly 
across the social class spectrum, however, and 
involves many youngsters without previous or sub- 
sequent participation in delinquency. 

A second type of elementary collective behavior 
involving occasional delinquency has accompanied 
the spring holidays of students in various coun- 
tries. Here property destruction and the defiance 
of adult authority and conventional morality also 
occur with a considerable element of spontaneity 
and collective excitement. Delinquency is not the 
object of those who participate, but it is a frequent 
result. A 

There is little research documenting the exist- 
ence or the nature of delinquent subcultures Or 
other collective forms of delinquency. On the basis 
of reports from several Western countries, the 
following conclusions seem warranted concerning 
subcultures and gangs: a a 

(1) Delinquent gangs exist in many cities an 
are responsible for much theft, property dammik 
and violence against persons (themselves an 
others). 

(2) Most but not all countries report that suc 
gangs are especially common among youngsters i 
the most depressed socioeconomic areas of k 
cities and in low-rent housing projects, which z 
many countries have been built to alleviate hou 
ing conditions for lower-class families. 3 

(3) More boys than girls are members of gangs» 


uch 
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and boys are the acknowledged leaders, but girls 
frequently participate and, when they do, often 
“instigate” delinquent episodes. Sex relations within 
the gang most commonly occur between boys 
and girls who are “going steady,” although shifts 
often occur in such alliances. Marriages, both 
common-law and legally formalized, frequently oc- 
cur between gang boys and girls, but the courtship 
system is not exclusive for either, and many prefer 
mates who have not participated in any gang, or 
at least not in their own. 

(4) It is evident that youngsters who partici- 
pate in gangs gain status within the gang as a 
result of their participation, and in many cases 
they enjoy the notoriety of gang exploits that re- 
sults from mass media coverage of delinquent 
episodes. The mass media thus help to perpetuate, 
and perhaps to create, both gangs and delinquent 
subcultures, for it is partially through their cover- 
age of delinquent episodes that delinquents come 
to know of one another, and gang reputation as a 
status criterion is enhanced. 

(5) Delinquent behavior by gangs is episodic 
in nature, and, except in rare instances, not all 
gang members participate in any given episode. 
Observation suggests that those youngsters who 
are core members of gangs, particularly those who 
occupy positions of leadership or are striving to do 
so, are most likely to be involved in a given 
episode, 

(6) Gang size and structure vary greatly, but 
gangs with more than fifty members are relatively 
rare; structure tends to be weak and fluid; and the 
cohesion of gangs is weak except in times of crisis. 

(7) The elaboration of gang “nations” through 
rough age-grading (senior, junior, midget, peewee 
branches of a larger gang) or across neighborhood 
or community lines (East Side and West Side 
branches, etc.), according to published reports, has 
occurred only in a few large cities of the United 
States (New York and Chicago, in particular). 

(8) Similarly, hypothesized delinquent subcul- 
tures (conflict, criminal, retreatist) appear to exist 
in relatively pure form only in these large U.S. 
cities. European studies do not report them. From 
the United States, fragmentary evidence suggests 
that in rural areas and in small towns, and in 
larger cities without the huge concentrations of 
disadvantaged ethnic populations together with ab- 
ject poverty such as exist in New York and Chicago, 
group delinquency does not assume the specialized 
forms described as delinquent subcultures. 

This is not to say that delinquency in other areas 
may not be subcultural in nature. It seems likely, 
both theoretically and empirically, that in many 
instances it is. Youngsters who “get into trouble” 


in any local institutional context tend to know one 
another and, through such processes as exclusion 
from circles of nontroublemakers, the search for 
solutions to common problems, and mutual attrac- 
tion, they tend to be thrown into interaction with 
one another (Empey & Rabow 1961). Legal defini- 
tions seem crucial, for they provide a common 
identity of those processed, for themselves and 
for others. 

In summary, the concept of delinquent subcul- 
tures requires greater theoretical and empirical 
specification. Whether it will prove to be a viable 
concept will depend upon demonstration of utility 
not otherwise provided by such concepts as per- 
sonality, group and community process, collective 
behavior, and concepts of social class subcultures. 


Class, personality, and group process 

Discussions of collective patterns of delinquency 
are couched largely in terms of the position of 
lower-class adolescents in the social structure. 
Community and group-process variables, which are 
more immediate in the experience of individuals 
and groups, provide a possible linkage between 
theories couched in terms of social structural vari- 
ables and those based on personality considerations. 

Thus, Miller (1958) describes gang delinquency 
as a manifestation of various lower-class “focal 
concerns,” such as trouble, toughness, smartness, 
excitement, fate, and autonomy. Similarly, Cloward 
and Ohlin (1960) describe relations between ado- 
lescents and adults at the community level as 
crucial in determining the nature of collective de- 
linquent behavior on the part of juveniles. But in 
neither case are personality variables integrated 
either theoretically or empirically with social and 
cultural concepts. 

The group has been described as the context of 
much delinquency, and Cohen (1955), especially, 
has stressed its catalytic nature by virtue of which 
more extreme behavior occurs than otherwise 
would be likely on the part of participating indi- 
viduals. More recently, in experimentally manipu- 
lated groups in treatment cottages and in gangs 
in their natural community settings, such group 
processes as role playing and status striving have 
been found to be involved in the precipitation of 
delinquency episodes (Short & Strodtbeck 1965). 
Recruitment to particular group roles may prove, 
in part, to be a function of individual personality 
characteristics. 

Role performance is likely also to be linked in 
important ways to personality. The observation 
that gang leaders often react to status threats by 
leading or attempting to lead the group in delin- 
quency episodes is a case in point. It seems likely 
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that what is considered threatening will in im- 
portant ways be determined by the experience and 
the personality configuration of the individual 
leader, It can hardly be doubted that personality 
considerations, in addition to role and situational 
elements, are involved in a broader spectrum of 
delinquent behavior. 

Psychological types of delinquents. Several psy- 
chological types of delinquents have been proposed. 
The classification with perhaps the greatest amount 
of theoretical and research specification is that 
which describes three major types and discusses 
their social correlates (Reiss 1952; cf. Hewitt & 
Jenkins 1947). The relatively integrated delin- 
quent, characterized by adequate personal con- 
trols, accounts for nearly two-thirds of the boys 
classified. This offender often is a member of an 
organized group and characteristically resides in 
a high delinquency area; his family socialization is 
conventional and reasonably adequate. In contrast, 
the relatively weak ego delinquent is described as 
an insecure, internally conflicted, and anxious per- 
son. His offense pattern characteristically includes 
property destruction and other aggressive acts, but 
not gang membership. The defective superego de- 
linquent has failed to internalize conventional 
standards of conduct and is, instead, oriented to- 
ward deviant groups. His offense pattern, like that 
of the integrated delinquent, typically involves 
property offenses and aggressive behavior in a 
gang context. He is described as having been so- 
cialized in a broken home characterized by much 
conflict and nonconventional behavior. 

Repeated attempts to demonstrate that there is 
a “delinquent” personality type have failed. The few 
studies that have employed control groups have 
compared “delinquents” with “nondelinquents,” 
without regard for more subtle differences in de- 
linquency involvement. On the basis of such stud- 
ies, it appears that delinquents are characterized 
by greater aggressive tendencies and less neuroti- 
cism than are nondelinquents (Glueck & Glueck 
1950). 


The relation between various personality char- 
acteristics and processes, on the one hand, and 
patterns of delinquency involvement, on the other 
hand, is unclear, theoretically and empirically. 
Viewed as a special case of collective adaptations 
by young people generally, this relation is one of 
the most significant unknowns in the behavioral 
sciences. 


JAMES F. SHORT, Jr. 


[See also Crime, article on CAUSES OF CRIME; DEVIANT 
BEHAVIOR. ] 
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H 
SOCIOLOGICAL ASPECTS 


Theories of delinquency causation can best be 
understood as a part of the social context in which 
they are developed. Two features of that social 
context seem crucial to an understanding of cur- 
rent theories, First, most of the systematic explana- 
tory efforts have come from the United States, 
where a pragmatically oriented social science 
developed early in the twentieth century and 
has continued to flourish. Thus, despite the histori- 
cally important theoretical works (Lombroso 1899; 
Bonger 1905) and systematic treatises on crimi- 
nology (Hurwitz 1947) contributed by European 
scholars, as well as the growing number of careful 
empirical studies from Scandinavia (Christie 
1960), England (Morris 1958), and other Euro- 
pean countries, most of the systematic theories 
continue to reflect the dominant trends in Amer- 
ican social science. Second, within American social 
science the subject matter of delinquency has been 
a primary concern of sociologists, who have done 
most of the research and written most of the texts, 
and a secondary but still important concern of 
psychologists; it has received less attention from 
the other social science disciplines. 


Theories of delinquency causation 


The following review of five theoretical emphases 
will illustrate the range of current efforts to under- 
stand the causation of delinquency. It will omit 
work done primarily from within a biological or 
physiological perspective, as well as that done 
within a psychoanalytic framework. 

Psychogenic theories. Since shortly after the 
turn of the century a number of essays and studies 
have located the causal processes of delinquency 
within the individual delinquent. Many of these 
contributions have come from clinical experience 
with delinquent youths. The central argument is 
that delinquency is a “solution” to psychological 
problems stemming primarily from faulty or path- 
ological family interaction patterns. 
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The research and theoretical interpretations of 
Healy and Bronner (1936) provide an excellent 
example of this approach. These investigators sys- 
tematically compared delinquent youths with their 
nondelinquent siblings. The most important dif- 
ference between them was that over 90 per cent 
of the delinquents, as compared to 13 per cent of 
their nondelinquent siblings, had unhappy home 
lives and felt discontented with their life circum- 
stances. The nature of the unhappiness differed: 
some felt rejected by parents; others felt inadequate 
or inferior; others were jealous of siblings; still 
others were affected by a more deep-seated mental 
conflict. But whatever the nature of the unhappi- 
ness, delinquency was seen as a solution. It brought 
attention to those who suffered from parental 
neglect, provided support from peers for those who 
felt inadequate, and brought on punishment for 
those who sought to reduce guilt feelings. This 
study, though published in 1936, is still the proto- 
type of studies following what may be called a 
psychogenic model, that is, studies that search for 
the causes of delinquency in the psychological 
make-up of the delinquent, which typically stems 
from specific styles of family interaction. 

Later studies in this tradition have more clearly 
identified detailed aspects of family relations that 
seem important, but they have not altered the 
fundamental point of view. McCord, McCord, and 
Zola (1959) found that among youths from areas 
of low socioeconomic status, delinquents differed 
from nondelinquents in the extent of parental re- 
jection and in the inconsistency of punishment and 
discipline. Bandura and Walters (1959) gathered 
detailed interview data from adolescents with rec- 
ords of highly aggressive actions and compared 
them with a control sample of nondelinquents. 
Their investigation suggested that the delinquent 
youths differed from the nondelinquent controls 
relatively little in their relationship with their 
mothers but that they had much less identification 
with their fathers. In addition, discipline was 
meted out in a harsher manner within the families 
of the delinquents. Also, the more aggressive, de- 
linquent boys had poor relations with their peers, 
while the boys with more healthy family relations 
had stronger peer relations as well. 

In the interpretations of these studies, the psy- 
chogenic model is clearly in operation: unfor- 
tunate, unhappy family circumstances lead to 
personal psychological problems of adjustment for 
the youth, which in turn are in some way solved 
by the commission of delinquent acts. Attention 
focuses on the boy, his family, and their problems. 
Delinquency is assumed to be a form of reaction 
to the problems. 
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Sociogenic theories. At about the time that 
Healy and Bronner and others were studying the 
psychogenic sources of delinquency, social scien- 
tists at the University of Chicago were impressed 
by the force of culture and social disorganization 
in a rapidly expanding city filled with immigrants 
and other minorities. In a series of investigations of 
crime rates in the urban area, it was found that 
rates of delinquency were stable from one period 
to another, that the largest percentage of youths 
committed their delinquencies in companionship 
with others, and that a series of sociocultural 
characteristics were typically associated with the 
areas where high rates were found. These charac- 
teristics included rapid turnover of population; a 
sizable portion of the land used for industrial build- 
ings; high rates of minority-group membership; 
and high rates of social disruption as reflected in 
broken homes, suicide, alcoholism, and related 
problems, The studies of Thrasher (1927), Shaw 
and McKay (1942), and Shaw and Moore (1931) 
report both evidence and argument in line with this 
tradition. 

In the face of these characteristics, it was natural 
for students of delinquency to see important causes 
residing not in the youth’s family alone but in the 
cultural context of his home and neighborhood. 
In such areas a boy's “natural history” was to par- 
ticipate in delinquent careers (Shaw 1929), No 
specific psychological block seemed necessary; pat- 
terns of delinquent and criminal action were all 
around. Joining in those patterns would be natural 
for all except those restrained by close family life 
or by strong ties to conventional community in- 
stitutions such as the school. 

In this research tradition the key concept for 
understanding why youths became delinquent was 
association with other youths already delinquent. 
This was later put most clearly by Sutherland, who 
developed the theory of differential association 
(Sutherland & Cressey 1960). This theory asserts 
that youths become delinquent to the extent that 
they participate in settings where delinquent ideas 
or techniques are viewed favorably: the earlier, the 
more frequently, the more intensely, and the longer 
the duration of the youths’ association in such 
settings, the greater the probability of their becom- 
ing delinquent. Unlike the psychogenic theories this 
set of ideas focuses on what is learned and who it 
is learned from rather than on the problems that 
might produce a motivation to commit delinquen- 
cies. Support for this theory has been found both 
in studies using the method of self-reported de- 
linquent behavior (Clark & Wenninger 1962; Akers 
1964) and in those using official crime data (Reiss 
& Rhodes 1964), 


Theories of delinquent subcultures. At least 
until the early 1950s the two perspectives reviewed 
above were the dominant forms of theories at- 
tempting to link individuals with delinquency. In 
the decade of the 1950s, however, attention was 
drawn to organized-gang activities more insistently 
than in the past. Partly this was because of an 
apparent increase in the amount of violence and 
crime engaged in by such groups, and partly be- 
cause it seemed to reflect the nearly universal im- 
portance of the gang in industrialized countries, In 
any event, there were enough differences between 
the formally organized gangs studied and reported 
on during the 1950s and the somewhat looser ag- 
gregates usually referred to in the earlier studies 
to generate a new series of theoretical efforts 
focused on the organized street gang. 

The central concept that emerged to encompass 
the phenomenon of the gangs was that of the 
delinquent subculture. The concept “culture” refers 
to the set of values and norms that guide the be- 
havior of group members; the prefix “sub” indicates 
that these cultures often emerge in the midst of a 
more inclusive system, Delinquent subculture re- 
fers, then, to a system of values and beliefs en- 
couraging the commission of delinquencies, award- 
ing status on the basis of such acts, and specifying 
typical relationships to persons who fall outside 
the delinquents’ social world. 

Several variations of the theory of subcultures 
have been formulated, each of which provides a 
distinctive way of locating the nature of the de- 
linquent patterns, the sources that give rise to 
the patterns, and the functions the patterns serve 
for the groups where they are found. Leading state- 
ments of this theory include those of Cohen 
(1955), Cloward and Ohlin (1960), and Miller 
(1958). Variations or modifications have been sug- 
gested by Yablonsky (1962), Bordua (1961), and 
Spergel (1964). Detailed review of the various 
ideas cannot be provided here, but it is important 
to note the central line of argument, for it has 
something in common with both of its predeces- 
sors. Like the more general sociogenic theories, the 
sources of the problem are found in the nature of 
the social structure and the particular patterns of 
neighborhood and community life where delin- 
quent gangs abound: high population density, low 
socioeconomic status, high rates of family disor- 
ganization. No matter how the particular argument 
is framed, there is agreement on the general source 
of delinquent subcultures within the broader Con- 
text of urban slum life. 

Unlike those involved in earlier sociogenic work, 
most subculture theorists have been unwilling H 
rest with the concept of differential association an! 
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the explanatory power it provides. Like the psy- 
chogenic theorists, they have seen the formation 
of delinquent gangs and subcultures as a reaction 
to the problems caused by low social status in a 
world where high status is valued and yet where 
access to the means to high status is relatively 
unavailable to them. In this feature, the explana- 
tory model derives from the more general theory 
of deviant behavior and anomie (Merton 1938). 

The situational delinquent. The theories out- 
lined above have one quality in common. In all of 
them delinquency is viewed as having deep roots. 
In the psychogenic tradition these roots lie pri- 
marily within the individual, while in the sociogenic 
tradition they lie in the structure of the society, 
with emphasis either on the ecological areas where 
delinquency prevails or on the systematic way in 
which the social structure places some individuals 
in a poor position to compete for success. But no 
matter where the emphasis is laid, it is clear that 
in these theories delinquency is a serious business, 
requiring for its prevention or correction a funda- 
mental reorganization of the psyche, the social 
structure, or the culture. 

The conception of the situational delinquent pro- 
vides a different perspective. Here, the assumption 
is that delinquency is not deeply rooted, that the 
motives for delinquency are often relatively simple. 
And if they are not so deep, then perhaps their 
causes do not lie in the deeper recesses of the 
psyche and the social order but are closer to the 
immediate contingencies of action in the everyday 
world. Such a conception of the nature of delin- 
quent motives forces a closer examination of the 
immediate social context of delinquency and of 
the problems of adolescent youth rather than of the 
more distant problems of achieving middle-class 
status from lower-class origins. 

This conception is receiving increased attention 
from students of delinquency. It is partially re- 
flected in the provocative treatment by Matza 
(1964a; 1964b), who restores the concept of “will” 
to the study of delinquency and argues persuasively 
that potential and actual delinquents assess risks 
and engage in rationalizations in much the man- 
ner suggested by the traditional criminal law. He 
notes, for example, that when delinquents offer 
excuses for their acts, the excuses bear a close 
resemblance to those provided within the tradi- 
tional law of crimes. And he emphasizes the shift- 
ing, transitory character of delinquent acts and 
actors. 

Other students reflect a similar perspective in 
varying degrees. In one of the principal empirical 
studies of gang delinquency, sociologists James 
Short and Fred Strodtbeck (1965) began with the 
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orientations suggested by the conception of a de- 
linquent subculture but became increasingly per- 
suaded of the necessity to study immediate group 
process and interaction. Briar and Piliavin (1965) 
provide further argument in favor of a situational 
approach. 

Delinquency and secondary deviation. Tannen- 
baum (1938) suggested years ago that the forma- 
tion of delinquent careers owed much to the stigma 
and labeling associated with the processing of de- 
linquents by the police, courts, and other agencies. 
Since almost all adolescents engage in some form 
of delinquency, the interesting explanatory prob- 
lem may be why some continue and others do not. 
Lemert (1964) speaks of secondary deviation to 
refer to the deviant acts that appear to have as 
their cause the limitations placed on normal func- 
tioning by the prior definition of the person as a 
deviant. The strong suggestion is that the develop- 
ment of a disciplined delinquent career is a re- 
sponse to the labeling and stigma that result from 
official treatment as a deviant. Whatever the source 
of the initial delinquent act, the labeling and stigma 
that result from the official processing provide im- 
portant new stimulus toward delinquency, for the 
actor is invited, implicitly, to assume a delinquent 
role. 

This emphasis directs attention to an under- 
developed area in the general theory of deviant 
behavior, the analysis of the deviant’s response to 
the organized system of social control (Cohen 
1965). In the case of delinquency there are still 
no definitive studies that clearly trace the conse- 
quences of official sanctions per se on the later 
conduct of the delinquent. The theoretical em- 
phasis on secondary deviance points strongly to 
the need for such studies. 

Social definitions of delinquency. In all of the 
theoretical schemes traced thus far, the emphasis 
remains on the delinquent and the forces that move 
him toward or away from delinquency. Yet one of 
the most interesting features of delinquency is the 
way it is socially defined by conventional people. 
Unlike some of the more traditional areas of 
criminal law, the concept of delinquency is only 
vaguely spelled out in legislation. The result is 
that much room is open for discretion in the desig- 
nation of conduct patterns as “delinquent.” This 
discretion is in the hands of police and judicial 
officials, with the result that delinquency is in a 
very real sense a product of the interaction between 
adolescents and officials and cannot easily be un- 
derstood by focusing on only one side of the in- 
teraction (Wheeler 1965). 

Thus, theories of delinquency are beginning to 
be concerned with the conduct of officials, since 
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that conduct is an integral part of the official label- 
ing of persons as delinquent. Political scientist 
James Q. Wilson has found, in comparative 
studies of police administration carried out in the 
mid-1960s, that variations in the juvenile arrest 
rate in two different cities may be largely a func- 
tion of differences in the nature and type of police 
organization in the cities. These differences are not 
merely a function of differing degrees of police 
efficiency; rather, they reflect differences in what 
is conceived to be delinquent conduct. This view 
requires a radical reconstruction of theories of 
delinquency, one that focuses jointly on the be- 
havior, background, and social setting of the de- 
linquent and on the behavior, background, and 
social setting of those who are officially charged 
with responding to delinquency. This perspective, 
of course, is not limited to delinquency. It is equally 
applicable to a broad range of deviant behavior 
patterns (Becker 1963; Cicourel & Kitsuse 1963). 

Limitations of current theories, Among the 
many problems and issues facing all those who 
would develop systematic theories of delinquency 
causation, the following seem most important. 

Adequacy of data. The theories reviewed above 
all suffer heavily from a lack of adequate support- 
ive data. Most of the thedries require for their 
test the gathering of information about adolescents 
that is of a type not routinely collected by the of- 
ficial agencies that deal with youth. This is perhaps 
most true of the theories that emphasize collective 
enterprise among delinquents, as well as situational 
elements and structures. Record-keeping systems 
are designed to report individual acts of delin- 
quency and typically provide very little information 
about the surrounding social context. It is an espe- 
cially demanding and exacting enterprise to gather 
systematic data of a type that bears closely on the 
truth value of the various theories. 

Reconciling divergent perspectives. The several 
perspectives described above may each be: valid, 
but for differing sectors of the adolescent popula- 
tion, Delinquency is a blanket concept that of 
course conceals a variety of different types of de- 
linquent acts. One way of harmonizing the various 
Perspectives is to note their applicability to dif- 
ferent types of delinquents. Hewitt and Jenkins 
(1947) have provided one such typology, Reiss 
(1952) has provided another, and Gibbons (1965) 
has reviewed a variety of such efforts for both 
delinquents and adult offenders. 

Still another way of reconciling the divergent 
perspectives is to assume that they interact under 
appropriate conditions to produce delinquency. 
While most students would agree with Cohen's 


critique (1955) of multiple-causation theories as 
inadequate, most would also agree that the several 
processes described above may interact to produce 
effects that no one of them would produce operating 
alone. One or two empirical studies, for-example, 
show the joint effects of socioeconomic status and 
family pathology on delinquency (Stanfield 1966). 
Yet this area of research is still grossly underde- 
veloped. 

Practical implications. A seemingly important 
practical consequence of a theory of delinquency 
causation is that it might suggest action strategies 
for the prevention or control of delinquent conduct. 
Although the action implications of some of the 
above theories have never been spelled out clearly, 
there are at least a few instances in which preven- 
tive programs have developed (Powers & Witmer 
1951; Miller 1962; Mobilization for Youth, Inc. 
1961). Although it is too early to draw firm con- 
clusions, most investigators would agree that the 
programs to date have not proved effective. A chief 
problem remains to spell out clearly the practical 
implications of the theories, to design preventive or 
corrective programs that embody those ideas, and 
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when the theories are applied. 

Cross-national studies. The theories treated 
here find the causes of delinquency within the 
family, the neighborhood, the community, or in 
one’s location in the social order. They do not relate 
delinquency directly to the forces of mass society 
itself. Yet it is the burden of argument of much 
serious social criticism and broader social theory 
that the problems ought to be couched at the level 
of mass society itself. The modern bureaucratized 
world, it is argued, produces a broad malaise, a 
sense of alienation from work and community, 4 
loss of identity, and a decline in meaningful experi- 
ence. The resulting anomie leads to many deviant 
reactions, among them delinquency. a 

These ideas are too broad and general to consti- 
tute a theory of delinquency that is subject 9 
careful empirical test, but it is fitting to close this 
article by noting that problems of delinquency arè 
indeed common to most of the industrialized world. 
A grave limitation of current theories lies in the 
absence of a genuine cross-national focus upon 
delinquency. Despite the technical problems that 
arise in cross-national research, it seems an e 
tial ingredient if our theories are not to be high y 
limited in their scope and explanatory capacity. 

STANTON WHEELER 
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PSYCHOLOGICAL ASPECTS 


Ironically, as nations grow richer and the oppor- 
tunities for youth multiply, juvenile delinquency 
seems to increase steadily. In almost every society, 
of course, adults have complained about the pre- 
sumed immorality of children. Even Socrates in 
the fifth century B.c. sounded an alarm about the 
“delinquents” of his time. Editorialists today echo 
Socrates’ opinion, but in our era, laments about the 
condition of young people may have more to sustain 
them. 

Incidence of delinquency. While every statistic 
in the measurement of crime remains open to 
doubt, there seems little reason to deny that de- 
linquency grows as a society becomes industrialized 
and urbanized (Bloch & Geis 1962). In England, 
for example, the best studies indicate that bur- 
glaries committed by delinquents climbed almost 
200 per cent between 1938 and 1961; sex offenses, 
by about 300 per cent; and violent crimes by an 
astounding 2,200 per cent (Mays 1963). The 
United States, that classical arena for crime, ex- 
hibits a similar trend: juvenile crimes have gone 
up since 1940 by about 93 per cent (Bloch & Geis 
1962, p. 186). These statistics refer only to of- 
ficially recognized crimes. Many juvenile offenses 
remain undetected or, if recorded, go unpunished. 
Indeed, as John Mays’s studies have shown, virtu- 
ally every child in certain urban areas has partici- 
pated in a wide range of delinquent activities even 
though only 10 per cent or 15 per cent of the 
youthful population are recognized by the courts 
as delinquents (Mays 1963). 

Using official definitions of delinquency, it ap- 
pears that as a society progresses materially, crime 
increasingly becomes the prerogative of youth. 
Thus, in the United States during the 1950s, crimes 
committed by boys under eighteen increased six 
times as rapidly as crimes committed by adults. By 
1961, American children accounted for 8 per cent 
of the arrests for murders, 20 per cent of rapes, 51 
per cent of burglaries, and 62 per cent of auto 
thefts. As many as one-fifth of American boys be- 
tween the ages of ten and seventeen have appeared 
before courts or have been arrested by police 
(Bloch & Geis 1962). 

Every affluent nation faces a similar problem. 
Juvenile crime has grown immoderately during the 
last fifty years and has outdistanced the rate of 
many types of adult crime. To understand this 
phenomenon requires some comprehension of the 
relation of culture, social structure, and early en- 
vironmental experiences as they combine to pro- 


duce psychological types prone to delinquency. The 
first step toward such knowledge is to review briefly 
the great fund of knowledge concerning the char- 
acteristics of delinquents. 


General characteristics 


Psychological studies of delinquents have tra- 
ditionally been concerned with how delinquents 
differ from nondelinquents in intellectual struc- 
ture, physiological constitution, and personality 
characteristics, 

Intellectual structure of delinquents. In 1912, 
Henry H. Goddard initiated research on the intel- 
ligence of delinquents (Goddard 1921), and his 
original endeavors stimulated hundreds of similar 
studies. Goddard administered a crude test of in- 
telligence to incarcerated delinquents and found 
that 50 per cent were “feeble-minded.” Later re- 
search appeared to confirm this result. Edwin H. 
Sutherland, for example, surveyed 340 studies of 
delinquents conducted up to 1928 (Sutherland 
1931). He reported that these studies found that 
from 50 per cent to 20 per cent of delinquents were 
feeble-minded. 

More recent research utilizing more sophisticated 
tests has, however, tended not to support the earlier 
studies. L. A. Siebert, for example, tested 8,003 
court cases. The average IQ of these delinquents 
was 91.4, slightly below the national average (Sie- 
bert 1962). Caplan, who has reviewed contempo- 
rary evidence, concludes that delinquents differ 
from nondelinquents only by approximately eight 
points on standard intelligence tests (Caplan 1965). 
This inconsequential difference can be explained 
in a number of ways: more intelligent delinquents 
may escape detection by the police, or perhaps the 
fact that the tests are normally administered in a 
Stressful situation (such as a juvenile reformatory) 
may affect the performance of delinquents. 

While the general intelligence of delinquents 
does not differ significantly from that of nonde- 
linquents, there may be specific cognitive functions 
where differences exist. Wechsler has found that 
delinquents usually score higher on “performance 
tasks than on those which require verbal skill 
(Wechsler 1939). This characteristic seems, how- 
ever, to be a reflection of the lower socloecon EE 
Status of delinquents, rather than specifically 0 
their criminal activities (Caplan 1965). Baker and 
Sarbin have found that delinquents utilize a rela- 
tively limited number of cognitive categories in 
viewing the outside world (Baker & Sarbin 1956). 
In consequence, they are less able to tolerate am- 
biguities, less able to predict the behavior of others, 
and tend to deal with other human beings as if they 


were simply “mirror images” of the delinquents 
themselves. [See INTELLIGENCE AND INTELLIGENCE 
TESTING. ] 

Physique and physiology of delinquents. A 
number of investigators have long been concerned 
with examining the constitution of delinquents, on 
the assumption that certain physical traits might 
determine the character and actions of the delin- 
quent. Lombroso may be considered the founder of 
this school of thought. He believed that delinquents 
were “moral idiots” who differed from normal men 
in basic and inborn physical characteristics (Lom- 
broso 1899). Goring and others have subsequent- 
ly disproved Lombroso’s specific claims (Goring 
1913), but more sophisticated researchers have 
found evidence to support the belief that the 
physique of delinquents differs from that of non- 
delinquents. These investigators have compared the 
total body structure of delinquents and nondelin- 
quents and have found, generally, that delinquents 
tend more often to be “mesomorphs” (relatively 
muscular, tightly knit people). Work by Sheldon 
(Sheldon 1949) and by the Gluecks (Glueck & 
Glueck 1950) supports this view. The Gluecks, for 
example, found that 60 per cent of their sample of 
delinquents could be typed as mesomorphs. Al- 
though the matter remains in great dispute, there 
seems, on the whole, to be sufficient evidence to 
conclude that delinquents are more often meso- 
morphic. Such a finding may, of course, be inter- 
preted in a number of different ways. Some scholars 
contend that mesomorphy may be considered as a 
reflection rather than a cause of delinquency, since 
obviously most delinquent acts require a fairly mus- 
cular physique. Other scholars would attribute 
greater significance to the research findings and 
would contend that the bodily constitution is a 
basic causal factor (e.g., mesomorphs, at birth, are 
presumably more aggressive; as they grow up, this 
higher level of aggression may push them toward 
delinquency). [See PSYCHOLOGY, article on CONSTI- 
TUTIONAL PSYCHOLOGY.] 

Other investigators have been concerned with 
various specific neurological and physiological 
functions of delinquents. Lindner, for example, has 
noted that a high proportion of delinquents under- 
react to painful stimuli (Lindner 1942/1943). 
Stafford-Clark has found that delinquents exhibit 
more signs of physical immaturities or “develop- 
mental anomalies” than do nondelinquents (Staf- 
ford-Clark et al. 1951). The Ostrows, as well as 
other investigators, have reported that delinquents 
have more indications of neurological disorders 
than does a normal population (Ostrow & Ostrow 
1946). These various results can again be inter- 
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preted in several ways. It could be that the higher 
rate of physical disorders is simply a result of en- 
vironmental deprivation, or it may be that, in some 
cases, a physiological malfunction plays a causal 
role in producing delinquency. 

Personality characteristics of delinquents. Many 
psychologists have been concerned with defining 
the particular personality traits which lead people 
to become delinquents. Siegman in Israel found, 
for example, that delinquents are highly “present- 
oriented” and do not plan for the future (Siegman 
1961). In a series of studies Hathaway and Mona- 
chesi have utilized ihe Minnesota Multiphasic Per- 
sonality Inventory to delineate various personality 
patterns of delinquents (Hathaway & Monachesi 
1953). They have shown that many delinquents 
are emotionally disturbed, but equally important, 
that certain types of emotional disorder are nega- 
tively correlated with delinquency. 

Dinitz and his colleagues have longitudinally 
studied a group of boys who had been exposed to 
social influences which, under usual circumstances, 
produce delinquency (Dinitz et al. 1958). The de- 
linquents in this group were less “socialized,” less 
responsible, and more often perceived themselves 
as likely “to get into trouble.” 

While these studies are highly provocative, the 
evidence suggests that there is no single personality 
pattern which characterizes all delinquents, or even 
a majority of delinquents. In 1950, Schuessler and 
Cressey reviewed 113 investigations and concluded 
that delinquency and specific personality traits 
were not related (Schuessler & Cressey 1950). 
More recently, Quay has examined the literature 
and decided that there are particular types of de- 
linquents who differ from each other and have dis- 
tinctive personality characteristics (Quay 1965). 
To understand fully the psychological aspects of 
delinquency, then, it is necessary to examine each 
of these various types. 


Types of delinquency 

With some precision and a degree of practical 
utility, juvenile delinquents can be categorized into 
three general classes: the socialized delinquent, the 
neurotic delinquent, and the psychopathic delin- 
quent. Each of these types exhibits certain traits 
that distinguish it from the other types and from 
nondelinquents. Since the background and treat- 
ment of each variety of delinquent differs, an 
understanding of youthful crime should commence 
with an examination of these differences. 

Socialized delinquency. The socialized delin- 
quent has been analyzed by such researchers as 
Albert J. Reiss, R. L. Jenkins, and Lester Hewitt 
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(Reiss 1952; Hewitt & Jenkins 1947). Indeed, most 
research on delinquency has concentrated on this 
type—and rightfully so, since socialized delin- 
quents account for the great majority of crimes. 
The socialized delinquent does not suffer from any 
particular psychological disorders, other than those 
which characterize the typical adolescent. His 
crimes are not motivated by deep-seated anxieties 
or unresolved conflicts, but rather by a simple de- 
sire to conform to the norms of his gang. Typically, 
such a boy comes from a transitional urban area 
where a gang subculture has become entrenched. 
In his immediate social environment, delinquency 
has evolved into an accepted, even honored, pat- 
tern of life, a legitimate way of achieving prestige 
in the juvenile community. Often, in early child- 
hood the socialized delinquent joins a gang and 
imbibes the values and habits of a delinquent sub- 
culture, Put most simply, in the fashion of Edwin 
Sutherland, the child learns to become criminal 
through “differential association” with delinquents 
(Sutherland & Cressey 1924). 

One should not conceptualize the process of be- 
coming a socialized delinquent, however, as just a 
matter of chance learning. Only certain children 
seem to be drawn toward a gang culture. Spe- 
cifically, as many pieces of research have demon- 
strated, the socialized delinquent seems to be pro- 
duced by a frustrating and inconsistent familial 
background. Compared with noncriminal children, 
the socialized delinquent has most often been 
raised in a family characterized by parental con- 
flict, rejection, or neglect (Glueck & Glueck 1950), 
In such a family, the child finds that his elemental 
needs for self-esteem, security, and emotional 
warmth are frustrated. Not unnaturally, he searches 
for alternative ways of fulfilling his basic needs, 
and, if he lives in a delinquent neighborhood, the 
gang offers him a very “live” option. Typically, too, 
the socialized delinquent has been reared in a 
home characterized by a lack of supervision and 
erratic discipline. In consequence, he has not inter- 
nalized the usual middle-class standards that could 
immunize him from delinquent influences in his 
broader social environment. 

Such a child adopts the ethic and customs of the 
gang as a means of assuring himself the sense of 
importance and security denied him in other ways. 
The gang can offer him acceptance and guidance. 

As socialized delinquents reach adulthood, the 
majority shed the pattern of criminal behavior: at 
least 60 per cent stop their criminal behavior by the 
age of 21 (McCord et al. 1959), This reformation 
is largely spontaneous and unaffected by society's 
conscious attempts to bring about change. It would 
seem to be due to the simple process of aging and 


entering new social roles. By adulthood, the social- 
ized delinquent typically leaves his original home 
and neighborhood; he undertakes new responsi- 
bilities as a father, husband, and wage-earner. 
Thus, he escapes the influences that led him to 
delinquency and adopts new social roles that are 
inconsistent with membership in a juvenile gang, 

Neurotic delinquency. The neurotic delinquent 
suffers from deep anxiety, intense insecurity, and, 
often, pervasive guilt. For such a boy, criminal be- 
havior is a way of expressing an unresolved con- 
flict and offers a release from anxiety. His behavior 
stems from deeply imbedded psychological causes 
rather than from a simple acceptance of a gang 
culture as a means of winning prestige. Unlike the 
socialized delinquent, the neurotic child often com- 
mits his crimes alone and usually commits only a 
Single type of crime. His behavior exhibits a com- 
pulsive quality that is often absent in socialized 
delinquents. The juvenile arsonist, sexual offender, 
or narcotics addict usually comes from the ranks 
of neurotic delinquents. The motives behind the 
neurotic’s crimes are varied and difficult to com- 
prehend. The arsonist, for example, may set fires 
because of exhibitionistic desires; the neurotic 
burglar may commit his offenses because the act 
of burglary offers him sexual release (Abraham- 
sen 1960). 

Although their specific characteristics and back- 
grounds differ markedly, neurotic delinquents gen- 
erally emerge from a more middle-class, conven- 
tional environment than do socialized delinquents. 
Their families exhibit severe emotional strain, and 
their parents are usually neurotic or psychotic. 
Some studies have described the neurotic delin- 
quent as possessing a “relatively weak ego” and as 
tending to isolate himself from other people, par- 
ticularly other children (Hewitt & Jenkins 1947). 

Because their reformation depends upon a pro- 
found reorientation in character, neurotic delin- 
quents more often continue their criminal behavior 
into adulthood than do the typical gang delinquents. 

Psychopathic delinquency. The psychopathic 
delinquent is relatively rare but, from society S 
point of view, perhaps the most dangerous of young 
criminals. The psychopath’s distinguishing re 
are (1) his inability to form a lasting emotiona 
relationship with other human beings and (2) his 
almost total lack of guilt, remorse, or inhibition. 
Unlike the neurotic delinquent, the psychopath does 
not suffer from internal conflict or anxiety. Ua 
the socialized delinquent, the psychopath does no! 
find emotional satisfaction in gang membersise 
The psychopathic delinquent commits a wide ae 
of crimes and has a remarkably high rate of rec 
vism. Almost all investigations of psychopaths’ €n- 


vironments indicate that they have been raised in 
homes characterized by extreme parental brutality, 
neglect, discord, and intensely severe discipline. 
Many have come from foster homes or orphanages. 
They have seldom, if ever, experienced a warm, 
loving relationship with other human beings, and 
they seem to lack the capacity for affection. Quite 
often, the psychopath suffers from a neurological 
disorder, perhaps of a type that decreases his ability 
to inhibit impulses (McCord & McCord 1964). [See 
PSYCHOPATHIC PERSONALITY.] 

Summary. Each of the above types of delin- 
quent reacts to particular pressures operating in 
his environment: the socialized delinquent seeks 
guidance and security from the gang; the neurotic 
delinquent uses crime as a release from intolerable 
anxiety; the psychopathic delinquent fails to inter- 
nalize the usual social norms, even those of a de- 
linquent subculture. While it is difficult to specify 
the exact forces that produce each type of delin- 
quent, social science has progressed to the point 
where one can propose some generalizations about 
causal forces. 


Modern civilization and delinquency 


As has been noted, all types of juvenile delin- 
quency increase as industrialization and urbani- 
zation proceed. Industrial societies as a group 
greatly exceed “primitive” societies in their rate of 
crime. And within the class of economically afflu- 
ent countries, the rate of delinquency correlates 
rather closely with the degree of industrialization. 
Thus, the United States, as the most economically 
advanced nation, leads all other countries in the 
commission of crimes by juveniles. What is there 
in the nature of modern civilization—particularly 
in North American culture—that creates this in- 
creasing criminal proclivity? Among the many ex- 
planations that have been offered, the following 
theories have gained prominence. 

Depersonalization and anonymity. As urban- 
ization increases, the “face-to-face” social controls 
of traditional society begin to lose their importance. 
In a rural, economically stagnant, village-based 
society, every person is subject to the scrutiny of 
his immediate community. In an urban environ- 
ment, however, human relationships become more 
depersonalized and anonymous. The person feels, 
and is, freer to act as an individual rather than as 
a member of a closely knit community. One con- 
sequence of this greater freedom may well be that 
the individual feels less inhibition about experi- 
menting with various forms of deviant behavior, 
including crime. [See ALIENATION.] 

Increased mobility. Modern societies provide 
greater opportunities for social and geographical 
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mobility. In the United States, for example, one out 
of every five families changes its place of residence 
each year. One obvious result of such mobility is 
that families in modern societies do not maintain 
the same continuous ties with their neighbors as 
do people in more traditional, stable societies. This 
depersonalization of communal relationships may 
also weaken the social controls exercised over chil- 
dren and thus contribute to a high incidence of 
delinquency. [See MIGRATION; SOCIAL MOBILITY.] 

Increased family disorganization. The nature 
of family life apparently changes as industrializa- 
tion goes forward. The extended kinship system 
gives way to a smaller “nuclear” family. For a 
variety of reasons, the stability of the family also 
seems to decline. In the United States in 1900, one 
divorce occurred for every fourteen marriages. In 
the contemporary period, one marriage in every 
five ends in divorce. Because of the well-established 
correlation between familial discord and juvenile 
delinquency, one would expect that the higher rate 
of familial disorganization in modern societies 
would be reflected in more juvenile crime. 

Frustration of goal achievement. As Merton 
has suggested (1949), modern civilizations—par- 
ticularly the United States—are characterized by 
an almost universal goal of individual achievement. 
Everyone is encouraged to seek material success, 
but society does not offer equal means to all for 
fulfilling this goal. The lower-class child, denied 
the chance of competing successfully, may turn to 
deviant channels for achieving his goals. Accord- 
ing to this hypothesis, delinquency will be highest 
in societies that inculcate individualistic aspira- 
tions in all people but fail to provide equality of 
opportunity. 

In modern industrialized civilizations, people live 
in a fast-moving, exciting atmosphere—in a society 
broken into atomized, mobile, and depersonalized 
groups. Traditional social controls lose their influ- 
ence. The emphasis on competition puts extreme 
pressure on those individuals who have the least 
chance of success: for example, ethnic minority 
groups and members of the lower class. In conse- 
quence, an increasing proportion of youths turn to 
delinquent behavior. 


Social factors 

Within an industrialized civilization, the chance 
that a child will become delinquent varies con- 
siderably according to his particular social position. 
Among the more important of these variations, one 
should note the following contrasts. 

Sex differences. Boys have a much higher rate 
of delinquency than do girls. In most modern so- 
cieties, approximately five times as many males as 
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females become juvenile offenders. Aside from the 
obvious differences in physical strength, several 
social factors account for this contrast. 

Parsons (1949) has noted that boys in an indus- 
trialized society are traditionally reared by their 
mothers. Compared with children in peasant fami- 
lies, they have relatively little opportunity to iden- 
tify with their fathers or to copy masculine forms 
of behavior, At the age of puberty, however, they 
are expected to become men and to shift their 
identification from the mother to the father. Par- 
sons has hypothesized that this dislocation creates 
severe strain for boys. In an attempt to assert their 
masculinity, boys may well turn to delinquent be- 
havior. In contrast, girls continue to identify with 
their mothers and do not suffer the same “crisis of 
identity” that boys do. 

Further, it is clear that girls in our society do 
not normally achieve respect by participation in 
the “rough” activities characteristic of delinquents. 
For a girl who searches for warmth and recogni- 
tion, the female role in our society encourages 
passivity rather than the masculine-oriented be- 
havior of delinquents. Thus, a girl who experiences 
the same frustrations that produce a delinquent boy 
might well assert her femininity in promiscuous 
behavior but would be unlikely to turn to burglary 
or aggressive crimes. 

It should be noted that as the female role in our 
society has more closely approximated that of the 
male, the incidence of female delinquency has in- 
creased and the gap in rates of crime between boys 
and girls has narrowed. 

Minority groups. Minority groups, particularly 
in America, contribute disproportionately to the 
number of adjudicated delinquents. Formerly, in 
America sons of Irish or Italian immigrants be- 
came delinquent significantly more often than 
“native-born” children. As the immigrant groups 
have won middle-class status, however, their rate 
of delinquency has declined. Today, Negroes, 
Puerto Ricans, and Spanish-Americans most often 
find their way into the hands of police and courts, 
Many factors account for this. Police bias, particu- 
larly in the southern United States, undoubtedly 
has an influence, The inferior economic status of 
these groups, their high rate of familial disorgan- 
ization, and their experience of cultural conflict 
also help to explain the differential rate of delin- 
quency. Yet certain minority groups, most notably 
the Japanese, have not produced a high number 
of delinquents. While they, too, have suffered dis- 
crimination, their unusually tight familial organ- 
ization has apparently insulated them from the 
usual delinquent patterns. 


Rural and urban comparisons. Rural com- 
munities generally have about one-third the rate of 
delinquency of urban areas. The greater cohesive- 
ness of rural communities, the closer social con- 
trol, and the greater intimacy between the police 
and the citizens probably account for this contrast. 
Rural rates of delinquency have, however, gone up 
at the same pace as urban rates, and as urban pat- 
terns of life infiltrate even the remotest farming 
areas, the rural-urban difference becomes less 
striking. 

Economic differences. Most importantly, delin- 
quency has concentrated heavily in lower-class, de- 
teriorated, slum areas. In such cities of the United 
States as Chicago and San Francisco, the incidence 
of delinquency in the most economically deprived 
areas is 55 times that in the privileged suburbs. 
This distinctive relationship cannot be dismissed as 
simply ephemeral or due to a tendency on the part 
of police to be more lenient with middle-class chil- 
dren. Since the investigations of the “Chicago 
school” of criminology, many theories have been 
proposed to explain this phenomenon (Shaw & 
Moore 1931). Today, the three most accepted ex- 
planations of delinquency’s predominance in eco- 
nomically deprived areas have been presented by 
Cohen (1955), Cloward and Ohlin (1960), and 
Miller (1958). 

Delinquent subculture. Following the work of 
Merton and Sutherland, Cohen has argued that the 
typical working-class youth has not been equipped 
by his family, school, or social environment for 
competition. A boy reared in a lower-class environ- 
ment comes to realize his underprivileged status 
and the many obstacles that he faces in attempting 
to achieve self-respect in conventional ways. Be- 
cause of his frustration, he rejects the usual ethic 
of society and accepts in its place a “delinquent 
subculture”: a gang society that stresses negative, 
destructive, aggressive behavior. Since the typical 
lower-class boy cannot win status in the usual man- 
ner, he joins an “inverted” culture where he can 
express his spite against the larger society. His 
crimes are often nonutilitarian in nature—he steals 
or fights, not for material profit but for the glory 
of it. The most respected member of the subcul- 
ture is the one who can best demonstrate his prani 
ess, aggressiveness, and defiance of law-abiding 
standards. d 

Patterns of delinquent subcultures. Clowar' 
and Ohlin accept Cohen's position that a delin- 
quent subculture arises because of the frustral 
experienced by members of the lower class, bu 
they take the theory several steps further. They a 
tend that there are several varieties of delinquen 


societies, each with its unique properties: the 
“criminalistic” pattern, where gang members accept 
organized crime as a distinctive way of life; the 
“conflict” pattern, where the gang turns to aggres- 
sive, destructive behavior, regardless of its material 
rewards; and the “retreatist” pattern, characterized 
by “beats” who use drugs or alcohol and maintain 
an attitude of nonchalant indifference to the ordi- 
nary world. Perhaps Cloward and Ohlin’s most dis- 
tinctive contribution is their belief that specific 
factors in a neighborhood determine which type of 
“delinquent adaptation” will take place. They em- 
phasize, for example, the role of an adult system 
of criminal values. If adults in an area participate 
regularly in organized crime, Cloward and Ohlin 
assume that a “criminalistic,” nonviolent pattern of 
delinquency will appear rather than another type. 

Working-class norms, To some degree, Miller 
parts company with other sociologists concerned 
with the broad social base of delinquency, for he 
believes that lower-class culture itself generates a 
delinquent response among boys. He argues that 
delinquent subcultures do not have paramount im- 
portance, but rather that the entire environment of 
working-class people is one which supports an 
assertion of masculine “toughness,” “smartness,” 
and the ability to “con” other people. 

General theoretical considerations. The varia- 
tions between the different positions arouse dis- 
pute, but the basic contention appears similar: in 
an industrial society where all groups do not have 
equal access to power and privilege, the children of 
deprived economic groups will absorb an ethic that 
contradicts the society's usual values. For these 
theorists, delinquency represents a normal response 
to a particular social system. 

Obviously, these theories do not explain all 
aspects of the problem of delinquency. They help 
to explain the actions of socialized delinquents 
but are of little aid in clarifying the nature of neu- 
rotic or psychopathic crime. Essentially, the theo- 
ries are confined to explicating why certain areas 
within modern society produce a generally higher 
incidence of delinquency than do others. Neverthe- 
less, even in the most deviant urban areas, the 
majority of boys do not develop a predominantly 
criminal pattern. To explain why one child chooses 
a delinquent adaptation and another does not, one 
must examine other aspects of the person’s life, 
particularly his early familial environment. 


Family experiences 

Numerous studies have demonstrated that par- 
ticular familial experiences are the crucial factors 
differentiating the delinquent from the nondelin- 


DELINQUENCY: Psychological Aspects 91 


quent child. Only certain types of families, clus- 
tered disproportionally in lower-class strata, create 
a delinquent child. Indeed, research in Minnesota 
and California estimates that only 5 per cent of 
American families account for all juvenile delin- 
quency (Buell et al. 1952). The “delinquency- 
prone” family generally has the following features 
(Glueck & Glueck 1950; McCord et al. 1959; Healy 
& Bronner 1936). 

Conflict. Delinquent families are rent with pa- 
rental discord, bitterness, and even hatred. One of 
the first experiences of the potentially delinquent 
child is to see his parents engaged in acrimonious 
quarrels. Not unnaturally, he learns to suspect the 
motives and good will of people around him. Vari- 
ous studies have shown that a boy raised in a 
broken home in the United States has about twice 
the chance of becoming delinquent as a boy raised 
in a cohesive home. If the parents remain together 
but engage in constant conflict, the boy’s chance of 
becoming delinquent is even further increased 
(McCord et al. 1959). 

Paternal rejection. Fathers of delinquents tend 
to reject their sons or at best treat them with dis- 
tant disdain. In one typical study of delinquents, 
87 per cent came from families where the father 
was overtly cruel to the son, absent from the home, 
or severely neglectful—while only a small minority 
of noncriminal boys experienced paternal rejection 
(McCord et al. 1959). 

Paternal delinquency. Fathers of delinquents 
are much more likely to have records of crime, 
alcoholism, or mental disorders (Glueck & Glueck 
1950). Thus, to the degree that the child tends to 
copy his father’s actions, the boy’s chances of de- 
linquency are increased. 

Maternal behavior. Mothers of delinquents gen- 
erally reject and neglect their sons. One study 
showed that seven times as many delinquents as 
nondelinquents had been reared by mothers who 
either felt indifferent toward their children or 
actively rejected them (Glueck & Glueck 1950). 
Another study demonstrated that a boy’s chance of 
serving a sentence in a penal institution was in- 
creased by four times if his mother had neglected 
him in childhood (McCord et al. 1959). In addi- 
tion, mothers of delinquents have a much higher 
rate of crime and other forms of social deviance. 

Parental supervision. The parents of delin- 
quents either fail to give their sons any supervision 
at all or tend to discipline them in a severe, often 
erratic fashion (Glueck & Glueck 1950). Contrary 
to the usual opinion, children raised under a regime 
of strict but inconsistent discipline have the high- 
est incidence of delinquency. 
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General home environment. In general, de- 
linquents come from homes that offer them a por- 
trait of human relationships full of conflict and 
strife. They learn to view the world with suspicion 
and to conceive of other human beings as threat- 
ening, punitive, and aggressive. If the family re- 
sides in an area characterized by a delinquent sub- 
culture, the child not unnaturally turns to the gang 
as a cure for his frustrations. Although this general 
background characterizes most delinquents, the 
specific types emerge from somewhat different en- 
vironments. The neurotic delinquent, for example, 
has more often been exposed to a highly religious 
background, where his sense of personal worthless- 
ness has been reinforced by strict parental disci- 
pline. In contrast with the other varieties, the 
psychopathic child has suffered from almost total, 
continuous parental rejection and neglect. 


Conclusions 


Whether a child chooses delinquency as an ado- 
lescent mode of life depends upon a complex inter- 
action of many factors in his background. It is 
improbable that social science will ever totally 
unweave the skein of influences that operate to 
produce delinquency. Nonetheless, studies have 
produced some generalizations concerning the in- 
terrelation of variables leading to delinquency 
(McCord et al. 1959): 

(1) A cohesive, loving home tends to compen- 
sate for the influence of a delinquent subculture. 
The majority of children raised in a transitional 
slum area (but who live in a normal family) do 
not become delinquents. 

(2) The effects of a father’s absence, neglect, 
or cruelty on his son seem to depend largely upon 
the mother’s attitude. If the mother is loving, the 
child is not likely to become delinquent. 

(3) The nature of the parents’ discipline does 
not affect criminality, but the consistency of its 
imposition does. 

By considering the interaction of these factors 
(and others) in the person’s background, it now 
appears probable that social science will be able 
to predict delinquency with a high degree of accu- 
racy. Recent studies by the Gluecks (1959) have 
indicated that predictions made in early childhood 
and based on the nature of the child’s family have 
an extraordinary degree of accuracy in later years, 
Because the psychological roots of delinquency are 
so deeply enmeshed in modern society and the 
family, it would be utopian to hope that delin- 
quency can ever be totally eradicated. Nonetheless, 
our increasing knowledge of the varieties of delin- 


quency and their sources offers hope that new 
techniques of prevention and rehabilitation may 
substantially reduce the toll of youthful crime, 


WILLIAM McCorp 


[Directly related are the entries Crime and Psycuo- 
PATHIC PERSONALITY. Other relevant material may 
be found in ADOLESCENCE and FAMiLY.) 
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Iv 
DELINQUENT GANGS 


When the second United Nations Congress on 
the Prevention of Crime and the Treatment of 
Offenders met in London in 1960, delegates from 
most countries reported upsurges in adolescent 
delinquency, not only quantitatively greater than 
before but more violent and more likely to be 
group-based rather than individual in character 
(Middendorff 1960). 

In more than a dozen countries, new words have 
been coined to describe adolescents, nondelinquent 
as well as delinquent, who wear distinctive cloth- 
ing and hair styles, listen to special types of music, 
frequent amusement areas and other gathering 
places, and generally annoy adults: bar gangs 
(Argentina), blousons noirs (France), bodgies 
(Australia), chimpira (Japan), Halbstarke (West 
Germany), hooligans (Poland), nozem (Nether- 
lands), raggare (Sweden), stilyagi (Soviet Union), 
tapkaroschi (Yugoslavia), tau-pau (Formosa), 
teddy boys (Great Britain), vitelloni (Italy). In 
short, adolescence has come to be regarded, by 
experts as well as by the general public, as a 
period of potential antisocial behavior. This is a 
recent image of adolescence; the turbulence tradi- 
tionally associated with adolescence was an inner 
state resulting from sexual maturation and the 
search for an adult identity (Erikson 1962). In the 
context of this new perspective on adolescence, 
delinquent gangs tend to be interpreted as extreme 
manifestations of teen-age culture (Mays 1961), 
rather than as juvenile divisions of adult criminal 
gangs. 

Comparability of findings. The term “delin- 
quent gang” means different things to different 
people. An important reason for the confusion is 
that research reports on gangs in different cities 
and countries are not usually prepared with a 
view to rigorous comparisons. Since each investi- 
gator emphasizes those aspects of delinquent gangs 
in which he is most interested, readers of reports 
from different places cannot tell whether disparate 
phenomena are being included within the label 
“gang delinquency,” or whether verbal habits differ. 
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For example, the widespread assumption of Euro- 
pean criminologists that the delinquent gangs of 
American cities are larger, better organized, more 
violent, and more vicious than their European 
counterparts has not been tested by systematic 
comparisons (for one expression of this assump- 
tion, see Centre de Formation 1963, p. ix). 

Gang characteristics—some unanswered ques- 
tions. How large must a group be to constitute 
a gang? Should cliques of four or five boys be 
called gangs, or should the gang concept assume 
a membership of fifty or more? Size is, of course, 
related to internal differentiation. Must a gang 
have a recognized leader or leadership group, or 
can the leadership function shift from time to time 
and from activity to activity? How tightly inte- 
grated must members be with the group for the 
gang concept to be appropriate? One criminologist 
has suggested that delinquent gangs typically have 
moblike as well as grouplike characteristics (Yab- 
lonsky 1959). Perhaps the term “gang” suggests 
a higher degree of integration than is actually 
found. How delinquent must a gang be in order 
to be properly described as a “delinquent” gang? 
Some criminologists argue that a delinquent gang 
must require delinquent behavior of members, not 
merely permit or encourage it (Cloward & Ohlin 
1960, p. 9). Other criminologists point out that 
actual gangs, even highly delinquent gangs in 
cities like Chicago, contain members who are fairly 
conventional in values and behavior (Matza 1964; 
Short & Strodtbeck 1965). This issue is related to 
that of specialization among delinquent gangs. Do 
fighting gangs differ strikingly from stealing gangs 
or drug-using gangs? Research suggests that most 
delinquent gangs engage in a wide variety of law- 
violative behavior rather than a few specialties 
(Cohen & Short 1958; Short et al. 1963). 

Must a delinquent gang have some minimum 
continuity through time? The criminological litera- 
ture contains accounts of gangs lasting a dozen or 
more years. Such continuity is usually the result 
of the simultaneous inclusion within the gang of 
a considerable age range. When 18-year-olds start 
dropping out, 1l-year-olds are being recruited. 
However, all members of some gangs are about the 
same age. Not only does this give the gang a 
shorter life expectancy; it also means that gangs 
differ, one from the other, depending, for instance, 
on whether the members are mainly 14-year-olds 
or mainly 17-year-olds. 

Without a precise description of what a de- 
linquent gang is, criminologists can emphasize 
without fear of refutation one or another of a 
considerable array of causal factors: family disor- 


ganization, blighted neighborhoods, traditions of 
delinquency, early school leaving, employment 
problems, psychopathology. Each of these factors 
is capable of explaining the emergence of some of 
the phenomena included under the rubric “delin- 
quent gangs.” 

Social upheaval and gang delinquency. The 
problem of explanation has been further con- 
founded by factors associated with delinquent 
gangs, which may not be necessary conditions for 
gang formation, World War 1 is an example of 
such a factor. The conspicuousness of the problem 
of adolescent gangs following the war prompted 
some observers to infer a direct connection. It is 
possible, of course, that the violence of the war 
and the personal insecurity it engendered contrib- 
uted directly to attitudes on the part of youth, 
which found expression in gang delinquency. More 
likely, the effect on delinquency was indirect— 
through disruption of family life and the resulting 
interferences with the socialization of children; 
through speeding up processes of urbanization and 
industrialization, thereby tearing people loose from 
their social moorings; and through inordinate 
material desires stimulated by increasing affluence. 

Ethnic minorities and gang delinquency. An- 
other confounding factor is ethnic status. Ethnic 
minorities serve as a catalyst for the development 
of gang delinquency and especially street fighting. 
Puerto Rican gangs in New York City, Negro gangs 
in Chicago, Mexican-American gangs in Los An- 
geles—like Irish gangs in Liverpool, Arab gangs 
in Paris, and Korean gangs in Tokyo—are commu- 
nity problems. Because they are marginally inte- 
grated in the host society, low-status minorities are 
in a weaker position than the majority to instill 
conventional motivations in preadolescents and to 
control adolescents informally. Thus ethnic cleav- 
ages reinforce and intensify the social forces mak- 
ing for gang delinquency. ha 

But gangs form in urban industria] societies 
without war dislocations or ethnic minorities. The 
Swedish experience is instructive. Sweden was not 
a belligerent during World War 11; the country has 
no appreciable minority population; welfare serv- 
ices are highly developed; and slums have been 
eliminated. Nevertheless, Swedish raggare gangs 
have been troublesome to the police, especially in 
Stockholm, Malmö, and Gothenburg. At one time 
raggare automobiles interfered with traffic i® 
Stockholm to the extent that city officials invited 
raggare leaders to the Town Hall to discuss the 
problem. oe 

If neither social upheaval nor ethnic disc 
tion is a primary factor in the development © 


delinquent gangs, what is responsible? Research 
is presently inconclusive, but the probability is 
that the social consequences of urbanism and 
industrialization can explain the conjunction of 
delinquency and affluence. 

Gang formation and weak adult control. With 
the development of modern societies, control of 
adulis over adolescents decreases. This weakness 
of adult control is most obvious under pathological 
circumstances such as slum neighborhoods or 
broken homes (McKay 1949; Toby 1957). Its 
ultimate source, however, is not pathology but the 
increasing social fluidity resulting from the alloca- 
tion of education, recreation, work, and family life 
to separate institutional contexts. These changes 
in social organization affect everyone in contem- 
porary societies, but their impact is especially great 
on adolescents because adolescence is a period of 
transition. Youngsters must disengage themselves 
from the family into which they were born and 
reared and establish themselves in a new family 
unit (Parsons 1942), They must eventually with- 
draw from the system of formal education and 
assume an occupational role. 

While preparing to make these transitions and 
learning preparatory skills, many adolescents are 
socially adrift, except for such solidary relation- 
ships as they form with youngsters in their own 
situation (Cohen 1955). Delinquent gangs there- 
fore represent an autonomous and antisocial devel- 
opment of adolescent solidarity, and they are, of 
course, less pleasing to adults than scouting or 
church-affiliated youth groups. Gang formation is 
always a possibility in contemporary societies be- 
cause adolescents are not typically under effective 
adult supervision—although some adolescents are 
better controlled by adults than others. The auto- 
mobile symbolizes the looseness of the ties between 
adults and adolescents because it is an instrument 
whereby adult supervision can be evaded. 

Individualism. Decreasing control of adults 
over adolescents is reinforced by individualism. 
In contemporary societies, the unit of social par- 
ticipation is usually the individual rather than the 
family group or the community. The child is an 
individual in school and in the neighborhood play 
group. Later, he will participate in the political and 
economic systems as an individual. Besides this 
implicit principle of individualism embodied in the 
social organization of contemporary societies, there 
exists an explicit ideology of individual fulfillment. 
For example, the Declaration of Independence 
refers to the right to “life, liberty, and the pursuit 
of happiness.” And contemporary motion pictures, 
television programs, and mass circulation maga- 
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zines suggest that thrilling entertainment, new 
clothes, cigarettes, automobiles, and liquor are 
necessary for happiness; such ideas are found even 
in societies where traditions emphasize collective 
goals (such as Japan). Some youngsters join delin- 
quent gangs in order to satisfy these hedonistic 
desires. 

Relative deprivation—a hypothesis. Although 
diminished adult control over adolescents and in- 
creased individualism may explain the increase 
in delinquency rates, the principle of relative depri- 
vation is more useful in explaining recruitment 
into gangs. Some adolescents feel more at a disad- 
vantage than others. It is among these alienated 
and uncommitted adolescents that delinquent 
gangs have their greatest appeal (Karacki & Toby 
1962). Thus, the principle of relative deprivation 
helps account for the disproportionate representa- 
tion of ethnic minorities in delinquent gangs. It 
also explains why school failures and those who 
withdraw from school before completing a course 
of study are more likely to become delinquents 
than students doing well in school. In contempo- 
rary societies, the system of formal education is 
an important mechanism for allocating youngsters 
to positions in the occupational system; students 
who do poorly at school have reason to grow dis- 
couraged about their futures. Such discouragement 
is often antecedent to rebellion (Stinchcombe 
1964; Toby & Toby 1961). 

However, the principle of relative deprivation 
does not help to explain the increasing rate of gang 
delinquency, unless one assumes that the sting of 
relative deprivation is increasing. A plausible argu- 
ment can be made that it is increasing. In Sweden, 
for instance, where social and economic leveling 
has reduced poverty to a minimum, relatively few 
adolescents need feel hopeless about their chances 
for the future, but these few feel more deprived 
than Untouchable youths in India. Army studies 
during World War 11 have shown that the individ- 
ual’s expectations for himself—and therefore his 
feeling of deprivation or satisfaction—depend 
partly on the proportion of his reference group 
that is better off than he (Stouffer et al. 1949, 
pp. 250-258). The same reference group phenom- 
enon may underlie adolescent delinquency in afflu- 
ent societies [see REFERENCE GROUPS]. 


If factors related to urbanism and industrializa- 
tion do indeed account for the proliferation of 
delinquent gangs in many large cities of the world, 
the problem will increase in seriousness in the 
years ahead. Countries like Italy and Spain, where 
adolescent gangs are still rare, will find that delin- 
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quency is part of the price industrial societies pay 
for their individualistic affluence and for the free- 
dom of adolescents from close supervision by 
adults. 

Jackson Tosy 


[See also ADOLESCENCE; COLLECTIVE BEHAVIOR.] 
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1. GENERAL 
1. Econometric STUDIES 


I 
GENERAL 


The idea that prices are dependent in some way 
on demand and supply is very old. Long before the 
development of theoretical economics, it was un- 
derstood that a large supply would cause a fall in 
price and a large demand would cause a rise in 
price. A good deal of economic theory revolves 
around the clarification and quantification of this 
simple idea. 

Adam Smith distinguished carefully between 
demand and desire and defined effective a 
essentially as the quantity that would be purchas' : 
at a given price. He had a clear conception of what 
today we would call “equilibrium,” or “normal, 
price, which he called “natural price’—the price 
at which just enough would be forthcoming from 
the suppliers onto the market to equal the effec- 
tive demand at that price. If the price in the 


market were above the natural price for a given 
commodity, there would be unusual incentives to 
produce this commodity and bring it to market, 
and the quantity offered for sale would increase. 
If the amount coming onto the market were larger 
than the effective demand, that is, the amount 
taken off the market, the price would fall. Simi- 
larly, if the price in the market were below the 
natural price, there would be a diminished incen- 
tive to produce the commodity and bring it to 
market; hence the amount coming onto the market 
would decline, and if this were less than the effec- 
tive demand, the price would rise. Much of the 
development of the theory of demand and supply 
since Adam Smith can be regarded as a clarifica- 
tion and elaboration of the fundamental principles 
that he enunciated. 

Functions and curves. The next major develop- 
ment in the theory of demand and supply was the 
development of the concept of demand and supply 
functions and curves, associated mainly with the 
name of Alfred Marshall, although a Scottish econ- 
omist, Fleeming Jenkin, is usually credited with 
the first formulation of these concepts. In its 
simplest form the demand function is a function 
relating the quantity demanded to the price of the 
commodity. The supply function similarly relates 
the quantity supplied to the price. When only these 
two variables are involved, the functions can be 
expressed as demand and supply curves in a two- 
dimensional diagram, as in Figure 1. The quantity 
of the commodity demanded or supplied is meas- 
ured along the horizontal axis and the price is 
measured along the vertical axis. DD’ is the de- 
mand curve; SS’ is the supply curve. Figure 1 
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Figure 1 — Demand and supply: stable system 
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shows the most typical forms of these curves. In 
the case of demand, there is some price OD at 
which nothing will be bought at all. At a zero price, 
a finite quantity OD’ will be bought; this repre- 
sents the point of satiation. The supply curve is 
drawn so that there is some price OS below which 
nothing will be offered and above which, as the 
price rises, a larger quantity will be supplied. 

For many purposes it is convenient to describe 
the demand and supply curves in terms of parame- 
ters, that is, a set of numbers that is sufficient to 
identify each point on them. The simplest assump- 
tion is that of linearity, that is, that the demand 
and supply curves are straight lines. In this case 
the equations may be written: 


(1) qa =d + eupa, 
(2) Qa =S HEP, 


where qu is the quantity demanded, q, is the quan- 
tity supplied, p, is the price at which each quantity 
is demanded, and p, is the price at which each 
quantity is supplied. Each curve or function can 
then be described by only two parameters. In the 
case of demand, d measures what might be called 
the height or extent of demand and is equal to OD 
in Figure 1; e, may be called the absolute elas- 
ticity of demand, which in Figure 1 is negative. It 
measures the absolute change in the quantity de- 
manded, which would result from a unit change in 
the price. In Figure 1 the slope or the gradient of 
the curve DD’ at any point is the “inelasticity,” 1/e.. 
Similarly the parameter s, equal to OS in Figure 1, 
is a measure of the height or extent of the supply, 
and the parameter e, is the absolute elasticity of 
supply, which measures the change in the quantity 
supplied for each unit change in the price. Again, 
1/e, represents the slope of the curve SS’. 

Elasticity. The term elasticity was introduced 
into economic analysis by Alfred Marshall, the 
analogy being the elasticity of a spring. In an elas- 
tic spring, a given increase in the weight exerted 
produces a large increase in the length of the 
spring; similarly, if either demand or supply is 
elastic, a given increase in the price produces a 
large increase in the quantity demanded or sup- 
plied. Another name that might be given to this 
concept is responsiveness, the quantity demanded 
or supplied being thought of as responding more 
or less eagerly to a change in the price. Marshall 
himself did not use the absolute elasticity concept 
but a concept of relative elasticity, defined as the 
proportionate change in quantity divided by the 
proportionate change in the price, or: 


(3) €=pdq/q dp. 
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He apparently did this because it is a dimension- 
less parameter, that is, a number that is inde- 
pendent of the units in which the quantity or the 
price are measured. A demand or supply curve of 
constant relative elasticity would be a straight line 
on double-logarithmic paper. There is no reason to 
suppose in fact, however, that these functions are 
more likely to be logarithmic than linear in abso- 
lute terms, and for many purposes the absolute 
concepts are preferable. A logarithmically linear 
demand curve with constant relative elasticity, for 
instance, would not intersect either axis in Figure 
1, implying that the price would have to be infinite 
before cutting off purchases altogether and that 
at a zero price an infinite quantity would be taken. 
This clearly is absurd. In practice, more than two 
parameters are often needed to describe demand 
and supply functions, but there are only a few 
problems in which the departure from linearity 
seems to have much economic significance. How- 
ever, it is reasonable to suppose that there are 
eventual limitations on increasing the quantity 
supplied that cannot be overcome by a rise in price, 
so that the supply curve will become less elastic at 
high prices, as in Figure 1. It is possible also that 
a similar condition applies to demand. The simplest 
equation that can be used to express this condition 
is a quadratic form, [See ELasticiry.] 

Equilibrium. The equilibrium position of the 
system of Figure 1 is assumed to be the point P, 
where the demand and supply curves intersect and 
where PN is the equilibrium price and ON is the 
equilibrium quantity. Any equilibrium, however, 
must be seen as a special case of a dynamic sys- 
tem, and in this case there are at least two differ- 
ent dynamic systems that have this point of equi- 
librium. The first, associated particularly with 
Adam Smith and Alfred Marshall, is that in which 
the difference between the demand price and the 
supply price, leading to changes in quantity sup- 
plied, is the principal motivating factor of the 
dynamic system. The demand price of a given 
quantity is that price at which the quantity can 
be sold in the market; thus a point on the demand 
curve such as D, indicates that the quantity OQ, 
can be sold at the price Q,P,. The supply price is 
the price that will call forth a particular quantity. 
Thus, if S, is a point on the supply curve, Q,S, is 
the price that could call forth an amount OQ,. If 
then the quantity coming to market is OQ,, when 
the demand price is in excess of the supply price 
by an amount D,S,, this means that the actual 
price received by the sellers, Q,D,, is greater than 
the price that would motivate them to produce the 
quantity involved, Q,S,. They are therefore receiv- 


ing excessive returns, and the assumption is that 
this will motivate them to expand the quantity 
offered for sale, as indicated by the arrow A,. Simi- 
larly, if the quantity coming on the market is 0Q,, 
which is larger than the equilibrium quantity ON, 
the demand price, which is the actual price re- 
ceived, will be below the supply price; the suppliers 
will receive less than is necessary to persuade 
them to put this quantity on the market, and the 
amount coming to market wil] decline, In the cir- 
cumstances of Figure 1 it is clear that P is a stable 
equilibrium, for if the quantity is below ON it will 
increase; if it is above ON it will diminish. 
Another approach to the problem of the equi- 
librium of demand and supply, associated typically 
with the names of Léon Walras and J. R. Hicks, 
interprets the demand curve as showing the quan- 
tity that will be bought at each price and the 
supply curve as indicating the quantity that will 
be sold at each price, with equilibrium achieved 
through price change. That is, at a price of P,, the 
quantity that would be bought is P,D, , the quantity 
offered for sale is P,S,, and there is a “surplus,” 
or excess supply, equal to D,S,. This is the amount 
that is being offered for sale but that can find no 
buyers. Under these circumstances it is supposed 
that some sellers will offer the commodity for sale 
at a lower price; and if the market is competitive, 
the prices in all transactions will fall correspond- 
ingly. If, therefore, there is a surplus, the price will 
fall in the direction of the arrow A,. Similarly, if 
the price is below the equilibrium price, equal say 
to P., the quantity that the buyers wish to pur- 
chase, P.D., is greater than the amount offered 
for sale, P.S,, and there is an excess demand equal 
to S,D.. Under these circumstances there are un- 
satisfied buyers who wish to buy at the price but 
who cannot find sellers; they will offer to buy at 
a higher price, causing the price to rise, as indi- 
cated by the arrow A,. Here again the point P is 
an equilibrium price, for if the actual price is above 
NP it will fall; if it is below NP it will rise. 
These different dynamic systems do not affect 
the position of stable equilibrium itself as long sA 
the demand and supply curves have elasticities © 
Opposite signs. However, if the two curves slope 
in the same direction, as in Figure 2, we are in be 
ous trouble. Here we show a demand curve DD 
and a negatively elastic supply curve SS’ intersect- 
ing in three places, P, , P., and P,. In terms of M 
Smith-Marshall dynamics, P, and P, are stable 
positions of equilibrium; P. is unstable. At quanti- 
ties smaller than implied by P, , the demand price 
exceeds the supply price and hence the quel 
will expand toward P,. Between P, and P: t 


supply price exceeds the demand price and the 
quantity will contract toward P, again. Between 
P, and P, the demand price exceeds the supply 
price and the quantity will expand toward P;. Thus 
any divergence around P, is extended in the ini- 
tiated direction. At quantities larger than implied 
by P., the supply price exceeds the demand price 
and the quantity will contract toward P,. 

If, on the other hand, we look at the equilibrium 
from the Walras-Hicks point of view, we see that 
now P. is the stable position of equilibrium and 
P, and P, are unstable. At prices above P,, there 
is excess demand and the price will rise still fur- 
ther. Between P, and P, there is excess supply, 
and the price will fall toward P.. Between P. and 
P, there is again excess demand, and the price 
will rise toward P,. Below P, there is excess supply 
and the price will fall away from P; . 

The full resolution of this problem requires an 
investigation of the dynamics of the price system 
that we cannot pursue here. In the real world, these 
dynamic factors, of course, operate simultaneously. 
Shortages create pressure for prices to rise, sur- 
pluses for prices to fall. Excess demand-prices 
create pressure for production to increase, excess 
supply-prices for production to diminish. Circum- 
stances can be postulated in which there are no 
stable equilibrium sets of prices and quantities. 
These circumstances are fortunately somewhat 
unlikely. There have been occasions, such as in 
the depression of the 1930s or in hyperinflations, 
in which the relative price system seemed to suffer 
a real breakdown. These occasions, however, have 
always been associated with profound disturbances 
in the absolute level of money prices and incomes; 
there are no clear examples of the inability of a 
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Figure 2 — Demand and supply: unstable system 
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relative price system to move toward an equili- 
brium in the absence of monetary breakdown. 

The demand and supply functions have differ- 
ent meanings, depending on the period of time to 
which they refer and the nature of the system that 
they are intended to describe. They may be used 
either to describe the forces underlying the de- 
termination of a price in a market on a particular 
day, in which case they are usually called “market 
demand and supply curves,” or they may be used 
to describe the forces determining a “normal” price, 
toward which the market price may tend. 

Market curves. In the theory of market price, 
it is supposed that there is no production or con- 
sumption and the stocks of money and other ex- 
changeables in the hands of the marketers remain 
constant during the “market day.” The market de- 
mand and supply functions are then psychologi- 
cally based, describing the state of mind of the 
people in the market. The market demand curve 
shows how much would be offered to purchase at 
each price; the market supply curve shows how 
much would be offered for sale at each price, In 
the course of the day in the market as a whole, 
the equilibrium market price is that at which the 
market is “cleared,” that is, at which the quantity 
offered for sale is equal to the quantity offered 
to purchase. Here it is the shortage-surplus dy- 
namic that clearly dominates the scene, The use 
of market demand and supply curves to describe 
the equilibrium of the market may be criticized on 
the grounds that in this case the two curves are 
not independent. For instance, a psychological 
change in the market that makes people more 
eager to buy will also make them less eager to 
sell. In other words, the same change will move the 
market demand curve to the right and the market 
supply curve to the left. If these moves are similar 
in extent, the price will rise without any change 
in the quantities exchanged. If on the other hand 
there is a decrease in the “divergence” in the 
market, that is, the degree to which the market 
is separated into buyers and sellers, people will be 
both more eager to buy and more eager to sell at 
each price. The market demand curve and the 
market supply curve will both move to the right, 
and the quantity of transactions will increase; the 
price will not change. Even though we recognize 
that market demand and supply curves are not 
independent, they still can be a very useful method, 
among others, of exposition and analysis of 
changes in the market. 

Normal curves. Normal demand and supply 
curves refer not to the equilibrium of the market 
but to the equilibrium of production and consump- 
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tion. For each price, the normal demand curve 
shows what will be the amount consumed in the 
period under consideration. The normal supply 
curve likewise shows for each price what would 
be the quantity produced. At the point of inter- 
section of the two curves, production is equal to 
consumption and also demand price is equal to 
supply price, that is, the production of the com- 
modity is normally profitable. The position and 
shape of the curves will depend on the period of 
time that is considered. Generally speaking, the 
shorter the period of time under consideration, the 
more inelastic will be the demand and supply 
curves. In very short periods, a change in price will 
have little effect on the quantity produced, although 
it may have some effect on the quantity consumed. 
As we move to longer and longer periods, both pro- 
duction and consumption are more likely to make 
more adjustments, and the demand and supply 
curves will become more elastic. 

Derivation of functions. A good deal of price 
theory is devoted to the derivation of demand and 
supply functions from the underlying functions 
that determine them. Supply functions are derived 
from the structure of cost functions, which in turn 
are derived from production functions. Demand 
functions are derived from preference functions. 
In the case of market demand and supply, both de- 
mand and supply curves are derived from the 
preferences of people in the market. Thus in Figure 
3 we show the situation of a single marketer, let 
us say in the wheat market. In the upper part of 
the figure, we measure his stock of wheat along 
OW and his stock of money along OM. The point 
A represents his initial position, with a stock of 
wheat equal to ON and a stock of money equal to 
NA. The dashed lines are his indifference curves, 
each one showing a set of combinations of stocks 
of wheat and money to which he is indifferent. 
They may be thought of as the contours of a 
utility, or preference, “mountain” above the plane 
of the paper. In a perfect market, his exchange 
opportunity is represented by a straight line 
through A, the slope of which is equal to the price. 
Thus if the price of wheat is equal to OM,/OW,, 
his exchange opportunity is represented by the line 
M,W, through A. He will look for the point on this 
exchange opportunity line that has the highest 
utility, which is the point at which it touches his 
highest attainable indifference curve, in this case 
at B,. At this price, therefore, he will move from 
A to B,, meaning that he will buy an amount of 
wheat equal to NN, for an amount of money 
equal to AC, . At a higher price the exchange oppor- 
tunity line would be M.W, and he would move to 


B., selling an amount of wheat equal to NN, for 
an amount of money equal to C.B.. At a price 
OM,/OW,, the exchange opportunity line is M,W, 
and the marketer will neither buy nor sell, The 
locus of points such as B, and B, lies on the heavy 
curve through A, B,, B4. For each price, then, we 
can identify the quantity that the marketer will 
either buy or sell. This is shown in the lower part 
of Figure 3, in which price is measured on the ver- 
tical axis and quantity on the horizontal axis, re- 
flecting the situation in the upper part. At a price 
equal to na, equal to OM,/OW.,,, the marketer will 
neither buy nor sell anything but will remain in his 
initial position. At a price equal to n.b., equal to 
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Figure 3 — The source of market demand and supp'Y 
functions 


OM,/OW,, he will buy an amount equal to nn:. 
At a price equal to mb,, equal to OM./OW., he 
will sell an amount equal to nn,. The curve bab, 
is then his market demand-supply curve, the sec- 
tion ab, being his market demand curve and sec- 
tion ab, being his market supply curve. If we 
summed these for all the marketers, we would get 
the total market demand and supply curves. 

The problem of constructing normal demand 
and supply curves from the determining functions 
is more difficult. A simple model of normal supply 
can be derived if we suppose that we can identify 
the amount that will be produced at each level of 
average cost. Thus in the milk industry we might 
suppose that no milk is produced at an average 
cost below, say, 15 cents per gallon, that 10,000 
gallons is produced at an average cost of 15 cents, 
50,000 gallons at an average cost of 16 cents, 
100,000 gallons at an average cost of 17 cents, and 
so on up the scale, If now the price is below 15 
cents, no milk will be produced at all. At 15 cents, 
10,000 gallons will be produced, at 16 cents, 
60,000 gallons, at 17 cents, 160,000 gallons, and 
so again we go up the scale, cumulating the 
amounts that will be produced at all costs below 
the price in question. The supply curve is then 
seen to be the cumulative frequency distribution 
of those average costs that Marshall called “par- 
ticular expenses.” The situation is complicated by 
the fact that different firms produce different 
quantities at different average costs. Essentially, 
however, a point on the supply curve indicates the 
quantity that can be produced at costs equal to or 
below the price indicated. These cost functions in 
turn are determined by the production functions, 
which show how much of the commodity can be 
produced with given amounts of input. The total 
cost of producing a given quantity is equal to the 
sum of the values of the inputs needed to produce 
it, and the value of each input is equal to its quan- 
tity multiplied by its price. Demand is related in 
a complex way to the preferences and incomes of 
consumers and the prices of substitute or comple- 
mentary commodities. 

Use of functions. Demand and supply func- 
tions are useful mainly in problems involving com- 
parative statics [see STATICS AND DYNAMICS IN 
Economics], that is, in the comparison of one 
position of equilibrium of the price system with 
another such position after change in the parame- 
ters that determine the equilibrium of the system. 

Parametric change. Thus, suppose we have a 
“rise in demand,” meaning by this a movement 
of the whole demand curve to the right, indicating 
that people are willing to buy more at each price 
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Figure 4 — Effect of a rise in demand 


than they were before. The effect of this rise de- 
pends on the elasticity of supply, as shown in 
Figure 4. Here we suppose that DD' is the original 
demand curve and D,D;,, the dashed line, is the 
new demand curve after the rise in demand, Pù is 
the original equilibrium price. If the supply curve 
is perfectly inelastic, such as P,S,, there will be a 
rise in price to P, but no change in the quantity. 
If the supply curve is perfectly elastic, such as 
P,S,, there will be an increase in the quantity from 
P, to P, but no change in the price. In the inter- 
mediate situation, such as a supply curve P,S,, the 
new equilibrium will be P,, where there is some in- 
crease in the quantity and some increase in the 
price. The more elastic the supply curve, the 
greater the increase in the quantity, the smaller the 
increase in the price. If the supply curve is nega- 
tively elastic, such as P,S., there will be a decline 
in the price as a result of the increase in demand. 
Similarly we can show that the effect of an in- 
crease in supply, that is, a movement of the supply 
curve to the right, indicating that people will offer 
or produce larger quantities at a given price, de- 
pends on the elasticity of demand. The more elastic 
the demand, the greater the change in quantity 
and the less the change in price. These proposi- 
tions, simple as they are, are of great importance 
in many branches of economics. They throw light, 
for instance, on the question as to why an increase 
in the money supply produces price and wage 
increases under some circumstances and produces 
increases in output and employment under other 
circumstances, 

Tax and tariff incidence. Another important 
application of demand and supply curves is in the 
theory of the incidence of taxes and tariffs. The 
effect of a tax is shown in Figure 5. Suppose that 
DD’ and SS’ are the demand and supply curves, 
with an initial equilibrium at the point P. Suppose 
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Figure 5 — Effect of a tax 


now that a tax is imposed equal to T,T,. In equi- 
librium the demand price must now exceed the 
supply price by this amount, since a tax is, in 
effect, an addition to the cost of production. The 
output of the commodity will therefore decline 
from ON to ON, , at which point the required con- 
dition is fulfilled. The more elastic the demand and 
supply curves, the greater the reduction in output. 
The more elastic the demand, the less the increase 
in the demand price; the more elastic the supply, 
the less the decline in the supply price. That is, 
the relative incidence of the tax as between de- 
manders and suppliers depends on the relative 
elasticity of the two demand and supply functions. 
The yield of the tax is equal to the tax per unit 
of commodity, T,T, , multiplied by the total output, 
ON,, that is, the shaded rectangle in the figure. 
This will clearly exhibit a maximum value at some 
point, so that if the tax is higher than at this point, 
a reduction in tax will actually increase the yield. 
The incidence of tariffs may be examined in like 
manner and be shown to depend on the elasticities 
of demand and supply curves in various countries. 
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Figure 6 — The cobweb theorem 


The general rule that emerges is that in any such 
situation the most adjustable variable adjusts, 
When supplies and demands are inelastic, this in- 
dicates that price is more adjustable than quantity, 
hence the major impact is made upon price. If 
demands and supplies are elastic, the major impact 
is made upon quantity. 

Dynamic solutions. Demand and supply curves 
can also be interpreted in such a way as to lead 
to certain dynamic solutions. The so-called cobweb 
theorem is a famous example. This is shown in 
Figure 6. Here DD’, the demand curve, shows the 
price that a particular quantity, say the harvest of 
a crop, will fetch. The supply curve, SS’, shows 
how much will be produced, let us say in the fol- 
lowing year, in response to that price. Suppose 
then that we have a crop with the harvest equal to 
OA, The price is AB. As a result of this price, the 
harvest the following year is equal to RC, which 
makes the price that year equal to TE. In response 
to this, the harvest the following year is UF, and 
so we go on, following the path GHI, etc. If the 
demand curve is more elastic than the supply 
curve, the cycle will converge on the point of equi- 
librium, as in the figure. If the supply curve is 
more absolutely elastic than the demand curve, the 
cycle will explode. If the two elasticities are equal, 
the cycle will continue indefinitely with constant 
amplitude, There is some evidence of this phe- 
nomenon in agriculture, for instance in the s0- 
called hog cycle and also in such crops as potatoes. 
These demand and supply curves are not stable 
over long periods, however, and we must be careful 
to avoid predictions from dynamic systems that are 
as simple as this, for the reality is always much 
more complex. 4 

The demand and supply functions can easily 
be extended to include more variables, and for 
some purposes it is very important to do so. The 
demand and supply functions, for instance, may 
be written as functions of two or more commod- 
ities instead of one. We could, for instance, expand 
equations (1) and (2) into a two-commodity sys 
tem as shown in equations (4) through (7), 


(4) Qaa = da + CoaPa + FaaPo» 
(5) as = Sa + CosPat fasPo+ 
(6) Goa = dy + foaPa + CraPos 
(7) Gos = So + fosPa + CvsPo- 


The notation is the same as for equations (1) and 
(2) except that the subscript a refers to commod- 
ity A and the subscript b refers to commodity B. 
The parameters f are the (absolute) cross-elas- 
ticities. For instance, fa shows by how much the 


quantity of A demanded changes per unit change 
in the price of B. Several types of relationships are 
possible. If the commodities are independent in 
demand or supply, the f parameters will be zero. 
If the fı parameter is positive, it means that an 
increase in the price of B will raise the purchases 
of A, which means the two commodities are com- 
petitive in demand. If the fy parameter is negative, 
a rise in the price of B, which presumably di- 
minishes the purchases of B, also diminishes the 
purchases of A, suggesting that the commodities 
are complementary, such as knives and forks. Simi- 
larly in the case of supply, if the parameter fa, is 
positive, it means that an increase in the price of 
B raises the quantity of A supplied. This suggests 
that the commodities are complementary in pro- 
duction, such as beef and hides or wool and 
mutton. If fa is negative, a rise in the price of B 
diminishes the quantity of A supplied, indicating 
that the commodities compete in production be- 
cause, for instance, they use the same scarce re- 
sources. If we now add to equations (4) through 
(7) the two conditions of equilibrium, represent- 
ing the equality of quantities demanded and sup- 
plied for both commodities, as in equations (8) 
and (9), we have a complete equilibrium system, 
with six equations and six unknowns, the four 
quantities and the two prices. 


(8) Yan = Yas 

(9) ua = Gos 

The equations of course do not have to be re- 
stricted to the linear form. 

We can easily extend this system to the general 
equilibrium of n commodities, In doing this, how- 
ever, we have to be careful because there are cer- 
tain constraints on the parameters of such a system 
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Figure 7 — Backward-sloping supply curve 
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imposed by the fact of scarcity. For instance, an 
increase in demand (or supply) for one commodity 
must almost always be counterbalanced by a de- 
cline in demand (or supply) for another. These 
constraints, however, are very complex, and ít is 
virtually impossible to put them into simple mathe- 
matical form. It is possible, for instance, for the 
demand for all commodities to increase if the 
velocity of circulation of money increases. Simi- 
larly, it is possible for the supply of all commodi- 
ties to increase if there is technological change and 
economic development. 

The prices of factors of production, such as 
wages and rents, can be treated like any other 
prices in demand and supply analysis. They pre- 
sent, however, some interesting peculiarities. In the 
case of the supply of a factor such as labor, we 
have the possibility of what is called a backward- 
sloping supply curve. Thus, in Figure 7 we plot 
the wage on the vertical axis and the amount of 
labor offered on the horizontal axis. There is likely 
to be some wage, OA, that is so low that no labor 
will be offered at all, because at this wage the labor 
cannot subsist, or because there are better oppor- 
tunities elsewhere. At the wage that will just in- 
duce labor to come into this occupation, there may 
be quite a large amount of labor offered, AB, be- 
cause the laborer has to work long hours in order 
to make enough to live on. At higher wages, how- 
ever, the amount of labor offered may be less, fol- 
lowing the course BC, for if the laborer has a con- 
ventional standard of living and requires only a 
certain income to live on, the higher the wage the 
less he has to work in order to earn this fixed 
amount. It is also possible, however, that at a cer- 
tain point, such as C, the laborer realizes that he 
can earn much more than subsistence if he works 
harder, and the elasticity of supply again becomes 
positive, from C to D. A similar phenomenon may 
sometimes be observed in the case of land, when a 
rise in rents causes the landowner to employ a 
larger area in parks and gardens for his own pri- 
vate pleasure because he can get an adequate in- 
come from a smaller amount of land rented out. 
Under these circumstances there may be quite seri- 
ous breakdowns in the equilibrium of demand and 
supply, resulting, for instance, in inflation or in 
perpetual labor shortages in cases where custom 
prevents wages from rising. Some of the peculiari- 
ties of traditional societies may perhaps be ex- 
plained in these terms. 


Demand and supply have emerged over the years 
as very useful tools of fairly rough analysis. They 
serve to break down the forces operating on prices 
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and outputs into those that operate mainly on the 
side of the sellers and those that operate mainly on 
the side of the buyers. They should not be expected 
to answer all the problems of price theory. They 
remain, however, indispensable tools. 


KENNETH E. BOULDING 


[See also the biographies of MARSHALL; SMITH, ADAM; 
Watras.] 
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LLS 
ECONOMETRIC STUDIES 


Econometric studies of demand and supply are 
directed toward obtaining quantitative estimates of 
market demand and supply functions from em- 
pirical data. The principal data used have been 
time series of prices, quantities, incomes, and re- 
lated variables; the traditional estimating technique 
has been regression analysis; and the interpreta- 
tion of the resulting equations has been guided by 
the theory of consumer demand and the theory of 
the firm. 

Concepts of demand and supply have occupied 
a prominent place in the development of economic 
theory. The pure theory of consumer demand 
reached a high level of refinement in the hands of 
Pareto, who portrayed the individual consumer as 
attempting to maximize a utility function that in- 
cluded the quantities of every good and service 
purchased by him. The quantities purchased were 
themselves functions of prices and of the con- 
sumer’s income. The resulting demand functions 
of individual consumers were in principle measur- 


able and could evidently be aggregated to yield 
market demand functions. Marshall and others 
specified the conditions of profit maximization for 
an individual firm and showed that the marginal 
cost curves of individual firms producing the same 
product could be aggregated to form an industry 
supply curve. Demand and supply functions for 
all commodities and services were included (con- 
ceptually) in the theory of general economic equi- 
librium formulated by Walras and Pareto. 

Economic theory specified the signs of the co- 
efficients relating prices and quantities in market 
demand and supply functions, but it could not 
specify the magnitudes of these coefficients for 
particular commodities. Yet major policy problems, 
for example, the effects of tariffs, bounties, and 
excise taxes, required quantitative knowledge. The 
policy interests of government agencies and the 
commercial interests of business firms began in 
the 1800s to generate time series data of an aggre- 
gative sort, including wholesale prices of staple 
commodities and quantities of imports, exports, 
production, and consumption of such commodities 
in various countries. The task confronting pioneers 
in the econometric analysis of demand and supply 
was to use market data of this kind as a basis for 
determining the required functions. When the task 
was first seriously taken up, after 1900, it was 
found to involve complex problems of statistical 
estimation and logical interpretation. 

A sophisticated econometric analysis of the de- 
mand and supply relationships for particular com- 
modities requires an analyst with many skills. He 
must be thoroughly grounded in economic theory. 
He must be well versed in the theory of statistical 
estimation (including the estimation of sets of 
simultaneous relationships) and in techniques for 
coping with the problems of autocorrelated dis- 
turbances and multicollinearity, which are fre- 
quently encountered in economic time series. He 
must acquaint himself very thoroughly with the 
conditions under which particular commodities are 
produced, marketed, and consumed, insofar as 
these conditions affect the decision processes of the 
relevant firms and consumers. He must take pains 
to determine the extent to which each of his eco” 
nomic time series corresponds to the variable called 
for by economic theory. 

A person who has made the necessary invest- 
ment in economic and statistical theory will ordi- 
narily be tempted to slight the detailed study of 
data, institutions, and technology pertaining to 4 
particular commodity. On the other hand, a person 
who because of predilection has made a major m 
vestment in learning the technological and insti- 
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tutional characteristics of an industry usually lacks 
sophistication in economic or statistical theory. 
This dilemma may sometimes be resolved by co- 
operation between the econometrician and the 
commodity or industry specialist. 

The literature of demand analysis published 
prior to the late 1950s is much richer in volume 
and interest than that of supply analysis. The first 
and major part of this article will be devoted to 
studies of demand. 


Studies of demand prior to 1939 


The story of statistical demand analysis from its 
beginnings through 1938 can be told in terms of 
developments in economic theory, in statistical 
theory and techniques, and in published economic 
data. 

Economic theory. The economic theory requi- 
site for demand analysis was available at an early 
date, In 1838 Cournot stated the theory of demand 
in a form that lent itself to numerical application, 
and he suggested that it would be easy to learn, for 
all commodities for which statistics had been col- 
lected, whether current prices were above or below 
the value that would maximize the total value of 
the quantity sold during a given period. Marshall 
elaborated the concept of a market demand curve 
in a partial equilibrium context. Walras’ general 
equilibrium theory, published in 1874, expressed 
the quantity demanded of any commodity as a 
function not only of its own price but of all other 
prices, In various articles and books from 1893 to 
1911, Pareto extended Walras’ theory and stated 
it with greater rigor. 

Statistical theory and technique. Gauss’s meth- 
od of least squares was published early in the nine- 
teenth century, but for several decades it was used 
primarily by physical scientists in dealing with 
errors of measurement. The first demand analysts 
were encouraged to adopt statistical methods as a 
result of the work of Galton, Pearson, and their 
fellow biometricians, very late in the century. The 
theory of correlation was elaborated during the 
1890s by Pearson, Yule, and others; and several 
years elapsed before anyone tried to apply it to 
price—quantity relationships. 

The early work on correlation emphasized bi- 
variate or multivariate normal populations for 
which the simple, partial, and multiple correlation 
coefficients had structural significance [see MULTI- 
VARIATE ANALYSIS, article on CORRELATION (1)]- 
This correlation emphasis, carried forward through 
various textbooks, was to prove quite misleading 
to the less sophisticated practitioners of statistical 
demand analysis during the 1920s and 1930s. 
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By the 1920s R. A. Fisher and others concerned 
with the design of experiments and the analysis 
of variance had arrived at a clear-cut distinction 
between the correlation model, emphasized by 
Galton and Pearson, and the regression model, in 
which values of the independent variables were 
predetermined by the investigator [see LINEAR 
HYPOTHESES, article on REGRESSION]. Ezekiel 
(1930), whose book was widely used by demand 
analysts in the United States, captured the new 
regression emphasis of Fisher but continued to 
use some of the older, correlation terminology. 

In the 1930s the theory of statistical inference 
was further developed by J. Neyman and others. 
Frisch (1934) greatly illuminated the implications 
of multicollinearity (high intercorrelation ) of the 
independent variables in regression analyses based 
on economic time series. The problem of auto- 
correlation in the residuals from time series regres- 
sions was largely ignored in empirical demand 
studies prior to the 1940s. 

The problem of identifying a statistically esti- 
mated price—quantity relationship as a demand 
curve rather than a supply curve appears to have 
been recognized first by Lenoir, in 1913, and then, 
in 1914, by the critics of Moore's upward-sloping 
“demand curve” for pig iron [see STATISTICAL IDEN- 
TIFIABILITY]. It was very clearly stated by Working 
(1927), and its ready solution, in what would now 
be called recursive models, was noted by Ezekiel 
(1928). Statistical methods for dealing with the 
general simultaneous equation case were not avail- 
able prior to 1943, although Wright (1934, pp. 
196-201), in a paper relatively neglected by econ- 
omists, successfully estimated what would now 
be called a just-identified model, by a method 
logically equivalent to that of reduced forms [see 
SIMULTANEOUS EQUATION ESTIMATION]. 

Economic data. The availability of data has 
been the most severely limiting factor in the esti- 
mation of demand functions, from Cournot’s time 
to the present. The data resources of most countries 
have been greatly enriched since the 1930s, in con- 
nection with the development of national income 
accounts, index numbers of prices, production and 
consumption, input-output tables, and, in some 
cases, econometric models of their national econ- 
omies. The price and quantity data available for 
empirical studies of demand prior to 1939 were 
much more limited in scope. Data on agricultural 
staples have been particularly accessible in the 
United States, and these commodities were used 
almost exclusively by Moore, Schultz, and others 
in their empirical work. Knowledge of data limita- 
tions fostered a pragmatic attitude on the part of 
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some agricultural economists, whose applied work 
was highly useful for policy and forecasting pur- 
poses. Ignorance or disregard of data limitations 
often vitiated the empirical work of academic 
economists, who were mainly interested in “testing” 
theories or techniques. 

Empirical studies. At various times Cournot, 
Jevons, Marshall, and Pareto had all paid lip 
service to the idea of empirical studies of demand 
(Schultz 1938, pp. 657-658). Even earlier some 
businessmen and men of affairs (as well as econ- 
omists) had no doubt formed judgments about 
price—quantity relationships of particular interest 
to them. Stigler (1954) reports some examples. 
These instances of casual empiricism had no 
cumulative effect. Benini (1907) computed a least 
squares multiple regression of coffee consumption 
on the price of coffee and the price of sugar—the 
first application to demand analysis of what came 
to be the dominant statistical technique. 

Lenoir’s book (1913) in some respects remained 
unsurpassed until the 1930s. Starting from Pareto’s 
theory of consumer demand, Lenoir made an ex- 
plicit transition to market demand curves. He made 
a similar derivation, based on Pareto, of market 
supply curves. A diagram showing three roughly 
parallel (nonlinear) demand functions intersecting 
two roughly parallel (nonlinear) supply curves was 
used to adumbrate, at least, what is now known as 
the identification problem. Lenoir then calculated 
multiple regression equations, involving as many 
as three independent variables, for a number of 
commodities, including coffee, wheat, and coal. 
His descriptions of the time series and the markets 
involved appear to reflect care and sound judg- 
ment. The book as a whole was a most promising 
integration of economic theory, statistical tech- 
nique, and sophisticated attention to the data; but 
it was apparently unnoticed in the United States. 
It is not mentioned in later work by Moore. Schultz 
devotes part of one sentence to “Dr. Marcel Lenoir’s 
important book” (1938, p. 64) but gives no descrip- 
tion or appraisal of Lenoir’s work. 

Moore's work, Schultz wrote, “The statistical 
study of demand is a new field in economics and 
may be said to be the creation of only one man— 
Professor Henry L. Moore” (1938, p. 63). Although 
recognizing that Moore had predecessors, Schultz 
commented that “none of these predecessors of 
Moore attracted much attention, none covered so 
wide a field, and none succeeded so well in wring- 
ing fresh knowledge from the accumulated masses 
of data” (p. 64). 

Moore’s books Economic Cycles: Their Law and 
Cause (1914) and Forecasting the Yield and the 


Price of Cotton (1917) furnished the inspiration 
for much of the statistical demand analysis that 
was carried on in the United States during the 
1920s. Unlike most American economists of his 
generation, Moore was thoroughly familiar with 
the works of Cournot, Walras, and Pareto. He was 
personally acquainted with Walras and corre- 
sponded with him. In 1909 and 1913 he took 
courses in mathematical statistics, including cor- 
relation, with Karl Pearson (Stigler 1962, p. 2). 

Despite his admiration for the general equili- 
brium theory of Walras, Moore was pleased to find 
that in practice he could obtain fairly good fits to 
his data (prices and production of farm crops) by 
using equations in two or three variables. His sta- 
tistical procedures were simple, in keeping with 
the rather crude data that were available to him. 
He expressed his time series variables in terms of 
percentage changes from one year to the next or 
in terms of link relatives (the given year’s value 
expressed as a ratio to the value in the preceding 
year) or as ratios to trend. The adjusted series 
on prices and production were then related by 
the method of least squares, using straight lines, 
second-degree parabolas, and in some cases third- 
degree parabolas. Moore derived statistical demand 
curves for corn, hay, oats, and potatoes in 1914 
and a demand function for cotton in 1917. In sub- 
sequent articles, Moore introduced the concept of 
the flexibility of price (namely, the reciprocal of 
the elasticity of demand), estimated the response 
of cotton acreage to cotton prices in the preced- 
ing year, and described a moving equilibrium of 
demand and supply for potatoes in which the de- 
mand curve expressed current price as a function 
of current production and the supply curve ex- 
pressed current production as a function of the 
preceding year's price. We have here the well- 
known cobweb model, the simplest member of a 
class of recursive models, which assume great im- 
portance in discussions of statistical techniques 
after 1943. 

Moore’s* methods of data adjustment were 
chosen on a common-sense basis. Two of them, 
link relatives and per cent changes from year to 
year, were essentially first-difference transforma- 
tions. Many years later it was shown that first- 
difference transformations not only reduced inter- 
correlation of the explanatory variables but also 
frequently eliminated significant autocorrelation 
in the residuals. Moore’s third method, using ratios 
of actual observations to trends, largely eliminate 
the distorting effects of intercorrelated time trends, 
as did the quasi first-difference transformations. 
The constant term in a regression equation based 
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on first differences of prices and quantities may 
be interpreted as a linear time trend in the level of 
the price—quantity relationship. [See the biography 
of Moore, HENRY L.] 

Other studies. The simplicity of Moore's sta- 
tistical methods and his apparently successful 
applications of them to data for agricultural com- 
modities excited great interest among agricultural 
economists in the United States. Some of them 
wished to provide farmers with dependable pre- 
dictions of prices for the coming year, so that 
the farmers could adjust their production plans 
and increase their expected incomes. Some needed 
demand and supply functions to estimate the prob- 
able effects of alternative price support, supply 
control, tariff, or subsidy policies. Some of these 
men achieved a high level of sophistication in em- 
pirical demand analysis, notably Ezekiel, Waugh, 
H. Working, and E. J. Working, and a number of 
excellent studies were published by them between 
1922 and 1929 (see the bibliography in Schultz 
1938, pp. 779-803). Their work was distinguished 
by knowledge of data, technology, and institutions 
and by competent and in some cases highly cre- 
ative use of economic and statistical theory. There 
were, also, some less competent “price analysts,” 
whose work was empirical in the extreme and 
tended to discredit the whole field of statistical 
demand analysis. With the onset of the economic 
depression of the 1930s, several of those who had 
done significant demand studies were drawn into 
the action programs of the Department of Agricul- 
ture and other government agencies. Their unpub- 
lished analyses continued to be influential in the 
appraisal of programs and contemplated policies. 

European econometricians did important work 
in the 1920s and the 1930s. Frisch in Norway esti- 
mated the marginal utility of money from a statis- 
tical analysis of the demand for sugar in Paris 
(1926), Hanau in Germany estimated the demand 
curve and the supply curve for hogs (1927). Leon- 
tief in Germany published a statistical analysis of 
demand and supply functions for several commod- 
ities, using a highly imaginative but statistically 
unreliable technique (1929). Tinbergen in the 
Netherlands published an important paper on sup- 
ply curves, including a statistical measurement of 
the supply elasticity of potato meal (1929). Mar- 
schak in Germany made econometric studies of 
family budgets and argued that on certain assump- 
tions these data would also yield estimates of de- 
mand elasticities with respect to price (1931). Roy 
in France published a collection of his econometric 
studies (1935), including measurements of de- 
mand elasticities for goods and services whose 
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prices change only infrequently—such as gas, 
postage stamps, trolley fares, and tobacco (a gov- 
ernment monopoly in France), Some of these men 
had high creative potential in theory and methodol- 
ogy and are now well known for their important 
contributions to fields of econometrics other than 
demand analysis. Of those listed in this paragraph, 
Frisch had by far the greatest impact on economet- 
ric studies of demand, both before and after 1939, 

Schultz’s work. Henry Schultz was a student 
of H. L. Moore. Inspired by Moore, he studied the 
works of Cournot, Walras, and Pareto and during 
1919 attended lectures on statistics by A. L. Bowley 
and Karl Pearson. 

Schultz’s monumental work The Theory and 
Measurement of Demand (1938) is the definitive 
statement on econometric analysis of demand prior 
to World War u. Schultz was the “complete de- 
mand analyst” of his generation, and his 24-page 
bibliography is still the best point of departure for 
a survey of the pre-1938 literature. 

Schultz’s life work is essentially summed up in 
his 1938 book (817 pages). Chapters 1, 18, and 19 
together provide an excellent statement of the 
theory of consumer demand, incorporating the ex- 
tensions of Pareto’s basic formulation developed by 
J. R. Hicks, R. G. D. Allen, and E. Slutsky. Chapters 
2 and 3 deal with the logic and theoretical validity 
of various proposed methods for deriving demand 
curves from time series and from family budget 
data. The empirical section (chapters 5-17) pre- 
sents statistical demand analyses for ten major 
crops, along with summary comparisons and inter- 
pretations of the results for the various commod- 
ities, time periods, and functional forms investi- 
gated. Further empirical studies are presented in 
the chapters on related demands (chapters 18 and 
19), and the 50-page appendix on elements of 
curve fitting and correlation provides an excellent 
guide to least squares regression methods. It would 
be difficult to name any econometric work of large 
scope in the 1930s that achieved a better integra- 
tion of economic theory and statistics with pains- 
taking and realistic attention to institutions, mar- 
kets, and data. [See the biography of SCHULTZ.] 

Summary of accomplishments to 1939. Success 
in empirical demand analysis was not held up by 
the need for further development in economic 
theory. Multiple regression analysis was the stand- 
ard estimating technique from 1914 on, although 
a number of interesting experiments (weighted 
regression; regression analysis by short-cut graphic 
methods; confluence, or bunch-map, analysis) 
were tried. 

The leading demand analysts of the 1920s and 
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1930s were more sophisticated about the identifica- 
tion problem than is often supposed. Most of the 
commodities with which Moore, Schultz, and the 
agricultural economists were concerned followed 
the cobweb, or recursive, model. As a result of bio- 
logically necessary time lags, the identification of a 
relation between the current production and the 
current price of a farm commodity, as a demand 
function, was straightforward; a relation between 
last year’s price and this year’s acreage or produc- 
tion could be readily identified as a supply func- 
tion. E. J. Working made a clear statement of the 
identification problem, but Ezekiel pointed out that 
“correlated shifts in demand and supply’ sched- 
ules, which Working feared might completely in- 
validate many price-analysis studies, are not so 
likely to cause trouble as he thought” (1928, p. 
224). Only the “instantaneous” adjustment of sup- 
ply to price within a given time unit could give rise 
to such trouble, and such adjustments (for farm 
products) were generally small, relative to those 
in subsequent time periods. 

A few attempts were made in the 1930s to de- 
rive demand functions for consumer goods. In 
1914 Moore found a positively sloping relationship 
between the price and production of pig iron and 
made the mistake of trying to rationalize it as a 
different kind of demand curve. Although it could 
be argued that the demand for iron, steel, and 
other producers’ durable goods is ultimately de- 
rived from consumer demand, the empirical con- 
nection is tenuous and the subject is best treated 
in contexts other than that of demand analysis. 


Studies of demand since 1939 


Since 1939 there have been important develop- 
ments in methods of estimation, in data networks, 
and in the objectives of demand studies. 

Methods of estimation. Tests for autocorrela- 
tion in the residuals from time series regressions 
were published in the early 1940s and became a 
standard feature of demand studies. Haavelmo’s 
brilliant generalization of the identification prob- 
lem (1943) created in the minds of theoretically 
oriented economists grave doubts concerning the 
validity of any demand function estimated by 
conventional least squares methods. Interestingly 
enough, the economists who were or had been most 
deeply involved in empirical demand analysis, in- 
cluding Wold, Stone, Fox, Waugh, and Ezekiel, 
were not impressed with the implications of Haa- 
velmo’s methods for their own work. Wold made 
the most creative response, and his articles on re- 
cursive models and causal chains did much to 
place the traditional single-equation methods and 


the proposed simultaneous equation methods in a 
common perspective (1953 and articles noted 
therein). Fox (1958) used arrow diagrams, which 
can be readily interpreted as causal ordering dia- 
grams, to express his hypotheses concerning the 
directions and relative magnitudes of the influ- 
ences of each variable upon directly related vari- 
ables in complete demand, supply, and price struc- 
tures for farm and food products. Inspection of the 
arrow diagram for a given commodity frequently 
disclosed that the consumer demand function and 
certain other functions could be estimated appro- 
priately by least squares. Debate concerning alter- 
native estimators of simultaneous equation models, 
which was active in the early 1960s, has some im- 
plications for demand analysis as well as for other 
branches of econometrics. 

Changes in data networks and objectives. The 
Keynesian revolution in macroeconomic theory, by 
demonstrating the important analytical and policy 
uses of estimates of the national income and its 
components, stimulated many countries to develop 
comprehensive national income and product ac- 
counts. This interest reflected a growing recogni- 
tion that governments must take a great deal of 
responsibility for economic stabilization and the 
avoidance of unemployment. Tinbergen’s econo- 
metric models and Leontief’s input-output models 
also required comprehensive coverage (at appro- 
priate levels of aggregation) of all sectors of a 
national economy. 

The prospective uses of economic data in com- 
prehensive models have given a more analytical 
focus to the design of data systems in recent years 
than generally existed prior to World War 11. At 
the same time, comprehensive models not only en- 
courage but logically require certain restrictions 
upon the coefficients of demand functions for dif- 
ferent commodities. These restrictions were gener- 
ally disregarded in the pre-1939 tradition, except 
for a few studies (by Schultz, Waugh, and Ezekiel) 
of commodities closely related in demand, such as 
beef, pork, lamb, and chicken. Hence, empirical 
work in the 1960s has shown a much greater 
tendency to use “extraneous” or a priori informa- 
tion to supplement time series data and to rely on 
theoretical considerations in supplying coefficients 
that cannot be estimated directly from the data. J 

Major empirical works. Wold (1953) publishe 
an excellent exposition of Paretian demand theory, 
of the theory of stationary random processes as z 
affects the analysis of economic time series, and 0i 
the theory of regression analysis. While the scope 
of Wold’s theoretical chapters is similar to that © 
Schultz, Wold incorporates a number of major 
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theoretical developments that had been made after 
1938, by himself and others. Wold’s book requires 
a higher level of mathematical and statistical com- 
petence on the part of its readers than does 
Schultz's, and in the 1950s it largely supplanted the 
latter as a graduate text and reference work. Wold’s 
empirical chapters, however, were not particularly 
distinguished. 

Stone’s comprehensive book (1954) establishes 
him as a “complete demand analyst” in the Schultz 
tradition, and he had indeed been influenced by 
Schultz’s writings. Stone explicitly covers those 
portions of economic theory and estimation proce- 
dures that are relevant to demand analysis. The 
mathematical level of his exposition is less de- 
manding than Wold’s, but the implications he 
draws for empirical work are equally authoritative. 
The volume and quality of the econometric results 
presented by Stone are also impressive. An interest- 
ing feature of his work is the use of income elastic- 
ities of demand, estimated from family budget 
data, to adjust time series of per capita consump- 
tion before regressing them on time series of prices. 

Fox (1958 and articles noted therein) published 
the largest array of empirical demand functions 
that had so far been produced in the United States 
for farm products and foods, taking advantage of 
the many improvements achieved by the Bureau of 
Agricultural Economics between 1938 and 1950 in 
data on retail prices, per capita food consumption, 
disposable personal income, and food marketing 
margins. His demand studies covered both farm 
products and foods, whereas Wold and Stone dealt 
only with foods. Fox used logarithmic first-differ- 
ence transformations and took explicit cognizance 
of identification problems. 

Special adaptations of demand theory. Strotz 
(1957) put forward the concept of a “utility tree,” 
which justified the aggregation of consumer goods 
into budget categories that were assumed to be 
want independent of one another. Cross elasticities 
of demand between individual commodities in two 
want-independent groups could then be ignored 
in applied work; nonzero cross elasticities might 
continue to exist and be recognized between groups 
of commodities. Strotz was motivated by a desire 
“to exchange some realism [in assumptions] for 
greater relevance [in applications]” (1957, P. 
270) and stated that his hypothesis “implies cer- 
tain empirically meaningful and interesting Con- 
ditions on the price coefficients of the demand 
functions” (p. 269). 

Frisch (1959) published a complete scheme for 
computing all direct and cross elasticities of de- 
mand in a model with many sectors. He noted that 
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it was easier to obtain estimates of budget pro- 
portions and Engel (income) elasticities than of 
elasticities with respect to price but that by making 
certain want-independence assumptions the price 
elasticities could be deduced from the knowledge 
of budget proportions and Engel elasticities. Frisch 
noted further that the direct elasticities with re- 
spect to price could, as a rule, be estimated more 
easily than the cross elasticities. If we are willing 
to assume that the behavior of the market can be 
described by the behavior of a representative indi- 
vidual and are also willing to make stipulated as- 
sumptions about want independence, all the cross 
elasticities of demand in a model including many 
commodities could be supplied in a reproducible 
way consistent with the other assumptions of de- 
mand theory. 

Brandow (1961) followed Frisch’s suggestions 
in specifying a complete matrix of price and in- 
come elasticities of demand at retail prices, for 24 
foods or food groups and for the aggregate of all 
nonfood consumer goods and services. This matrix 
represented a careful synthesis of results obtained 
from United States data by other investigators, to- 
gether with additional analyses of his own. He 
presented equations for converting each of his 24 
consumer demand functions into demand func- 
tions at the farm price level and derived other 
equations for determining the internal balance of 
the livestock and feed economy. Brandow did not 
attempt to develop equations for forecasting acre- 
ages and supplies of crops, although adjustments 
of livestock supplies to quantities of feed avail- 
able were endogenous in his model. As of 1965, 
Brandow’s study was the most comprehensive and 
internally consistent application of demand analy- 
sis yet completed. 

Apart from Schultz (who died in 1938), Waugh 
has been the most sophisticated of the American 
demand analysts in his command of economic 
theory and the most imaginative in his applica- 
tion of statistical techniques. His methodological 
and empirical contributions have extended from 
1923 to 1964. His bulletin Demand and Price Anal- 
ysis (1964), while written on an expository level, 
reports many methodological experiments, includ- 
ing a symmetric matrix of direct and cross elas- 
ticities of demand for three competing commodi- 
ties; an empirically derived “indifference surface” 
for beef and pork; an orthogonal regression; an 
appendix on relationships between demand co- 
efficients (along the lines of Frisch’s 1959 article); 
and an appendix on the optimal allocation of com- 
modity supplies among independent markets to 
maximize producer returns. Waugh’s bulletin pro- 
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vides a brief, clear, and interesting survey of the 
theory and techniques of demand analysis, with 
many illustrations. 

Smith (1964) applied linear programming to 
the electronic computation of human diets, draw- 
ing his data from time series of weekly purchases 
of some 600 food products by 176 families in 
Lansing, Michigan [see PROGRAMMING]. The re- 
tail prices of the products were also obtained. 
Smith showed that the solutions for least-cost diets 
meeting stated requirements for nutrients, speci- 
fied complementarity restrictions, and “food-habit” 
or palatability stipulations could, under certain 
very special assumptions, be made to yield esti- 
mates of the marginal utilities of commodities and 
the marginal costs of the effective restraints. It 
remains to be seen whether this approach will con- 
verge with that of Frisch and Brandow. Ladd and 
Martin (1964) used the Lansing data to demon- 
Strate the existence of distributed lags in the ad- 
justment of consumer demand for some foods to 
price and income changes when the units of ob- 
servation were four-week (and in some cases 13- 
week) periods [see DISTRIBUTED LAGS]. 


Econometric studies of supply 


Prior to the late 1950s, econometric studies of 
supply were much less exciting to economists than 
were studies of demand. The theory of consumer 
demand embraces all consumer goods and services 
and embodies principles of rational choice appli- 
cable to consumers in general. But the technol- 
ogy (and geography) of supply may be highly 
specialized to particular commodities, and such 
conceptual unity as the field possesses may be sub- 
merged in these details. 

In some countries and periods, agricultural pro- 
duction takes place under atomistic competition 
and provides data for econometric analysis, In the 
industrial sector, oligopoly and monopolistic com- 
petition are common; many firms operate at less 
than full capacity, and statistical studies suggest 
that their supply curves are nearly horizontal over 
the usual ranges of output. Data on the cost func- 
tions of individual firms are often closely held 
[see PRODUCTION AND COST ANALYSIs]. 

Published econometric studies of supply have 
for the most part dealt with agricultural commodi- 
ties. Moore used least squares regression methods 
to estimate the response of potato yields in a given 
year to prices in the preceding year. Bean (1929) 
and other agricultural economists used regression 
techniques to estimate the responses of crop acre- 
ages to prices lagged one or two years and the 
responses of hog numbers to the lagged relative 
prices of hogs and corn. 


In the United States, interest in econometric 
supply analysis was largely dissipated by the de- 
pression of the 1930s, by government programs to 
control acreages (after 1932), and by the severe 
droughts of 1934 and 1936. Some local interest 
continued in supply response studies for minor 
crops that were not subject to acreage allotments. 
In 1928 Schultz gave considerable attention to 
estimating the supply curve for sugar, in addition 
to the demand curve, It is significant that he had 
little to say about supply functions in his 1938 
book. 

Nerlove (1958) injected a new idea into the 
traditional type of supply response study. He hy- 
pothesized that farmers adjust their planted acre- 
age of a crop in response to the difference between 
the most recently experienced actual price and 
their concept of the “long-run normal price” for 
the same year. The technique proposed for in- 
ferring the long-run normal price was ingenious, 
but it only momentarily dispelled the apathy of 
economists toward regression studies of acreage 
response in an era of government programs con- 
trolling acreage. 

In the 1950s the popularization of linear pro- 
gramming methods among agricultural economists 
by Heady and others opened the way for a norma- 
tive analysis of supply response. An individual 
farmer could be represented as adjusting his 
planted acreages of different crops in an effort to 
maximize net income, subject to his resource con- 
straints and given his expectations concerning the 
prices at which the various crops would sell after 
harvest. The existence of support prices permitted 
accurate predictions in some cases. 

Day (1963) applied recursive linear program- 
ming to a homogeneous eleven-county area in the 
Mississippi Delta, which he treated as though it 
were a single profit-maximizing unit. Day’s model 
included nine possible outputs (counting cotton 
lint and cotton seed as separate products) aan 
about 27 possible production inputs. He specifie 
input and output vectors for a total of 97 different 
production processes, each process representing a 
particular crop grown on a particular soil ge 
with a stated level of application of commerci 
fertilizer and a particular combination of machin- 
ery, labor, and pesticides. 

Day applied his model recursively, year by year, 
from 1939 through 1958. New production activi- 
ties, for example, mechanical cotton picking, Were 
introduced into the profit-maximizing solution only 
when they had become more profitable than a 
previously existing activities on at least one E 
type. The model generated time series for all ou! 
puts and inputs, as well as “shadow prices for 
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the restrictions (including government acreage 
allotments) that were effective in each year’s op- 
timal solution. 

In the early 1960s Heady and others applied 
linear programming to each farm in representa- 
tive samples of farms to arrive at aggregative sup- 
ply response functions. Heady and Egbert (1964) 
developed a linear programming model of agricul- 
tural production in the United States for more 
than 120 regions and for all major crops in each 
region. By 1965 plans were under way to include 
livestock production in the model and to increase 
the number of regions. 


By 1965 the most ambitious econometric studies 
of demand and supply had achieved a (partly 
synthetic) comprehensiveness that would have as- 
tonished the early critics of Moore. But Moore 
himself would not have been astonished. In his 
last book, Synthetic Economics (1929), Moore had 
proposed an implementation of the general equi- 
librium model of Walras, the principal elements 
of which would be comprehensive sets of empirical 
demand and supply functions! 

Kart A. Fox 
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The term democracy indicates both a set of 
ideals and a political system—a feature it shares 
with the terms communism and socialism, “De- 
mocracy” is harder to pin down, however, than 
either “socialism” or “communism”; for while the 
latter labels have found in Marxism an ideological 
matrix, or at least a point of reference, democracy 
has never become identified with a specific doc- 
trinal source—it is rather a by-product of the entire 
development of Western civilization. No wonder, 
therefore, that the more “democracy” has come to 
be a universally accepted honorific term, the more 
it has undergone verbal stretching and has become 
the loosest label of its kind. Not every political sys- 
tem claims to be a socialist system, but even com- 
munist systems claim to be democracies. Since 
World War n, “democracy” encompasses every- 
thing; as stated by a UNESCO report: “. . . for the 
first time in the history of the world . . , practical 
politicians and political theorists agree in stressing 
the democratic element in the institutions they 
defend and in the theories they advocate” (United 
Nations . . . 1951, p. 522). 

One reaction to this state of affairs has been to 
avoid using the term. As has been forcibly stated, 
“, . . discussions about democracy . . . are intellec- 
tually worthless because we do not know what we 
are talking about” (Jouvenel 1945, p. 338 in 1948 
edition), The alternative is, of course, to dissect 
the term as analytically as possible. 


Scope and meaning 


Democratic legitimacy. Democracy is, to begin 
with, a principle of legitimacy [see also Lecrti- 
MACY], So conceived it is both the minimal and the 
sole common denominator of any and all demo- 
cratic doctrine. From the democratic viewpoint no- 
body denies that power is legitimate only when it 
is derived from the authority of the people and 
based upon their consent. Nobody questions that 
democracy is the opposite of autocracy. But this 
agreement is short-lived and indeed rests on fragile 
foundations. For democracy as a legitimizing prin- 
ciple lends itself to two diverging interpretations: 
(1) that the consent of the people can be a mere 
presumption, an untested assumption; or (2) that 
there is no democratic consent unless it is verified 
through ad hoc procedures (which exclude, notably, 
consent by sheer acclamation). And these oppos- 
ing views are related to an even more fundamental 
disagreement over the very meaning of the term 
people—a hazy notion indeed, 

“The people” can be understood as a singular 


term (in fact, peuple, Volk, and popolo are singu- 
lar nouns in French, German, and Italian) or as a 
plural, that is, as a single entity or as “everybody.” 
And, clearly, it is only the latter notion that calls 
for a legitimacy ascertained by means of reliable 
procedures; for “the people” conceived as an entity, 
or as an organic whole, easily combines with a 
legitimacy assumed on the sole basis of acclama- 
tions and plebiscitary approbations. Therefore, on 
the grounds of democracy conceived merely as a 
principle of legitimacy, any and all governments 
can easily claim to be democracies simply by 
switching from verified consensus to presumed 
consensus. By itself, then, popular consent does 
not suffice to qualify any particular political sys- 
tem as a democracy. Such qualification is given 
only by the procedures of consent—and these are 
controversial. 

The normative focus. From a normative stand- 
point the definition of democracy strictly derives 
from the literal meaning of the term—“power of 
the people.” We may say that the ought of democ- 
racy amounts to the etymology of the term, There 
are, however, three different normative approaches: 
oppositional, realistic, and perfectionistic (or uto- 
plan). Used as an oppositional concept, democracy 
indicates what ought not to be; realistic norma- 
tivism points to what could be; while utopian 
normativism presents the image of the perfect so- 
ciety that must be. Moreover, since the normative 
attitude is basically future-oriented, it is easily 
converted into “futurism” in the sense that "democ- 
racy” becomes a long-range projection unrelated to 
current deeds. The use of undemocratic means to 
achieve democratic ends finds its justification pre- 
cisely in this attitude. 

The descriptive focus. A descriptive standpoint 
leads to definitions that bear little resemblance, if 
any, to the normative definitions of democracy. A 
concern with what democracy is in the real world 
seldom, if ever, makes reference to the notion of 
people. As Dahl puts it, in actuality democracies 
are “poliarchies” (1956, pp. 63-89). And the stand- 
ard definition provided by most authors describes 
democracy as a system based on competitive par- 
ties, in which the governing majority respects the 
rights of minorities. The discussion is focused on 
the concepts of representation, majority rule, Op- 
Position, competition, alternative government, con- 
trol, and the like—hardly ever on the notion of 4 
self-governing people. Even descriptively, san 
the approaches can be quite different: structur i 
procedural, or behavioral, These are not clear-cu 
distinctions, for both the structures and the pro- 
cedures of democracy are meant to elicit and t0 


enforce a given behavior. Yet procedures are not 
necessarily related to institutional structures; and 
moreover the behavioral definition may be incom- 
patible with the structural and procedural defini- 
tions, as we shall see. 

The typological focus. Democracy is also one 
type of political system among others, and from 
this viewpoint the problem becomes to define the 
properties that distinguish it from nondemocratic 
polities, When the issue arises, the attempt is often 
made to qualify democracy with reference to an 
ought rather than to the is. Clearly, however, the 
identity of a political system cannot be ascertained 
on normative grounds. It can only be assessed on 
factual grounds, that is, with reference to the pos- 
sibility of verification provided by a descriptive 
account. 

Another source of confusion lies in the inter- 
mingling of three different standards. At times 
democracy is taken to include all the political sys- 
tems falling short of outright dictatorships. This 
identification is purely negative; the standard is 
very low, and we are thus confronted with an 
unspecific type. However, since no political system 
has a definite form at the moment of its inception, 
this minimal standard of democracy may aptly in- 
dicate its “initial” type. In other instances the 
standards are higher, and democracy is identified 
positively by the existence of developed representa- 
tive institutions and by the establishment of “con- 
stitutional government” [see CONSTITUTIONS AND 
CONSTITUTIONALISM; see also Friedrich 1937]. 
Since this is the more frequent case, as well as the 
meaning in which the term democracy is more 
frequently used, we may speak of it as the medium 
or normal type. Finally, when we use a high stand- 
ard and refer to maximum achievement, we are 
confronted with a strict meaning of democracy, 
according to which the term denotes an advanced 
type. 

According to the minimal standard, roughly half 
of the world may be included in the realm of 
democracy; according to the medium standard the 
number of democratic countries dwindles; and ac- 
cording to the high standard a mere dozen or so 
countries have achieved a satisfactory degree of 
democracy. And it requires little effort to imagine 
how easily the label “democratic” can be turned 
into “undemocratic,” and vice versa, simply by 
switching from one standard to another. 

The dimensional focus. A distinction must also 
be made between small-scale and large-scale opera- 
tions, between microdimensions and macrodimen- 
sions. Microdemocracy applies to face-to-face rela- 
tionships, i.e., to small groups. Macrodemocracy 
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applies whenever a collectivity is too large and/or 
spatially too scattered to allow any direct inter- 
change among its members and any kind of face- 
to-face relationships. The distinction implies that 
a macrodemocracy is not some kind of enlarge- 
ment of a microprototype. Their respective proper- 
ties have very little, if anything, in common, at 
least in the sense that voluntary associations and 
small political units provide no clues for under- 
standing a modern political democracy. They are 
perhaps the most essential inner nourishment of a 
democratic political system, but they can neither 
replace it nor dispense with it. In particular, they 
provide no model for macrodemocracy. It may be 
argued that no definite line can be drawn between 
small and large, which are indeed relative con- 
cepts; nonetheless the fact remains that micro- and 
macrodemocracy are inversely correlated: the 
greater the geographical extension of democracy, 
the less its intensity as a real experience of shared 
decision making. 


Secondary meanings 

From the time the term demokratia was coined 
in the fifth century B.C. until roughly a century 
ago, democracy was used as a political concept. 
Tocqueville was struck, however, by the social 
aspect of American democracy, and we thus speak 
of “social democracy”; Marxism has popularized 
the expression “economic democracy”; and guild 
socialism, particularly the Webbs’ book, Industrial 
Democracy (1897), has given currency to the label 
“industrial democracy.” These are the major sec- 
ondary usages of the term democracy; and since 
we shall be concerned mainly with the primary 
political usage, they will only be briefly considered. 

Social democracy. “Social democracy” is gen- 
erally conceived as an endogenous state and style 
of the society and should therefore not be confused 
with “socialist democracy,” which is a policy en- 
forced by the state upon the society. The expres- 
sion social democracy usually points to the democ- 
ratization of the society itself, as expressed by its 
manners and customs, and particularly by the be- 
lief in what Bryce called “equality of estimation,” 
that is, equal treatment and equal respect for every 
man. Social democracy may thus be defined as an 
ethos and a way of life characterized by a general 
leveling of status differences. By implication it may 
also indicate a “multigroup society” in which a 
lively network of microdemocracies sustains and 
implements political macrodemocracy. 

Economic democracy. Since political democracy 
is primarily concerned with political and juridical 
equality, and since the expression social democracy 
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denotes equality of status, it follows that concern 
for the equalization of wealth may be called eco- 
nomic democracy. In this generic and obvious 
sense then, the label denotes a democracy whose 
primary policy goal is the redistribution of wealth 
and the equalization of economic opportunities. 
So conceived, economic democracy presupposes po- 
litical democracy—indeed, it is meant to be the 
ultimate feedback of a democratic form of gov- 
ernment. 

However, in the Marxist sense—which is by far 
the prevalent association of the expression—“eco- 
nomic democracy” does not presuppose political 
democracy; it replaces it. This follows from the 
materialistic conception of history, i.e., from nega- 
tion of the autonomy of politics. In the Marxist 
approach political democracy has no value in itself, 
for it is only a superstructure of bourgeois and 
capitalistic oppression, and “political” democracy 
is thus reduced to “capitalistic” democracy. But 
once the domain of politics disappears from our 
purview, there remains little, if anything, that one 
can say about democracy in constructive terms. 
One can oppose the “false” existing democracies, 
but what can one propose for the sake of rebuilding 
a “true” democracy? Therefore in the Marxist 
meaning “economic democracy” is only an oppo- 
sitional concept, and a distorted one at that, for it 
is not really the obverse of capitalist democracy 
but merely the obverse of capitalist economy. In 
other terms, in this context, democracy only means 
an economic system, and one based on the assump- 
tion that politics can be taken out of politics, 

Industrial democracy. “Industrial democracy” 
is a narrower but more constructive term for pin- 
ning down the problems associated with the idea 
of economic democracy. Basically, industrial de- 
mocracy is democracy within industrial plants. In 
many ways it is an adaptation of the Greek formula 
to an industrial society: it is a microdemocracy in 
which the member of the political community, the 
polites, is replaced by the member of an economic 
community, the worker. In its ultimate form, indus- 
trial democracy calls for self-government by the 
workers in a plant—a direct self-government which 
could or should be crowned at the national level by 
a “functional democracy,” that is, by a political 
system based on functional representation [see 
REPRESENTATION]. In practice, the ideal of indus- 
trial democracy has materialized only at the micro- 
level in a number of schemes concerning the 
workers’ participation in management: “codetermi- 
nation” in Germany and Austria, workers’ councils 
and “self-management” in Yugoslavia, and institu- 


tionalized practices of joint consultation between 
management and trade unions in various other 
countries (reviewed, for example, in Clegg 1960), 

To sum up, nobody will deny the importance of 
social democracy as a vital basis of a democratic 
polity; and it is usually conceded that economic 
equalization and industrial democracy are valuable 
goals. Nonetheless, all these conceptualizations are 
secondary in that they presuppose, explicitly or 
implicitly, a political democracy. In other words, 
these democracies are not sovereign. In particular, 
if the over-all political system is not a democratic 
system, economic equality has little meaning and 
industrial democracy can be eliminated overnight. 
This is the reason why democracy is first and fore- 
most political democracy, with the understanding 
that “the importance of the democratic political 
method lies mainly in its nonpolitical by-products” 
(Frankel 1962, p. 167). 

People’s democracy. The labels “people’s de- 
mocracy,” “progressive democracy,” “Soviet democ- 
racy,” and the like, pose a special problem. The 
difficulty is not simply that they point to a cluster 
of manifold elements but that the components of 
the cluster are so slippery that they defy analysis. 
A communist democracy is a “politico—economic” 
democracy, a “macro-micro” democracy, and a 
“supra-infra” democracy. It is almost impossible, 
therefore, to classify a people's democracy in terms 
of the distinction between the political and extra- 
political meanings of democracy. The notion is 
clearly derivative, however, and may be considered 
in this sense as another secondary meaning of 
democracy. 

For one thing, the expression “people's democ- 
racy” was coined and launched only after World 
War 11, as a transparent response to the “goodness 
of the word democracy. The derivative nature of 
the notion is also revealed by its thinness. The 
range of any discourse about the communist-type 
democracy is basically confined to a normative 
context and leans particularly on normative- 
futuristic democracy. In any case, it remains refrac- 
tory to empirical verification, since the comm a 
theory bypasses both structural and proceda 
arguments and draws exclusively on a behand 
definition of democracy that cannot be disproveđ. 
It follows that the theory of communist democracy 
does not succeed in showing how this demie 
is meaningfully related to the facts. All in all, y 
no criterion can the systems labeled “people's de- 
mocracy,” “Soviet democracy,” etc., be differenti- 
ated from nondemocratic political systems [see 
COMMUNISM]. 


Democracy as a form of government 


Greek and modern democracy. Greek democ- 
racy, as practiced in Athens during the fourth 
century B.C., was the closest approximation to the 
literal meaning of the term. One could argue, in 
effect, that the Athenian demos had more kratos 
(power) than any other people since. At the same 
time Greek democracy represents the maximum 
conceivable enlargement of a microdemocracy. 
When the demos gathered, the Athenian system 
actually operated as a “town-meeting” democracy 
in which some thousands of citizens expressed 
their ayes and nays. 

To be sure, when the demos was assembled “de- 
mocracy” consisted largely of decisions made by 
acclamation. But the town-meeting aspect was only 
the impressive part of the system. Its most effec- 
tive part resided in the mechanism that made “all 
command each, and each in his turn all,” as Aris- 
totle concisely put it; i.e., the exercise of power was 
effectively and largely shared by means of a rapid 
turnover of officials. The sharing in the exercise of 
power was also effective in that it occurred at ran- 
dom, for most officials were chosen by lot. On both 
counts—the collective “self-governing” and the in- 
dividual “governing in turn”—Greek democracy 
was a direct democracy based on the actual partici- 
pation of the citizens in their government. 

Modern democracy is entirely different. It is 
based not on participation but on representation; 
it presupposes not direct exercise of power but dele- 
gation of power; it is not, in short, a system of 
self-government but a system of control and limita- 
tion of government. While Greek democracy can be 
defined literally as a “government of the people over 
the people,” modern democracy cannot, for the 
people who are governed are not the same people 
who govern. Therefore, we should not be misled 
into believing that present-day “electoral participa- 
tion” resembles the real participation of the Greek 
citizen and even less that the devices we call “direct 
democracy” (referendum, initiative, etc.) can 
bridge the gap between the Greek and the modern 
formulas, 

Greek and modern democracy are also entirely 
different in respect to political freedom. In fact, 
only the modern form can be called “liberal democ- 
racy.” The vagueness of the term liberalism and 
the multifarious aspects of freedom make this a 
controversial topic, with some authors flatly deny- 
ing that men of antiquity were free (for example, 
Fustel de Coulanges 1864) and other scholars (re- 
cently, Havelock 1957) affirming the contrary. Yet 
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there is at least one sense, and a very real one, in 
which we can follow Benjamin Constant in con- 
trasting ancient and modern liberty (1819). The 
freedom of the citizen of the polis consisted in his 
part of sovereignty. Moreover, his freedom was not 
conceived as liberty for each individual rooted in, 
and protected by, “personal rights.” The individual 
as such, “each body,” was absorbed in the collective 
“Allbody,” that is, the polites was called to exist for 
the polis; whereas we are likely to hold the op- 
posite, that the state exists for the sake of the 
citizens. And while this does not imply that the 
Greeks called freedom what we consider oppres- 
sion, it does point to the fact that their liberty was 
entirely dependent upon the existence of a diffuse 
and relatively small political community (hardly 
a “state” in our sense of the term) in which the 
liberty of the individual could still be entrusted to 
his share in the exercise of sovereignty [see 
FREEDOM]. 

It is fairly obvious that the Greek type of democ- 
racy is inapplicable to modern conditions. Modern 
political societies are large societies, and the greater 
the number of the people involved, the less their 
participation can be effective and meaningful. Fur- 
thermore, the modern nationwide state confronts 
us with spatial or extensional impossibility, for 
real self-government cannot occur among absen- 
tees; it requires a demos to be present in person on 
the spot. Finally, it should not escape our atten- 
tion that the “directness” of a democracy is strictly 
related to political primitivism: the government of 
all in turn is, in effect, the counterpart of a low 
degree of distinctiveness, explicitness, and special- 
ization of the political functions. 

It would seem that we are confronted with a 
paradox. To the Greeks democracy, literally under- 
stood, was a possible form of government. To us, 
instead, literal democracy is an impossible form 
of government. The query is: Why did we rein- 
state—after two thousand years of oblivion and 
disrepute—a term whose original and literal mean- 
ing calls for a blatant impossibility? 

It does not suffice to reply that we give the Greek 
term a different meaning. Names are important in 
themselves, and the fact is that all over the world 
the common man of the twentieth century under- 
stands the word democracy very much in the same 
way as did the citizen of ancient Athens: its utter- 
ance elicits similar behavior, similar expectations, 
and similar demands. Nor can the issue be evaded 
simply by saying that the choice of the term de- 
mocracy was unfortunate. For the word has gained 
acceptance not despite but because of its utopian 
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bent. It is not a coincidence that while the Greeks 
coined the term democracy to describe a possible 
form of government, we have revived a term that 
prescribes an impossible form. In the modern 
world, then, “democracy” is first and foremost a 
normative word: it does not describe a thing, it 
prescribes an ideal. 

The ideal. Westerners have lived under demo- 
cratic systems long enough to have reached the 
phase of democratic disillusionment. They are 
therefore likely to underestimate the impact of 
ideals and especially the force of the democratic 
“illusion"—no matter under which apocalyptic ban- 
ner—in the rest of the world. Westerners are thus 
inclined to miss the peculiar temper of modern 
politics and—ironically enough—the fundamental 
change that Western rationalism has brought about 
in man’s attitude toward history. 

Until the Enlightenment political forms were not 
conceived as future-oriented paradigms; for the 
paradigm was in the past, in a lost paradise, or in 
a state of nature, For millennia political theorists 
have been concerned with what could be. But from 
the French Revolution onward we have become 
concerned with what should be. Classical liberal- 
ism still belonged to an age of reasonableness in 
which men were content with regulating the tide; 
democracy, socialism, and communism were born, 
instead, out of a Promethean attitude, out of the 
ambition to swim against the tide. The difference 
between the names liberalism and democracy is 
hardly descriptive; it is normative. The latter label 
has absorbed the former one in large part because 
“democracy” has a utopian potential that “liberal- 
ism” lacks [see LIBERALISM]. 

In fact, by any other criterion the term liberalism 
would have been a more advantageous choice. It 
was not associated with a memorable failure, an 
experiment that had rapidly degenerated into both 
sectional government—the “rule of the poor against 
the rich,” as Aristotle realistically put it—and 
“mobocracy,” a lawless rule of the mob. Moreover, 
the term liberalism pointed to the very crowning 
of the long-sought ideal of a mixed and balanced 
form of government. Thus the current success of 
the name democracy draws on the same reason 
that accounts for its previous abandonment, 
namely, that “democracy” points to an extreme 
ideal—no less extreme qua ideal than “commu- 
nism,” and so much so that in a purely normative 
context the two ideals can ultimately be joined. 

This is not to say, of course, that “democracy” 
was deliberately reinstated because modern man 
has fallen into a utopian mood. The adoption of the 
name democracy was also a response to the entry 


into politics of ever-growing masses. The small lit 
erate elites of former times could well dispense 
with miranda and credenda, to use Charles Mer- 
riam’s terms; but the more politics opens up to 
comparatively illiterate masses, the more miranda 
and credenda are needed to feed them no less than 
to mobilize and to manipulate them. 

In a historical purview, then, it is the ought, the 
deontology of democracy, that comes to the fore, 
And a historical approach also helps to place the 
various forms of democratic normativism in per- 
spective. 

During the nineteenth century, the term democ- 
racy was mainly used in progressive circles as an 
oppositional ideal. As Louis Hartz points out, the 
image of democracy depicted by its early advocates 
was basically the negation of what they wanted to 
destroy (Chambers & Salisbury [1960] 1962, p. 27). 
Democracy so conceived is simply the reverse of 
absolutism, a polemical notion whose function is 
to oppose, not to propose. The utterance of “democ- 
racy” is a way of saying no to inequality, injustice, 
and coercion. But once the enemy is defeated the 
problem becomes to specify what ought to be, that 
is, to identify equality, justice, and freedom in a 
positive fashion. Faced with this problem demo- 
cratic normativism splits: it can either adapt itself 
to the real world or consolidate itself into a future- 
oriented perfectionism. 

Realistic normativism follows from awareness of 
the “opposite principle” (Herz 1951, pp. 168-189), 
or of the principle of the “opposite danger” (Sartori 
[1962] 1965, pp. 63-67). Its proponents realize 
that as an ideal is converted into reality, it must 
be continuously adjusted as it approaches fulfill- 
ment. Therefore, the more an actual democracy 1s 
maximized, the more its deontology must be min- 
imized. If within an established democracy the 
democratic ought is maintained in its extreme sine 
it militates against the very system it has produced, 
that is, it produces “opposite” results. Jo 

Utopian normativism, on the other hand, main 
tains an oppositional attitude within an m 
democracy. It refuses to admit that ideals e 
countervailing function and will not allow i 
ideal to fade in victory. The normative attitude y 
to maximize ideals in their purity, in anticipation 
of a future in which the ought will finally over 
come the is. 

Theoretically, one can easily dismiss both gier 
sitional and utopian normativism. But the fact 3 
mains that we live in a time of explosion ‘a 
expectations and in which the high tide of ie 
cratic perfectionism has yet to come in mos d 
Asia, Africa, and Latin America. We are thus face 


with an apparently insoluble dilemma. In order to 
make democracy succeed in the real world we have 
adopted a realistic normativism; but a realistic 
image of democracy, and often a disillusioned one, 
can hardly compete on a world-wide scale against 
the appeal of Utopia. A realistic normativism loses 
the dimension of the future, and this suggests that 
Western democracies may well lose control over the 
explosive potentialities of the democratic ideal. 

The reality. The ought and the is of democracy 
are inextricably intertwined. A democracy exists 
only insofar as its ideals and values bring it into 
being. Therefore, to deal separately with the norms 
and the facts is an analytical device. It is a neces- 
sary device, however, for while the name democ- 
racy is fitting for prescriptive purposes, it can be 
very misleading for descriptive purposes. Only in 
the Greek world did the name and the thing co- 
incide. In our world the descriptive meaning of 
‘democracy” cannot be explained and derived from 
its literal meaning. 

When we come to the how of democracy, a dem- 
ocratic polity is usually identified by the manner of 
selection of its leaders and by the fact (which is 
also a corollary) that their power is checked and 
restrained, As Schumpeter puts it, in a democracy 
“the role of the people is to produce a government,” 
and therefore “the democratic method is that insti- 
tutional arrangement for arriving at political deci- 
sions in which individuals acquire the power to 
decide by means of a competitive struggle for the 
people’s vote” ({1942] 1962, p. 269). The emphasis 
here is definitely procedural. It is also procedural 
in all the definitions concerned with majority rule 
and minority rights [see MAJORITY RULE]. On the 
other hand, when we deal with the institutional 
arrangements and agencies of “constitutional de- 
mocracy” the emphasis tends to be structural, But 
in many cases the structural and procedural focuses 
are so tightly interlinked that we may well speak 
of a combined “structural—procedural” definition. 

A neat distinction may be made instead between 
structural—procedural democracy, on the one hand, 
and the behavioral definition of democracy on the 
other. In this latter focus a democracy is identified 
by the activity of its leaders rather than by the 
manner of their selection. And the contention may 
follow that whenever one finds a “rule for the 
people” one finds a democracy. 

No one denies that governing for the people is 
the very purpose of a democratic government; for 
nobody affirms that democratic structures and pro- 
cedures are an end in themselves. The question is 
whether political altruism should be left to the gods 
or whether it should be secured precisely by struc- 
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tural and procedural means. Furthermore, the way 
in which the rulers happen to rule does not suffice 
to qualify a political system. A benevolent despo- 
tism remains a despotism no matter how benevo- 
lently the despot happens to behave. By the same 
token, ruling for the people is “demophily,” not 
democracy. For democracy is not merely a manner 
of governing; it is a form of government, a political 
system. 

The distinguishing mark of “real” democracy— 
ie., of democracy in the real world—is provided, 
then, by the means that are conducive to the end 
of governing for the people. The step from demoph- 
ily to democracy is indeed a long step. It occurs 
after endless deceptions and failures, and it occurs 
only when leaders are forced to respond to the 
people by means of structural and procedural safe- 
guards [see REPRESENTATION, article on REPRESEN- 
TATIONAL SYSTEMS]. 

The standards. In order to define democracy as 
a type of political system it is essential to ascertain 
what democracy is not. This in turn presupposes 
clarification of the standards of democracy. For, 
depending on the standards, a political system may 
or may not be considered a democracy. Indeed, dif- 
ferent standards have to be used, for democracy is 
not a static entity: democracy in the nineteenth 
century cannot be assessed like democracy in the 
twentieth, and a developed democracy is different 
from a developing democracy. The problem then 
is to make coherent use of the pertinent standard. 

With reference to the developed and successful 
democracies of the Anglo-American or Scandina- 
vian type, the standards are high. Here “democracy” 
denotes more than political machinery; it also de- 
notes a way of living, a “social democracy.” In 
particular, these democracies have gone a long way 
toward the maximization of equality—equality of 
status, of opportunity, and of starting points. We 
may thus speak of “full” or advanced democracy to 
denote the maximum current achievement of de- 
mocracy in the real world. In this sense, then, 
democracy is a polar type, just as totalitarianism 
is the polar extreme of dictatorship. 

In areas in which democracy has never been 
stable or effective—including a number of Euro- 
pean countries—the standard is considerably lower. 
In this instance a polity qualifies as a democracy 
because of its machinery rather than its achieve- 
ment and is more of a political arrangement than 
a state of the society, This more limited political 
character is revealed by the fact that emphasis is 
laid less on equality and more on liberty—as is 
only natural, for liberty has a procedural priority 
over equality. The test is provided by free elections, 
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a competitive party system, and a representational 
system of government. It would be unfair to require 
a more exacting standard; for only the successful 
functioning of the machinery over time allows de- 
mocracy to strike roots in the society. Aside from 
the United States, hardly any country was a full 
democracy before World War 1, not even Great 
Britain; and even currently the existence of con- 
stitutional government, as opposed to arbitrary gov- 
ernment, still represents the highest performance 
of democracy in most of the world. It is fair to say, 
therefore, that the standard provided by a constitu- 
tional government that secures political freedom, 
personal security, and impartial justice is the aver- 
age standard, i.e., that this is what democracy 
“normally” means, 

Up to this point we are able to specify what 
democracy is: the border between a democratic and 
a nondemocratic political system is still definite. 
But no sooner do we apply the word democracy to 
most of the Third World, and in particular to the 
so-called developing nations, than the standard be- 
comes so low that one may well wonder whether 
the word democracy is still appropriate. At this 
point we speak of democracy simply to indicate that 
a given political system is not an overt dictatorship, 
that is, a dictatorship that allows no freedom, no 
Opposition, and no independence to the courts. Some 
scholars are inclined to go even further. Shils 
speaks of “tutelary democracy” ([1959-1960] 1962, 
pp. 60-68), thereby implying that the standard can 
be reduced to the sole condition that the ruling 
elite earnestly profess democratic beliefs and pur- 
sue the goal of future establishment of some kind 
of democratic structure. 

From the point of view of the classification of 
political systems it appears that the category of 
“initial” democracy cannot be stretched to include 
tutelary democracy. For promises are not deeds, 
and an authoritarian method of achieving democ- 
racy has to surmount the additional difficulty that 
the means shape the ends. A tutelary democracy 
may be even less than a mere “behavioral” democ- 
racy; it is only a possible future, only a futuristic 
democracy. Yet, the notion of tutelary democracy 
has its merits. For one thing, to profess democratic 
ideals is better than nothing at all; that is, the 
notion has the merit of singling out the importance 
of a belief system, in contrast to the somewhat 
deterministic and mechanical view that democracy 
follows from a set of socioeconomic conditions. In 
the second place, to raise the problem of democracy 
in the context of formless or transitional societies 
has the virtue of stimulating our imagination. 

The question arises: When we speak of Western 


experience, is the key term “Western” or “experi- 
ence”? In other words, can there be a non-Western 
path to democracy? But it would be premature to 
venture into these speculations without first dis- 
cussing the conditions of democracy. 

The conditions. Conditions are usually divided 
into necessary and/or sufficient. But we really know 
so little about the conditions of de mocracy that in 
most cases the best we can do is speak of facili- 
tating conditions. 

Economic development. Recently the trend has 
been to relate the conditions of democracy to a 
given stage of socioeconomic development. For ex- 
ample, Lipset argues that “the more well-to-do a 
nation, the greater the chances that it will sustain 
democracy.” If the hypothesis is tested by the usual 
indexes of economic development, one finds that 
the average wealth, degree of industrialization and 
urbanization, and level of education is indeed much 
higher for the more democratic countries (Lipset 
1960, pp. 45-76; Almond & Coleman 1960, pp. 
538-544). 

It has been pointed out, however, that if we look 
at specific cases rather than at over-all averages 
the correlation between democracy and economic 
development is weak, and that between the great 
extremes of wealth and sheer poverty is a large 
no man’s land where apparently any political sys- 
tem can exist (Eckstein 1961, pp. 38-40). More- 
over, a correlation is not a causal link, And even 
assuming that some sort of causal link does exist 
between welfare and democracy, we may still won- 
der whether a country became democratic because 
it was prosperous, or prosperous because it was 
democratic. If we abide by the low standard of 
initial democracy, then England surely became 2 
democracy, i.e., a constitutional government, before 
the advent of industrialization, prosperity, and lit- 
eracy, On the other hand, if we judge by the ad- 
vanced type, it is fairly obvious that there can be 
no equalization of wealth until a people becomes 
wealthy. It would seem, therefore, that economic 
growth is a condition for the growth of democracy 
—not for its establishment [see ECONOMIC GROWTH, 
article on NONECONOMIC ASPECTS]. ae 

Two more specific caveats are also in order. mee , 
much of the available evidence is statistically 
biased, in the sense that our findings are narrowly 
confined to the variables that are susceptible R 
quantification, and measurability is not a e 
of relevance. In this respect the objection could 
that indexes of economic development toin 
meaningfully only with the temperature of po 
Granted that prosperity is likely to moderate ee 
tensions of the class struggle and the intensity 


ideology, “cool” politics facilitates any regime, and 
therefore prosperity may help to stabilize a dictator- 
ship as much as a democracy. 

The second caution is that much of the available 
statistical evidence is gathered under categories 
that hopelessly lack discrimination. For instance, a 
strong positive correlation has been found between 
stable democracy and degree of literacy (with the 
perplexing exceptions of Germany and France), 
and it is often argued that the most important 
single factor in promoting democracy is the degree 
of education. In itself, however, literacy is only ex- 
posure to communications; and this implies that 
literacy can be conducive to mass manipulation no 
less than to individual self-realization. It appears, 
therefore, that our faith in education rests on the 
hidden premise that what we really mean is “liberal 
education,” the kind that inculcates, among other 
things, liberal and democratic values. The problem 
is, then, whether a merely technological literacy, 
or a type of education that inculcates illiberal 
values, may not promote autocracy. And the statis- 
tical figures collected under the literacy category 
do not make the discrimination that we need most 
[see MODERNIZATION]. 

Intermediate structures. The foregoing reserva- 
tions remind us of the view held by Tocqueville 
(1856), followed by Durkheim and recently re- 
stated, for example, by Kornhauser, that democracy 
presupposes the backbone of an “intermediate struc- 
ture” of independent groups and voluntary associa- 
tions (1959, pp. 76-90). Unquestionably, the 
support of a vital and active “infrastructure” of 
self-governing organisms and institutions is of great 
help. One may say that political macrodemocracy is 
safer and more authentic the more it reflects and 
presupposes an “infrademocracy.” Once again, how- 
ever, one should be wary of considering this a nec- 
essary condition for any stage of democracy. The 
necessary condition (though in no way a sufficient 
one) should be stated in broader and less exacting 
terms, for example, by pointing to the fact that no 
modern democracy has yet succeeded until the 
development of a middle class bridges the gap 
between the populace and the state. 

Leadership. Furthermore, the fact that we have 
developed a keen interest in the socioeconomic pre- 
conditions of democracy should not lead us to 
underplay the strictly political conditions of democ- 
racy—as Aron has recently underlined (Aron et al. 
1960). Since the current emphasis on the pre- 
political requisites of democracy is partly due to a 
research bias (because of research facilities and 
the greater facility of the research), it is all the 
more necessary to stress the importance of leader- 
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ship. Leadership is a disturbing variable in two 
senses: it disturbs the social scientist because of its 
“subjective” elusiveness, and it has a disturbing 
effect on the “objective” data. On the first count 
the social scientist should eliminate it; but on the 
second count he cannot. Prolonged effectiveness 
gives legitimacy to a political system, whereas in 
a modernizing society no legitimacy can withstand 
prolonged ineffectiveness (Lipset 1960, pp. 77- 
90). The effectiveness of democracy depends first 
and foremost on the efficiency and skill of its 
leadership. This becomes even truer the less favor- 
able the objective conditions [see LEADERSHIP]. 

All in all, the conditions of democracy are still 
largely unknown. On the one hand, whenever our 
hypotheses can be tested empirically—as in the 
case of indexes of economic growth—the findings 
are somewhat circular, for we are told that the 
kind of soil that favors democracy is the soil that 
has been cultivated best. On the other hand, when 
we come to examine the specifically political con- 
ditions of democracy, our assumptions remain 
poorly verified. Although this follows from the in- 
trinsic difficulties of the domain of politics, we 
surely could do better by formulating our queries 
with more precision. 

Clearly the conditions of advanced democracy 
are not those of initial democracy, and the problem 
of implementing a political democracy is different 
from the problem of planting it. For instance, an 
open class system, an equalitarian value system, 
and an industrial society are necessary conditions 
neither for the take-off into democracy nor for 
normal democracy; in fact, these so-called condi- 
tions presuppose the successful performance of 
normal democracy and can be viewed, therefore, as 
consequences rather than antecedents. The reward- 
ing query is, then, “which are the conditions for 
each standard?” and, conversely, “which level of 
democratization canno: be attained under given 
conditions?” 

The point is not simply that there is no one 
factor crucially associated with the success of de- 
mocracy; the point is also that the cluster of factors 
has a historical dimension, in the sense that all the 
relevant factors have to be considered in a sequence, 
with reference to their order of succession, their 
tempo, and their timing. It would seem, in fact, 
that “objective” factors are less important in initi- 
ating a democracy than (1) the will of an efficient 
and capable leadership, and (2) regulation of the 
flow of demands in such a way that the political 
system can process it without getting overloaded. 
For what throws a political system off balance— 
and particularly a democracy—is a sudden im- 
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balance between an outburst of expectations and 
the capacity for meeting them. 


Prospects and alternatives 
pec 


The ways of history are not infinite, but they are 
varied. And the prospects for democracy in most 
of the world are related to the search for new solu- 
tions or, better said, to the search for adaptations 
and substitutions, If the question is whether there 
are “alternative forms of democracy,” the reply can 
only be that this kind of new solution has not been 
discovered. But if the question is whether there 
are alternative ways of achieving democracy more 
quickly, this is surely pertinent and vital. In fact, 
the problem of the developing countries is to catch 
up, a matter of speed and short cuts, And there is 
no better evidence that economizing is possible and 
that elimination of steps is feasible than the evi- 
dence provided by the Western experience itself. 

Let it be recalled that for a long time constitu- 
tional lawyers believed that bicameralism was an 
essential safety, and yet there are unicameral sys- 
tems that do just as well. Likewise, we often claim 
that rotation in office is part and parcel of democ- 
racy; but there are political systems in which one 
predominant party competitively gains and keeps 
an absolute majority, thereby providing an instance 
of democracy without governmental turnover. In a 
similar vein we tend to think that the interplay 
between majority and opposition is the keystone of 
a democratic system, and only recently have we 
started to realize that the argument does not apply 
to any kind of opposition—indeed, the institution- 
alization of the opposition may not improve things 
at all, and an irresponsible and purely demagogic 
opposition is likely to wreck any democracy. This 
does not imply that there can be a democracy 
where dissent and contestation are impeded; it does 
suggest, however, that whenever the stakes are too 
high to allow a peaceful transmission of power to 
the opposer, we should explore the possibility of 
subsidiary forms and mechanisms of contest. 

Finally, most people equate democracy with uni- 
versal suffrage; and it is surprising how little atten- 
tion is being paid to the tempo and sequence that 
enable a political system to process the entry into 
politics of hitherto excluded and far-removed 
masses, that is, to cope with the so-called crisis of 
participation. And this in spite of the evidence that 
a sudden, massive enfranchisement is either a 
sham or is likely to shatter any developing democ- 
racy. Universal suffrage, in fact, seems to be the 
one taboo that we are not prepared to break—again 
a confirmation of the extent to which we are fas- 
cinated by the word at the expense of the sub- 


stance. To be sure, “people” means “all the people,” 
and therefore in a literal sense there is no democ- 
racy until “everybody” is given the power to vote, 
But in our macrodemocracies the power of each 
amounts to a powerless fraction of power; and 
therefore the substance of the matter no longer is 
that everybody should be equally entitled to self- 
government (by virtue of his vote), but that as 
many people as possible should not be misgoverned. 
Thus, as long as free elections do occur, the size 
of the electorate matters far less than the essential 
goal, namely, the establishment of a political sys- 
tem that makes a government responsive and 
accountable, 

The foregoing considerations are meant only to 
suggest that we need to cross-examine our dogmas 
and to acquire a fresh vision. In matters of polity 
building, genuine invention is very rare and very 
slow, and the successful innovations have usually 
been accidental. Before speculating about “new 
solutions” we should reduce to a minimum the 
requirements of the solutions that have been tested. 
Democracy, as Woodrow Wilson said, is the most 
difficult form of government. We cannot hope, 
therefore, to export the “complete” Western type. 
On the other hand, it is equally obvious that the 
new states and developing nations cannot pretend 
to start from the level of achievement at which the 
Western democracies have arrived. In fact, no de- 
mocracy would ever have materialized if it had set 
for itself the advanced goals that a number of mod- 
ernizing states currently claim to be pursuing. In a 
world-wide perspective, the problem is to minimize 
arbitrary and tyrannical rule and to maximize a 
pattern of civility rooted in respect and justice for 
each man—in short, to achieve a humane polity. 
Undue haste and overly ambitious goals are likely 


to lead to opposite results. 
GIOVANNI SARTORI 


[See also CONSENSUS; CONSTITUTIONS AND CONSTITU- 
TIONALISM; DELEGATION OF POWERS; ELECTIONS; 
LIBERALISM; PARTIES, POLITICAL; REPRESENTATION: 
Other relevant material may be found in DICTATOR- 
SHIP; POLITICAL THEORY; POWER.] 
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DEPRESSIVE DISORDERS 


The term “depressive disorders” refers to a large 
group of psychiatric conditions whose main char- 
acteristic is an abnormal feeling of depression. This 
classification is broad as well as vague; it implies a 
definition of depression and of abnormal depres- 
sion. Moreover, the word “depression” refers some- 
times to an ordinary emotion, at other times to a 
symptom, and still at other times to a given clinical 
entity. 

This article will discuss depression as a normal 
and as an abnormal emotion, clinical pictures of 
depressions, psychodynamic and sociocultural as- 
pects of depression, and treatment. 


Depression as a normal emotion 


Depression is a common emotion, second only to 
anxiety in psychiatric concern and study. Often de- 
pression is referred to by such terms as sadness, 
melancholy, despondency, dejection, low spirit, and 
anguish. Although practically all human beings 
have at some time experienced depression, this 
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emotional state is among the most difficult to de- 
scribe and to analyze. It is a pervading feeling of 
unpleasantness accompanied by such somatic con- 
ditions as numbness, paresthesias of the skin, alter- 
ations of muscle tone, and decreases in respiration, 
pulsation, and perspiration. The head of the de- 
pressed person has the tendency to bend; the legs 
flex; the trunk tilts forward. The face assumes a 
special expression because of increased wrinkles 
and decreased mimic play. There is also retardation 
of movements, rigidity in thinking, and a general 
feeling of weakness. 

It is, however, at the mental rather than the 
somatic level that depression has. more specific 
characteristics. Whereas anxiety is characterized by 
an expectancy of danger, depression is accom- 
panied by a feeling that the dangerous event has 
already occurred, that the loss has already been 
sustained. For instance, the loved person has al- 
ready died, the good position has been lost, the 
business venture has failed, a self-concept or an 
ideal can no longer be maintained. There is also a 
more or less marked sense of despair, of unpleasant 
finality, as the lost object is considered irreplace- 
able. The loss is deemed to have repercussions on 
the present as well as the future. From these ex- 
amples it is evident that, at least in conditions con- 
sidered normal, depression seems to be a reaction 
to psychological processes, such as evaluations and 
appraisals, that have occurred at a cognitive level. 
The person who becomes depressed must have been 
able to understand and to assess the significance of 
certain events. Thus, cognitive and, as we shall see 
later in detail, social factors (inasmuch as the lat- 
ter enter into complicated cognitive processes) are 
very important in the engendering of depression. 

Such considerations would make depression a 
uniquely human emotion, different from more 
primitive emotions such as fear, rage, and some 
forms of anxiety that occur also in much lower 
animals. This point, of course, is debatable. Inas- 
much as animals cannot verbalize their feelings, it 
is an open question whether or not the belief that 
animals become depressed is an anthropomorphiza- 
tion. A dog, for instance, may appear depressed 
when his master is away, but it is doubtful that 
this is either a feeling involving the future or a 
sense of loss that transcends the immediate dis- 
comfort. A feeling of deprivation rather than de- 
spair seems to be involved here. On the other hand, 
these uncomfortable feelings of deprivation, which 
even infrahuman animals are capable of experi- 
encing, may be the precursors of human depression. 

According to Arieti (1959a), depression may 
have functional significance. If depression had 


negative survival value, species capable of experi- 
encing it probably would not have procreated them- 
selves in the course of evolution. Other emotions 
have survival value; anxiety, for instance, is a warn- 
ing of forthcoming danger. As with other unpleas- 
ant sensations and emotions, the function of de- 
pression is to urge the person who experiences it 
to alter his behavior, so that the factor engender- 
ing depression will disappear or be removed. 

For example, an individual hears the news of the 
unexpected death of a person he loved. After he 
has understood and almost instantaneously evalu- 
ated what that death means to him, he experiences 
shock, then sadness. For a few days all thoughts 
connected with the deceased person will bring 
about a painful, almost unbearable feeling. Any 
group of thoughts even remotely connected with 
the dead person will elicit depression. The indi- 
vidual cannot adjust to the idea that the loved per- 
son does not live any more. And, since that person 
was so important to him, many of his thoughts or 
actions will be directly or indirectly connected with 
the dead person and therefore will elicit sad 
reactions. 

Nevertheless, after a certain period of time, that 
individual adjusts to the idea that the person is 
dead. By being unpleasant, the depression seems to 
have a function—its own elimination. It will be 
removed only if the individual is forced to reorgan- 
ize his thinking, to search for new ideas so that he 
can rearrange his life. He must rearrange especially 
those ideas that are connected with the departed, 
so that the departed will no longer be considered 
indispensable. 


Depression as an abnormal emotion 


Depression is deemed abnormal when it is €x- 
cessive relative to the antecedent event or events 
that have elicited it; when it is inappropriate in 
relation to its known cause or precipitating factor; 
when it is a substitution for a more appropriate 
emotion, for instance, when it takes the place of 
hostility or anxiety; or, when it does not seem to 
have been caused by any antecedent factor of 
which the person is aware. 

Both constitutional and psychological atiet 
have been postulated to explain the occurrence 0 
abnormal depressions. The constitutional ae 
have been studied particularly in connection Wi 
the depression that occurs in the course of manic- 
depressive psychosis. Slater (1936), for instance, 
believes that the inheritance of manto-deprer 
psychosis depends on a single dominant autona 
gene. To explain why the incidence of the psycho 
is lower than would be expected from his hypothe 


sis, he states that the dominant gene is a rare one. 
Kraines (1957) believes that the depression, as 
well as the manic state of manic-depressive psy- 
chosis, is due to a dysfunction (either increased or 
decreased function) of the diencephalon. Accord- 
ingly, the experience of depression is a “psychic 
response” to this diencephalic dysfunction. Ele- 
vated values for adrenal and thyroid functions, 
blood pressure, and pulse rate are reported fre- 
quently in depressed patients (Reiss 1954). 

Arieti (1959a) has described what happens 
when the process of reorganizing ideas concomi- 
tant with normal depression fails. In such cases, 
depression, rather than forcing a reorganization of 
ideas, slows the thought processes that are carriers 
of mental pain. There appears to be a teleological 
or adaptational mechanism that seemingly acts 
with the goal of decreasing the quantity of thoughts 
in order to decrease the quantity of suffering. At 
times, the attempt to slow thought processes suc- 
ceeds so well (as in the state called “stupor”) that 
only a few thoughts of a general and vague nature 
remain. These are accompanied by an overpower- 
ing feeling of melancholy. Thus, the slowing down 
of thought processes that have an unpleasant emo- 
tional tone is a self-defeating mechanism and, in 
a vicious circle, aggravates the situation instead of 
alleviating it. 

Often, abnormal depression defensively replaces 
seemingly more appropriate emotions, generally 
anxiety or hostility, The patient appears to be more 
capable of coping with a feeling of depression than 
with anxiety or hostility. 


Clinical pictures of depressions 


Depressive disorders have been known since 
ancient times. Some of the great physicians of an- 
tiquity, like Hippocrates and Aretaeus, wrote about 
the conditions that are now called “psychotic de- 
pressions.” In spite of this long record in the history 
of medicine, no agreement has been reached about 
the various classifications of depressions or the 
nature of the various types. Following is a brief 
account of the different clinical pictures that have 
been recognized. 

Reactive or psychoneurotic depression. Reactive 
or psychoneurotic depressions are characterized by 
relatively mild feelings of despondency, guilt, and 
self-depreciation; mild retardation and reduction of 
spontaneity; and the absence of hypochondrial pre- 
occupations. These depressions seem to occur sub- 
sequent to, or as a reaction to, an unpleasant or 
tragic event in the life of the patient. The patient 
is aware of this connection. Some authors empha- 
size the fact that reactive depressions are not endog- 
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enous, that is, are not engendered by internal 
factors, such as organic or hereditary predisposi- 
tions, but are consequent to external factors. In 
addition, the patient does not accept the depression 
as a way of living and, like all other psychoneurotic 
patients, would like to be relieved of his depressed 
condition. 

Depressed type of manic—depressive psychosis. 
The depressive aspect of manic-depressive psy- 
chosis has generally received the most considera- 
tion. Although it was already recognized by Are- 
taeus, it was described more fully and clarified in 
some aspects by Kraepelin in the sixth edition of 
his Psychiatrie: Lehrbuch fiir Studierende und 
Ärzte, published in 1899 (see Kraepelin 1883). 

This type of depression is characterized by a 
triad of symptoms: a pervading feeling of melan- 
cholia, disordered thought processes characterized 
by retardation and unusual content, and psycho- 
motor retardation. In addition, there are, less 
typically and less frequently, some somatic dys- 
functions. 

This type of depression occurs much more fre- 
quently in women than in men. In the majority of 
cases it is periodic, not permanent. Attacks of de- 
pression are separated by normal intervals. 

Feeling of melancholia. The onset of the per- 
vading feeling of melancholia is sometimes quite 
acute and dramatic, at other times slow and insidi- 
ous. At times, it is misunderstood or unrecognized 
at first. When an unpleasant event, such as the 
death of a close relative, has occurred, a certain 
amount of depression is justified. However, when 
a certain period of time has elapsed and the de- 
pression normally should have subsided, it becomes 
instead more intense. The patient has an unhappy, 
sad appearance. He looks older than his age, his 
forehead is wrinkled, and his face, although under- 
going very little mimic play, reveals a despondent 
mood. In some cases the main fold of the upper 
eyelid at the edge of its inner third is contracted 
upward and a little backward (sign of Veraguth ). 

The patient is often unable to describe his feel- 
ing of intense unhappiness. He may say that “his 
eyes have consumed all the tears” and “his life is a 
torment.” He experiences a desire to punish him- 
self by destroying himself and, at the same time, 
to end his suffering. Suicidal ideas occur in about 
75 per cent of patients, and actual suicide at- 
tempts are made by at least 10 to 15 per cent. 

Disordered thought processes. The second im- 
portant symptom concerns the content and type of 
thinking. Gloomy, morbid ideas leave little room 
for other thoughts. The patient convinces himself 
that he is hopeless; he has lost or will lose all his 
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money, his family is in extreme poverty, he is in- 
capable of working or even taking care of himself. 
There is no great variety in his thoughts. He gives 
the impression of selecting thoughts that have an 
unpleasant content so that his melancholic mood is 
retained. Quasi-delusional ideas or definite de- 
lusions may occur, often representing distortions 
of the body-image and hypochondriasis. The patient 
thinks, for example, that he has a tumor, syphilis, 
or tuberculosis, that he has lost his bowels, or that 
his brain is melting. The patient is often self- 
accusatory, in a delusional form. He believes that 
he has committed many crimes or that some crimes 
which have been committed by others are, indi- 
rectly, his responsibility. Especially in some West- 
ern subcultures that are permeated by intense re- 
ligiosity, the ideas of guilt, sin, self-condemnation, 
and eternal damnation are prevalent. Hallucina- 
tions are rare in this condition and occur almost 
exclusively at night. 

Psychomotor retardation. The third important 
sign of depression is retarded hypoactivity. The 
actions of the patient are reduced in number and 
are carried out at a slow pace. Talking is reduced 
to a minimum. Employment and household re- 
sponsibilities are ignored, and appearances are neg- 
lected. When depression reaches a state of stupor, 
the patient cannot talk or move at all and may be 
mistaken for a catatonic patient. 

Somatic dysfunctions. Frequent accessory symp- 
toms are reduction in sleep, decrease in appetite, 
loss in weight, marked decrease in sexual desire, 
dryness of the mouth caused by decreased secretion 
of the parotid glands. A frequent subvariety of this 
condition, called “agitated depression,” is character- 
ized by motor restlessness rather than hypoactivity. 

Manic episodes. Kraepelin emphasized that in 
many patients attacks of depression alternate with, 
or occasionally are followed by, other attacks that 
seem to have contrasting characteristics. These are 
the manic attacks, characterized by a state of ela- 
tion or euphoria, a disorder of thought processes 
distinguished by flight of ideas and pleasant con- 
tent, and an increased mobility. Kraepelin thought 
that when the illness manifests itself in a complete 
form, both the depressed and manic attacks occur, 
although in different sequences. Thus he conceived 
both forms of attack as part of a specific clinical 
entity, to which he gave the name “manic—depres- 
sive psychosis,” 

It was soon noticed that the manic attacks are 
much less frequent than the depressive ones, Per- 
haps the difference in frequency is not as marked 
as it seems. It could be, in fact, that mild attacks 
of mania are better tolerated or unrecognized by 
the patient or his relatives, 


The intensity of the depression is not the only 
characteristic differentiating between psychotic and 
reactive depression. In a psychotic depression the 
patient accepts his depression as his way of life and 
feels justified in feeling as he does. 

Psychotic depressions. Many depressions, even 
of psychotic proportions, are never preceded or fol- 
lowed by manic attacks, Many authors are reluctant 
to include such depressions in the manic-depressive 
syndrome. They feel that these depressions prob- 
ably should have a classification unreélated to the 
Kraepelinian psychosis. 

Except for the fact that it is not followed by 
manic attacks, psychotic depression is no different 
from the depression of the manic-depressive 
psychosis, 

Involutional melancholia. Involutional melan- 
cholia (involutional psychosis, melancholic type) 
occurs in women at least three times as frequently 
as in men and is characterized by an intense feel- 
ing of depression that occurs around the “involu- 
tional” age. In women this age generally coincides 
with the menopause. However, involutional melan- 
cholia may precede or follow the menopause even 
by an interval of four to five years. 

Involutional melancholia presents the same 
characteristics as the depression of the manic- 
depressive psychosis, with the following exceptions: 
(1) agitation and restlessness often substitute for, 
or are superimposed on, the hypoactivity, and 
(2) the prepsychotic personality differs from that 
of the manic-depressive. The patient is described 
as a quiet, retiring, worrisome, frugal, and rigid 
individual. 

Kraepelin, who had originally excluded involu- 
tional melancholia from the manic-depressive cat- 
egory, reconsidered his views and finally included 
it in that group. Kraepelin thought that involutional 
melancholia had an unfavorable prognosis. How- 
ever, as some of his contemporaries showed, this is 
not so, Electric shock treatment, devised by Ugo 
Cerletti and Lucio Bini in 1938 (Cerletti 1950), 
seems to be able to cure practically all cases, if by 
cure we mean a return to the prepsychotic level of 
adjustment, whatever it was. s ia 

Other types of depression. Depressions may a ‘A 
occtr as parts of other syndromes or conditions. 
depression occurring in old age is often referred to 
as “senile depression,” and one occurring in schizo- 
phrenia as “schizophrenic depression.” When 2 
combination of schizophrenic and manic-depressive 
features occurs, the syndrome is often calle 
“schizo—affective psychosis.” 7 $ 

A type of decrees that is often described n 
the French and Italian literatures is the one am 
“Cotard’s syndrome,” after the neurologist J. Cotat@ 


who first described it. Cotard’s syndrome is charac- 
terized by a depression accompanied by a delu- 
sional state of negation. Nothing exists; the world 
has disappeared. After the cosmic reality is denied, 
the physical reality of the patient himself is denied. 

Spitz (1946) has described what he calls “ana- 
clitical depression” in infants who have been sep- 
arated from their mothers between the sixth and 
eighth months for an approximate period of three 
months. This type of depression seems more similar 
to a state or sense of deprivation, as it occurs in 
infrahuman animals, than to an emotional state 
that includes a sense of finality and despair. 


Psychodynamic mechanisms of depression 


Many authors feel that even if constitutional- 
hereditary facts are important in the etiology of 
depressive disorders, some experiential factors oc- 
curing in the early life of the patient have started 
psychodynamic patterns that have finally deter- 
mined these conditions. Psychoanalytic studies have 
contributed greatly to the psychodynamic under- 
standing of depressions. These psychodynamic 
mechanisms apply primarily to the depressions 
labeled as the depressed type of manic-depressive 
psychosis, psychotic depression, and involutional 
melancholia. It was Freud's pupil Abraham (1912), 
not Freud, who introduced manic-depressive psy- 
chosis to psychoanalytic conceptions through a 
comparison of abnormal depression with normal 
grief. Both conditions are at times elicited by a loss 
that the person has suffered; but, whereas the 
normal mourner is concerned about the lost person, 
the depressed patient is disturbed by guilt feelings. 
The unconscious hostility that he had for the lost 
object is now directed toward himself. Abraham 
also assumed a regression to an ambivalent pre- 
genital stage of object-relationship, that is, a return 
to an anal-sadistic stage. 

Freud (1917) accepted Abraham’s idea that 
there is a relation between mourning and melan- 
cholia and pointed out that whereas in mourning 
the object was lost because of death, in melancholia 
there was an internal loss because the lost person 
had been incorporated (or introjected). The sadism 
present in the ambivalent relationship is then 
directed against the incorporated love-object. 

Radó (1928) suggested that melancholia repre- 
sents a desperate search for love. The ego tries to 
punish itself in order to prevent parental punish- 
ment. 

According to Melanie Klein (1948), a “depressive 
position” is a normal event in the life of every child, 
generally at the time of weaning, that is, around 
the age of six months. The mother who before was 
seen by the child as two persons (one good and one 
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bad), according to Klein, is at this time seen as one 
person; the mother continues, however, to be in- 
ternalized as a good or bad object. The child is 
afraid that his instinctual aggressive impulses will 
destroy the good object, and he interprets the loss 
of his mother’s breast and milk at the time of wean- 
ing as the result of the destructive impulses, The 
inability to solve adequately this “depressive posi- 
tion” may, according to Klein, Jead later to a path- 
ological depression. 

Arieti (1959a) found the following mechanisms 
in the cases that came under his attention, Very 
early in the life of people who are prone to become 
psychotically depressed there is a period of intense 
gratification of needs, The mother or mother- 
substitute is motivated by a sense of duty to be as 
lavish as possible in her care and affection. This 
attitude predisposes the child to be very receptive 
to others, to introject the others, and later to be an 
extrovert and a conformist. 

At a later period but still in early childhood, gen- 
erally during the second or third year of life, 
although in some cases even much earlier, the fam- 
ily situation undergoes a drastic change. The mother 
now takes care of the child considerably less than 
before and makes many demands on him. This 
change in the mother’s attitude may be due to the 
fact that in the meantime another sibling has been 
born, and the mother is now lavishing her care on 
the newborn with the same duty-bound generosity 
that she previously had for the patient. More often, 
however, the reason for the change is to be found 
in the personality of the mother, who believes that 
when a child is an infant, he has to be fully taken 
care of, but as soon as he shows the first signs of 
an independent personality, he should start to be 
given increasing responsibilities, a sense of duty, 
of obligation, and the like. No transitional stages 
are allowed. The child who experiences a drastic 
change from an atmosphere of receptivity from the 
others to an atmosphere of expectation by the 
others may undergo a trauma. He tries to find solu- 
tions or pseudosolutions. Generally he adopts one 
of the two following mechanisms, which he will 
repeat throughout his adult life, The first is an at- 
tempt to make himself even more babyish, more 
dependent, and aggressively dependent, so that the 
mother, or the important adult who later takes her 
place symbolically, will be forced to re-establish an 
atmosphere of babyhood and of early bliss. Another 
mechanism consists in trying to live up to the ex- 
pectations of the mother, no matter how high the 
price, how heavy the burden. Love or the early bliss 
will be recaptured only by complying and working 
hard. If love is not obtained, the patient is at fault; 
he must atone or work harder. 
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The patient finds out later in life that these mech- 
anisms do not work. No matter how aggressively 
dependent the individual becomes, he does not re- 
capture the early bliss. No matter how compliant 
and hard working he is, he does not obtain what he 
wants. This realization brings about the depressed 
feeling. The actual manifest symptomatology of 
depression occurs when a symbolic reproduction of 
the early trauma takes place later in life. A loss 
has been sustained and the patient feels that his 
way of living has caused such a loss. The loss may 
be the death of a person important to the patient, 
the realization by the patient that an important 
interpersonal relationship has failed ( generally 
with the spouse), or a severe disappointment in a 
relationship to an institution or work to which the 
patient has devoted his whole life. At times the loss 
is more difficult to evaluate; it may concern such 
abstract concepts as ideals, re-evaluation of the 
meaning of one’s life, and personal significance, 

Claiming and self-blaming depression. The psy- 
chological symptomatology of psychotic depressions 
has been divided by Arieti (1962) into two main 
types: the claiming depression and the self-blaming 
depression. 

In the claiming depression, which has recently 
become more common at least in the United States, 
the symptomatology, although fundamentally one 
of depression, seems to be a gigantic claim. All the 
symptoms seem to have a message; “Help me; pity 
me. It is in your power to relieve me. If I suffer, it 
is because you don't relieve me of this suffering.” 

In the self-blaming type of depression the main 
emphasis is on self-accusation and self-depreciation. 
The patient feels he does not deserve help from 
others. 

When the prepsychotic personality is of the 
aggressive-dependent type, the psychosis tends to 
be the claiming type of depression. When the pre- 
psychotic personality is of the duty-bound, hard- 
worker type, the psychosis tends to be the self- 
blaming type. 

The suicide attempt (often successful) made by 
patients affected by a psychotic depression has 
been the object of various psychodynamic interpre- 
tations. Several possibilities must be considered. 
(1) The suicide attempt may actually be a cry 
for help. The patient wants to convey this message 
to an important person in his life: “You have 
the power to prevent my death, if you wish to.” 
(2) The patient wants relief from suffering. 
(3) The patient wants to punish himself, since he 
feels guilt-ridden and deserving of drastic punish- 
ment. (4) The patient does not really want to kill 
himself but the detested person whom he has in- 
corporated. The last explanation is the one origi- 


nally proposed by Freud (1917) and the one still 
accepted by the orthodox psychoanalytic school, It 
is actually based on little clinical evidence and is 
not unanimously accepted by other schools. 


Sociocultural factors in depressions 


Sociological and cultural factors may be im- 
portant in facilitating those conditions which 
predispose to depression, especially those of the 
manic-depressive type. Some psychiatrists (Bellak 
et al. 1952; Arieti 1959b) have noted a decline in 
incidence of this psychosis. For instance, in a 
period of 20 years (from 1928 to 1947), the per- 
centage of first admissions to hospitals of manic- 
depressive psychosis in New York State was 
reduced from 13.5 per cent of all admissions to 
3.8 per cent. Relevant information has been gath- 
ered in other parts of the world. Gold (1951) has 
found a relatively higher incidence of manic- 
depressive than of schizophrenic psychoses in the 
lands of the Mediterranean Basin, as well as in 
Ireland. He reports that in Oriental countries, espe- 
cially where Hinduism and Buddhism prevail, 
manic-depressive psychosis is less common, but in 
Fiji manic-depressive patients are numerous, Gold 
writes further that whereas in India the incidence 
of manic-depressive psychosis is low and that of 
schizophrenia higher, the reverse is true for the 
Indians who have immigrated to Fiji. 

Recently, however, psychiatrists have noticed, at 
least in the United States, an increase in the num- 
ber of depressive disorders. This increase has not 
yet been subjected to statistical study and seems to 
involve depressions not typically of the manic- 
depressive category. 

In his attempts to understand the decline of 
manic-depressive psychosis Arieti (1959a; 1959b) 
drew upon the concepts of the sociologist Riesman 
(1950). Manic-depressive psychosis occurs more 
frequently in those cultures which Riesman has 
called “inner-directed” and tends to disappear where 
this orientation is also disappearing. It is thus pos- 
sible to formulate the hypothesis that an ier 
directed culture evokes or increases the manic- 
depressive propensity. 

nnenatreest society. In the inner-directed 80 
ciety the mother feels duty-bound and nee 
at the time of the birth of the child is very muc 
concerned with his care, permitting the child Y 
develop strong introjective tendencies. Soon, e 
ever, the mother will start to burden the child wi : 
responsibilities and with a sense of duty and guilt. 
This is often made necessary by the fact that eu 
mother now has to devote herself to another chil h 
The bliss of paradise is lost, and life becomes x 
purgatory. An inner-directed personality develop: 


in the sense that this orientation is implanted early 
in life by the direct and definite influence of the 
significant adults. Family-life and child-rearing 
practices are established that seem to correspond 
to those found in the families of manic-depressive 
patients. 

This hypothesis, suggesting the influence of 
inner-directed society upon manic-depressive psy- 
chosis, receives some validation from the research 
of Eaton and Weil (1955) on the mental health of 
the Hutterites. The Hutterites are a group of people 
of German ancestry who have settled in the Dako- 
tas, Montana, and the prairie provinces of Canada. 
The Hutterite society seems to be an inner-directed 
one. In a population of 8,542 people, Eaton and 
Weil found only 9 persons who sometime in their 
lives suffered from schizophrenia and 39 with 
manic-depressive psychosis; that is, among the 
Hutterites manic-depressive psychosis occurred 
4.33 times more frequently than schizophrenia, 
whereas in the general population of the United 
States the incidence of schizophrenia by far exceeds 
that of manic-depressive psychosis. 


Treatment of depressions 


Depressive disorders are generally treated with 
three types of therapy. (1) Psychotherapy (gen- 
erally psychoanalytically oriented in the United 
States; in Europe, often existentially oriented) is 
generally administered to patients suffering from 
the reactive depressions, the various types of psy- 
chotic depressions provided they are of moderate 
intensity, and patients who are experiencing a 
symptom-free interval between psychotic attacks. 
(2) Drug therapy is employed in the treatment 
of moderately severe depressions. “Psychoanaleptic 
drugs,” such as imipramine hydrochloride, and 
the large group of monoamine oxidase inhibi- 
tors are among those used. (3) Electric shock 
therapy, commonly used prior to the advent of 
drug therapy, is now reserved for the most recal- 
citrant and stuporous cases. It is still the most fre- 
quently used treatment in involutional melancholia. 
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[Other relevant material may be found in ELECTRO- 
CONVULSIVE SHOCK; MENTAL DISORDERS; MENTAL 
DISORDERS, TREATMENT OF; PSYCHOANALYSIS; Psy- 
cHosts; and the biographies of ABRAHAM; FREUD; 
KLEIN; KRAEPELIN.] 
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René Descartes (1596-1650) was born at La 
Haye, in Touraine, near Poitou, into a family of 
merchants and lower civil servants aspiring to the 
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petite noblesse. A second son, he was put to school 
with the Jesuits at the newly founded Collége 
Royale of La Fléche where, until he was sixteen, he 
studied, first, the trivium, then “philosophy’—that 
is, logic, physics, and metaphysics. Later in the 
curriculum, “morality” was added to logic and 
mathematics to physics; in mathematics his talent 
was at once evident. Two aspects of his education 
are notable: he received a thoroughly “modern” 
intellectual training, in which bodily fitness was 
valued, and Galileo's physics was early and nat- 
urally absorbed into Jesuit teaching. 

Between 1612, when he left school, and 1618, 
when he took up the profession of arms, the events 
of Descartes's life are unclear. He was for a time 
in Paris and may then have come into contact with 
the scientific group around the Minim Father, 
Marin Mersenne, at the Place Royale. After some 
hesitation he evidently decided on a military career. 
As a soldier of fortune he took part in the Thirty 
Years’ War, fighting with the Dutch, who in spite 
of their Protestantism allied with France against 
the Hapsburgs. During the night of November 10, 
1619, Descartes had his famous dream of a univer- 
sal and unified science; in his German poéle (“stove- 
heated room") that winter he worked out his 
“rules for thinking,” finally published in 1637 as 
the Discourse on Method. 

From 1620 until his death, Descartes led a life 
with neither a fixed profession nor a fixed abode. 
He was, simply, a thinker; his inherited income, 
his books, and his occasional patrons sufficed for 
his needs. He traveled through Europe, including 
Italy, before 1628, making Paris (and in Paris, 
Mersenne) his center; thereafter, until 1649, he 
lived chiefly in Holland, the first of a stream of 
immigrant visitors to profit from the freedom and 
stimulus of that country. 

Descartes’s fellowship was varied—the scientist 
Isaac Beeckman in Breda, who like him was deeply 
concerned with establishing a method for the 
new sciences; Claudius Salmasius, a French theolo- 
gian at Leiden; the mathematicians Hortensius at 
Amsterdam and Golius and Van Schooten at Leiden; 
among doctors, the anatomist Plempius, with whom 
he studied; Plempius’ disciples Heereboord and Hey- 
danus, responsible for the introduction of “Carte- 
sianism” into the academies; Etienne de Courcelles, 
who translated the Discourse into Latin; the poly- 
math Constantijn Huygens, secretary to the prince 
of Orange (Descartes played a part in the education 
of his scientific son, Christiaan Huygens). His cor- 
respondence was enormous, and he maintained his 
connections with French scientists, He lived in 
Deventer, Leiden, Egmond, and Amsterdam, mak- 
ing several trips to France; in 1649 in the service 


of Queen Christina, he accepted an appointment 
to Stockholm, where he died in 1650. 

Descartes’s published work shows great consist- 
ency and care. In 1629 he was busy with a treatise 
in metaphysics, later expanded and published as 
the Meditations in 1641. From 1629 to 1633 he 
worked on his cosmological study, Le monde 
(1664), a book in two sections, one ostensibly on 
light (or action at a distance), the other on man; 
but, as he frankly reported in the Discourse of 
1637, he was alarmed by the condemnation of 
Galileo's theories and did not publish the work (it 
was posthumously published in 1662 and 1664). 
At this point he evidently turned back to his dream 
and contemplations on method; in 1637 his ver- 
nacular Discourse appeared, written for ordinary 
thinking people, followed by long essays exemplify- 
ing its application to optics, meteorology, and geom- 
etry. Perhaps because of his nervousness over his 
discontinued cosmological speculation, Descartes 
made special efforts to secure approval for his 
book, sending a copy to the Jesuit Père Noel, rector 
of La Fléche, and seeking approval of the book, 
through Mersenne, from the French chancellor 
Pierre Séguier, 

The Discourse was influential in many areas, 
from literary explication de texte to metaphysics. 
Appropriately, in a work in which human certainty 
is reduced to man’s consciousness of his own think- 
ing processes, Descartes described the etiology of 
the treatise itself—what sort of an instrument he 
himself was, how he was trained, where and under 
what conditions he got his idea of universal meth- 
od, how he tested his method, how he came to write 
it. Throughout Descartes’s work there are patches 
of autobiography, presented not as self-indulgence 
but as data relevant to the substance presented. 
Although the Discourse records his approval of his 
own training, it nonetheless, with its appendages, 
revised (or reformed) the Jesuits’ philosophical 
curriculum, as it did the teaching of other groups. 
Logic, physics, and metaphysics all suffered reor- 
ganization at Descartes’s hands, as did mathe- 
matics and morality (the last subject more fully 
developed with the appearance of Les passions de 
l'âme in 1649). In 1641 his Meditations came 
out, expansions both of the Discourse and his 
early metaphysical speculations and, in 1644, his 
Principia philosophiae. Though he was soon em- 
broiled in methodological and theological polemt 
particularly with Dutch Reformed divines, hs 
social success was enhanced by a ages 
friendship with the bluestocking Princess Elizabe a 
daughter of the dispossessed elector palatine, 
well as by the patronage of Queen Christina. i: 

Descartes made substantial contributions 


mathematics, physics, and psychology. In his own 
terms, his work in geometry and optics has re- 
mained viable: Cartesian geometry is still a legiti- 
mate entry to that discipline; his description of the 
operation of the eye, though since greatly refined, 
is the “correct” one. His “method,” important for 
its influence in the social as well as in the natural 
sciences, has the following simple rules: (1) to 
accept nothing as true not clearly recognizable as 
such; (2) to divide up each difficulty into as many 
parts as possible; (3) to carry on reflections in due 
order, from the simplest part to the most complex 
combination; and (4) to make regular enumera- 
tions and reviews, in order to be certain of having 
made no omissions. 

These regulations, striking for their common 
sense, involve two major presuppositions: the first, 
that ideas, fixed and reliable as numbers and 
mathematical ratios, build and follow as inexorably 
and securely upon one another as do mathematical 
processes; the second, that the proposition cogito, 
ergo sum (“I think, therefore I am”) is radically 
true. Another way of stating the first is that Des- 
cartes believed in the truth of mathematics, and 
therefore used it as a model for his thinking about 
thinking, or that he was a thorough rationalist. 
Another way of stating the second is that Descartes 
used skepticism to establish an ontology based— 
again, in sharp break with the philosophical tra- 
dition—upon the selfs irreducible recognition of 
the thinking self. From that conviction and by 
means of his own method, he constructed his (re- 
markably arrogant) ontological proof of God and 
his definitions of the existence of the material 
world and the laws of its nature. 

Accompanying the crucial importance he attrib- 
uted to mind is his notion of its separation from 
body, or matter; the sharp distinction he drew be- 
tween mind and body resulted in part from his 
attempt to relate Galileo's mechanist physics—for 
him, in spite of the ban unassailably true—to the 
immaterial process of rationality and, through that 
process, to the immaterial deity in whose existence 
he believed. One result of that radical separation 
of mind and matter is his extremely abstract con- 
ception of deity (justifiably criticized as deist rather 
than Christian); another is his mechanist psy- 
chology. Animals, indeed, he concluded to be ma- 
chines; man, endowed with soul and mind, is there- 
by no mere mechanism. Les passions de l'âme deals 
with psychological manifestations as functionally 
as possible, translating inward “passion” into out- 
ward “action” and discussing the sensible appear- 
ances of behavior rather than the invisibilia of 
man’s inner life. Insofar as he could, relying 
heavily upon Harvey's work and his own ana- 
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tomical experience, Descartes provided a physio- 
logical explanation of behavior, one example of 
which (unfairly ridiculed) is his location of the 
soul in the pineal gland. Behavior was discussed in 
the “causal” terms of external stimuli and the pas- 
sions classified as predictable reactions to certain 
situations and stimuli (e.g., wonder, desire, re- 
morse, love and hatred, pride and shame). He was 
interested in such physiological manifestations of 
the passions as tears and flight. Psychology thus 
became a branch of “physics,” or physiology, a hy- 
pothesis with great influence upon subsequent psy- 
chological experimentation. The Cartesian beast- 
machine became the model for animal study in the 
eighteenth century; inevitably, in spite of Des- 
cartes’s precautions against such a step, La Mettrie 
presented the argument, firmly based in the Car- 
tesian psychology, that man too is simply machine. 
In his effort to free thought and thinking from the 
debilitating influence of the mutable body and to 
exempt it, theoretically at least, from skewing by 
the passions, Descartes split inward life into two 
quite different and unrelated components, the 
rational and the emotional. 

RosALIE L. COLIE 


[Other relevant material may be found in REASONING 
AND LOGIC; THINKING; VISION; and in the biogra- 
phies of DURKHEIM and KANT.] 
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DETERRENCE 


As the release and utilization of energy from 
atomic nuclei have challenged and intrigued phys- 
ical scientists, so behavioral and social scientists 
have been intrigued and challenged by the intrusion 
of this energy into the domain of their concern. 
Awareness of the potential destructiveness of 
Weapons employing nuclear energy has prompted 
extensive consideration of their impact on inter- 
national relations. 

In its contemporary usage, the concept “deter- 
rence” refers to hypothesized effects of nuclear 
weapons technology on the set of alternatives from 
which national policy makers choose their courses 
of action, By extension, these “effects” affect the 
conduct of international relations, 

The set of alternatives from which national 
leaders make their choices has traditionally in- 


cluded among its elements the “resort to war” as an 
instrument of national policy; the hypothesized 
effects of nuclear weapons are relevant to this 
subset of foreign policy alternatives, 

“Deterrence” refers to the attempt by decision 
makers in one nation or group of nations to re- 
Structure the set of alternatives available to de- 
cision makers in another nation or group of nations 
by posing a threat to their key values. The restruc- 
turing is an attempt to exclude armed aggression 
(resort to war), from consideration. 

The fundamental deterrence hypothesis is: If 
the threat to values is sufficiently large, the exclu- 
sion of armed aggression from consideration is 
probable. 

This hypothesis, taken together with a subsidiary 
hypothesis—nuclear weapons pose a sufficiently 
large threat to values—yields the deduction: Nu- 
clear weapons make probable the rejection of 
armed aggression as a potential policy alternative. 

The ubiquity of this syllogism in the deterrence 
literature (Brody 1960; Lefever 1962, pp. 313-332; 
Halperin 1963, pp. 133-184) argues for the ex- 
amination of the postulates on which it is founded. 
Three central assumptions appear to underlie this 
syllogism; they may be classified under the follow- 
ing themes: (1) the rationality of decision makers; 
(2) the unidimensionality of threat and of re- 
Sponse to threats; and (3) the constancy of sets of 
Policy alternatives (see Deutsch 1963, pp. 71-72, 
for a critique of a somewhat different set of deter- 
rence theory assumptions). T 

The assumption of rationality. The rationality 
of decision makers posited in the deterrence litera- 
ture is both a norm against which behavior can be 
evaluated and a prescriptive guide to choice. The 
rational decision maker, in deterrence theory, is 
presumed to avoid the resort to war in those situa- 
tions in which the cost anticipated from aggression 
is greater than the gain expected from such an 
action. $ 

This notion of rationality concentrates on avoid- 
ance behavior; it predicts the rejection of alterna- 
tives where cost exceeds gain. Empirical studies on 
foreign policy decision making are not abundant, 
but those that have been done raise doubts about 
this characterization of the policy process (Snyder 
& Paige 1958; Holsti 1962). In the light of research 
on domestic and foreign policy making, this posi 
late of deterrence theory is of dubious validity. x 

Conceptions of threat. The fundamental i. 
rence syllogism contains implicit assumptions abo 
the nature of threat and about the relationship © 
threat to deterrence. È f 

Threat is presumed to be a simple function 0 


destructive capacity; the greater the destructive 
capability, the greater the threat. A more complex 
view of threat (for example, Singer 1958) calls 
this conception into question unless “destructive 
capacity” is construed very broadly—beyond num- 
bers of weapons, warhead size (yield), and 
accuracy. 

Experiments on the relationship of power to 
threat indicate that the ambiguity (or, conversely, 
the clarity) of the threat is closely related to the 
amount of threat experienced (Cohen 1959). 

It has been suggested elsewhere (Brody 1963, 
pp. 696-697) that the threatening interaction can 
be conceived of as being determined by three fac- 
tors: (1) perception by one party of the other 
party's hostile intentions; (2) perception of the 
other party's capability of inflicting damage to the 
perceiver; and (3) the credibility of the other 
party's declaratory policy. If threat inhered only in 
capability (and we ignored hostility and credi- 
bility), we would conclude that adversaries and 
allies are equally threatened by the same weapons 
systems, 

Complex conceptions of threat call into question 
not only the proposition that threat varies uni- 
formly with destructive power but also many of the 
assumptions about the relationship of threat to de- 
terrence, The literature is replete with suggestions 
that the effectiveness of deterrence is a direct func- 
tion of the amount of destructive capability. Wohl- 
stetter (1959) has refined this conception by point- 
ing out that deterrence is more properly conceived 
of as a function of the amount of capability poten- 
tially remaining after an attack has been absorbed. 
Both of these conceptions assume that the threat- 
ened decision maker will be “deterred,” that is, that 
he will abandon even the consideration of the 
course of action that worried the threatener. 

“Credibility” has received attention from deter- 
rence theorists (Kaufmann 1956, pp. 12-38), but 
as a quality of weapons rather than of relations 
among nations. However, the possibility of an ag- 
gressive response to a credible threat receives much 
less attention (for example, Kahn 1963). Psycholo- 
gists, on the other hand, have often found aggres- 
sive responses to threat (for example, McNeil 
1959). This suggests at least the potential for such 
responses in threatening international situations. 

Milburn, drawing on psychological research, pro- 
poses that we characterize threats on a number of 
dimensions (for example, symbolic—concrete and 
clear-vague); however, the relationship between 
types of threats, characterized on these dimensions, 
and the inhibition of behavior (deterrence) or ex- 
citation of behavior (provocation) is without em- 
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pirical research, despite the salience of such knowl- 
edge for deterrence theory. 

Availability of policy alternatives. Implicit also 
in much of the literature on deterrence is the as- 
sumption that alternatives to the resort to war are 
available to and perceived by decision makers, 
irrespective of the international situation. 

The validity of this proposition is difficult to 
establish: the historical record is rich with ex- 
amples of national leaders who, in intense crises, 
“saw no way out” and for whom war became the 
only viable alternative. On the other hand, despite 
extremely intense crises, decision makers in the 
nuclear-armed nations apparently have not felt 
that their policy alternatives were reduced to 
one—war. 

Contradictory hypotheses can be entertained to 
resolve this paradox: Because of their destructive- 
ness, nuclear weapons may have transformed the 
international system, thus making the prenuclear 
historical examples simply inapplicable. However, 
it is equally plausible that the international system 
has moved onto a plateau of high intensity (McClel- 
land 1961)—the cold war—and that increases in 
tension (relative to this base), comparable to his- 
torical crises, have not yet been experienced by 
leaders in one nuclear nation when confronted by 
another. Which hypothesis will stand the test of 
research cannot be forejudged, but an answer is 
needed to strengthen the conceptual foundations 
of deterrence theory. 

Deterrence strategies. Despite the lack of con- 
firmation of the premises on which elemental de- 
terrence theory is based, there is widespread a 
priori acceptance of them. The dialogue, which has 
produced a voluminous literature, has largely been 
concerned with how best to deter; thus, the debate 
has been primarily strategic rather than social 
scientific. 

An important impetus to this debate has been 
the rapidly changing weapons technology; each 
new capability has found spokesmen arguing the 
necessity of including it in the considerations af- 
fecting strategic thought. Technology has tended 
to lead strategy, bringing an evanescent quality to 
the deterrence literature. 

The debate among deterrence strategists can be 
characterized by examining the various positions 
along three dimensions: (1) the mission of the 
deterrent; (2) the means by which deterrence is to 
be accomplished; and (3) the values threatened 
and to be threatened. 

There is little disagreement that a goal of foreign 
policy is the dissuasion of other nations from com- 
mitting aggression of any kind—from strategic 
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nuclear attack to guerrilla warfare. Debate arises 
over how much of this spectrum can be deterred 
with nuclear weapons. 

Glenn Snyder (1961) provides a useful distinc- 
tion between the missions of denying access to ter- 
ritory and of punishing aggression (that is, of re- 
taliation); weapons appropriate to one mission may 
not be suited to the other. Thus, for example, the 
mission of denying western Europe to Soviet forces, 
it is argued, requires different weapons than does 
the mission of retaliating against the Soviet home- 
land, should the Soviet leaders choose to attack. 
The counter to this argument, stemming from the 
strategic doctrines of Douhet (Brodie 1959), asserts 
that the massive threat inherent in nuclear retalia- 
tory capability will deter tactical as well as strategic 
aggression. 

Those who argue for specific weapons for specific 
missions—the strategy usually called “graduated 
deterrence”—point to the ineffectiveness of stra- 
tegic bombing as a deterrent in World War r 
(Blackett 1962) and to the reduced credibility of 
an all-purpose deterrent after the Soviet Union 
emerged as a nuclear power. 

This debate overlaps the dialogue about the 
means by which deterrence can best be accom- 
plished. The Strategists who advocate graduated 
deterrence generally argue for the limitation of 
Strategic capability at the minimum needed to 
deter ( Morgenstern 1959). The logic of this posi- 
tion relies heavily on the invulnerability to attack 
of individual units of the deterrent force. The 
critics of minimum deterrence point out, however, 
that the continuation of this invulnerability in view 
of the growth of weapons technology is too uncer- 
tain for so fundamental an element of policy (Kahn 
1960; 1963). The advocates of minimum deter- 
rence argue that sustained effort in producing 
weapons beyond the minimum (in an attempt to 
guarantee at least “statistical invulnerability” of 
the force as a whole) is itself a stimulus to the 
search for countermeasures and to the uncon- 
trolled stockpiling of arms, that is, to arms races, 

Proponents of graduated deterrence argue that 
the absence of military capability to counter a par- 
ticular lower-level or nonnuclear threat, for ex- 
ample, insufficient conventional forces to deny ter- 
ritory in western Europe to the Soviet Army, creates 
an unstable situation fraught with the danger of 
escalation to strategic nuclear war. It is in escala- 
tion, that is, in resort to weapons with much greater 
destructive Capacity than the weapons employed 
by the attacking side, that deterrence theorists see 
the greatest likelihood of limited wars becoming 
general nuclear wars (Schelling 1960; Halperin 


1963). Moreover, the expansion of limited conflicts 
is often seen as the most probable cause of general 
war. 

Concentration on the deterrence of general war 
has led to a debate about what is the best object of 
threat: Is the threatened destruction of cities more 
effective than the threatened destruction of military 
targets in precluding resort to war? 

The dual nature of weapons—they can be used 
to fight as well as to threaten—has brought an 
element of confusion to the debate. If a war is to 
be fought, response weapons capable of destroying 
military capability may be most effective in limit- 
ing damage to targets of high value to the respond- 
ing nation (Kahn 1963). On the other hand, it is 
contended that producing this amount of weaponry 
(because there are many more well-defended mili- 
tary targets than civilian targets) could create an 
unstable arms race that would make war more 
likely, 

The advocates of minimum deterrence have 
tended to opt for targeting population and indus- 
trial centers; the opposition to this point of view 
has tended to argue for preserving the option of 
retaliating against any target, civilian or military. 
The second policy seems to be characteristic of 
United States military posture; the first seems to 
have been adopted by the Soviet Union. 

Deterrence and international relations. The 
lively debates among deterrence strategists has had 
a salutary effect on the field of international rela- 
tions. Social scientists have expressed uncertainty 
about the validity of the principles on which deter- 
rence theory rests and some skepticism about asser- 
tions concerning the way nations act and are likely 
to act; from this doubt and questioning, profes- 
sional students of international politics have begun 
to think of the relationship of military technology 
to foreign policy as a significant area of research. 
Moreover, the subject matter has brought scholars 
from all the social sciences together and has forced 
interdisciplinary communication. P 

The strategic debates will continue as sien | 
technology continues to change, but the appt 
foundations laid by the work of concerned poci 
scientists are the beginnings of the renee 
of military strategy into the study of internation r 
relations; in this context we can expect the patien 
development of an empirical theory of deterrence. 

RICHARD A. BRODY 


[See also NUCLEAR WAR; STRATEGY.] 
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DEVELOPMENT BANKS 


The label “development bank” attaches to a group 
of national financial institutions, in developing 
countries, that vary widely in size, source of funds, 
and scope of activity. Essentially, a development 
bank is a vehicle for mobilizing and channeling 
medium- and long-term capital into the productive 
sector of the economy; most are also a source of 
entrepreneurship and technical assistance, Some 
development banks create and finance public enter- 
prises or plan or carry out national development 
programs, but these are the exception; typically, 
the term “development bank” describes an institu- 
tion created to encourage the growth of the private 
sector, Although limitations of funds and of quali- 
fied personnel have led countries in an early stage 
of development to entrust to their development 
banks responsibilities in several fields—agriculture, 
industry, housing, mining—emphasis on a single 
field is more usual. Among these specialist institu- 
tions the greater and the growing number are con- 
cerned with industry, and it is these that best 
illustrate the possible range of a development 
bank’s operations. 

Among industrial financing institutions may be 
found not only public banks but a number of banks 
that are privately owned, wholly or predominantly. 
Banks that extend agricultural credit or are actively 
engaged in promotional work, however, are nor- 
mally government owned, These activities are rel- 
atively risky, expensive, not immediately revenue 
producing, and hold little appeal for private in- 
vestors. In countries with a low level of domestic 
savings and limited access to external sources of 
finance, any private participation in a development 
bank is likely to be no more than token, at least 
initially. Sometimes even in countries with a rel- 
atively high savings ratio, when government policy 
has called for support of the private industrial 
sector, the government will choose to create a 
public development bank as a vehicle for its assist- 
ance. In any event, a benevolent government inter- 
est is essential to a development bank’s success, 
and most banks function in the context of a na- 
tional development plan, 

The antecedents of the modern development 
bank can be traced back some hundred years to 
industrial financing institutions in Europe and the 
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United States (Diamond 1957, pp. 19-37; Institut 
d'Études Bancaires et Financières 1964, pp. 49- 
66). Since the 1950s the number of development 
banks has increased rapidly, as developing coun- 
tries have sought mechanisms to accelerate their 
economic growth and especially to stimulate indus- 
trialization. Tallies vary, depending on definition; 
moreover, they are soon outdated, as new banks 
come into operation; in 1964 there were well over 
one hundred, to be found in all parts of the less- 
developed world. The International Bank for Re- 
construction and Development and its affiliates, 
the International Finance Corporation and the In- 
ternational Development Association, have encour- 
aged the creation of private development banks in 
appropriate circumstances, extending both tech- 
nical assistance in their establishment and oper- 
ation (including recruitment of management and 
training of staff) and financial support (amounting 
to about $300 million by mid-1964) through loans 
or participation in share capital. Assistance has 
also been provided by the Inter-American Develop- 
ment Bank and through the aid programs of indus- 
trialized countries. The United States by mid-1964 
had committed about $1,000 million to various 
types of development banks, The United Kingdom 
and France have a long history of extensive assist- 
ance to such institutions, the former concerned 
with banks in Commonwealth countries, the latter 
with those in former French Africa. Germany and 
Japan have entered the field more recently. The 
development bank has served as a means of retail- 
ing funds—often foreign exchange—to enterprises 
too small to be financed directly by these public 
sources, 

Development banks are intended to fill a gap 
primarily, though not exclusively, financial. Com- 
mercial banks, traditionally, have not made avail- 
able investment capital; they generally provide 
short-term money only, frequently at a high rate 
of interest, and tend to favor existing enterprises 
and the larger borrowers. Savers in the typical 
developing country prefer investments promising 
a quick return to those offering only the prospect 
of gradual growth; they would rather invest in com- 
mercial ventures or real estate or send funds abroad 
than participate in untried industrial enterprises. 
The financial institutions and capital market mech- 
anisms that in more developed countries serve to 
transfer savings are absent, embryonic, or ineffi- 
cient. Often the business community shows little 
inclination to look for new fields of activity or even 
to act when promising opportunities are called to 
its attention. Finally, the organizational and tech- 
nical competence to launch and sustain a new 
enterprise is frequently lacking or in short supply. 


A development bank appropriately capitalized, 
well managed and staffed, and free to reach invest- 
ment decisions objectively on business and eco- 
nomic grounds can help to overcome these various 
difficulties. It can supply investment capital—loan 
or equity or some intermediate form—from its 
own resources and can recruit capital from other 
sources, domestic and foreign, It can encourage 
and facilitate direct investment in industrial securi- 
ties and help to develop a capital market. It can 
spark industrial initiative, sponsor urgently needed 
new ventures, and provide advice at all stages of a 
project, from planning to operation. 

Each bank is, or should be, designed with a par- 
ticular country’s needs and environment in view. 
In consequence, there is no model for a develop- 
ment bank, and no single description adequately 
fits them all. 

Capitalization. Private development banks are 
organized as corporations, some with several hun- 
dred million dollars of initial capital, some with 
far less. The capitalization of any particular bank 
necessarily reflects a judgment concerning the 
amount that it appears feasible to raise from avail- 
able sources. A second relevant consideration is the 
desirability of launching the bank with resources 
large enough to assure an impact on the economy 
and, assuming the bank is soundly run, earnings 
sufficient to cover operating costs, reserve and tax 
requirements and, before too long, payment of a 
modest dividend, Usually, institutional sharehold- 
ers—commercial banks, insurance companies, in- 
vestment houses—predominate. Many institutions 
in the industrialized countries have invested in 
development banks, primarily to obtain a window 
on events in a country to which they have some 
special tie or in the expectation that the bank will 
ultimately prove beneficial to their own interests. 
Foreign shareholders serve as a channel for the 
managerial skills and modern technology so scarce 
in developing countries and also help the bank to 
resist any undue local political pressure. 

It has proved harder to attract individual share- 
holders, at least in the early stages of a bank’s 
operations; they tend to have a greater concern for 
the probability of an adequate return on their in- 
vestment. A successful development bank can be 
expected to earn moderate returns in time; none 1S 
likely to be highly profitable, Individual investors 
have, however, been attracted by the inducemeni 
of special government assistance in a bank's initi 
capitalization. The most effective device has been 
a long-term loan, interest-free or at low interest, 
subordinate to share capital in the event of the 
bank’s liquidation. Earnings on the no-cost Or Jow- 
cost funds enhance the return on equity; the SU 
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ordination to equity provides shareholders with a 
cushion against losses. Some governments have 
permitted a development bank to manage public 
funds for a fee and without risk, or have extended 
special rediscount or borrowing privileges, granted 
preferential tax treatment for dividends, or guar- 
anteed a minimum dividend, Development-bank 
loan capital has come primarily from public sources 
at home or abroad; a few banks have borrowed 
in the local capital market with a government 
guarantee. 

Operation and policies. Normally, a develop- 
ment bank tries not to compete with other sources 
of funds, particularly the commercial banks. This 
is one reason why most development banks do not 
accept deposits and, in fixing their loan charges, 
take into account not only their own cost of capital 
but also the going rate of interest. Compared with 
commercial banks, the development bank gives 
greater weight in its investment decisions to growth 
potential and quality of management and places 
less emphasis on security and credit standing. As 
a development agency, the development bank is 
mindful of the needs and preferences of applicants 
and the potential benefit of a proposed investment 
to the economy. At the same time, if it is a share- 
holder-owned institution, it cannot ignore the 
source and cost of its own capital and the com- 
position of its investment portfolio and must there- 
fore take profitability into consideration (Boskey 
1959, pp, 49-55). 

Although development banks finance the mod- 
ernization or expansion of established enterprises 
as well as the establishment of new enterprises, 
their characteristic willingness to take risks and to 
combine development and banking criteria in in- 
vestment decisions makes them particularly useful 
to the latter; new enterprises cannot readily borrow 
in the market and have limited earnings to reinvest. 
Equity financing is frequently most appropriate for 
such enterprises: unlike a loan, it imposes no in- 
terest or repayment burden. But the bank is not 
always welcomed as a co-owner. Family-held enter- 
prises are sometimes reluctant to admit outsiders 
to ownership; this may be particularly true where 
the bank is a government institution, Moreover, 
the banks take their own investment portfolios 
into account in determining the nature of their 
financing; they generally limit equity holdings to 
the equivalent of their own share capital plus 
reserves, 

A development bank does not usually extend 
either very small or very large loans. Appraisal, 
administration, and follow-up of small loans is 
costly and time consuming; the loans themselves 
are risky. Unless the bank is a public institution 
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with no pressure to pay dividends or has special 
funds for assistance to small enterprises, it nor- 
mally excludes such enterprises from its clientele 
by setting a minimum on the size of individual 
investments. For different reasons—to diversify 
its portfolio and avoid allocating an unreasonably 
large proportion of resources to a single project— 
the bank usually sets a ceiling on the amount of 
financing to any one client. Thus, development 
banks usually do not cater to very large projects 
either, except as partners or participants with other 
investors (such as the International Finance Cor- 
poration). Normally, a bank will supply no more 
than half of an applicant's financial requirements; 
this mobilizes capital in support of the bank’s con- 
tribution and spreads the impact of its financing. 

There are a number of ways open to a develop- 
ment bank to assist the growth of a capital market 
and to encourage the practice of industrial invest- 
ment, When enterprises financed by the bank have 
“seasoned” and begun to show a profit, the bank 
may sell their securities from its portfolio, This 
adds to the supply of marketable securities in cir- 
culation and broadens industrial ownership and the 
base for future sales; in addition, it releases the 
bank’s resources for new investments, The bank 
may underwrite new securities, normally only 
those issued by enterprises it has itself financed. 
It may offer participations in its investments and 
issue its own obligations. Each of these activities 
in effect invites investors to rely on the bank’s ap- 
praisals and investment decisions. Consequently, 
the extent to which a development bank can be 
successful in these areas depends upon the extent 
to which it has been able to establish a reputation 
for sound business judgment. Finally, the bank 
may be an educator and innovator, introducing or 
making more familiar financial instruments and 
techniques employed in the capital markets of in- 
dustrialized countries. 

Role in stimulating investment. The develop- 
ment bank’s promotional role—the stimulation of 
new industrial investment—is an important one. 
At the same time, since promotion is likely to be 
expensive and unlikely to produce immediate re- 
turns, private banks can afford only limited activity 
of this character unless they have special resources 
for the purpose. The possibilities are varied. 
Through its own staff or contacts available to it, 
the bank can help to translate an idea into a bank- 
able project and a project into an operating enter- 
prise. The bank’s analysis, in the regular course of 
business, of proposals submitted to it may lead to 
suggestions for modifications in the financial, eco- 
nomic, technical, or managerial aspects, render- 
ing the proposal more economic and technically 
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sounder, The bank may find experienced industrial 
partners for new and complex undertakings. It will 
try to bring investment opportunities and private 
capital together. It may conduct a survey of a 
geographic region or an economic sector to find 
promising projects. It may actively sponsor, and 
establish in cooperation with other sources of capi- 
tal and technology, new enterprises to meet specific 
needs of the economy. Once the bank has financed 
an enterprise, it provides continuing guidance to 
its clients on matters of operation and policy. By 
encouraging the adoption of good financial and 
managerial practices, it can help to create and 
maintain a sound industrial sector, 


Although the development bank is a useful and 
versatile instrument of economic development, it 
is only one of many; it cannot by itself overcome 
the host of problems that developing countries en- 
counter on the road to industrialization. Moreover, 
certain conditions are a prerequisite to the success 
of any development bank. Before a bank is created 
the needs it is to meet should be identified, to facili- 
tate formation of an appropriate organization and 
financial design. For a private institution to be 
profitable, there must be within the country enough 
of the basic ingredients of industrialization to 
assure a continuing flow of a fair volume of busi- 
ness. There should be at the outset experienced 
management and a competent and sufficiently 
numerous staff; often this will mean some foreign 
personnel initially, a circumstance which develop- 
ing countries, especially those newly independent, 
sometimes find hard to accept. Finally, realization 
of a development bank's potential requires govern- 
ment policies—fiscal, monetary, commercial—that 
support and facilitate the bank’s activities and that, 
in particular, create a climate congenial to private 
enterprise. Experience indicates that it is a waste 
of scarce development resources, both of funds and 
personnel, to create a development bank where 
these conditions cannot be, or are not, met, 


SHIRLEY BoskEY 
[See also Economic GRowTH.] 
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1, A THEORY or DEVELOPMENT 


1 
THE FIELD 


Developmental psychology is concerned with the 
study of changes in behavior throughout the life 
span. Although, logically, equal emphasis should 
be placed on development during all stages of life, 
so far most research has dealt primarily with in- 
fants, children, and adolescents. Because of this 
emphasis the term “developmental psychology” is 
often used interchangeably with the older terms 
“child psychology,” “adolescent psychology,” and 
“genetic psychology.” These areas are distinct from 
one another, but it is impractical to discuss them 
separately. Child and adolescent psychology tra- 
ditionally include the investigation of behavior at, 
rather than across, particular periods of develop- 
ment, and genetic psychology involves a concern 
for ontogenetic as well as phylogenetic changes in 
behavior. Underlying all the terms, however, is 4 
concern with the emergence of behavior in the 
human being, and, therefore, in this discussion no 
attempt is made to distinguish between the various 
approaches [see ADOLESCENCE; AGING; INFANCY]. 

In the growth of developmental psychology theo- 
retical positions, practical needs, public awareness, 
and support of research have been closely inter- 
related. In various phases different forces we 
been dominant, depending on the psychologic 
and social milieu of the time. The field, therefore, 
has been one of shifting emphases. It is necessary 
to review the historical background of develop: 
mental psychology in order to understand its cur 
rent structure. 


The early period (1880-1920) 


Although for centuries parents and other pee 
had observed children and gathered much “CO! 
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mon-sense” knowledge about them, the information 
was contaminated by the biases of the parent- 
child relationship and the goals and wishes of other 
observers. Interest in systematically investigating 
developmental changes in behavior is, in fact, rela- 
tively recent. Research in this area received its 
initial impetus from the late nineteenth-century 
work in evolutionary biology. Charles Darwin, Wil- 
helm T, Preyer, and other biologists observed the 
development of human infants as a means of gain- 
ing further understanding of human evolution. 
After the publication of these observations, many 
baby biographies were published, They gave some 
indications of the changes that occur in the in- 
fant’s repertoire of behavior, although they were 
typically limited to a few cases and covered only a 
restricted age range. 

The theory of evolution had a stimulating effect 
on other types of research, including that of the 
“founder” of developmental psychology, G. Stanley 
Hall. His belief in the theory that ontogeny reca- 
pitulates phylogeny resulted in his studying the 
behavior of the individual child as a source of in- 
sight into the behavior of man at earlier evolu- 
tionary periods. Many aspects of children’s be- 
havior were observed, including play and social 
relationships. 

Hall introduced the questionnaire method, mark- 
ing the end of a period in which observation was 
the primary method of obtaining data; question- 
naires came to be widely used for studies of chil- 
dren’s thoughts, interests, and wishes. Soon labora- 
tory procedures were applied to children, but the 
early experiments tended to be little more than 
demonstrations that certain phenomena associated 
with lower animals or with adults also occur in 
children. These studies dealt with the traditional 
topics of experimental psychology, such as memory, 
learning, sensation, and perception. In many cases, 
studies originally using animals or college students 
were adapted so that children could be used as 
subjects. 

Intelligence testing. By 1890 there was con- 
siderable emphasis on objective observation and 
laboratory methods. Society was faced with the 
growing number of problems related to children 
that accompanied increased urbanization and ex- 
pansion of public education. When, for example, 
the question of how to handle mental defectives in 
schools and institutions arose, groups of individu- 
als in Great Britain and France began studies of 
how to assess and deal with such children. One of 
the most significant outcomes of these investiga- 
tions was the development by Alfred Binet of a 
scale to distinguish between individuals of normal 
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and subnormal intelligence. Whereas previous 
efforts to assess intelligence had been unsatisfac- 
tory, Binet’s work in constructing tests, which in- 
volved complex verbal and adaptive functions 
rather than simple sensory or motor functions, 
yielded a successful scale. It was soon obvious that 
such a scale had broader use than solely distin- 
guishing between normal and defective children. 
Further work by Binet and others, such as Lewis 
M. Terman, resulted in useful tests for “measur- 
ing” children’s intelligence and investigating the 
effects of variables potentially capable of influenc- 
ing intellectual development [see INTELLIGENCE 
AND INTELLIGENCE TESTING]. 

Psychoanalysis. During this period Sigmund 
Freud was developing a theory of psychic function- 
ing that emphasized the importance of childhood 
experiences as determinants of later behavior. As- 
pects of the theory were presented in 1909 to devel- 
opmental psychologists and others at Clark Uni- 
versity. Although Freud directed attention toward 
the early emotional life of children, his work had a 
minimal effect on early research in developmental 
psychology. Not until the 1930s, when develop- 
mental psychologists were no longer so strongly 
influenced by biological thinking, did the resistance 
to his views decrease and did serious investigations 
of his ideas concerning the effects of early experi- 
ences on psychological development begin. 


The period of expansion (1920-1940) 

Research during the first two decades of the 
1900s was aimed at providing a clearer delineation 
and categorization of children’s behavior, The re- 
search was highly empirical and not significantly 
influenced by theoretical analysis of psychological 
functioning. The work of this period, however, 
provided data necessary for the later development 
of systematic positions. 

By the 1920s developmental psychology was 
rapidly becoming a visible and important area in 
psychology. The belief was widely held that the 
application of scientific methods to the study of 
children would lead to solutions of problems in 
education, pediatrics, child care, and treatment. 
Educators began lecturing parents on how to rear 
children. Clinics were organized to evaluate chil- 
dren’s abilities and to advise parents on such mat- 
ters as discipline, toilet training, and feeding. 
Nursery schools were established in most large 
universities and in many colleges and were quickly 
put to use as a source of subjects for psychological 
research. Most important, funds became available 
for the support of research. The most notable donor 
was the Laura Spelman Rockefeller Fund, under 
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whose auspices institutes devoted to research with 
normal children were established or extended at the 
universities of California, Iowa, Minnesota, and 
Toronto, and at Columbia University. The Fels 
Research Institute and the Yale Child Study Center 
also became leading centers for the study of chil- 
dren’s behavior in the United States. The Institut 
J.-J. Rousseau in Geneva and the Vienna Psycholo- 
gische Institut were the most active European insti- 
tutions for research in developmental psychology. 
The 1920s and 1930s were exciting and productive 
years at these institutes, but because the institutes 
were separated from departments of psychology, 
developmental psychologists tended to be isolated 
from psychologists interested in other aspects of 
behavior. Only during the past few years has the 
relationship with general psychology been firmly 
re-established. 

Divergent viewpoints on how environmental and 
biological factors influence behavior resulted in a 
major controversy over the question of the con- 
tributions of nature and nurture to intellectual 
functioning. It was easy to take sides; it was much 
more difficult to accumulate information that with- 
stood the tests of critical scrutiny. The controversy 
eventually ended in a stalemate, for although scores 
of studies were published, few survived the devas- 
tating attacks that were waged against their design, 
execution, and analysis. Solutions for the enor- 
mously complex issues that were being raised 
would have to await the time when developmental 
psychologists had more sophisticated tools and 
information at their disposal. 

Behaviorism and its critics. The introduction of 
behaviorism into developmental psychology in the 
1930s marked the beginning of an era in which 
schools of psychology, each with its own theories, 
problems, and preferred methods, polarized the 
field. Previous research had been aimed at provid- 
ing information about children’s behavior; now re- 
search was often directed toward establishing the 
validity of theories advanced by various schools. 
Behaviorism had an effective and vigorous advo- 
cate in John B. Watson. Edward L. Thorndike and 
Ivan P. Pavlov had provided a background of in- 
formation and techniques in their studies of learn- 
ing and conditioning in animals, which Watson 
was able to use to develop his approach to psy- 
chology. He espoused an extreme form of environ- 
mentalism and relied heavily on the use of condi- 
tioning as the means for modifying behavior. His 
studies of infant behavior and of the acquisition 

and extinction of fear responses in children be- 
came classics, and his belief that introspection 
should be superseded by objective observation and 


measurement of behavior exerted a healthy influ- 
ence on developmental psychology. it was not long, 
however, before advocates of other views responded 
critically to Watson's position. 

Among the most vocal schools of critics were 
gestalt psychologists and psychoanalysts. Although 
at the time neither group had a large influence on 
developmental psychology, their views provided the 
basis for later work. Another critic who took strong 
exception to the associationistic view of American 
behaviorists was Jean Piaget. As a result of his 
training in biology, Piaget emphasized the adaptive 
functions of behavior in maintaining an equi- 
librium between the individual and the environ- 
ment. Piaget's views were elaborated into a com- 
prehensive theory of cognitive development. The 
contrast between him and the behaviorists extended 
to all aspects of their work, Whereas the behavior- 
ists utilized the experimental method in their in- 
vestigations of simple forms of behavior and 
emphasized the role of controlled experiences in 
the modification of behavior, Piaget preferred the 
clinical method and chose to study the interaction 
of maturation and experience in the production of 
complex behavior and higher thought processes. 

Normative and longitudinal studies. The chart- 
ing of the development of behavior was not con- 
fined to cognitive development. Arnold Gesell at 
Yale and Charlotte Bühler in Vienna had begun 
extensive normative studies of child development. 
For nearly forty years Gesell maintained an active 
program of research in the exploration of motor, 
adaptive, language, and personal-social behavior 
from birth through adolescence. The normative in- 
formation from these studies, along with the re- 
sults of his investigations of feeding, sleep, and 
other topics, became a major source of guidance 
for mothers in many countries. Gesell, like Piaget, 
had a strong biological orientation. While other 
American psychologists were proclaiming the influ- 
ence of the environment on behavior, Gesell con- 
tinued to show how specific types of experiences 
may have differential effects on behavior, depend- 
ing on the maturational level of the individual. 

A series of longitudinal studies of child develop- 
ment was initiated in the early 1930s. The longr 
tudinal method had not been employed extensively 
prior to this time, although such investigations a 
Terman’s famous study of gifted individuals pe 
demonstrated its productivity. At the University © 
California, Nancy Bayley began a study of the 
development of mental abilities in infants am 
children (1940), Herbert Stolz and Harold Jones 
began a study of somatic and psychological dev 
opment during adolescence, and Jean Macfarlan 
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began a study of personality development. At the 
Fels Research Institute, Lester Sontag and his 
associates began a comprehensive longitudinal 
study of growth and development. All these studies 
are still being conducted; data are being collected, 
and old data are being analyzed in new and inter- 
esting ways [see PERSONALITY, article on PERSON- 
ALITY DEVELOPMENT; and SENSORY AND MOTOR 
DEVELOPMENT]. 

Gestalt and field theories. Psychology began to 
be influenced significantly by the work of the 
gestalt psychologists, who, in turn, were influenced 
by field theory from physical science. The exten- 
sion of the gestalt approach by Kurt Lewin had the 
greatest influence on research with children. Dur- 
ing his years in Berlin and in the United States, 
Lewin directed a series of studies aimed at demon- 
strating the important role of situational factors in 
determining behavior, Lewin’s preference for an 
ahistorical approach to the analysis of behavior led 
him and others to a careful consideration of the 
structure of the psychological field and the forces 
operating in it. [See FIELD THEORY; GESTALT THE- 
ory; and the biography of LEw1N.] 


Later developments (1940-1965) 


The remarkable rate of growth in developmental 
psychology was seriously curtailed at the end of the 
1930s. Many factors contributed to a temporary 
decline in activity. The normative, descriptive ap- 
proach had lost its allure; it was no longer of inter- 
est to reaffirm the fact that behavior developed 
systematically. Other areas of psychology appeared 
to be more challenging, and the number of per- 
sons entering developmental psychology decreased. 
World War 11 disrupted research programs. Perhaps 
as important as any factor, however, was the with- 
drawal of supporting funds from the institutes and 
the general unavailability of new funds for re- 
search. For about fifteen years it became more and 
more difficult to initiate new research projects and, 
in many cases, to complete the analyses of data 
obtained from earlier studies. In 1938, for example, 
approximately 500 publications concerned with 
children’s behavior appeared; by 1949 the number 
had dropped to approximately 250. The general 
level of the work also began to decline; writings 
became more speculative and less dependent on 
original observations or experiments (see Barker 
1951). Fortunately, the field did not disintegrate; 
there was enough activity at an appropriately high 
level to maintain the interest in research with 
children during the critical years. 

The period of decline came to a halt in the mid- 
1950s, when interest in developmental psychology 
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was reactivated. Since then, interest has increased 
at a rapid pace and there has been a revitalization 
of the area of study. Research with children has 
been greatly stimulated by the rapid and sophisti- 
cated advances in psychological theories, method- 
ology, and instrumentation. Many of the theoretical 
positions in general psychology have direct rele- 
vance to the discussion of the emergence of be- 
havior and are readily applicable to developmental 
investigations. Funds became available at an un- 
precedented level to support research and training. 
In the United States, for example, in 1962, a na- 
tional Institute of Child Health and Human Devel- 
opment was established to provide leadership in 
training of investigators and research in develop- 
mental sciences, including developmental psychol- 
ogy. More powerful statistical analyses have been 
adopted and developed. Computers have made pos- 
sible many types of studies that could never have 
been carried out without rapid mechanical analyses 
of the data. 

The descriptive approach in developmental psy- 
chology has been replaced by a variable-oriented 
approach. The adoption of systematic positions 
dealing with developmental changes in children’s 
behavior has led to an increased interest in de- 
termining children’s responses to particular speci- 
fiable or manipulable variables and to a decreased 
interest in describing behavior in naturalistic or 
controlled settings. There is a tendency for sub- 
systems to be developed that deal with specific 
aspects rather than with whole domains of be- 
havior. The level of conceptualization has in- 
creased; concepts and processes now dominate 
thinking in developmental psychology. 

There has been a redistribution of emphases 
among the various methods used in studying chil- 
dren. The experimental method has gained in popu- 
larity, primarily because of the opportunity it 
allows for the control of both independent and 
dependent variables, An experiment is no longer 
conceived merely as a controlled situation yielding 
quantitative data but as a method that permits the 
manipulation of the variables under study. Few 
long-term longitudinal studies have been initiated; 
the problems involved in the selecting of appro- 
priate measures at the beginning of the study, the 
maintaining of the sample of subjects over long 
periods of time, and the organizing of the large 
amounts of data accumulated in such studies have 
discouraged most investigators from undertaking 
more than four-year or five-year longitudinal in- 
vestigations. Observations, interviews, and tests 
tend to be used less frequently than they were in 
earlier periods, but they are still employed when 
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they are particularly applicable to the topic under 
investigation. 

Although attention continues to be directed 
toward developmental changes in such areas as 
children’s learning, motivation, perception, and 
language, new interests are developing. The redis- 
covery of Piagets work and modifications in be- 
havior theory have resulted in a rapid development 
of interest in research in cognitive processes. Re- 
search in socialization has been stimulated by 
psychoanalytic and social learning theories. Studies 
of personality development have concentrated on 
specific aspects of behavior, for example, anxiety, 
aggression, and dependence, rather than attempt- 
ing to deal with such global problems as adjust- 
ment and emotional behavior. Interest has been 
reawakened in the study of neonates, especially in 
their sensory, perceptual, and adaptive behavior. 
Cross-cultural studies, once in the domain of social 
anthropology, have been undertaken to ascertain 
the effects of different child-rearing practices on 
children’s later behavior. Studies of the ecology of 
behavior have resulted in detailed analyses of 
children’s behavior in naturalistic settings. 

It is doubtful that twenty years ago anyone could 
have anticipated the remarkable resurgence of 
interest in developmental psychology that has oc- 
curred in the past decade. More research on basic 
processes of behavior is being conducted than in 
any previous period. Programs designed to train 
psychologists for research in developmental prob- 
lems are being initiated and expanded, both in 
departments of psychology and in research insti- 
tutes. A strong interest in applied problems is not 
evident, perhaps as a reaction to the overcommit- 
ment to the solution of social problems that char- 
acterized earlier work in developmental psychology, 
There are indications, however, that the techniques 
and knowledge now being accumulated can be used 
effectively in attacking the practical problems of 
the developing child. As the interest in such prob- 
lems increases, it may be expected that the gap in 
communication between developmental psychology 
and pediatrics, education, sociology, and other dis- 
ciplines will begin to close. 


HAROLD W. STEVENSON 
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n 
A THEORY OF DEVELOPMENT 


Most theories of development and of intelligence 
seek to explain new problem-solving or concept- 
formation behavior by the influence of acquired 
experience, that is, as a function of external rein- 
forcements; and they seek to explain the formation 
of logical structures by the influence of language. 
In this double perspective, knowledge tends to be 
considered as a kind of “copy” of reality: either as 
a sensorimotor copy, because the “associations” on 
which the learning is based are seen to be in one-to- 
one correspondence with a series of relationships 
already existing in the external world, or as a ver- 
bal copy, because language simply describes a 
reality that is considered ready-made. 

Without denying the fundamental importance 
of both acquired experience and language, I would 
include, in addition, the following three central 
notions: 

(1) Knowledge of an object does not consist of 
having a static mental copy of the object but of 
effecting transformations on it and reaching some 
understanding of the mechanisms of these trans- 
formations. An intelligent act consists above all of 
coordinating operations, uniting, ordering (in the 
sense of introducing order), etc. These operations, 
which derive from the subject's internalization of 
his own actions, are the instruments of the trans- 
formations that knowledge is concerned with. 

(2) Logical relationships are, first and above 
all, operational structures. Although their most 
advanced forms are certainly expressed by lan- 
guage, their origins are found in the coordination 
of the subject’s own actions. Even at the sane 
motor, preverbal level, a child is involved in activi- 
ties that include uniting, ordering, introducing cor- 
respondences, etc.; and these activities are the 
source of operations and logico-mathematical struc- 
tures. 4 
(3) Knowledge is not determined strictly by 5 
knower, or by the objects known, but by exchange 
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or interactions between the knower and the objects 
(between the organism and the environment). The 
fundamental relation is not one of simple associa- 
tion but of assimilation and accommodation; the 
knower assimilates objects to the structures of his 
actions (or of his operations), and at the same 
time he accommodates these structures (by dif- 
ferentiating them) to the unforeseen aspects of 
the reality he encounters. 

It is with this emphasis on the subject's own 
activities and transformational operations that I 
look at the stages of mental development and the 
progressive formation of intelligence. In a word, 
a ready-made object knowledge is not assumed, 
because we must transform it in order to know it; 
nor is a fully formed knowing subject assumed, 
because the subject must elaborate his own struc- 
tures through transforming objects. 


Sensorimotor intelligence 

Even at the sensorimotor level, before the emer- 
gence of language, we can see this two-sided de- 
velopment—the child’s thought structures develop 
in the very process of his “construction of reality.” 
Infants assimilate situations into their repertory 
of actions; they find out about new things by acting 
on them in the ways they are capable of, such as 
sucking and grasping. By repetition, coordination, 
and generalization, these actions engender a sys- 
tem of “schemas of assimilation.” For example, a 
child who already pulls things and places things 
comes to coordinate these actions in the form of 
pulling a little rug toward himself in order to reach 
something placed on it. On the other hand, as part 
of this same development, this intelligence struc- 
tures reality. For instance, at first, objects have 
no permanence; the universe is one of moving 
tableaux that appear and disappear and are with- 
out spatial localization once they have disappeared. 
But with the coordination of actions, a “schema of 
the permanent object” develops: an object is 
sought after it has disappeared, and it thus con- 
tinues to exist in the sense of its having a spatial 
localization even when it is actually outside the 
field of perception and action. 

Another basic schema, which, at the practical 
level, is closely tied to the schema of the permanent 
object, is that of the coordination of positions and 
changes of position, what geometers term “the dis- 
placement group”—the coordination of movements 
from positions with the possibility of a return to 
the point of departure and of the use of detours 
or alternate routes to get to the same point. There 
is even a sensorimotor schema of causality. First, 
this consists of attributing phenomena to the sub- 
ject’s own actions, without taking account of physi- 
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cal or spatial contact. Then, as causality is attrib- 
uted to relations between objects themselves and 
some physical interaction is found necessary, there 
emerges progressive objectivation and spatializa- 
tion of causal relations. 

Sensorimotor intelligence thus makes two funda- 
mental advances. (1) Actions are coordinated; and 
this is the source of later logic and operations, 
when these schemas will be internalized and re- 
structured at the representational level. ( 2) From 
the point of view of structuring reality, the sub- 
structure is provided for certain fundamental con- 
cepts—object permanence, causality, space, and 
temporal succession. (For further elaboration on 
these ideas, see Piaget 1936; 1937.) 


Figurative and operative symbolic functions 


At about one and a half or two years of age, 
there emerge the beginnings of the symbolic func- 
tion and representational evocation, or “thought” 
(or, again, internalized intelligence). 

Knowing consists necessarily of a system of sig- 
nification, and signification consists necessarily of 
a signifier and something signified. At the early 
sensorimotor level, the sensorimotor, perceptual, 
and postural actions upon objects—the schemas— 
are the things signified, and the signifiers are some 
aspect of these very schemas. Such signifiers are 
the cues or signals of conditioning, and at this early 
stage they are still relatively undifferentiated from 
what they signify. 

By contrast, the symbolic function has its begin- 
ning (at around one and a half or two years) with 
the development of signifiers that are differentiated 
from the schemas that they signify; that is, some- 
thing can be evoked or represented by something 
other than a part of itself. Such signifiers may be 
symbols that bear some resemblance to what they 
stand for or signs that are arbitrary conventions. 

Language and imitation. The words of a lan- 
guage are signs. In children, at the same time the 
language sign system appears other aspects of 
functioning, which are based on symbols rather 
than signs, emerge: deferred imitation, symbolic 
(or fictional) play, and mental imagery (which is 
internalized imitation). All of these are alternate 
ways of evoking situations and objects in their 
absence. 

Imitation is the medium by which the child 
progresses from sensorimotor to representational 
functions. A form of imitation is already present 
at the sensorimotor stage. This is a material, or 
active, representation and takes place only in the 
presence of the model being imitated. It is non- 
deferred imitation (even though it may keep going 
for a while after the model has disappeared). It 
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does not imply any form of mental representation, 
and it does not necessarily result in any. On the 
other hand, deferred imitation (imitation that 
starts in the absence of the model) does lead to 
representation, as is clear in the case of symbolic 
play. It then becomes internalized as a mental 
image, which permits the acquisition of language. 
(Language is based on this deferred imitation, not 
only on conditioning; otherwise it would develop 
much earlier. ) 

Thought is made possible by the emergence of 
the symbolic function, through the internalization 
of sensorimotor actions and through the recon- 
struction of the early structures on the level of 
representational signification. But before looking 
more closely at this reconstruction, we must intro- 
duce an essential, although often overlooked, dis- 
tinction. The recent research of Piaget and Inhelder 
reveals this distinction more and more clearly. 

Figurative and operative aspects of thought. At 
the heart of cognitive processes are two aspects— 
not separated sectors, but different poles: the 
figurative aspect and the operative aspect. 

The figurative aspect deals with static configura- 
tions of reality and provides adequate figural repre- 
sentations of these states. This aspect is predomi- 
nant in perception, in imitation, and in mental 
imagery that is a form of internalized imitation. 

The operative aspect, on the other hand, deals 
with transformations of these states. This aspect 
is predominant in actions in general and, in the 
special case of operations, in internalized actions 
that have become reversible. The operations of 
adding and subtracting, for instance, are products 
of the actions of uniting and separating. The term 
“operative” refers to both actions and operations, 
while the term “operational” refers to operations 
alone. 

This distinction between the figurative and the 
Operative aspects is an important one from the 
point of view of the development of intelligence. 
In adults, the figurative is subordinated to the 
operative; that is, states are seen as the result of 
one transformation and the starting point for 
another. However, in young children, especially in 
children who have not yet begun to think opera- 
tionally, these two aspects remain uncoordinated; 
the children are not capable of certain elementary 
forms of reasoning because they focus too much 
on fixed states, without seeking to relate them oper- 
ationally. For instance, the majority of four- to six- 
year-olds think that after a liquid is poured from a 
wide glass into a narrow one there is more of it, 
because they note only the fixed levels of the liquid 
and not the pouring action, which links the initial 
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and terminal states. (For further reference, see 
Piaget 1945; Piaget & Inhelder 1966.) 


Preoperational thinking (two to seven years) 


Operations are internalized, reversible actions, 
coordinated in total structures such as classifica- 
tions, seriations, and multiplicative matrices. The 
psychological criterion for the existence of opera- 
tional reversibility is the presence of the notion of 
conservation, a negative example of which is the 
children’s reactions to the liquids, cited above. 

Sensorimotor intelligence manifests the thresh- 
old of operativity, since the displacement group 
itself is a total structure characterized by reversi- 
bility, and there is an invariant, or a conservation 
schema, in the form of the schema of the perma- 
nent object. But internalization is missing here. 
These schemas consist merely of physical, succes- 
sive actions and not yet of simultaneous repre- 
sentations. 

One would be tempted to think that as soon as 
the symbolic function was formed, these sensori- 
motor structures would be internalized as opera- 
tional structures. This is indeed what finally takes 
place but, because it is much more difficult to 
reproduce an action in thought than to execute it 
physically, it takes place much more slowly than 
one might suppose. This internalization requires a 
total reconstruction on a new plane; and during 
the course of this construction, the child must 80 
through the same difficulties he did on the sensori- 
motor level. For example, the four- to six-year-olds 
we studied knew perfectly well the route they took 
from home to school but could, nonetheless, not 
represent this route on a sand table. 

Egocentrism and decentering. The necessary 
condition for operational development at this age F 
a decentering from the child's own actions an 
own point of view. The phenomenon of the child's 
centering on his own point of view and on his own 
actions is what is called egocentrism (a term that 
has often been misunderstood because of confusion 
with affective egocentrism ). By egocentrism, Ime 
nothing more than the absence of cognitive are 
tering. Already at the sensorimotor level the cours 
of development is one of progressive decente 
from an initial state of radical egocentrism. 
newborn infant has no awareness at all of his w 
identity, precisely because of the fact that he o 
not distinguish his own point of view from ct 7 
possible points of view. As a result, he lives Ee 
universe without objects, centered spatially, m 
ally, and temporally on his own body. The oe 
struction of object permanence, the dipr E 
group, and objective causality are characterized, 
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general, by progressive decentering, to the point 
where he becomes one element in a universe of 
objects and of spatial and causal coordinations. 
This Copernican revolution, as it were, must be 
effected all over again in the realm of representa- 
tional thought and in social relations; another 
decentering is necessary to liberate the child from 
his own point of view. One example, from the 
realm of spatial representation, involves a three- 
dimensional model containing three mountains; 
the child is given a series of drawings made from 
various positions and is asked to indicate the angle 
from which each of the drawings was made. Chil- 
dren at the preoperational stage cannot do this. 
A social example is the question of the number of 
brothers in a family. A child will say, for instance, 
that he has two brothers but that each of them has 
only one brother—because he doesn’t count himself 
as a brother. 

Absence of conservation in egocentrism. The 
main feature of egocentrism, however, is the ab- 
sence of reversibility, and therefore of operations. 
None of the tests of conservation is successful at 
this level. We have already seen the example of 
the pouring of liquids. We get the same results 
using small beads instead of a liquid. Similarly, if 
ten blue tokens are lined up opposite a row of ten 
red tokens, the child will think that the number or 
quantity changes if you spread out or push together 
one of the rows. Again, if you give him two similar 
balls of Plasticine and then reshape one as a 
sausage or as a pancake, he will think that the 
quantity has changed. By about 7 or 8 years of 
age, he will affirm that the quantity has not been 
changed; but he will continue to think, until 9 or 
10, that the weight has been changed; and only 
at 11 or 12 will he believe that the volume, as 
measured by water displacement, remains un- 
changed. These experiments, which were first con- 
ducted in Switzerland, have been replicated in 
other countries (for example, in the United States 
by Elkind 1961) with the same results. 

In spatial conception at this age level, there is 
nonconservation of length. Two sticks of equal 
length will be judged equal by the child if they are 
placed side by side with their extremities aligned, 
but if one is moved so that it extends beyond the 
other, the child will now say that it is longer. Simi- 
larly, there is nonconservation of distance: the 
space between two points is considered smaller if 
an object is placed in the middle of it—empty 
space is not equated with filled space. Similarly, 
there is nonconservation of surfaces, volumes, etc. 
(For further reference, see Piaget & Inhelder 1941; 
1948.) 
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The concrete operations 

At seven or eight years of age, the first opera- 
tional structures take form. These first operations 
are limited, dealing with manipulations of objects 
themselves and not with verbal hypotheses. They 
are limited to classes, relations, and number, deal- 
ing either with discontinuous elements or with 
spatial and temporal continuums. They do not 
attain the general level of propositional logic. 

Moreover, these concrete operations do not cover 
the entire logic of classes and relations but only 
certain of the structures involved. These first opera- 
tional structures, to which we have given the name 
of “groupings,” relate to contiguous inclusions and 
relationships, without yet being generally combina- 
torial. They are only semilattices and incomplete 
groups, like seriations, classifications, multiplica- 
tive matrices, etc. 

Seriation. Seriation, or succession of transitive 
asymmetrical relations, has its roots in early preop- 
erational, even sensorimotor, schemas. Very young 
children can arrange a series of blocks in the order 
of their size. But at this level, it is done only by 
successive trial and error, with no use of deductive 
transitivity (knowing that A < B and B < C means 
that A < C). At seven or eight, there is a system- 
atic method, consisting of finding the smallest of 
all, then the smallest of the remaining ones, etc., 
so that each element is understood as being bigger 
than all the preceding ones and smaller than all 
the following ones. This is the basis for an imme- 
diate deductive understanding of transitivity. 

Classifications. Classifications, too, are present 
at the preoperational level, but in a form that re- 
mains elementary because it lacks operational re- 
versibility. When small children, say four-year-olds, 
are asked to make classifications of objects, they 
make “figural collections,” with them; that is, the 
spatial arrangement is made as if the class itself 
was dependent on a configuration in space. Slightly 
older children make nonfigural collections. The 
collections no longer have a significant shape, but 
they are still only material collections and not logi- 
cal classes, because the children do not understand 
the intensive quantification of class inclusions. 
Suppose, for example, that we present seven prim- 
roses (class A) and three other flowers (class A’), 
which together make up a collection of ten flowers 
(class B =A + A’). The child will agree that all 
the primroses are flowers and that all the flowers 
are not primroses, but he does not conclude that 
there are fewer primroses than there are total 
flowers (A < B). He will say there are more prim- 
roses because there are only three others. He can- 
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not reason at the same time with both the whole 
class and a subclass. In trying to compare A with 
B, he must split B into A and A’, and then since 
A is already one term of the comparison, the child 
uses A’ as the other term. At an advanced opera- 
tional level, however, he will understand that A = 
B — A’ and A’ = B — A. Therefore A < (A + A’), or 
A <B, In other words, understanding class inclu- 
sion presupposes operational reversibility. 

The same sort of analysis can be made of chil- 
dren’s notions of correspondences, multiplicative 
matrices, etc. However, the operational mechan- 
isms involved in the construction of the notion of 
whole numbers deserve special attention. We have 
already seen that even though children may use 
number names, it is not until the age of seven, on 
the average, that they are convinced of the con- 
servation of number if the spatial arrangement is 
changed, The operational concept of number is a 
synthesis of class inclusion and ordering. On the 
one hand, there are the inclusions I< (I +1) < 
(I + I +1). On the other, since all the elements are 
equivalent, with all qualitative differences having 
been abstracted, the only way to distinguish one 
element, I, from another is by introducing order. 
This order may be a spatial array, a temporal 
array, or a logical enumerating order of the type 
I+ I> I, where the sign > indicates any sort of 
transitive order. Bertrand Russell maintained that 
the cardinal number was based on one-to-one cor- 
respondence between classes of the same extension, 
without having to introduce order, whereas the 
ordinal number was based uniquely on ordering. 
But the one-to-one correspondence to which he 
referred is not the qualitative correspondence of 
the logic of classes, such as is found in multiplica- 
tive matrices, for example, where an element cor- 
responds to another because of corresponding 
properties, but is rather a correspondence of one 
unity to another unity, with the qualities abstracted, 
And this already presupposes an arithmetical unity, 
that is, number. Cardinal and ordinal numbers can 
be neither understood nor formalized independ- 
ently of one another. Only by seeing number as a 
synthesis of inclusion and seriation can we avoid 
the vicious circle inherent in Russell’s formaliza- 
tion. This interpretation of number, as a synthesis 
of inclusion and seriation, was discovered purely 
through psychology and has since then been for- 
malized by the logician J. B. Grize. 

There is another whole area of operations at 
this level. So far, we have been discussing discon- 
tinuous elements, where the Operations deal with 
likenesses and differences. There is also the area 


of infralogical or spatiotemporal operations, which 
deal with proximity and separation in a continuum, 

Representative space. Representative space (as 
opposed to sensorimotor and perceptual space) is 
based on the following operations: partition and 
addition of parts (corresponding to addition of 
classes), placement and displacement (correspond- 
ing to seriation), and measuring as a synthesis of 
partition and displacement (as number is a syn- 
thesis of inclusion and seriation). There is a point 
of interest here from the point of view of philos- 
ophy of science. These spatial operations apply 
equally to Euclidean (metric) projective, and topo- 
logical space. In the historical development, Euclid- 
ean geometry is considered to have come first, 
followed much later by projective, and later still 
by topological, considerations. The psychological 
construction presented here is the contrary of this; 
topology is the most fundamental, with the other 
two being derived from it. We have found that in 
children, development follows this order. The first 
intuitions are topological, and from this come pro- 
jective and Euclidean, or metric, representations. 

Time, The operations involved in the notion 
of time are again similar. There is the ordering of 
successive events; there is inclusion of time inter- 
vals; and the synthesis of these leads to time meas- 
urement, The notion of time is based on a primitive 
intuition of speed, which determines judgments of 
speed much before there is any metric notion of 
speed as a ratio of space to time. (Speed as a pre- 
requisite to understanding time, rather than vice 
versa, is closer to relativity physics than to classical 
physics.) This primitive notion of speed is ordinal, 
based simply on the intuition that if one moving 
thing overtakes another on the same path, it is 
going faster. This judgment of speed is based on 
spatial order (behind and ahead) and temporal 
order (before and after) but does not require any 
evaluation of length of time or size of interval. 
[For further reference, see Piaget 1946a; 1946b; 
Piaget & Szeminska 1941; Piaget, Inhelder, & Sze- 
minska 1948; see also Tıme, article on PSYCHOLOG- 
ICAL ASPECTS.] 


Formal, or hypothetico-deductive, operations 
The period of formal, or hypothetico-deductive, 
operations begins at about 11 or 12 years of E 
Children start to be able to reason not only aban 
concrete objects but about verbal hypotheses. F 
operations of propositional logic, in a natural, non 
axiomatized form, of course, are added to the oP 
erational structures. n 
There are at least three types of external evi 
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dence of this new development. (1) Children are 
able to reason with verbal material alone, that is, 
without concrete referents, (2) In experimental 
problems they are able to make hypotheses about 
the possible factors and vary them systematically 
to test the hypotheses. (3) In general terms, they 
are now capable of “reflection,” that is, of thinking 
about thinking or of operations on operations. 
Propositional logic is a kind of system of operations 
raised to the second power, since the contents of 
the propositions are class or relational operations 
and since new operations are involved in the inter- 
propositional structure (implications, disjunction, 
etc.). This is why adolescents can construct the- 
ories, naive though they may be, whereas children 
cannot. 

These operations have two main characteristics. 
First of all, they are combinatorial, giving rise, for 
example, to a more complex lattice structure in 
place of the simpler “groupings.” Second, the two 
forms of reversibility found at the concrete opera- 
tional level are united in a single system known 
as the group of four transformations. These two 
types of reversibility are inversion, or negation 
(N), which characterizes classification structures, 
and reciprocity (R), which characterizes relational 
structures. The operations involved in the groups 
of four transformations then are N, R, I (identity), 
and C (correlativity); so that NR=C, NC=R, 
RC =N, and NRC = I. With these operations, 11- 
and 12-year-olds can develop new operational 
schemas, such as proportions, double reference 
systems, etc. (see Inhelder & Piaget 1955). 


The mechanism of intellectual development 


The classical factors that usually are called upon 
to account for intellectual development are biologi- 
cal maturation, environmental influence (expe- 
rience), and social transmission (education ). Each 
of the three does play an important role, but they 
are insufficient to completely account for intellec- 
tual development. Thus a fourth is necessary. 

Biological maturation clearly needs to be sup- 
plemented when we see that the speed of develop- 
ment of operational structures is dependent upon 
cultural settings, individual experience, etc. 

As for experience as a unique explanatory factor, 
it is indeed true that all notions need a basis in 
experience, even logical and mathematical notions. 
But there are two distinctly different types of ex- 
perience, although they often go together. Physical 
experience consists of acting upon objects and 
drawing some knowledge about the objects them- 
selves, for example, weighing things and finding 
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out that weight is not always proportional to vol- 
ume. In logico-mathematical experience, on the 
other hand, the knowledge is drawn not from the 
objects that are acted upon but from the actions 
themselves. Arranging objects in various ways and 
finding out that the arrangement has no effect on 
the total number of objects is an example of this 
second type of experience. Physical experience cer- 
tainly does not suffice to explain the development 
of intelligence. As for logico-mathematical expe- 
rience, this is only a preparation for deductive 
thinking, which is based on the coordination of 
actions. 

Finally, linguistic, educative, or social transmis- 
sion, important though they are, can have an 
influence only to the extent that the child can 
assimilate what is offered to him into his own 
operational structures. 

Progressive equilibration. Another factor thus 
becomes necessary—the progressive equilibration 
of actions as they become coordinated. There are 
two reasons why equilibrium is a satisfactory ex- 
planatory model. First, intellectual equilibrium de- 
rives from the subject's activities as they compen- 
sate for external disturbances. Since compensation 
leads to reversibility, this progressive equilibration 
of the operational structures takes the form of 
progress toward ever-increasing reversibility. Equi- 
libration can thus be seen as a possible definition 
of intelligence, as well as an interpretation of its 
development. Second, an operational structure is 
equilibrated once it has achieved reversibility, but 
before then it passes through preoperational steps 
of successive equilibration. If we follow this pro- 
gressive equilibration, we see that any given step, 
although not necessarily the most probable at the 
outset, becomes the most probable, once the pre- 
ceding step has been reached. Equilibration thus 
includes a model of successive probabilities that is 
capable of dealing with factual detail, and which 
thus provides a learning theory based on internal 
reinforcement and not only on external or empiri- 
cal reinforcement (see Apostel, Mandelbrot, & Pia- 
get 1957). 


Perception and mental imagery 


The research on intellectual development is only 
a part of this line of developmental research. There 
is also work in the fields of perception and mental 
imagery. 

Perceptual development. In the area of per- 
ceptual development, two groups of phenomena 
have been identified: (1) effects that diminish in 
intensity but whose structure remains unchanged 
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and (2) activities that become more articulated 
with age. 

The effects that are not structurally modified 
but whose intensity decreases with age can be 
interpreted by a common probabilistic model. About 
15 classic opticogeometric illusions have been 
studied developmentally, Examples of these are the 
Delboeuf illusion, the Miiller—Lyer illusion, and il- 
lusions of angles, rectangles, parallelograms, etc. 
Each one was studied in a variety of different pro- 
portions; and at every age, positive and negative 
maxima were established for the proportions. The 
maxima proportions were found to be the same for 
every illusion, which permitted the effects of all of 
the illusions to be reduced to a general law, that 
of a “centration” effect: the areas focused on are 
overestimated, and the peripheral areas are under- 
estimated. A study of ocular movement, by Piaget, 
Vinh-Bang, and Matalon (1958), was able to sup- 
port this interpretation in some of the simpler 
situations. For instance, vertical lines are over- 
estimated in comparison with horizontal lines, and 
subjects were in fact found to focus during much 
of the exposure on the summit of the vertical line, 
[See PERCEPTION, article on ILLUSIONS AND AFTER- 
EFFECTS; VISION, article on EYE MOVEMENTS.] 

This centration effect was itself reduced to a 
probabilistic explanation based in part on the dis- 
tribution of the points of “encounters” between ele- 
ments of the figure and elements of the receptor 
organs and in part on the series of correspondences, 
or “couplings,” established among these points of 
encounter. Studies were carried out to establish 
temporal maxima for each of the illusions, in both 
children and adults, for exposures ranging from 
:01 second to 1.0 second. These studies confirmed 
the duality of these two factors, 

The second group of phenomena—activities that 
become more articulate with age—stems from 
exploration activities, such as comparing parts of 
the figure, structuring the figure by introducing 
perceptual coordinates, anticipating parts of the 
figure, etc. These activities give rise to certain sys- 
tematic secondary illusions by exaggerating the 
deformation resulting from Systematic centration. 
Once again, eye movement studies have confirmed 
this interpretation, showing, among other things, 
how much more systematic adult perceptual ex- 
ploration is than that of children. This explains 
why these illusions become more pronounced with 
age. 

These perceptual activities, then, come more and 
more under the influence and direction of intelli- 
gence. Intelligence has no effect on the primary 


perceptual illusions, but it does have an effect on 
perceptual activity, guiding exploration and intro- 
ducing its own structure. It is important to note 
to what extent a figurative function like perception 
lacks independent development and to what extent 
it is subordinated to operative functions. It has its 
beginnings in early motor behavior, and its later 
development is directed by operational intelligence. 

Mental imagery. Another figurative function 
whose development has been systematically studied 
is mental imagery, which has been relatively neg- 
lected in recent years but, indeed, was never much 
studied in children. Once again there are two prob- 
lems to study: (1) to establish whether the evolu- 
tion of imagery is autonomous or whether it is 
influenced by operational development and (2) to 
establish whether imagery paves the way for opera- 
tions or whether, on the contrary, it is modified and 
structured by the operations. 

As far as the first problem is concerned, two 
principal types of imagery have been distinguished. 
Besides reproductive, or “copying,” imagery there 
is anticipatory imagery (which may be as simple 
as imagining the rotation of a stick as it pivots 
about one of its extremities). In children who are 
too young to engage in operations, reproductive 
imagery is adequate, but anticipatory imagery is 
far from adequate. The errors made are, in fact, 
reminiscent of the errors in preoperational thought. 
Anticipatory imagery does not, then, derive directly 
from reproductive imagery but depends on the 
intervention of operations. At the operational level, 
this imagery becomes more flexible and can sarye 
as a useful symbolic auxiliary for operational 
thought. 

The second problem is examined by going back 
to the various operational tests (conservation, for 
example) and having the children anticipate how 
the situation will be, after telling them what trans- 
formation is about to be carried out. For instance, 
for the conservation of number, 12 blue tokens T 
placed in a row 15-20 centimeters long. About 3i 
centimeters away, 12 red tokens are spread out in 
a row 30-40 centimeters long. Each blue token n 
coupled with a red token by means of a channe: 
with walls 1 centimeter high, so that the tokens 
can move toward each other only along the ee 
and must ultimately meet. The children are asked, 
with the tokens in their original positions, whether 
there are as many blue as red, whether there a 
be as many if the blues are all moved up to wh 
the reds are, whether there will be as many if they 
are moved up only halfway, etc. The findings 23 
that small children are perfectly able to represen! 
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the displacement of the tokens, but before six and 
a half or seven years of age this does not lead them 
to assert that the number is the same. They report 
that the number increases as the row becomes 
longer, and decreases when the row is shorter. The 
anticipation remains transversal, focused on the 
length of the row. Later, at the operational level, it 
becomes longitudinal, focused on the paths traveled, 
and serves to verify the one-to-one correspondence. 
Other experiments support the same conclusion, 
namely, that adequate imagery in itself is not suf- 
ficient to bring about operational thinking but is 
structured and modified by the subject’s operational 
level (see Piaget 1961). 


Epistemological conclusion 

This entire body of research bears witness to a 
single broad intention: to put to experimental test 
the principal epistemological hypotheses about the 
nature of empirical and logico-mathematical knowl- 
edge. 

Above all, it reveals the insufficiency of empiri- 
cism as an epistemological theory: the development 
of empirical knowledge itself does not demand an 
empiricist interpretation. In point of fact, the study 
of this development in children reveals that expe- 
rience is never simply “read,” or passively regis- 
tered. It is always assimilated by the subject into 
his own structured schemas. In other words, even 
physical knowledge (in its broadest sense) is al- 
ways relative to a logico-mathematical framework 
brought to an experience by the subject himself. 
As for this framework, it is not simply a linguistic 
expression but is derived from the most general 
system of coordinations of the subjects own ac- 
tions. In short, it is operational. 

However, to say that the subject plays an active 
role in structuring the physical objects of knowl- 
edge does not mean that knowledge is rooted in 
the subject alone, as the a priori epistemological 
theories maintain. (The a priori influence is still 
alive in some forms of gestalt theory. ) Knowledge 
derives neither from objects independent of a sub- 
ject nor from a subject independent of objects; it 
derives from an indissociable interaction between 
subject and object or, in more general terms, be- 
tween organism and environment. This interaction 
leads at first to a lack of differentiation, or confu- 
sion, between objective and subjective, as is evi- 
dent in the child’s egocentrism. Later, it takes two 
related developments. One is decentering, which 
results in the objectivity of experimental knowl- 
edge. (This objectivity is not a given of the know- 
ing process but is a slow and laborious conquest. ) 
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The other is reflective abstraction, which leads to 
the construction of logico-mathematical structures. 
(See Piaget 1950.) 

JEAN PIAGET 


[Other relevant material may be found in CONCEPT 
FORMATION; INFANCY; LEARNING, especially articles 
on DISCRIMINATION LEARNING and LEARNING IN 
CHILDREN; PERCEPTION, article on PERCEPTUAL DE- 
VELOPMENT; PROBLEM SOLVING; REASONING AND 
LOGIC; THINKING.] 
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Both “deviant behavior” and “social disorganiza- 
tion” have been variously defined, but there have 
been few efforts to distinguish between the two 
concepts, In fact, it has been suggested that they 
are not different, that along with “social problems” 
and the somewhat outmoded “social pathology,” 
they signify only a potpourri of conditions that are 
considered undesirable from the standpoint of the 
observer's values, conditions that vary at different 
times and with different observers. According to 
this view, these terms have no scientific value and 
no legitimate status as sociological concepts. 

Such nihilism and counsel of despair are not 
justified. True, there is no consensus on the mean- 
ing of these terms, and they are, indeed, burdened 
with value connotations. However, they point to a 
number of distinctions that sociology must take 
into account, 

Concept of deviance. Turning first to the con- 
cept of deviant behavior, we must distinguish 
among the several definitions of the term, which 
are discussed below. * 

Behavior that violates norms. Deviant behavior 
is behavior that violates the normative rules, un- 
derstandings, or expectations of social systems. 
This is the most common usage of the term and 
the sense in which it will be used here. Crime is 
the prototype of deviance in this sense, and theory 
and research in deviant behavior have been con- 
cerned overwhelmingly with crime. However, nor- 
mative rules are inherent in the nature of all social 
systems, whether they be friendship groups, en- 
gaged couples, families, work teams, factories, or 
national societies. Legal norms are then but one 
type of norm whose violation constitutes deviant 
behavior. It is important to note that although 
deviance, in this sense, and conformity are “oppo- 
sites,” they represent the poles within the same 
dimension of variation; therefore, a general theory 
of the one must comprehend the other. 

Statistical abnormality. There is fair consensus 
that “deviant behavior” does not mean departure 
from some statistical norm, However various the 
definitions and usages, they seem to have in com- 
mon the notion of something that is, from some 
point of view, less “good” or “desirable,” and not 
merely less frequent. 

Psychopathology. For sociological purposes de- 
viance is seldom defined exclusively in terms of 
psychopathology, mental illness, or personality dis- 
organization, although it is commonly assumed 
that these phenomena are at least included within 
the scope of deviance. However, behavior is deviant 


in the first, or normative, sense because it departs 
from the normative rules of some social system, 
whereas behavior is pathological because it pro- 
ceeds from a sick, damaged, or defective person- 
ality. It is probable that most deviant behavior in 
the normative sense is produced by personalities 
that are clinically normal and that most behavior 
that is symptomatic of personality defect or mental 
illness does not violate normative expectations, In 
short, the two are independently defined, and the 
relationship between them is a matter for empirical 
investigation. It seems preferable to keep them con- 
ceptually distinct, retaining for the one the term 
“deviant behavior” and for the other the estab- 
lished terminology of psychopathology. 

It should be made clear that the distinction just 
drawn is not that between the psychological and 
the sociological levels of investigation. In viewing 
any human behavior we can ask, on the one hand, 
how it depends upon the history and structure of 
the personality that authors it. On the other hand, 
we can ask how it depends on the history and 
structure of the social system in which it is an 
event. Such questions can be asked about both 
mental illness and deviant behavior. However, in- 
quiry on the psychological and sociological levels 
cannot proceed altogether independently, for each 
must make some assumptions about the other. 
Durkheim (1897), in his classic treatment of sui- 
cide, made clear the analytical independence of 
the sociological level by demonstrating that varia- 
tions in rates of a given class of behavior within 
and between systems are a reality sui generis that 
cannot be explained simply in terms of the psycho- - 
logical properties of human beings but rather de- 
pend on the properties of the social system itself. 
However, he overstated his case and left the im- 
pression, whether it was his intention or not, that 
psychology has little to contribute to the under- 
standing of suicide. In fact, Durkheim's own treat- 
ment of the sociology of suicide is interlarded 
with assumptions about human motivation and 
other considerations that are ordinarily considered 
“psychological” [see SuicipE, article on sOcIAL 
ASPECTS]. 

Socially disvalued behavior and states. Deyen 
behavior may also be defined as socially disvalu‘ J 
behavior and states in general. This definition In 
cludes mental retardation, blindness, ugliness, 
other physical defects and handicaps, illness a x 
sorts, beggary, membership in ritually un a 
castes and occupations, mental illness, criminal H 
and a “shameful past.” What all these have A 
common is that, if known, they assign one to i 
socially disparaged role and constitute a blemi 


in the self. This blemish, or stigma, is an important 
constituent of all social encounters in which it is 
present. It poses problems to the stigmatized actor 
and his alters and has consequences for the de- 
velopment of personality and for social interaction. 
Goffman (1963) has demonstrated that it is pos- 
sible to generalize about the phenomenon of stigma 
and its consequences on a level that abstracts from 
the diversity of its concrete manifestations. 

Clearly, stigma is a legitimate and important 
object of investigation in its own right. Further- 
more, it is ordinarily an attribute of normatively 
deviant behavior; it may play a part in its genesis 
and control. It must therefore figure in a theory of 
deviant behavior. However, the fact that behavior 
is stigmatized or disvalued is one thing; the fact 
that it violates normative rules is another. Not all 
disvalued behavior violates normative rules; nor is 
it certain that all behavior that violates normative 
rules is disvalued. Explaining stigma is not the 
same as explaining why people violate normative 
rules, In keeping with the more traditional and 
better established usage, it seems preferable to 
limit the reference of “deviant behavior” to the 
violation of normative rules. 

Deviant behavior and deviant roles. Itis neces- 
sary to distinguish between what a person has 
done and how he is publicly defined and categor- 
ized by members of his social world. It is mainly 
the latter—the social role attributed to him—that 
determines how others will respond to him, To 
steal is not necessarily to be defined as “a thief”; 
to have sexual relationships with one of the same 
sex is not necessarily to be defined as “a homo- 
sexual” (Reiss 1961). Behavior that violates social 
rules may or may not become visible and, if visible, 
may or may not result in attribution of a deviant 
role. Furthermore, deviant roles may be attributed 
even in the absence of violations of normative 
rules. 

This distinction mirrors one of the perennial 
dilemmas of criminology. Is criminology concerned 
with all violations of criminal law or only with 
those violations that result in a legal adjudication 
of criminality? The former are infinitely more 
numerous than the latter, and data on their fre- 
quency and distribution are difficult to come by. 
The processes whereby some fraction of all vio- 
lators come to be selected for legal stigmatization 
as “criminals” bear only a tenuous relationship to 
actual histories of criminal law violation. Further- 
more, even legal attributions of criminality do not 
necessarily result in attributions of criminal roles 
in the world of everyday life. So, for example, 
“white-collar criminals” and income tax evaders, 
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even if legally convicted, are not likely to be de- 
fined as criminals in the world outside the courts 
and to experience the consequences of such defi- 
nitions (Sutherland 1949). 

The distinction between violating normative rules 
and being socially assigned to a deviant role is 
important. To explain one is not necessarily to 
explain the other. On the other hand, they interact 
in such ways that each must be taken into account 
in explaining the other. For example, to be adjudi- 
cated as an offender or even to be legally processed 
short of adjudication may have important effects 
on actual careers in criminal behavior (Tannen- 
baum 1938). It seems best to think of the field of 
deviant behavior as concerned with deviance in 
both these senses and with their interaction. 

The relativity of deviant behavior. It is com- 
monplace that normative rules vary enormously 
from one social system to another. It follows that 
no behavior is deviant in itself but only insofar as 
it violates the norms of some social system. This 
implies that the sociology of deviant behavior is not 
concerned with the encyclopedic study of prostitu- 
tion, drug addiction, etc., but rather with the ques- 
tion: “How do we account for the occurrence of 
these and other behaviors in situations where they 
are interdicted or disvalued by normative rules?” 

In fact, practical judgments of deviance in the 
world of everyday life take into account the col- 
lectivity membership of the actor. In general, a 
person comes under the jurisdiction of a system 
of normative rules when he is ascribed or success- 
fully claims the role of member of a collectivity. 
This is equally true of subcollectivities—associa- 
tions, cliques, academic institutions—within a 
larger collectivity. Indeed, to be subject to the 
normative rules of a collectivity comes very close 
to defining the social meaning of “membership” 
in a collectivity. 

More generally, the same may be said of any 
role, not of collectivity roles alone. The expecta- 
tions attaching to a role differentiate it from other 
roles and define the terms on which a person can 
be deviant. That this is true for such roles as hus- 
band and wife, doctor and patient, child and adult 
is elementary. It is equally true, but not so obvi- 
ous, for such transient roles as those of the sick 
and the bereaved. To occupy either of these roles 
is to be exempted from some rules otherwise ap- 
plicable, to be subjected to other rules, and to cre- 
ate special obligations for others in the role set of 
the sick or bereaved person, What it takes to be 
“sick” or “bereaved,” that is, the criteria of the 
roles, depends on the culture of the system. In any 
case, however, membership in those roles must be 
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validated in terms of those criteria. To successfully 
claim membership and then, in some manner, to 
betray oneself as “not really sick” or “not really 
bereaved,” as these are defined in one’s culture, is 
to lose the exemptions that go with that role, as 
well as to incur the special contumely of falsely 
claiming membership in a role for which one lacks 
the true credentials. 

In speaking of deviance one must specify the 
system of reference. The same behavior may be 
both deviant and nondeviant, relative to different 
systems in which the actor is implicated. However, 
we are still left with the question: “For any given 
system, who is to say what is deviant? Whose no- 
tions of right and wrong define the rules of the 
system?” This has been one of the most trouble- 
some issues in deviance theory. It is not entirely 
satisfactory to say that the rules of the system are 
those which are institutionalized—that is, agreed 
upon, internalized, and sanctioned (Johnson 1960, 
p. 20). This definition provides no criterion for a 
“cut-off point” defining the degree of institutional- 
ization necessary to determine deviance; in fact, 
the criteria of institutionalization are themselves 
multiple, and to some degree they vary independ- 
ently. 

Alternative responses to normative rules. The 
difficulty may arise partly from a failure to recog- 
nize the importantly different ways in which peo- 
ple may be oriented to normative rules. People 
sometimes seem to violate rules without guilt and 
without even the necessity for some mechanism 
for neutralizing guilt. The inference is typically 
drawn that such people do not recognize the rules, 
that—as far as they are concerned—these are not 
the rules of the system, except, perhaps, in the 
sense of a probability that others will react in a 
hostile way to certain behavior. Then the question 
does indeed arise: “Who is to say what is deviant?” 
We have perhaps been too quick to assume that to 
“accept,” “recognize,” “internalize,” “approve,” and 
“feel bound by” normative rules all mean the same 
thing. We suggest, on the contrary, that one may 
recognize a rule and even insist upon its propriety 
and necessity; one may accept the legitimacy of 
efforts to enforce the rule, even against oneself; 
and one may appraise the “goodness” of people in 
terms of conformity to the rule—but see the job of 
securing compliance with the rule as essentially 
somebody else’s job. One takes one’s chances and 
either “wins” or “loses.” It may be, for example, 
that “delinquent cultures” do not, in general, either 
“repudiate” (Cohen 1955) the rules of the “larger 
society,” “deny their legitimacy” (Cloward & Ohlin 
1960), or “neutralize” (Sykes & Matza 1957) them, 


but somehow institutionalize this “gamester” atti- 
tude toward the rules. 

Furthermore, it is necessary to distinguish be- 
tween what may be called the attribution of “va- 
lidity” to a rule and what might be called its “good- 
ness” or “propriety.” One may consider a rule 
stupid or unreasonable and yet recognize that it 
is the rule and therefore that it properly may or 
even ought to be enforced until it is changed. This 
would indicate that, within a given social system, 
there are criteria of what constitute the rules of 
the system that transcend individual differences 
about what the rule ought to be or differences with 
respect to depth of “internalization” of the rule. 
This distinction suggests a distinction made by 
Merton ([1949] 1957, pp. 359-368) with respect 
to two kinds of deviants: those who violate rules 
for any of a number of reasons but do not ques- 
tion the rules themselves; and those who violate 


rules in order to activate certain processes that, in 
that system, are necessary to effect “repeal” of a 
rule or to replace it with another. However, not all 


who would change a rule necessarily feel justified 
in doing so by violating it. In fact, it could be 
argued that the basis of social order is not con- 
sensus on what ought to be the rules, indeed that 
dissensus in this regard is the normal state, espe 
cially in modern society. Rather, the basis of order 
is agreement on the criteria of what the rules are 
and on the mechanisms for changing them. The 
intention of this discussion is to suggest that if we 
take account of these different ways of orienting 
to normative rules, disagreement on what the rules 
are is not so great as is commonly assumed. 

The sociology of normative rules. Acts are de- 
viant by virtue of normative rules that make them 
so. Therefore, the forms and rates of deviance 
change as the rules themselves change. In conse- 
quence of such changes, acts may move from n 
matively approved to forbidden; from one devian 
category to another; from some category of P ; 
ance to the category of “sickness” or in the 0 T 
direction. And some categories of deviance, sa 
as “heresy,” may become virtually extinct as Be 
of the functioning conceptual equipment of a a 
ciety. The study of such changes has been stae 7 
neglected, with some noteworthy exceptions in A 
sociology of law (Hall 1935). It should be ee 
that changes in normative rules cannot be a 
fully investigated apart from the study of behavi A 
oriented toward these normative rules. On the pa 
hand, normative rules shape behavior; on the oe 
hand, behavior is always testing, probing. and aa 
lenging normative rules, and in response ue sl = 
behavior normative rules are continually being ™ 


defined, shored up, or abandoned (Mills 1959; 
Cohen 1965). The study of this interaction process 
is an integral part of the sociology of deviance. 

Deviant behavior of collectivities. Whatever 
may be the metaphysical status of collectivities, for 
sociological purposes they are actors. They are 
social objects having names, public images, repu- 
tations, and statuses. They are publicly identified 
as authors of acts, and they are subject to rules. 
From the perspective of everyday life, collectivities, 
such as governments, corporations, fraternities, 
armies, labor unions, and churches, do things, and 
some of these things violate laws or other norma- 
tive rules. Little is known about the cultural un- 
derstandings on the basis of which acts (deviant 
and otherwise) are imputed to collectivities as dis- 
tinct from their members severally, because the 
matter has received practically no systematic study 
except in the field of corporation law. It is true that 
the status of an event as the act of a collectivity is 
a definition imposed upon the situation by some 
public and depends upon a set of culturally given 
criteria for attributing acts to authors. However, 
this is equally true of the attribution of acts to in- 
dividuals, and much of the law is concerned pre- 
cisely with specifying and making explicit the 
criteria for such attribution. 

All social acts are the outcomes of interaction 
processes. Whether they will be attributed to this 
concrete individual or that, or to a concrete indi- 
vidual or a collectivity, always depends on some 
culturally given schema through which action is 
viewed. Therefore, the neglect of deviant behavior 
of collectivities cannot be justified on sociological 
grounds. However, only in the area of “white-collar 
crime” (Sutherland 1949) has the subject even 
been approached. 

Theories of deviant behavior. We will make 
no attempt here to inventory the theories bearing 
upon one or another variety of deviance, but will 
limit ourselves to identifying the main features of 
the two traditions that most closely approach a 
generalized theory of deviance. The discussion will 
deal with contrasting emphases. It does not intend 
to offer a rounded picture of either tradition or to 
suggest that they are incompatible. 

The anomie tradition. The anomie tradition 
stems from the work of Durkheim (1897), espe- 
cially his analysis of suicide. Its emphasis is struc- 
tural and comparative, that is, it is concerned with 
explaining how variations in deviant behavior 
within and between societies depend on social 
structure. It is typically concerned with accounting 
for rates in contrast to individual differences. In 
Durkheim’s work the system properties that figured 
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most prominently were the degree of social inte- 
gration (variations in this respect accounting for 
suicide altruiste and suicide egoiste) and system 
changes that create discrepancies between men’s 
aspirations and the means for realizing them. The 
latter results in dérégulation, or anomie, that is, a 
breakdown in the power of social norms to regu- 
late and discipline men’s actions (variations in this 
respect accounting for suicide anomique). The 
elaboration of the anomie concept and the devel- 
opment of its implications constitute the anomie 
tradition. 

Merton ({1949] 1957, pp. 131-194), in his 
seminal paper, “Social Structure and Anomie,” 
made formal and explicit, and generalized to the 
field of deviant behavior, the model that was only 
partly explicit in Durkheim's analysis of suicide 
anomique. He emphasized the independent varia- 
bility of both the culture goals and the accessi- 
bility of institutionalized means (i.e., means that 
are compatible with the regulative norms). The 
disjunction between goals and means, leading to 
strain and to anomie, depends on the values of 
both these variables. Adaptations to such strain 
involve either accepting or rejecting the culture 
goals and either accepting or rejecting the institu- 
tionalized means. Each adaptation therefore in- 
volves two dichotomous choices; the logically pos- 
sible combinations of such choices yield a set of 
adaptations, one of which is conformity, and the 
others, varieties of deviance. This typology speci- 
fies the values of the dependent variable of the 
sociology of deviance-conformity. However, Mer- 
ton’s work is only a modest beginning toward 
specifying the conditions that determine choice 
among the logical possibilities. 

The Chicago tradition. The other tradition, 
which may fittingly be called the Chicago tradition, 
begins with the work of Thomas and Znaniecki, 
especially in The Polish Peasant (1920). This re- 
markable work is strikingly similar to Durkheim's 
writings in many respects, especially in its concern 
with the breakdown in the regulative power of 
social norms. As the tradition has developed, how- 
ever, it has taken on certain distinctive emphases. 
It has tended to focus not so much on deviance as 
an adaptation to strain as on deviance as cultur- 
ally patterned behavior in its own right. It has 
emphasized the social—psychological problem of 
the process of socialization into deviant cultural 
patterns. This approach has been most systemati- 
cally formulated by Clifford Shaw and Henry D. 
McKay (1942), by Edwin H. Sutherland in his 
theory of differential association (1942-1947), 
and most recently by Donald R. Cressey (1964). 
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Another development in the Chicago tradition 
stems from George Herbert Mead’s (1934) concep- 
tion of the self as an internalized object built up, 
in a process of communicative interaction, out of 
the social categories, or roles, available in the 
culture milieu. According to this conception, be- 
havior, deviant or otherwise, is supportive or ex- 
pressive of a social role. It is a way of validating 
one’s claim to such a role by behavior that is cul- 
turally significant of membership in such a role. 
This approach has been most developed by Erving 
Goffman (1956; 1963) and Howard Becker (1963). 
In general, the Chicago tradition emphasizes the 
learned nature of deviant behavior, the role of asso- 
ciation with others and of cultural models, the 
role of symbolism attaching to deviant behavior, 
and the gradual development of, and commitment 
to, deviant behavior in an extended interaction 
process (Short & Strodtbeck 1965). 

Development of comprehensive theories. The 
most comprehensive single formulation in current 
theory of deviant behavior is that of Talcott Par- 
sons (1951, chapter 7), which cannot be ade- 
quately subsumed under either tradition. It shares 
with the anomie tradition a stress on taxonomy, 
the concept “strain,” and the structural sources of 
deviance. It shares with the Chicago tradition a 
deep concern with interaction process and a con- 
ception of deviance and conformity as commit- 
ments that develop in the course of such inter- 
action. To a unique degree it integrates deviance 
theory with a more general theory of social systems. 

Two recent developments point toward a fusion 
of the Chicago and anomie traditions. Cohen 
(1955), starting with the conception of socially 
structured strain, has emphasized the role of inter- 
action process in the creation, as well as the trans- 
mission, of culturally supported deviant solutions 
or deviant subcultures. Cloward and Ohlin (1960), 
addressing themselves also to the determinants of 
choice among possible adaptations to strain, have 
emphasized the role of the availability, at the points 
of strain, of illegitimate or deviant opportunities, 
with special emphasis on the opportunity to learn 
and to perform deviant roles. However, the recon- 
ciliation or integration of the conception of deviant 
behavior as a way of dealing with a problem of 
ends and means, on the one hand, and as a way of 
communicating and validating a claim to a role, on 
the other, has not yet been achieved (Cohen 1965). 


Social disorganization 


When we say that a game, a plan, a committee, 
a family, an army, or a society has been “disor- 
ganized,” we mean a number of closely related 


things: that it has been interrupted; that its iden- 
tity is crumbling away; that its parts, although per- 
haps still recognizable, no longer hang together to 
constitute one thing; that it has disintegrated. In 
every case there is implied some criterion of same- 
ness, wholeness, continuity, or organization. This 
criterion is the correspondence of something “out 
there” to some pattern, model, or cognitive map in 
the mind of the observer. It defines the essential 
attributes or “boundary conditions” of a given type 
of object; the term “disorganization” refers to a 
break in the correspondence of what is “out there” 
to such a pattern. 

A social object, for example, a society or a fam- 
ily, is constructed of action. The pattern that de- 
fines such an object is a course of action or order 
of events. The same scene of action may be seen 
as containing several patterns: patterns that inter- 
sect and patterns within patterns. Whether a given 
object is disorganized depends on the pattern in 
terms of which it is defined. 

It will be useful to distinguish several special 
senses of the term “disorganization” that are com- 
patible with this more general definition. 

From the perspectives of everyday life. This 
discussion will focus on the meanings of social 
events to the people who participate in them; in- 
deed, this is the starting point of all sociological 
analysis. Crap games, corporations, political par- 
ties, and parades enter the sociologists lexicon 
because the conceptual schemes of everyday life 
make it possible for people to envisage them as 
possibilities, to recognize their existence and de- 
mise, to orient their action toward them, to take 
part in them, and to wreck them. 3 

One class of social objects may be called “activi- 
ties”: cleaning a rifle, preparing for battle, accom- 
plishing a mission, carrying out the Normandy 
campaign. Each of these is a sequence of acon 
that, from the standpoint of the actors, “hangs 
together” and constitutes “one thing.” Each, in 
turn, is part of a larger, more extended activity. 
The sameness or continuity of an activity ra 
depend, for the actor, on the correspondence o! 
the flow of events to some set of conventional rules 
(Cohen 1959). The model here is the “game ; its 
constitutive order is defined by the rules of the 

i i ber of ways 
game. There may be an infinite number 
of continuing a game without “breaching its ee 
daries”; however, the set of possible events ¢ a 
or “plays”) that will, at any juncture in the Pri 
continue the game, are given in the rules. Mai 
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of the nongame activities of everyday ite 
example, a party, a religious service, 4 eer al 


proceeding), or at least some of their e: 


components, are likewise defined by conventional 
rules. The sameness of the activity may also de- 
pend, for the actor, on the continued orientation 
of action toward some goal. Although the concrete 
action that goes into it may vary from moment to 
moment as the situation changes and although the 
act is literally built up out of bits of diverse action, 
it is seen and felt to be the same act so long as it 
is oriented to the same goal. Building a house 
would be an example. In either case, what has been 
going on, whether it is still going on, and the 
conditions that would constitute an interruption 
or disorganization of the activity depend on the 
pattern that defines that kind of activity for its 
participants. 

Another class of social objects may be called 
“collectivities.” Such are families, teams, corpora- 
tions, nations, gangs. A collectivity exists when 
both a common identity and a capacity for action 
are attributed to the incumbents of a set of roles. 
In other words, the pragmatic tests of a collectivity 
are whether it has a socially defined membership 
and whether it is socially defined as an actor. The 
collectivity ceases to be a “going concern” and is 
destroyed or “disorganized” when the common 
identity is extinguished and it is no longer treated 
as an actor. 

We have little systematic knowledge about the 
patterns to which structures of interaction must 
correspond in order to constitute collectivities in 
the world of everyday social life. However, if we 
are to talk about the “disorganization of collectivi- 
ties,” we must know what constitutes a breach of 
their boundaries; to do this we must determine 
what order of events defines a collectivity of a 
given sort for members of the society in question. 

From the social scientist’s perspective. Struc- 
tures of action may exist as objects for the social 
scientist that are not social objects from the per- 
spective of the “man in the street.” A “market 
structure,” a “substructure of goal attainment,” a 
“homeostatic process,” and an “ecological equilib- 
rium”—all are orderings of events in terms of some 
pattern that is part of the conceptual equipment 
of the social scientist. If these patterns are pre- 
cisely defined, they also imply a set of criteria for 
defining “disorganization” of the respective objects. 

Disorganization as the spread of deviance. The 
definition of disorganization in terms of the spread 
of deviance has a long history in sociology. Thomas 
and Znaniecki defined social disorganization as a 
“decrease of the influence of existing social rules 
of behavior upon individual members of the group” 
and went on to say that “this decrease may present 
innumerable degrees, ranging from a single break 
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of some particular rule by one individual up to a 
general decay of all the institutions of the group” 
(1920, vol. 4, p. 20). The principal subject matter 
of most textbooks entitled “social disorganization” 
is deviant behavior. According to this view, disor- 
ganization may be reduced and the integrity of the 
system restored either by strengthening social con- 
trols or by redefining norms so that behavior de- 
fined as deviant becomes normatively acceptable 
(Thomas & Znaniecki 1920, vol. 4, p. 4; Mills 
1959). 

Whatever the merits of this conception of disor- 
ganization, it cannot provide a general definition 
of disorganization. Certainly there are many struc- 
tures, visible from the specialized perspectives of 
the social scientist, that cannot be defined in terms 
of the conformity of action to normative rules. 
From the standpoint of the perspectives of every- 
day life as well, it seems important to distinguish 
between deviance and disorganization. “The rules 
of the game” define an activity or collectivity, that 
is, the pattern to which action must correspond if 
it is to constitute a certain sort of thing. The nor- 
mative rules, departure from which is deviance, 
specify how people ought to behave; they are cri- 
teria for judging the moral status of an act. For 
example, there are “dirty” ways of playing a game, 
unethical ways of carrying on a business, and 
“inhuman” ways of fighting a war. Still, they are 
unequivocally recognizable as integral to the 
respective activities, as “moves” in the respective 
games; and the rules of the game typically define 
the “next moves” that would constitute the con- 
tinuity of the game. 

Precisely because deviance is so intimately rela- 
ted to disorganization but is not in general identi- 
cal with it, it is necessary to distinguish them and 
treat the relationship between them as a problem 
for theory and empirical investigation. 

One of the conditions of the survival of any 
social activity or collectivity is that people be 
motivated to “play the game,” to take up their 
positions in the structure of interaction and con- 
tribute the moves that maintain the continuity of 
the structure in question. One of the general con- 
ditions of disorganization, then, is breakdown of 
motivation, and anything that undermines motiva- 
tion contributes to disorganization. It is elementary 
that conformity to normative rules—to some de- 
gree that cannot be stated in general terms—is 
fundamental to the maintenance of motivation. 
When people elect to participate in any social 
structure, they subject themselves to a certain dis- 
cipline; they commit resources; and they forgo 
alternatives. In other words, they pay a price. 
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Whatever the several reasons for which they join, 
their ends are attainable only if others “play the 
game,” and play it according to certain restrictions 
defined by the normative rules. Violations of the 
normative understandings tend to erode trust and 
undermine motivation. A certain amount of devi- 
ance is expected. Although disappointing, it is not 
surprising; it is allowed for in advance, and does 
not seriously impair motivation, However, at some 
point the spread of deviance and the consequent 
erosion of trust will destroy motivation and precip- 
itate disorganization. 

On the other hand, deviance may contribute to 
stability and preservation of the common enter- 
prise. Normative rules are usually adapted to typi- 
cal, recurring situations and tend to produce results 
that enhance the viability of the enterprise. How- 
ever, rules are categorical, and situations often 
arise in which conformity to the normative rules 
will thwart the attainment of the common objec- 
tive and weaken or destroy the structure. In short, 
there are times when someone must violate the 
normative rules if the enterprise is to succeed and 
thrive. Sometimes there are implicit rules (patterns 
of “institutionalized evasion”) that give flexibility 
to the normative rules so that the deviance is 
deviance only in an equivocal sense (Williams 
1951). But this is not always so, and therefore 
the relationship between deviance and disorgani- 
zation is rendered still more problematic. 

Finally, it is probably true that some kinds of 
deviance, even if not motivated by collectivity con- 
cerns, create the conditions necessary for the sta- 
bility of other substructures of the same system or 
of the system as a whole. Kingsley Davis (1937), 
for example, has made this argument relative to 
prostitution. 

Disorganization theory. There has been rela- 
tively little attention to the explicit development of 
disorganization theory, as compared with deviance 
theory. However, the beginnings of such theory are 
implied in more general theories and conceptions 
of social systems, such as general systems theory, 
interaction process analysis, structural—functional 
theory, and the input-output, homeostatic, equilib- 
rium, and cybernetic models. They tend to share 
the following ideas. 

A social system is, from one point of view, a 
mechanism that operates for its own perpetuation. 
It is what it is because the participants are moti- 
vated to behave in certain ways characteristic of 
the system and because the situation of action 
makes possible these ways and restricts the alter- 
natives. In order to preserve its structure (or so 
much of that structure as is constitutive of its 


identity) that motivation and situation must some. 
how be reconstituted, or other motivations and 
situations must be created that will generate be- 
havior coresponding to the same pattern. However, 
the system, as a product of its own functioning, 
tends to thwart the creation or re-creation of the 
conditions of its own survival. For example, it 
tends to transform the environment to which it 
has become adapted; to use up or to lose its own 
human and nonhuman resources; to generate dis- 
tance and distrust, resentment and alienation 
among its members; and to create new situations 
for which its culture provides no definitions or 
instructions. Various lists have been drawn up of 
conditions that must be met or tasks that must be 
performed if the system is not to fly apart in con- 
sequence of its own functioning (for example, see 
Aberle et al. 1950; Bales 1950; Parsons 1951, 
pp. 26-36). 

Most systems do manage to preserve their iden- 
tity. Therefore, the functioning of the system must 
also produce effects that correct or compensate for 
the centrifugal tendencies it produces. In particu- 
lar, the structure of such systems must include 
mechanisms for picking up information about 
threatening changes in the environment or in the 
system itself and for communicating that informa- 
tion to positions in the system that are capable of 
taking corrective action. Such action, in turn, may 
consist of responses tending to reduce or eliminate 
the change or further modifications elsewhere in 
the system, enabling the system as a whole to main- 
tain its boundaries or identity in the face of the 
change. 

Systems do not always succeed. Some are wholly 
extinguished; others suffer radical disorganization 
of various substructures but cling to those minimal 
attributes that define their identity. Some have 
standby mechanisms that can be activated in time 
to do the job of some injured organ or to get about 
the work of reconstruction before the damage 
proves lethal to the system; and others do not. 
These differences have been most systematically 
studied in connection with disasters (Baker & 
Chapman 1962). In general, it is the task of a 
theory of social disorganization to account for vari- 
ations in the ability of social structures to preserve 
their identity. 


ALBERT K. COHEN 


[Directly related are the entries CRIME; NORMS; goa 
PROBLEMS. Other relevant material may be fow a 
in Disasters; Drucs, article on DRUG ee 
SOCIAL ASPECTS; SOCIAL CONTROL; and in the 
raphy of SUTHERLAND.] 
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White Collar 


DEWEY, JOHN 


John Dewey (1859-1952), generally regarded 
as the most influential philosopher in American 
history, was born in Burlington, Vermont. After 
receiving his doctorate from Johns Hopkins Uni- 
versity in 1884, he taught at the University of 
Michigan (except for a year spent at the Univer- 
sity of Minnesota) until 1894, when he moved to 
the University of Chicago. 

Dewey was attracted to Chicago because peda- 
gogy there was included in one department with 
philosophy and psychology. At Chicago he estab- 
lished an experimental clementary school, wrote 
The School and Society (1899), and became in- 
volved with Jane Addams’ Hull House. At this time 
Dewey was developing his pragmatic approach to 
the theory of mind and his “instrumentalist” theory 
of logic: his essay “The Reflex Arc Concept in 
Psychology,” which had considerable influence on 
developments in psychology, appeared in 1896, 
and Studies in Logical Theory, a collection of es- 
says by Dewey and some of his colleagues and 
students, appeared in 1903. The book was greeted 
by William James as the signal of the birth of a 
“Chicago school” of pragmatic philosophy. 

In 1904 Dewey resigned his professorship at 
Chicago because he was displeased with the uni- 
versity administration’s actions toward the experi- 
mental school he headed. He moved to Columbia 
University as professor of philosophy, with addi- 
tional teaching responsibilities at Teachers College. 
Dewey taught at Columbia until his retirement in 
1929. During the 1930s he produced some of the 
most ambitious philosophical works of his career. 
He also continued to take part in a wide assort- 
ment of civic and political activities, the most 
dramatic of which was his service in 1937 as chair- 
man of the unofficial commission that held public 
hearings in Mexico and found Leon Trotsky in- 
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nocent of the charges made against him in the 
Moscow trials, 

Intellectual influences, Dewey's early intellectual 
attachments were to Hegel's philosophy. Dewey's 
New England upbringing had stressed the radical 
divisions that exist in man and the universe be- 
tween body and soul, nature and God, the world 
and the self. He found these beliefs “an inward 
laceration.” Hegel's philosophy, with its dialectical 
elimination of the presumed antitheses between 
matter and spirit, nature and the divine, and sub- 
ject and object, offered release from these oppres- 
sive dualisms. Hegel's influence on Dewey can 
be seen in Dewey's lifelong polemic against all 
forms of dualism in philosophy, in his concept of 
individuality as a social product, in his tendency 
to identify freedom with rational self-realization, 
and in his view that logical and moral principles 
are not fixed, external standards to be imposed on 
human inquiry and conduct but are instead evolu- 
tionary phenomena that emerge within the actual 
course of human thinking and acting. 

Over a period of fifteen years after leaving Johns 
Hopkins, however, Dewey drifted away from 
Hegel's philosophy and eventually renounced it al- 
together. The theme that came to seem increasingly 
important to him was what he called “the intel- 
lectual scandal” involved in the separation of sci- 
ence from morals. Hegel's abstract, metaphysical 
solution of this problem became increasingly un- 
congenial to Dewey, who desired to reformulate 
philosophical problems so that they implied clear 
alternative programs for social action. What Dewey 
described as his transition “from absolutism to ex- 
perimentalism” was further aided by the pioneer 
work of Charles Sanders Peirce (1839-1914) in the 
development of a pragmatic theory of logic and 
by William James's Psychology. Dewey’s own “prag- 
matism,” or “instrumentalism,” owes relatively little 
to James's pragmatism but much to James's bio- 
logical approach to the problems of mind. The work 
of Dewey's friend George Herbert Mead (1863— 
1931) also had considerable influence on Dewey's 
philosophy. Dewey's conceptions of the self, and of 
the genesis and function of such phenomena as 
“consciousness” and “conscience,” owe much to 
Mead’s work in social psychology and philosophy. 
Finally, Dewey’s thought is unintelligible except as 
a response to the Darwinian theory of evolution by 
natural selection. Dewey's theory of mind, his trans- 
lation of Hegel's categories into biological and cul- 
tural terms, his views on logic and morals, and his 
conception of the task of philosophy are all efforts 
to trace the implications of Darwin’s mode of 
thought. 


With respect to the question of intellectual in- 
fluences on him, Dewey's own judgment, however, 
should be emphasized. He believed that most of 
the significant influences on his intellectual de- 
velopment came not from books but from personal 
associations and practical experience, particularly 
in education. 

Instrumentalism. Throughout his career Dewey's 
central interest was to repair moral and social be- 
liefs and practices by encouraging the application 
of scientific methods and critical intelligence to 
them. Dewey believed that one of the principal ob- 
stacles to this was the traditional notion, embedded 
in common sense and defended and enshrined by 
philosophy, that “theory” is contemplative, pas- 
sive, and unmarked by practical concerns and that 
“practice” is by its very nature not susceptible 
to intellectual formulation or control. The develop- 
ment of a logic of human inquiry that would reveal 
the underlying unity of “theory” and “practice” was 
therefore Dewey's central intellectual enterprise. 

According to Dewey the advent of modern ex- 
perimental science had shown that theory and 
practice, far from being opposed, are in fact in- 
terdependent in successful scientific inquiry. Using 
such inquiry as a model, he formulated the view 
that general ideas are instruments for reconstruct- 
ing “problematic situations.” An idea, in other 
words, is what is sometimes known as a “leading 
principle” or an “inference ticket”—a rule directing 
and regulating the movement of an inquiry or argu- 
ment from one set of observations to another. The 
truth of an idea lies in its capacity to reorganize 
the materials of experience so that the problem 
that originally provoked reflection is resolved in ac 
cordance with the canons of disciplined inquiry: 
The power of an idea is measured by the novelty 
and significance for further inquiry of the questions 
it leads us to ask. Dewey thus rejected, or pany 
to reject, the traditional “correspondence theory o 
truth,” according to which the truth of an idea is 
simply a matter of its correspondence to the ex- 
ternal, independent reality to which it refers. 

Dewey’s “instrumentalism” appears to involve, 
the least, an overstatement, for if all ideas are E 
ply rules for making inferences, then we a 
forced to the paradoxical conclusion that there F 
no general statements in the sciences that pe i 
anything external to human habits of thought. z 
emphasis of Dewey's instrumentalism, nonethfk 
was extremely useful in enhancing critical a p 
standing of science. It presented inquiry as à pha 
in the continuing readaptation of a social ae 
to its environment and portrayed general ideas 
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as directives for action on the environment. It thus 
cast doubt on classic distinctions between theory 
and practice. 

Theory of moral judgment. To establish still 
further the continuity of science and morals Dewey 
also undertook to show that moral judgments are 
subject to the same essential logic of inquiry as 
that of the sciences. Dewey's argument, as de- 
veloped, for example, in Logical Conditions of a 
Scientific Treatment of Morality (1903b) and 
Theory of Valuation (1939a), is that moral ideals 
are properly interpreted as hypotheses proposing 
that certain courses of action will resolve specified 
sets of difficulties. Moral ideals, therefore, like the 
general ideas of the sciences, are instruments for 
the solution of problems, and their validity is to be 
determined by a matter-of-fact examination of the 
consequences of acting on them, analogous to the 
procedure by which general ideas in the sciences 
achieve acceptance. 

Dewey's position is frequently criticized on the 
ground that he erased the distinction between 
factual statements and value judgments, a distinc- 
tion on which the very conception of an objective 
science depends. The force of these accusations is 
weakened when it is seen that Dewey's argument 
entails the denial of the normally accepted view 
that there is a hard and fast distinction between 
means and ends. Critics of Dewey frequently ask 
how, in Dewey's terms, instrumental moral value 
can be ascribed to a course of action when he 
denied that there are in the last analysis any ends 
that have value for themselves. Dewey, however, 
rejected this question as irrelevant to the actual 
conditions under which moral choice takes place, 
for he believed that moral perplexities arise in 
specific contexts where certain established values 
are imperiled but where a host of other values, 
which might be questioned in other contexts, are 
not in fact in question. The problem of infinite 
regress is therefore not relevant to the practical 
contexts in which moral judgments are made; it 
implicitly introduces standards of demonstrative 
certainty that are not appropriate to this domain of 
thought. 

The analyses of moral judgments that Dewey 
offered in different works were, however, not en- 
tirely consistent, and in the eyes of many critics 
he never successfully refuted the charge that he 
confused descriptive and prescriptive statements. 

Critique of philosophy. Much of Dewey's work 
consisted of polemics against “the classic tradition” 
in philosophy. The major effort of classic philos- 
ophy, he argued in such books as Reconstruction in 
Philosophy (1920) and The Quest for Certainty 
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(1929), had been to give men a sense of surcease 
from surrounding perils they were helpless to over- 
come. Accordingly, the usual message of classic 
philosophy was that behind the everyday world of 
change and irrationality there is an unchanging 
and rational world, in the contemplation of which 
men may gain understanding and serenity. Thus, 
in Dewey's view, philosophical “dualism” was es- 
sentially an instrument of conservatism and retreat. 
It was also, he argued, an expression of such 
aristocratic social prejudices as the distaste for 
manual labor, Although relatively few scholars ac- 
cept Dewey's picture of the history of philosophy, 
his placing of that history within a social context 
has been the source of a considerable reappraisal of 
the Western philosophical inheritance. 

Philosophy of education. For Dewey the prin- 
cipal object of education was to instill in students 
the attitudes and habits conducive to the develop- 
ment of their capacity to solve problems. As he 
argued in Democracy and Education (1916a), this 
required that classroom emphasis be placed not on 
arrays of factual information or on inherited ideas 
presented as settled and accepted but on the intel- 
lectual methods by which such factual information 
or such ideas are discovered and reliably estab- 
lished. This view was further fortified by Dewey's 
belief that objective attitudes toward moral and 
social questions require the rejection of absolutes 
and the cultivation of flexibility and tolerance. 
Moreover, Dewey argued that a democratic culture 
requires from its members a capacity to adapt to 
diverse circumstances and to cooperate as equals 
with men and women of many different sorts. 
Schools responsive to these democratic imperatives 
would therefore aim at training students in habits 
of free and constant inquiry, in capacities to learn 
quickly, and in attitudes of social fellow feeling 
and cooperation. In spelling out this prescription, 
Dewey laid great stress on the atmosphere of the 
classroom: he opposed rote learning, stressing in- 
stead the pedagogical desirability of “learning by 
doing” and of connecting the materials of formal 
school instruction with the child’s experiences out- 
side the classroom. 

Dewey's theories of education came to be widely 
adopted, and they were given a variety of inter- 
pretations by ardent disciples. In his short book 
Experience and Education (1938a), written toward 
the end of his life, Dewey took account of some of 
the varieties of “progressive education” that had 
been associated with his name and expressed his 
serious misgivings about them. 

Conception of the social sciences. Dewey pro- 
vided intellectual support for the view that the 
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logic of inquiry in the natural sciences is applicable 
in its major features to the study of human affairs. 
However, although he argued that the social sci- 
ences could and should offer objective descriptions 
of facts, he also stressed that their progress as in- 
tellectual disciplines depended on the importance 
of the subjects they chose for study and on the 
refusal of social scientists to dodge controversial 
issues. He argued that adequate social inquiry, like 
physical inquiry, requires the experimental manip- 
ulation of existential conditions. Accordingly, the 
removal of taboos against social planning, he be- 
lieved, would greatly aid the progress of man's 
social knowledge. 

Social outlook. Dewey's social views, in general, 
may be characterized as reasonably typical of the 
so-called progressive era of American thought. His 
special contributions to progressive thought con- 
sisted in his critique of ivory-tower ideals of scholar- 
ship, his attacks on such intellectual absolutes as 
the doctrine of natural rights, and his enlargement 
of the concept of freedom to include the dimension 
of personal self-realization, beyond the mere ab- 
sence of external restraints. Closely connected with 
this was Dewey's restatement of the relation of the 
growth of individuality to environing cultural con- 
ditions, a view that led to the emphasis on the role 
of the school in social reconstruction, In such 
essays as “Logical Method and Law” (1924), 
Dewey also applied his instrumentalist approach to 
questions of jurisprudence, influencing the evolu- 
tion of American “legal realism”; in The Public 
and Its Problems (1927), he applied a similar ap- 
proach to problems of political science, contributing 
to the progress of “interest group” theory. 

The broad ideal behind his social outlook was 
articulated in several works, most notably, perhaps, 
in Art as Experience (1934). His critique of the 
industrial society of his day was based mainly on 
his conviction that this society reduced men to a 
state of passive acquiescence in external routines 
laid down for them. His image of a good society 
was one in which men are active agents, intel- 
ligently setting their own standards and partici- 
pating freely and equally in the making of their 
common destiny. 

Influence. Dewey's impact on American phi- 
losophy before World War 1 was massive, and his 
impact on educational theory and practice was 
even greater. His influence on psychology, juris- 
prudence, political science, and styles of thought 
in history and economics was also considerable. 
Even more important, perhaps, was his general 

influence on the atmosphere of American scholar- 
ship. He helped free that scholarship from sub- 


servience to genteel conventions and theological 
modes of thought, and he was one of those prin- 
cipally responsible for the acceptance of the view 
that the study of human affairs is properly a task 
of empirical science. By the example of his life, by 
his activities as a leader of such organizations as 
the American Association of University Professors, 
and most of all by his courageous articulation of 
his conception of philosophy, he contributed as 
much as any man to the spread of the idea in 
America that free scientific inquiry, recognizing no 
limits to the questions it might ask, is the linchpin 
of a sound society and of all responsible social 
action. 

CHARLES FRANKEL 


[For the historical context of Dewey's work, see the 
biographies of Darwin; HEGEL; James; MEAD; 
Peirce. For discussion of the subsequent develop- 
ment of his ideas, see EDUCATION; EDUCATIONAL 
PSYCHOLOGY; LEARNING; and the biographies of AN- 
GELL; BEARD; BENTLEY; MERRIAM; ROBINSON.] 
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DICEY, ALBERT VENN 


Albert Venn Dicey (1835-1922), British consti- 
tutional lawyer and political theorist, was given his 
middle name in honor of John Venn, leader of the 
Clapham evangelists. On his mother’s side he was 
related both to the Venns and to the Stephens— 
Sir Leslie Stephen, philosopher, literary critic, and 
author of The English Utilitarians, and Sir James 
Fitzjames Stephen, a judge and the author of a 
celebrated history of English criminal law. Dicey 
was educated at Balliol College, Oxford, where one 
of his friends and contemporaries was James (later 
Viscount) Bryce, author of The American Com- 
monwealth. Later Dicey became a fellow of Trinity 
and in 1882 was elected to the Vinerian professor- 
ship of English law at All Souls College, Oxford, 
the chair first occupied by Sir William Blackstone. 
Dicey’s two major works were his Introduction to 
the Study of the Law of the Constitution (1885) 
and Lectures on the Relation Between Law and 
Opinion in England, During the Nineteenth Cen- 
tury (1905). Both books are masterpieces of com- 
pression, and the Law of the Constitution still 
commands attention for its exposition of three 
major principles of the British system of govern- 
ment. These principles, as Dicey saw them, were 
(1) the close relationship between formal legal 
rules and informal conventions of constitutional 
behavior, (2) the legislative supremacy of Parlia- 
ment, and (3) the rule of law. 

Each of these principles has provoked criticism 
and debate in the twentieth century. Thus, Dicey 
insisted that conventions (for example, that gov- 
ernments resign if defeated on an issue of confi- 
dence in the House of Commons) were obeyed 
because of the existence of an ultimate legal sanc- 
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tion; a government that defied the convention 
about resignation, he argued, would find that the 
Commons would refuse to appropriate taxes or pass 
the Army Act, and administration could not there- 
after be carried on legally. The explanation is not 
convincing. It does not tell us why conventions 
are obeyed when governments have solid majorities 
in the House of Commons; or why the majority 
respects the rights of the opposition (and pays its 
leader a salary); or why the Queen assents auto- 
matically to legislation; or why the legal powers of 
the crown and of Parliament are not exercised over 
territories to which only de facto independence has 
been given. 

About the “sovereignty of Parliament” Dicey had 
much to say. It meant, he pointed out, the legal 
supremacy, or absolutism, of the “queen-in-Parlia- 
ment.” In the eyes of the English lawyer and the 
English courts, the queen-in-Parliament is capable 
of making or unmaking any law, and no body or 
person or court can set aside this sovereign and 
omnicompetent will. Fundamental laws and judi- 
cial review are unknown to the law of England. 
The only legal impossibility is that of binding or 
restricting the powers of future Parliaments; for 
what one Parliament can do, another can undo. 

The inconvenient practical implications of this 
common-law doctrine become evident when pro- 
posals are made, as they have been in Canada and 
other Commonwealth countries, to insert into the 
legal system a body of enforceable legal rights. If 
each successive English Parliament is sovereign in 
Dicey’s sense, such proposals cannot in legal prin- 
ciple be carried out. As a matter of political prac- 
tice, however, Britain's “sovereign” Parliament did 
bind itself irrevocably, and its successors as well, 
when in 1931 (and since then with increasing 
frequency) it relinquished the power to legislate 
for formerly dependent territories. And the United 
States is a monument to the futility of the English 
principle of legislative omnipotence, whether exer- 
cised by George m's Parliament or any other. 

From the jurisprudential point of view, there 
have developed in recent times some differences 
on a problem that Dicey did not foresee, namely, 
whether Parliament is prohibited by the sovereignty 
principle from altering the legislative process in 
such a way that it prevents future Parliaments 
from repealing or amending, by simple majority 
vote, laws deemed to be of constitutional impor- 
tance. There is disagreement about what the prob- 
able attitude of the courts would be if Parliament 
were to attempt to bind itself or its successors 
procedurally by requiring a special majority for 
the repeal of certain “entrenched” laws. This ques- 
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tion is of great potential importance in new consti- 
tutional systems deriving from the British parlia- 
mentary model, though it remains a theoretical 
issue in the United Kingdom. 

Dicey saw his third principle, the rule of law, 
as one that precludes not only arbitrary acts but 
even the existence of any wide discretion on the 
part of government officials. The rule of law makes 
possible the single jurisdiction, admired by Dicey, 
in which both citizens and state servants are sub- 
ject to the same set of courts. He preferred the rule 
of law (thus defined) to the Continental system, 
in which administrative courts for the adjudication 
of disputes between the citizen and the administra- 
tion exist alongside of the ordinary courts. Under 
the common-law principle, he believed, the state 
and the individual are treated as equals. 

Critics have suggested that Dicey’s view is politi- 
cally and administratively conservative and that 
it underrates the need for administrative discretion 
at the same time that it disregards substantial 
inequalities that may arise between citizens and 
officials as a result of executive privileges. A system 
of droit administratif may, by contrast, overcome 
these deficiencies, Dicey was not, however, in his 
later writing entirely insensitive either to the merits 
of the Continental system of control or to the need 
for discretionary powers on the part of government 
departments (Dicey 1915; also, Lawson 1959). 
Furthermore, Dicey’s strictures on official arbitrari- 
ness and discretion in matters of civil, as distinct 
from economic, liberties and in regard to the 
powers of policemen, if not those of civil servants, 
are today widely accepted. Indeed, Dicey’s views 
on free speech and the right of public assembly 
sound very modern, in contrast to his statements 
on economic welfare and state intervention. Since 
similar comments could be made about other con- 
temporaries of Dicey’s, notably John Stuart Mill, 
it would be worth investigating why the economic 
principles of Victorian England should have proved 
so ephemeral and the political ones so durable. 

Dicey’s social and economic views were those of 
a Whig and a free trader. On the Irish question he 
was a Unionist and supported his opinion with a 
vigorous tract, England's Case Against Home Rule 
(1886). Although his Law and Opinion is primarily 
historical and descriptive, his dislike of collectivist 
legislation can be divined in it without difficulty. 
In dealing with the relation between legislation and 
English thought and opinion, he refers to “opinion” 
primarily in the sense of organized juristic, politi- 
cal, and economic theory rather than in the sense 
of “public opinion.” In linking legal development 


to intellectual movements, he saw a period of “Old 
Toryism” linked to Blackstonian optimism, fol- 
lowed by a period of Benthamite reform linked 
with the utilitarian philosophy, and finally a period 
of collectivism, which he thought reflected “a senti- 
ment rather than a doctrine.” 

Collectivism, he believed, owes a direct debt to 
Benthamism, which by providing precedents for 
centralized governmental intervention of a limited 
kind, had forged the instruments for more exten- 
sive collectivist measures. The first form of inter- 
vention he styled “negative,” the latter “positive.” 
The contrast was an unclear one, since any “posi- 
tive” state purpose can be reformulated in terms 
of some social or economic impediment to freedom 
of access (to opportunity or happiness), which it 
is necessary to remove by collective restrictions. 
In 1860, Dicey thought, there was a widespread 
belief in Mill’s “simple principle” of individual 
liberty. But by 1914 there was collective provision 
for old age pensions, national health insurance, 
and school meals. Dicey doubted whether these 
measures could be justified on the principles set 
out in the essay On Liberty. He was surprised that 
the recipients of pensions from the state were al- 
lowed to continue to vote. 

One interesting question raised in Law and 
Opinion is why English society has absorbed major 
legislative changes without resistance or obstruc- 
tion from those whose interests are adversely af- 
fected. (The same question was posed by many 
after World War 11.) One of the answers suggested 
by Dicey is the misleading English belief that legis- 
lation changes little and in the end leaves most 
things in society much as they were. Dicey himself 
sometimes seemed to accept this view, which may 
explain his faith in social change through an en- 
lightened public opinion. 

GEOFFREY MARSHALL 


[For discussion of the history and subsequent develop- 
ment of Dicey's ideas, see ADMINISTRATIVE LAW; 
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DICTATORSHIP 


In contemporary usage “dictatorship” refers to 
the unrestricted domination of the state by an in- 
dividual, a clique, or a small group. Instances of 
dictatorial rule are found in all epochs and in all 
civilizations. The term “dictatorship” may signify 
not only the governing principle of a political sys- 
tem but also an ideology underlying a way of life 
and a normative expression of political behavior. 
Several expressions have been used to characterize 
the historical phenomenon of dictatorial rule: tyr- 
anny, despotism, autocracy, Caesarism, Fiihrerstaat, 
authoritarianism, totalitarianism. 

Apart from constitutional dictatorships estab- 
lished to deal with governmental emergencies, all 
forms of dictatorship share the following features: 

(1) Exclusivity and arbitrariness in the exercise 
of power. Dictatorships are characterized by the 
absence of a division of power, the suppression of 
competing, legitimate political and social groups 
and institutions, the concentration of political 
power in the hands of a dictator or of an autocrat- 
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ically governing group of leaders (elite), and the 
utilization of an autocratically guided and manipu- 
lated ruling apparatus to develop a monopoly of 
power. 

(2) Abolition or loosening of the juridical bonds 
of political power. The constitutional state is elim- 
inated, or a new revolutionary or counterrevolution- 
ary law created, merely as an instrument of rule. 
Related to this feature is the difficulty or impossi- 
bility of regulating the succession of the dictator in 
a lawful fashion. 

(3) Elimination or substantial restriction of 
civil liberties. Instead of voluntary cooperation of 
socially and politically autonomous groups and 
associations in the erection of the commonwealth, 
emphasis is placed on the citizens’ obligation to 
perform compulsory labor or collective services. 

(4) The predominantly aggressive, impulsive 
form of decision making. The domestic and for- 
eign policies followed by the dictator and/or the 
leading political elite are usually made impulsively 
and are inspired by a dynamic political activism, 
often based upon an ideological Messianism and 
aimed at transforming or disciplining the society. 

(5) Employment of despotic methods of politi- 
cal and social control. Such methods range from 
intimidation to propaganda, from imposing the 
duty of obedience to methods of terror. 

These characteristics of dictatorship are to be 
found in various combinations and modifications in 
different historical configurations: in the Greek 
and Sicilian tyrannies described by Plato and Aris- 
totle and in the postconstitutional Caesarist dicta- 
torship of the Roman Empire, from Sulla and 
Caesar to the imperial despots; in the many mani- 
festations of the tyranny of the noble families and 
the urban oligarchies in the early and late Renais- 
sance, whose problems were condensed in classic 
form by Machiavelli in The Prince; in the English 
absolute monarchy and the hesitant despotism of 
its revolutionary opponent, Oliver Cromwell; in the 
terroristic Jacobin dictatorship of the French Revo- 
lution and the social-revolutionary communist 
states; in the short-lived, counterrevolutionary 
fascist and National Socialist states; in the earlier 
forms of Latin American military dictatorships; 
and, since World War 1, in numerous completely 
or partially totalitarian structures patterned on the 
sociopolitical models of fascism or communism 
(e.g, Falangist Spain, Peronist Argentina, com- 
munist Cuba, and the states of such leaders as 
Nasser and Nkrumah). 

The concept of dictatorship, in its origin and 
evolution, may be understood both as a comple- 
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mentary and protective constitutional device and 
as a complete antithesis to the democratic constitu- 
tional state. Thus, Carl J. Friedrich (1937), in 
referring to the ancient Roman model, makes a 
distinction between constitutional and unrestricted 
dictatorship. Franz L. Neumann (1957, p. 248) 
comments that dictatorship may arise and function 
as “implementation of democracy,” “preparation for 
democracy,” or the “very negation of democracy.” 
Plato and Aristotle saw the origin of tyranny in the 
weaknesses and degeneration of democracy, and 
political theory has been based on the polarity of 
democracy and dictatorship ever since. However, 
the view that a revolutionary dictatorship neces- 
sarily presupposes the existence or the counterpart 
of a democratic constitution is disputed. Answers 
may be provided by the recent sociological and 
political research into the historical process of 
transition from a constitutional, restricted dictator- 
ship to an unrestricted, total dictatorship. It has 
also been held that the communist “totalitarianism 
of the left” has evolved from the rudiments of egal- 
itarian, messianic democracy of the French Revo- 
lution (Talmon 1952). 

The inability to function and the internal weak- 
ness of democracy are undoubtedly among the 
main causes of the establishment of dictatorial 
rule. The totalitarian communist system of the 
Soviet Union arose in consequence of the crumbling 
away of tsarist autocracy, hastened along by a 
mass movement. In general it can be shown that 
unresolved social tensions and economic crises, 
together with the undermining of constitutional 
order and the development of undemocratic power 
aggregates, are among the conditions that give rise 
to dictatorial regimes. 


Types of dictatorship 

Differences in origin, legitimation, organization 
of rule, and goals, as well as in political style, have 
led scholars to isolate types of dictatorship and to 
differentiate among them. 

Both Plato (Republic vim and 1x) and Aristotle 
(Politics book m1) dealt mainly with the structure 
and methods of tyranny and provided initial insight 
into the nature of dictatorial rule. Machiavelli was 
the first to distinguish between dictatorship as a 
constitutional institution of the republic and as a 
despotic form of government, which he recom- 
mended to the ruler as a means of restoring politi- 
cal order. Absolute monarchies are generally not 
regarded as dictatorships, since the exercise of 
power is clothed in traditional legitimacy. Yet 
whenever an absolute sove ign actually rules des- 
Potically, violating the customary standards of 


monarchical authority, his rule must be termed a 
dictatorship (e.g., Louis x1, Richard m1, Henry 
Philip 11). 

The well-known distinction between provisiona 
dictatorship (kommissarische Diktatur—the grant 
of special full powers in the event of state emer- 


revolutionary change of the entire political a 
social order) made by Carl Schmitt (1921) is 
hardly fruitful sociologically in view of the histori- 
cal and cultural variations in dictatorial rule. ~ 
The three ideal types developed by Franz L., Neu) 
mann (1957, p. 256 ff.), which use as a criterion 
the instruments of rule employed or required by 
dictators, are far better suited for classifying the 
various historical phenomena and systems of dic- 
tatorship. Neumann distinguishes “simple dictator- 
ship” (the ruler exercises absolute control of the 
traditional instruments of state power ), “Caesaristic 
dictatorship” (to gain power and to consolidate it, 
the ruler requires the support of broad masses of - 
the people and the execution of social-economic 
reforms), and “totalitarian dictatorship” (rule is” 
exercised through a differentiated power apparatus 
controlled by a governing party and a “social move- — 
ment”). 
Some authors assume that the process of trans- 
forming a constitutional dictatorship into a revolu- 
tionary dictatorship leads, in modern industrial 
societies, either to authoritarianism, a form of dic- 
tatorship based upon the prevailing values in the 
Society, or to totalitarianism, the form of dictator- 
ship that is able to force through a new system of 
values in society (cf. Drath 1958). This distincti 
is worthy of note because it makes allowance f ; 
the fundamental importance of the different socio- 
cultural presuppositions and sociopolitical goals of 
dictatorships. J 
Indeed, in any endeavor to set up a political- 
sociological typology of dictatorial systems hog 
sociocultural factors must be regarded as the pri- 
mary differentiating criteria, in addition to the — 
specific governmental structures and the means of © 
safeguarding the monopoly of power. onl 
The summary below of certain ideal types 0f 
dictatorship attempts to make allowance for “a k 
interaction of the cultural, social, political, a 
psychological factors, which is characteristic of a 
different historical manifestations of these systems | 
of rule. At the same time, these types show oe 
different are the intensity of this interaction angg 
the stability of the political-social relationships 
within the several systems. riled , 
Despotic one-man rule. Despotic one-man TUY 
fo hi, forms of 
is historically represented by the many 


tyranny in ancient Greece and Sicily and in Ren- 
aissance Italy, by some instances of Oriental mo- 
narchic despotism, and by certain cases of one-man 
rule in the developing countries. Political power is 
seized, usually by a palace revolution or coup 
détat, when a state or a society is in a critical 
situation; it is usually exercised only for a short 
time by a despot without moral scruples but capa- 
ble of bold decisions, This distinctly arbitrary rule 
is particularly unstable, because it is not supported 
by a strong organization; as a rule, it is backed 
only by some conspiratorial groups, small coteries, 
political bands or factions, camarillas, or military 
cliques. Usually the motive for the seizure of power 
is not concern for the commonweal, but personal 
gain, the suppression of opponents, or the con- 
quest of foreign territory. The wielders of power 
are aristocratic conspirators, plebeian demagogues 
and “tribunes of the people,” or war leaders (con- 
dottieri ), Sometimes these leaders have the political 
regime they usurped confirmed by plebiscites or 
try to consolidate it by victorious military cam- 
paigns. 

This type is related to Neumann's “simple dic- 
tatorship” but differs from it in that here the tradi- 
tional instruments of state power are played off 
against one another, rather than being meaning- 
fully coordinated and utilized to consolidate the 
ad hoc rule. The premises of such rule are the 
insecurity of social conditions, class conflicts, crises 
phenomena in existing democratic systems, mili- 
tary threats from foreign countries, as during the 
period of tyranny in ancient Sicily, and the meet- 
ing of different civilizations. In some instances 
(e.g., Cola de Rienzi) this form of rule has social- 
revolutionary traits, but it usually endeavors to 
consolidate ad hoc an existing or vanishing social 
status quo. Despotic one-man rule has an unmis- 
takable Machiavellian appearance: a pragmatic will 
for the exercise of raw power through the manipu- 
lation of all the available social and technological 
means. 

Elite-related rule. The most important feature 
of elite-related rule, whether by one man or by a 
group, is the development of a pyramid of power 
in an authoritarian state. The dictator controls the 
decisive key positions at the head of a combination 
of social elites and power aggregates consisting of 
such elements as the army, the police, the bureauc- 
racy, the nobility, the propertied class, and/or the 
dominant groups in a parliament. He endeavors to 
achieve a balance of power among these elites or 
else to range the groups that support him against 
others. Recognition of the fact that the more stable 
the foundations of this rule, the greater the likeli- 
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hood of continuous exercise of power, places limita- 
tions on the arbitrariness of the dictator. In many 
instances such rulers have attempted to provide 
constitutional guarantees for their regimes. This 
trend may be found in the various stages of Crom- 
well’s dictatorship as well as in the period of 
Jacobin rule. 

This form of rule is constantly subject to the 
hazard that its terroristic nature will be exacerbated 
by rivalries among the leading elites or by foreign 
military threats. In an elite-related dictatorship a 
division of political functions exists almost by defi- 
nition, The rulers surround themselves with rev- 
olutionary councils, advisory committees, and 
paramilitary organizations. Ambitious aides or com- 
petitors of the dictator demand their share of politi- 
cal power (e.g., Danton, Barras, Hébert, Fouché 
during the rule of the Jacobins; or Abbé Sieyés 
and Talleyrand under Napoleon 1). 

Governmental systems of this type may in cer- 
tain cases serve to maintain a sociopolitical status 
quo, to overcome a crisis, to prevent a revolution. 
Yet, as a rule, the measures taken by dictators and 
their advisory committees in a given socioeconomic 
situation are based on a particular sociopolitical 
concept of planning. The revolutionary or reform 
or restoration policies they advocate require popu- 
larity among the masses of the people. Conse- 
quently, this type often resembles Franz Neumann's 
definition of Caesarist rule. 

Historical examples of this type are: the later 
Roman dictatorships (Sulla, Caesar, Augustus) 
with their Senate cliques and civil war factions, 
their triumvirates, rival armies, and provincial bu- 
reaucracies; the regime of Cosimo and Lorenzo di 
Medici (urban oligarchy, Council of Seventy, or- 
ganization of the city's poor); Oliver Cromwell's 
rise from a radical speaker in Parliament and 
political leader of the Ironsides to Lord Protector 
(manipulation of Parliament's rule, supported by 
the army and by a newly established, rudimentary 
bureaucracy); the dictatorships of the Reign of 
Terror in the French Revolution (Robespierre, the 
club of the Jacobins, the Committee of Public 
Safety, and the Commune); the consulate and em- 
pire of Napoleon Bonaparte, founded on the pres- 
tige of his victorious army; the plebiscitarian, 
adventurous regime of Louis Bonaparte; the older 
South American dictatorships of the nineteenth and 
early twentieth centuries, established by corrupt 
generals through pronunciamento and based upon 
the power of the army; and finally, some of the 
more recent authoritarian forms of rule in the 
developing countries. 

These dictatorships exhibit the rudiments of ide- 
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ological justification of authoritarian rule (a “politi- 
cal formula” in Gaetano Mosca’s sense) and the 
beginnings of ideologically oriented social move- 
ments (in the English revolution the independ- 
entism of the parliamentary Left and of Cromwell's 
army; in the French Revolution the tendency toward 
an egalitarian socialist democracy). The means of 
social control employed are primarily aimed at the 
core of the power organization or at the elites 
emerging from the ruling class. The masses of the 
people are constrained to admire or revere the per- 
sonality of the leader. The social and political activ- 
ities of such dictatorships are partly revolutionary 
and partly restorative—especially when the powers 
of the different social classes are balanced. Thus, 
Sulla’s exercise of power, and that of many South 
American dictators, were counterrevolutionary or 
restorative, since they arrogated to themselves the 
function of “preserving law and order” in times of 
crisis and thereby prevented thoroughgoing social 
reforms. On the other hand, Cromwell's rule, like 
that of the Jacobins, bore revolutionary traits; both 
were pacemakers of that bourgeois revolution in 
whose name Napoleon 1 later waged his campaigns. 

Oriental despotism. The concept of an Oriental 
or Asian society, including primarily the ancient civ- 
ilizations of China, India, the Near East, and Tatar 
and tsarist Russia, but also the Eastern Roman and 
Byzantine empires, was known to the early political 
economists, as well as to Karl Marx and Max 
Weber. Recently K. A, Wittfogel (1957) has ex- 
plored anew the socioeconomic, cultural, and politi- 
cal factors that characterized it. He finds that 
Oriental despotism differs in essential features 
from the dictatorships of antiquity, the Middle 
Ages, and modern Europe, although it is akin in 
some ways to the elite-related as well as the totali- 
tarian types. 

A “hydraulic” society, based on extensive systems 
of waterworks, evolved a widespread bureaucratic 
network that directed the organization and plan- 
ning of corvée (forced labor) for irrigation proj- 
ects. According to Wittfogel, this brought forth 
an absolutist “managerial state.” The principal eco- 
nomic, administrative, and political functions lay 
in the hands of a ruling class consisting of bureau- 
cratic landowners and land managers, officers, and 
an influential priesthood. The person of the ruler 
enjoyed the highest secular and, in part, religious 
authority. His despotic regime was based on the 
state’s bureaucracy and on the army, but it was not 
totalitarian. Although the ruler did demand obedi- 
ence and complete submissiveness from his serv- 
ants, he respected the human rights of social 
groups in areas outside the purview of the state. 


Most political conflicts occurred within the ruling 
class; social conflicts and insubordination outside 
the ruling stratum were prevented by customary 
techniques of terror [see ASIAN SOCIETY, article on 
SOUTHEAST ASIA]. 

Totalitarian rule. Examples of totalitarian rule 
fall into two groups. First there are the sociopoliti- 
cal systems of fascist Italy, Nazi Germany, and 
the semifascist dictatorships of Peron in Argentina 
and Franco in Spain; second, there is the com- 
munist system in its various historical versions 
(above all, the Soviet Union and China) and sim- 
ilar political structures in the developing countries. 

Dictatorships of the Western fascist type arose 
as “crisis products” of the capitalist economic and 
social systems. Basically counterrevolutionary, they 
were characterized by an activist, militant social 
movement, which employed the Führer principle 
and methods of social discipline and control to 
mobilize and organize social and political forces, 
especially members of the middle classes who felt 
socially threatened. The Soviet communist system, 
on the other hand, was born of the class antago- 
nisms of bourgeois society, with the aid of an origi- 
nally democratic mass movement based on a revo- 
lutionary theory of society. Yet this movement is 
not itself a dictatorship, even in the sense of the 
concept “dictatorship of the proletariat.” (This 
term, seldom used today, implies that all political 
power stems from the organized working class; but 
the dictatorship of the proletariat has a utopian 
character and does not fall within the meaning of 
dictatorship as defined at the outset of this article; 
see Lenin 1917.) 

Notwithstanding resemblances and parallels in 
social structure, in the use of political ideology for 
legitimizing and maintaining the regime, and in 
the application of modern scientific and techno- 
logical means of organizing the economy and con- 
trolling men, there are striking differences between 
the communist and fascist systems. Fascism and 
National Socialism proceeded from counterrevolu- 
tionary concepts of society. Communism, however, 
has a revolutionary model of social development 
and evolved rational, bureaucratic forms of policy 
making that have maintained the system through 
many generations and have helped to one 
the political structure despite domestic and pani 5 
perils. Some of these differences involve the 7 
torical conditions under which the movements le- 
veloped and the structure and the special fn 
of the political ideologies and values that determi 
the actions of the leadership and mass behavior. : 

Every fully developed totalitarian rule inven 
not only the political structure, the position an 


function of the monopoly party and its organiza- 
tional satellites, and the relations among the state, 
the social movement, and the society. The concept 
of totalitarianism also includes the entire social 
structure and all the measures taken to transform 
it, a centrally directed economy, as well as the po- 
litical ideology and legal system developed to justify 
and maintain the regime. In sum, we can speak of 
totalitarian rule only where a centralistically ori- 
ented mass movement, led by a militant political 
minority in an authoritarian manner, relying on 
the monopoly of power, and with the aid of a dic- 
tatorially ruled state, builds an apparatus of power 
which bears upon all parts of the society. 

This multidimensionality, and the diversity of 
the various historical systems making up the phe- 
nomenon of totalitarianism, makes it difficult to 
elaborate a politically and sociologically fruitful 
concept of totalitarian rule, as is shown by the 
existing literature, in which the concept of totali- 
tarianism has become increasingly subject to scien- 
tific criticism. Some scholars maintain that there 
are no significant differences between the older 
dictatorships and modern mass despotisms except, 
say, in the art of mass domination (Hallgarten 
1954; 1957, p. 176 ff.). Others deny that totalitari- 
anism is merely a product of industrial society and 
point to totalitarian features that may be found, for 
example, in ancient Sparta and in the tyranny 
of Diocletian (Neumann 1957, p. 246). Several 
younger scholars believe that general concepts of 
ideal types and static, classificatory methods are of 
little use in the historical-empirical analysis of the 
various totalitarian power structures [see TOTALI- 
TARIANISM]. 

Constitutional dictatorship. Unlike the revolu- 
tionary and counterrevolutionary types of dictator- 
ship, in which the legality of the exercise of power 
is dubious and in many cases represents a break 
with the political evolution of the state, constitu- 
tional dictatorship respects the limits fixed by the 
constitution. Its function is to protect or restore the 
traditional, legal order in crisis situations or in 
domestic or foreign emergencies. Such an emer- 
gency (Staatsnotstand) can be defined as a serious 
disturbance or endangering of public safety and 
order, which cannot be overcome in normal, con- 
stitutional ways but can only be eliminated by the 
use of exceptional means (Hesse 1962). 

A constitutional dictatorship exists when martial 
law or a state of siege is proclaimed, and the execu- 
tive, specifically the military commanders, can 
limit civil rights and liberties. A tendency toward 
this form of dictatorship can also be seen, how- 
ever, in the so-called emergency decree legislation. 
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Some constitutions of democratic states (article 48 
of Germany's Weimar constitution, article 16 of 
the French constitution of 1958, article 77 of the 
Italian constitution of 1947) grant full powers to 
the executive to take temporary measures to restore 
law and order in the event of an emergency [see 
DELEGATION OF POWERS]. 

The problem of assuring respect for the limits 
of constitutional dictatorship is a very difficult one 
and has been exhaustively debated in politics and 
in scholarly literature. A limited emergency dic- 
tatorship may turn into a counterrevolutionary 
dictatorship whenever the conditions of political 
power are favorable, as shown by the exploitation 
of article 48 of the Weimar constitution by the 
National Socialists. A constitutional dictatorship 
has no revolutionary objectives with regard to socio- 
political change, although it may fulfill counter- 
revolutionary functions if class conflicts or disputes 
between social and political elites become wide- 
spread. The restoration of constitutional conditions, 
which is the objective of emergency legislation, 
often signifies the hardening of a socioeconomic 
status quo and may encourage revolutionary forces 
within a country to intensify their attacks upon the 
existing political and social order. 

The following historical instances of constitu- 
tional dictatorship are often cited. First, the clas- 
sical legal dictatorship of the early Roman Repub- 
lic in which one of the consuls appointed the 
dictator, upon the motion of the Senate, for a term 
of no more than six months and entrusted him 
with the task of defending or restoring constitu- 
tional order. He became, in effect, an extraordinary 
constitutional organ and remained bound by the 
laws. Second, the exercise of power by the medieval 
Italian commissioners, who were appointed by a 
sovereign prince in conformity with the constitu- 
tional order and were given extraordinary powers 
to act on his behalf, (For the first use of the term 
“commissionary dictatorship,” see Schmitt 1921, 
p. 6 ff.) Third, the “educational dictatorship” of 
Kemal Atatiirk in Turkey, in which the dictator 
himself established constitutional rules for his ex- 
ercise of power [see CRISIS GOVERNMENT]. 


Research interests and problems 

Legitimacy. The issue of the legitimacy of dic- 
tatorial systems has often been discussed in terms 
of the relation of the dictatorship to tradition, to 
law, and to the constitution [see LEGITIMACY]. 
Practically every historical dictatorship has tried to 
justify its existence, its methods, and its measures. 
In many cases where the dictatorship tried to se- 
cure legitimation by appealing to a law that it cre- 
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ated, the attempt failed, because the leadership, 
the state organs, and the administration always 
succumbed to the impulsive element in decision 
making. Whenever they have been unable to appeal 
to tradition, to natural law, to customary or exist- 
ing law, they have endeavored to legitimize the 
despotic exercise of power either, in an existential, 
Machiavellian vein, as the exalted art of building 
a state and directing society; or else, on a more 
pretentious ideological plane, as the expression of 
a communal order predestined by providence or by 
historical evolution. In so doing, they have ap- 
pealed to “national interest” and “reason of state,” 
to “the common welfare,” to “the welfare of the 
people,” to the interests and vital rights of a social 
class, as well as to the idea of a revolution and the 
laws of social development. 

The charismatic element has always been a 
major factor in efforts to legitimize dictatorship. 
Dictators and their aides have time and again man- 
aged to achieve an identification of broad strata of 
the population with the rulers, especially whenever 
they have manipulated the democratic means for 
the expression of public opinion (popular assem- 
blies, plebiscites, and parliamentary elections). In 
the process they have imposed extraordinary re- 
strictions on the freedoms of the citizens. Civil 
rights are occasionally set forth in the written con- 
stitution, but in everyday life they are constantly 
imperiled by the regime's claim to total control of 
the formation of political will and the conduct of 
social life. The development of “islands of free- 
dom” and the progress of “liberalization” in dic- 
tatorships are, therefore, always highly problematic. 

Social structure and mobility. The problems of 
the interdependence of the political and social 
orders found in the various dictatorships are an- 
other subject of lively contemporary discussion. 
Researchers are studying the social and economic 
prerequisites of such systems and the influence that 
such a regime and its policies have on a coun- 
try's social structure, economic system, and elite 
formation. 

It is widely recognized that dictatorships are able 
to change social structure, provided they have 
sociopolitical ambitions and approach the solution 
of social and economic problems with progressive 
goals and programs. This applies not only to the 
various systems of modern mass dictatorship in 
which, as under Soviet rule, some social classes 
(workers, peasants, and the working intellectuals) 
have an economically and politically dominant 
function, while others (the petty bourgeoisie) are 
assigned a tributary function, and still others dis- 
appear completely (the nobility, the upper class); 


it also applies to the systems of elite-dominated, 
authoritarian dictatorship. The very formation of 
a new political and economic ruling class from the 
leading elites, bureaucrats, and political function- 
aries that are needed to run the state apparatus can 
serve as an index of social-class formation in these 
systems of government. But little research has been 
done thus far on the specific forms of social mo- 
bility in societies that are dictatorially manipu- 
lated. The rise from the lower classes may be due 
to advancement within the political organization as 
well as to active participation in solving the eco- 
nomic problems of these achievement-oriented so- 
cieties. Many dictatorships evolve systems of social 
incentives and rewards for conformist behavior, 
thus manipulating the social prestige of the “fellow 
travelers,” “careerists,” and “parvenus” of a political 
upheaval, 

Political sociology is interested in ascertaining 
the social and political composition of the ruling 
class and of the larger groupings from which mem- 
bers of the elite are drawn; in determining how 
change in this composition is effected; and in dis- 
covering the causes of conflicts within the elite and 
how they are resolved. 

The conquest of a foreign country or its political 
penetration—a common occurrence of the last fifty 
years—may place large sections of the conquered 
population in the role of an oppressed class. The 
only way to maintain such a situation is by the 
use of adequate military, police, and administra- 
tive instruments of power. Otherwise, the social 
structure and the national consciousness of the con- 
quered country must be transformed by political- 
ideological infiltration in the image of the conquer- 
ing society. The establishment of satellite states 
and political satrapies and their relationship to the 
respective autocratic or totalitarian central state 15, 
therefore, an interesting problem for research. Su- 
pranational social movements and elites develop, 
whose revolutionary or counterrevolutionary or 
try to affect foreign societies (Lenin, Stalin, E 
Mao Tse-tung, on the one hand; Mussolini an 
Hitler, on the other). Rudiments of such develop- 
ments are to be found even in the imperial Roman 
dictatorship and in Oriental despotism. 

Political ideology. The ideological compon: 2 
of various forms of dictatorship are also an ne 
esting subject of study. Totalitarian regimes in E 
dustrial society are inconceivable without the pe 
of a political ideology that moves both the mas 
and the elites. Even in the older forms of aicea 
ship the “political formula” was a rather paar ee 
instrument of rule. Tyrants and mass leaders Ei 
always made use of myths, utopias, doctrines 


ents 


salvation, and, especially in modern times, social 
theories and Weltanschauungen in order to estab- 
lish the charisma of a leading personality, the his- 
torical mission of a ruling elite, or the necessity of 
a revolution. Alongside the despotic perfectionism 
of pure power politics can be found the cult of the 
leader, hero worship, and collective delusions, and 
also sociopolitical concepts with specific programs 
and claims upon the behavior of the citizens and 
the wielders of power. 

The structure and the sociopolitical functions of 
ideology differ widely under various dictatorships. 
Comparison of the ideologies of fascism, National 
Socialism, and Bolshevism, clearly shows the purely 
instrumental nature of the fascist and National 
Socialist ideologies, which are very vague and 
largely based on irrational, poorly formulated theo- 
retical premises. In contrast, Soviet communist 
ideology is nourished by the rational core of a dog- 
matized theory; it has been able to furnish the 
foundation for extensive social and economic plan- 
ning and has proved able to cope with changing 
conditions. 

Behavior and techniques of dictators. Various 
attempts have been made to study the psychological 
and social-psychological peculiarities of dictator- 
ship (e.g., Milosz 1953; Neumann 1957, p. 270; 
Lange 1954), Studies of the psychopathology of the 
personalities of dictators evoke much interest, as 
do the behavior, cooperation, and conflicts of mem- 
bers of the inner circle and of lesser aides and 
functionaries of the dictatorship. The clearer the 
sociopolitical ambitions of a dictatorship, the more 
emphasis is placed on methods of manipulating 
social and cultural activities, the more regard is 
paid to the desired image of man in the educational 
system, and the more attention is paid to the atti- 
tudes of the individual and the masses toward 
superiors, organizations, and the state. 

The extent to which anxieties and fears govern 
the behavior of men under tyrannies is in dispute. 
Readiness to subordinate oneself “voluntarily” and 
to cooperate in building up the state can often be 
found and examined in modern mass dictatorships, 
although in many cases mimicry and political 
“schizophrenia” are also prevalent. 

Plato and Aristotle long ago described the be- 
havior and methods of tyrants, and much of what 
they wrote applies to dictatorships that are more 
fully developed than the city-state despotisms of 
their day. The principal instruments employed by 
the tyrant are force, oppression, threats, and espio- 
nage, and, even today, the common weapons are 
terror, persecution, and intimidation. Yet it is often 
overlooked that modern mass dictatorships have 
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also been able to achieve successes by such methods 
as deception, corruption, and social rewards. Mas- 
sive propaganda and the whole educational system 
are geared to breeding a conforming man, toward 
whom a benevolent attitude is then displayed. 
Modern mass leaders demand admiration and devo- 
tion; propaganda sees to it that they obtain them. 
Wittfogel (1957, p. 181) refers to the “myth of a 
benevolent despotism.” In fact, however, the moti- 
vating principle of “the carrot and the stick” pre- 
vails in many dictatorships. Terror is aimed pri- 
marily at nonconformists of all kinds, although it 
may also be used against the supporters of the sys- 
tem when factional fights break out or when there 
is a change of leadership. 

In order to facilitate identification with the 
Fiihrer and the prevailing system, many dictator- 
ships also create an “adversary type” (the Jews for 
National Socialism, the “imperialists” for commu- 
nism), to which the blames for all problems can 
then be shifted. 

In modern dictatorships an important instru- 
ment of control is the compulsion to join state or- 
ganizations. Only such affiliation guarantees social 
status, social recognition, and job security, Even 
Aristotle pointed out that a sort of “occupational 
therapy” is advantageous to tyranny. And today we 
find that individuals, organizations, and the masses 
are kept in constant activity in dictatorial states. 

Dictatorship in new nations. In conclusion, a 
word should be said about the susceptibility of the 
developing countries to dictatorship. “Strong man” 
regimes are evolving in Asia and Africa, either in 
the form of military states or of presidential re- 
gimes. Whether these forms of authoritarian rule 
are primarily products of the colonial era or in- 
evitable types of transition to constitutional systems 
with political parties is currently being debated. 
This question is complicated by the fact that coun- 
tries with such different cultural, social, and po- 
litical backgrounds as Burma, Indonesia, Egypt, and 
Ghana have all proved susceptible to dictatorship. 
(It has been suggested that the difficulties of ana- 
lyzing such diverse sociopolitical systems may be 
surmounted by a comparative “developmental ap- 
proach” to political systems—Almond 1965, pp. 
183 ff.; Almond & Coleman 1960.) Effective de- 
mocracy is only possible in countries where there 
is a broad, stable middle class anchored in the 
population, a social and political ethos, and ade- 
quate two-way communication between the leader- 
ship and the masses. In most developing areas 
these prerequisites did not exist. Consequently the 
forms of parliamentary democracy imported from 
the colonizing powers could hardly take root; in- 
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stead they proved to be structurally alien to native 
cultures. Actually, in most of these countries the 
traditional feudal institutions were strengthened, 
and only a small minority of native intellectuals 
capable of political thought and action evolved. 
According to some scholars, the belief of these 
intellectuals in progress, especially their strong 
urge to accomplish rapid, radical industrialization 
and to establish the welfare state at once, can 
be realized only by an authoritarian government 
(Newman 1963, p. 16). Substitution of a one-party 
system, supported by the military and the bureauc- 
racy, for a multiparty system is the rule in many 
countries. Powerful military, monarchic, and auto- 
cratic state traditions facilitate authoritarian meas- 
ures, whether under the banner of military dictator- 
ship or of the one-party state with a charismatic 
leadership. The problems arising from the opposi- 
tion of the old elites to the new ones, the trans- 
formation of the new elites into oligarchies, and 
the distance between them and the masses—all 
play an important role in the growth of authori- 
tarian political systems in the developing countries. 
Behrendt (1965; see also Lewis 1965) warns 
against the notion that “developmental dictator- 
ships” are necessary to the socioeconomic develop- 
ment of these countries and concludes that such a 
dictatorship merely signifies a postponement of the 
process of social self-education but can never be a 
substitute for it. 
OTTO STAMMER 


[See also AUTocracy; CAUDILLISMO; Communism; DE- 
MOCRACY; Fascism; NATIONAL SOCIALISM; TOTALI- 
TARIANISM.,} 7 
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I 
CULTURAL DIFFUSION 


Diffusion means the spread of culture from one 
ethnic group or area to another. Various definitions 
have been proposed in order to distinguish between 
diffusion and acculturation; the difference is one 
of continuity and intensity of contact. Accultura- 
tion generally involves prolonged contact of whole 
culture complexes and may be unilateral or bi- 
lateral. 
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Diffusionism in ethnological theory. As early as 
the fifth century B.C., Herodotus noted examples of 
diffusion, but it was the discovery of America that 
stirred intellectual discussion and speculation. Had 
the cultures of the New World developed independ- 
ently, or had they been imported from the Old 
World? From the sixteenth to the eighteenth cen- 
tury numerous authors supported the diffusionist 
view, but their more or less naive writings, based 
on totally insufficient data, did not carry convic- 
tion. The prevailing trend in European science 
favored a theory of natural laws, which were 
thought to govern human progress and to produce 
similar results in the various regions of the earth. 
These ideas were clearly formulated in the eight- 
eenth century and culminated in the nineteenth 
century in the evolutionist school of anthropology. 
At the same time truly scientific treatises on the 
diffusion of culture traits began to appear. One of 
the first was Alexander von Humboldt'’s brilliant 
attempt to prove the Asiatic origin of the Meso- 
american calendric systems. Even the evolutionists 
did not deny that diffusion had played a certain 
role in the development of culture, although they 
minimized its importance. One of the most promi- 
nent, Edward Tylor, wrote several excellent papers 
in which he demonstrated the diffusion of various 
culture traits. 

Diffusion studies. Around 1890 a reaction ap- 
peared in Europe and America against the logical 
inconsistencies of the evolutionist theory. Franz 
Boas in the United States, Gabriel Tarde in France, 
and Friedrich Ratzel in Germany stood up as its 
critics, all of them stressing the importance of dif- 
fusion, Scandinavian archeologists showed that the 
development of the Bronze Age in Europe was 
influenced by cultures of the Near East and the 
Mediterranean regions. They thus started a trend 
which ever since has been important in prehistoric 
research, Fritz Graebner (1911) wrote the first 
systematic treatise on the theory of diffusion, and 
many important studies were made tracing the 
diffusion of single traits, such as the plow, whole 
complexes of metallurgical technique, and the 
spread of basic cosmological ideas and correspond- 
ing religious practices. The most important studies 
of the processes of diffusion, however, were made 
by American anthropologists. It is to them, even 
more than to Graebner, that we owe the theoretical 
foundations for the appraisal of diffusion in its 
various aspects. 

The processes of diffusion. Although whole cul- 
tures may spread as a result of the migration of 
their bearers, a culture is practically never adopted 
by other ethnic groups without considerable change 
and selection of traits. This applies also to diffusion 
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on a minor scale. Whether a particular trait is ac- 
cepted depends not only on its utility to the bor- 
rowers but even more on whether or not it can be 
integrated into the receiving culture. The borrow- 
ing of more elaborate utensils and techniques and 
of sophisticated ideas presupposes a complex socio- 
cultural level on the part of the borrowing people. 
Nomadic hunters may turn with relative ease to 
some kindof primitive shifting agriculture but will 
hardly be inclined to prepare permanent fields with 
all that this implies with regard to change in mode 
of life. The diffusion of elements of social structure 
encounters greater difficulties than that of material 
or religious traits. Among some peoples of south- 
east Asia and among the Tuareg of the Sahara, 
matrilineal forms of society, although unorthodox 
from the point of view of Islamic law, survived 
even conversion of these people to Islam. In gen- 
eral, however, the conversion to one of the higher 
religions is invariably accompanied by the accept- 
ance of numerous other culture traits—such as 
script, types of behavior, modes of dressing and 
housing, laws, and forms of government—and is 
one of the most potent factors in diffusion. Prestige, 
too, plays an important role. Traits have been bor- 
rowed merely for the sake of prestige, even though 
they are detrimental to the borrowers’ hygiene or 
economy. The adoption of European clothing by 
peoples in the tropics is a case in point. 
Negative selection, i.e., the rejection of certain 
cultural items, may be due to their incompatibility 
with firmly established customs, with the prevail- 
ing social structure, or with religious tenets and 
usages. Mere habit and the addiction to the tradi- 
tional way of doing things are no less powerful 
sources of resistance to innovations. Frequently, 
however, no particular reason can be detected that 
could explain why one foreign trait was accepted 
and another not. In many instances “traits spread 
erratically and unpredictably” ( Dixon 1928, p. 120). 
The more traits a people has already borrowed 
from another, the easier will be the acceptance of 
additional traits. Traits that at first were rejected 
may then be adopted since they no longer conflict 
with those of the receiving culture, which has al- 
ready been modified. Occasionally, the diffusion of 
even a single culture trait may have far-reaching 
consequences. The introduction of firearms, for 
instance, necessitated a change in the type of war- 
fare, and this, in turn, affected the social structure 
of Europe. When the Tanala in Madagascar bor- 
rowed from their neighbors the trait of planting 
rice on irrigated terraces and abandoned the slash. 
and-burn type of shifting agriculture, a complete 
transformation took place, not only of their rules 


of land tenure, but of their social and political or- 
ganization (Linton 1936, pp. 348-354), 

Borrowed elements of culture usually undergo 
some kind of change or adaptation. We need only 
think of the innumerable variations, from tribe to 
tribe, when techniques of pottery-making, metal- 
lurgy, and weaving have been diffused. Gothic art 
was not the same in Germany and Italy as in 
France, nor was the Hindu and Buddhist art of 
Java identical with that of India from which it was 
derived. When the Greeks adopted the old Semitic 
alphabet, they adapted it to their own language, 
and new changes took place when the alphabet 
was transferred to the various peoples of Europe. 
When a myth spreads from one area to another, 
details may be lost and new ones may be added to 
or substituted for the original ones. 

Stimulus diffusion. In many instances it is not 
the actual culture trait that is adopted by the 
borrowing people but merely the principle on which 
it is based. This process has been termed “stimulus 
diffusion.” It must have played an important role 
in the spread of agriculture and animal domestica- 
tion. The domestication of the reindeer by Siberian 
tribes, for instance, was almost certainly due to 
some aquaintance with the horse and cattle breed- 
ing of their southern neighbors, Members of illiter- 
ate tribes in Africa, Asia, and America who knew 
of the existence of the European or Arabic alpha- 
bets created new systems of writing for their Te 
spective languages. The development of Egyptian 
hieroglyphics probably resulted from some knowl- 
edge of the Babylonian script of the Jemdet Nasr 
period, even though the actual characters of each 
script are completely different. 

Agents of diffusion. The kind of traits trans- 
ferred from one people to another depends largely 
on the agents of diffusion—those giving and those 
taking. The traits will differ according to whether 
the agents are priests, medicine men, traders or 
artisans, men or women. Men’s societies and ee 
als, weapons, methods of hunting and warfare wi 
the concomitant religious and magical practices, 
such as scalping or head-hunting, can, of course, 
be disseminated only from men to men. Very ber 
the relative conservatism of women has contri 3 
uted considerably to cultural diffusion. Where a 
tergroup marriage is practiced, the women hold a 
to their native customs and will transmit at 
their children, This is bound to further the OF 
fusion of techniques of pottery and weaving, 0 
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ornamental designs, of myths, etc., pags "l 
even of languages. In some instances the ii ae 


ance of new culture traits has been due t 
initiative of one prestigious person. 


Diffusion may be the result of deliberate actions 
on the part of the donors. This applies to mission- 
aries of whatever religion and to the founders and 
promoters of nativistic cults and movements. Con- 
querors and colonial powers furthered, or even 
enforced, the adoption of their laws, customs, and 
techniques. However, even in such cases diffusion 
was frequently reciprocal. We need only cite the 
adoption of maize, tobacco, rubber, etc., by the 
Europeans in America or the flood of Oriental in- 
fluences that reached Europe as the result of the 
British occupation of India. If the conquerors were 
relatively few in number and the culture of the 
conquered superior to that of the conquerors, the 
rate of diffusion was occasionally reversed, with 
the result that the conquerors were more readily 
assimilated by the conquered, abandoning even 
their own language and eventually losing their 
ethnic identity. This happened, for instance, to the 
Langobards in Italy. 

The deliberate initiative of the borrowers may 
be no less important than that of the donors. In the 
mid-twentieth century Asians and Africans who 
study in Europe or America and take home their 
acquired knowledge are among the principal agents 
of diffusion. Similar processes have no doubt con- 
tributed to the spread of ancient civilizations, even 
though literary evidence of this is scarce. The visits 
of Romans to Greece in order to study Greek philos- 
ophy and rhetoric, and the pilgrimages of Chinese 
scholars to India to study the tenets and monastic 
rules of Buddhism, are cases in point. 

Diffusion versus independent invention. Wheth- 
er the occurrence of similar culture traits among 
different peoples is due to diffusion or to inde- 
pendent invention and parallel development has 
been a perennial subject of anthropological discus- 
sions. In the past, anthropologists who favored the 
argument for parallelism referred to the “psychic 
unity of mankind” and to those natural laws 
thought to produce similar effects. 

Where literate civilizations are concerned, the 
role of diffusion can usually be determined by his- 
torical research. There are still some moot cases— 
such as the question whether gunpowder and 
printing were invented independently in Europe 
or as the result of stimuli derived from China—but 
wherever we stand on firm ground it is apparent 
that the independent repetition of inventions has 
been exceedingly rare, particularly when compared 
to the enormous effects of diffusion. 

Where written sources are not available, the 
solution of problems of diffusion is frequently more 
difficult. Independent invention can be proved only 
rarely, and perhaps never with regard to immate- 


DIFFUSION: Cultural Diffusion 171 


rial culture traits or where perishable materials are 
concerned. We can be sure that the fine pressure- 
flaking of stone tools was invented independently 
in northwestern Australia and not derived from that 
of the Solutrean culture, since nothing of the kind 
has been found in the intermediate areas, Cases 
like this are exceptions. In general we can only 
try to demonstrate the probability of diffusion, and 
if this proves impossible, leave the question un- 
answered. The following criteria of diffusion, first 
formulated by Graebner (1911), are still applied, 
with only minor variations, by most anthropologists 
interested in the subject: the more complex a cul- 
ture trait is, the more secondary traits it contains 
that are not essential to its function (criterion of 
form), and the more similar the traits (criterion 
of quantity) shared by two areas, the more likely 
it is that the presence of these traits is due to 
diffusion. 

Loss of traits. In Graebner’s opinion these 
principles were valid also in the case of discon- 
tinuous distribution of traits. Boas and others 
were reluctant to admit diffusion except where 
contiguous areas were concerned. There are, how- 
ever, numerous explanations that can easily ac- 
count for gaps in the distribution of related culture 
traits, We need not even refer to those famous and 
frequently cited cases—the disappearance of the 
bow, of pottery, and of boatbuilding on several 
Oceanic islands. These were isolated incidents and 
of little importance. Of greater significance is the 
fact that any major culture change, whether due 
to internal development or to diffusion, is not only 
bound to add new culture traits but to result in 
the elimination of others. We need only think of all 
the appliances, customs, and types of behavior that 
within a lifetime are replaced by others and be- 
come obsolete, but which still survive in marginal 
areas. This process of elimination has been occur- 
ring since the Paleolithic. Above all, it was the 
spread of the higher civilizations, and of Christian- 
ity, Buddhism, and Islam, that ripped apart for- 
merly uninterrupted areas of archaic culture traits. 

Head-hunting may serve as an illustration. In 
the Old World during the nineteenth century head- 
hunting was largely restricted to southeast Asia 
and Oceania and to some tribes in Africa. Here 
was a case that clearly seemed to warrant the 
assumption of independent parallel development. 
The picture changes, however, when we consult 
the archeological record, or ancient literature. We 
shall find that head-hunting was practiced in vast 
regions of Europe, from the Iberian Peninsula to 
southern Russia. There are many indications of 
its having been practiced by tribes in central Asia, 
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and if we extend our research further back in 
time, we shall find traces of head-hunting in pre- 
dynastic Egypt and in the ancient Near East. Ob- 
viously, the two regions in which head-hunting 
was found in modern times, even though thou- 
sands of miles apart, were mere remnants of an 
area stretching from the Atlantic Ocean eastward 
to the Pacific where head-hunting was practiced. 

Closing the gaps in distribution. Occasionally, 
archeology helps to bridge gaps between widely 
separated areas, For example, a special type of 
harp, with a boat-shaped body, is found in west 
and central Africa, in Burma, and in Siberia. This 
puzzling distribution would seem to defy any at- 
tempt to explain it as the result of diffusion had 
not archeological discoveries shown that the same 
type existed in ancient Babylonia, Egypt, and India. 

In considering the possibility of diffusion the 
time factor should never be forgotten. Such a 
custom as that of perforating the lower lip and 
inserting a plug, found among some tribes in 
Africa and the Americas, may very well have 
spread from continent to continent and then dis- 
appeared from most of its former area with the 
passage of time. On the other hand, gaps in the 
distribution of traits may occasionally be due to 
rapid migrations. When, in the sixteenth century, 
a group of the Tupi-Guarani covered the thousands 
of miles from the Atlantic coast of Brazil to the 
Andes in ten years and, in the eighteenth century, 
a Kalmuk horde took nine months for its migration 
from the Volga to Mongolia, they cannot have left 
many traces on their way. Such rapid migrations 
have frequently been observed in historic times. 
There is no reason why this should not have oc- 
curred in prehistoric periods as well. Finally, we 
must not forget that wherever boatbuilding and 
navigation were sufficiently developed arts, peoples 
separated by the sea were in a certain respect 
neighbors. Given the necessary means of trans- 
portation, the same applies to peoples separated 
by deserts. 

Complex traits. In summary, we may conclude 
that the geographical distance between two similar 
culture traits, be it ever so great, does not in itself 
suffice to disprove diffusion, and that in such in- 
stances, too, we must rely primarily on the cri- 
terion of form. Very simple inventions may, of 
course, have been made more than once, However, 
inventions of weaving, of the various metallurgical 
processes, or of intricate calendrical systems were 
not single events but the results of whole series 
of innovations and improvements. The probability 
of such a series having been repeated in more or 


less the same order and with similar results is 
practically nil. 

Some reservations must be made with regard 
to social phenomena, such as sib and kinship sys- 
tems, marriage rules, totemism, chieftainship, 
priesthood, etc. There can be no doubt that in these 
respects, too, diffusion played an important role, 
but we do not yet know to what extent social 
trends and psychological factors may occasionally 
have produced similar, but unconnected, results, 
The subject has not yet been studied sufficiently 
on a world-wide basis, Therefore caution in assert- 
ing or denying the diffusion of specific social traits 
is advisable. 

Selected problems. The following examples may 
serve to illustrate some kinds of major problems 
concerning diffusion that are of anthropological 
interest. 

Megaliths. The widespread distribution of meg- 
alithic monuments in the Old World stimulated 
many fantastic and unsound speculations. In re- 
cent times, however, the relevant problems of dif- 
fusion have been carefully studied, particularly 
with regard to areas in Africa, Asia, and Oceania 
where such monuments are still raised and where 
their religious and social basis survives. It now 
appears certain that the custom of erecting meg- 
aliths belongs to a socioreligious complex charac- 
terized by emphasis on an ancestor cult and geneal- 
ogy, by wealth and fertility rites, animal sacrifice, 
and rank-conferring rituals that are thought to 
ensure for the souls of the performers or recipients 
a better lot in the hereafter. The original center 
from which this complex spread has not yet been 
determined. It lay probably in the Mediterranean 
area, possibly in Palestine and the surrounding 
regions, where megalithic monuments have been 
dated to the fourth millennium s.c. In some in- 
stances the megalithic complex spread as a result 
of migrations, in others by slow diffusion from 
tribe to tribe. In studying its various aspects, We 
are dealing with a prehistoric religious and social 
movement, which over thousands of years spread 
throughout an area reaching from the Atlantic 
coast of Europe to Oceania. 

Trans-Pacific diffusion. More than any other 
problem of diffusion, the question of trans-Pacific 
relations between the Old World and the Dp 
agitates anthropologists. It is crucial to eee 
anthropological thought. The proof of parallel i 
velopment could lie in determining whether or BS 
similar textile and metallurgical techniques, a 
endrical systems, art styles, etc., originated ee 
pendently in Asia and America. The inescapal 
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conclusion would be that what happened this one 
time could have taken place in other instances 
as well. The results of everything written since 
Ratzel and Boas in order to disprove independent 
parallel development would be jeopardized. We 
cannot base our theories on two contradictory sets 
of principles. 

In recent years the problem has been studied 
systematically by comparing whole archeological 
complexes, circumscribed by area and date, both 
in the New World and in the Old. It is now known 
that art styles appeared in America not only in the 
same sequence as comparable ones in Asia, but 
even at approximately the same dates, and that 
other correspondences, such as those in calendrical 
and cosmological systems or in metallurgy, fit into 
this chronological pattern. This cannot be acci- 
dental. There were no movements of whole popu- 
lations from Asia to America, nor colonial settle- 
ments like those of the Europeans after Columbus. 
Arguments in favor of the independent develop- 
ment of ancient American civilizations that were 
based on the absence of Old World crops and 
various aspects of technology are thus less com- 
pelling. Chinese records make it clear that at the 
periods in question ocean-going vessels were avail- 
able both in eastern and southern Asia, vessels 
that could not have encountered greater difficulties 
in crossing the North Pacific than the clumsy 
Manila galleons of the sixteenth and seventeenth 
centuries. In summary, we are justified in stating 
that as matters stand America offers no exception 
to the rule according to which close similarities 
between complex cultural phenomena are the re- 
sults of diffusion. 

The historic role of diffusion. Diffusion has al- 
ways had a catalytic function in sociocultural de- 
velopment. “The comparatively rapid growth of 
human culture as a whole has been due to the 
ability of all societies to borrow elements from 
other cultures and to incorporate them into their 
own” (Linton 1936, p. 324). Moreover, the neces- 
sity of integrating newly acquired elements into 
one’s cultural heritage creates new problems, which 
demand new solutions and thereby engender new 
ideas. It was the opportunity for relatively rapid 
interchange of inventions and ideas between a 
number of local cultures that made possible the 
birth of the oldest civilizations in the Near East. 
It was stimuli emanating from these oldest civiliza- 
tions which started the chain reaction that even- 
tually resulted in the emergence of one civilization 
after the other through the whole of world history 
(Heine-Geldern 1956). Obviously, the cultures of 
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marginal peoples least exposed to diffusion, such 
as the Australians, Tasmanians, and Fuegians, 
have remained the most primitive known in mod- 
ern times. 

ROBERT HEINE-GELDERN 


[Directly related are the entries ETHNOLOGY; HISTORY, 
article on CULTURE HISTORY. Other relevant material 
may be found in the biographies of GRAEBNER; 
ScHMIDT.] 
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Lii 
THE DIFFUSION OF INNOVATIONS 


Any society can be looked upon as forming an 
ordered system in which all individuals and all 
pieces of material equipment, including land, are 
component parts, linked together in a multitude of 
ways. Such a system may quite possibly remain in 
a stable state for a long period of time, in the sense 
that economic, social, and cultural activities are 
carried on over days, seasons, and years according 
to a fixed rhythmical pattern with an invariable 
structure and a stereotyped distribution of roles. 
Changes take place only as quantitative adjust- 
ments owing to changes in size of population. 

If in a subregion of the system a hitherto un- 
known element is introduced, say, for example, a 
new technical device, a new way of allotting social 
roles, or a new cultural manifestation, this event 
constitutes a perturbation that under certain con- 
ditions may be transmitted out into the surround- 
ing regions and propagate itself until eventually 
the whole system has become permeated and at the 
same time to some degree transformed. A permea- 
tion of this kind, either partial or total, is known as 
a diffusion of innovation. 

In some parts of the world and in certain pre- 
historic or historic periods, more sweeping in- 
stances of diffusion of innovation obviously have 
been rather unusual occurrences. On the other 
hand, in present-day Western societies, diffusion 
of innovation is in the regular course of things, 
and the whole social system is oriented toward 
constant change. 

Diffusion of innovation as a social phenomenon 
has been noted by scholars and other observers 
since antiquity, and a large amount of information 
on specific cases is available. More is, however, 
known of the results of diffusional processes than 
of the processes in action, for the obvious reason 
that processes of this kind are extremely hard to 
observe. Since the time of Friedrich Ratzel the tra- 
ditional way of studying diffusion, from the large- 
scale point of view, has been to compare the spatial 
distribution of cultural traits by the aid of maps. 
As a result of research work in this tradition, a set 
of national atlases of folk culture has been pro- 
duced in Europe, each containing a wealth of 
Suggestive information on preindustrial cultural 
relationships. 

In recent decades there has been a growing in- 
terest in the study of modern instances of diffusion, 
and scholars have increasingly taken advantage of 
the fuller supply of quantitative data and the pos- 


sibilities of immediate observation of ongoing 
processes. No doubt such studies can lead to results 
that also throw light on how diffusion took place 
earlier in history, One approach toward the dif- 
fusion process focuses on the characteristics of the 
acting individual and his reactions to his immedi- 
ate environment. Another approach is more con- 
cerned with the social system as a whole. In the 
tradition of cultural history and cultural geography 
various macroconcepts come into the foreground, 
Such as growth curves, centers of innovation and 
centers of spread, channels of diffusion, barriers to 
diffusion, cultural boundaries, and regional differ- 
ences in receptivity. The diffusion of innovations 
will be discussed in terms of these organizing 
concepts. 


Growth patterns 


Diffusion of innovation, as seen in its system- 
wide context, undoubtedly tends to show a series 
of recurring traits. The most easily observed pat- 
tern, when statistical information is at hand, is 
the curve of cumulative growth (Pemberton 1936; 
Sorokin 1937-1941). If the number of individual 
adopters of innovations—or in some cases such 
more relevant units as villages, cities, and firms— 
are measured over time, an S-shaped curve nor- 
mally appears. This curve shows a slow take-off 
Stage of varying length, an intermediate stage of 
more rapid development, and a final stage of de- 
clining growth, which seems to approach a ceiling 
asymptotically. Different innovations run through 
this process with very different speed, and various 
degrees of symmetry and irregularity are noted. In 
particular, the initial stage may vary considerably 
in length. 

The three stages of the growth curve can nor- 
mally be shown to have recurring counterparts in 
the way the spatial distribution of adopters de- 
velops (Hägerstrand 1965a). In the initial stage, 
adopters are usually concentrated in a small cluster 
or a small set of clusters, In the intermediate stage, 
expansion likewise takes place in a pattern that 
indicates that a new adoption is more likely to 
occur in the vicinity of existing adoptions than 
farther away; sometimes a jump of pet 
length comes about, and with it a new eer 
dispersal may begin to function (Kniffen 19514). 
The “neighborhood effect” creates an outw: 
movement along a more or less sharply pee 
frontier (as ripples on the water), while ee i 
same time the general density of adoption bel iat 
the frontier is continuously growing. A ar 
stage may be reached in the central area O 
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persal while the frontier is still advancing. Statis- 
tically, therefore, the total growth curve can be 
divided into a set of subcurves, showing time lags 
between them and also different rates of growth 
(Griliches 1957; Jones 1963). 

Urban-rural comparison. The neighborhood 
effect, together with the total pattern of spatial 
growth, shows itself on many different scales. It 
is perhaps most clearly seen when the spreading 
trait concerns the rural population (Hägerstrand 
1953; Griliches 1960). In such cases the diffusion 
tends to proceed in small steps, almost from farm 
to farm and from village to village. However, the 
extent of the neighborhood effect seems to be 
rather different in the United States in comparison 
with Europe, even for rural areas. 

If the innovation under observation belongs to 
the modern urban scene, the process has to be 
viewed on a nationwide or even international scale 
before the same fundamental patterns of change 
are evident. The most important cities are usually 
strongly reflected in the initial stages of urban or 
general innovations; this means that the national 
capitals are usually early adopters, followed by 
metropolitan centers next in rank. In the following 
stages the neighborhood effect seems to become 
gradually dominating over hierarchical rank (Bow- 
ers 1937; Pemberton 1938). It must be stressed 
that there are still very few thoroughly analyzed 
cases of urban innovations; the picture just given 
refers to ongoing investigations and may have to 
be revised as information is accumulated. 


Centers of innovation and diffusion 


Not much is known either about the factors at 
work behind the appearance and persistence of 
centers of innovation and centers of spread. It 
seems to hold true that certain areas or points may 
function as such centers over very long periods of 
time, emitting wave after wave in a manner that 
is repetitious in the main lines even if details dif- 
fer, As far as the “true” centers of innovation are 
concerned, it might be assumed that the cumu- 
lative principle of cultural development always is 
more easily at work in a few restricted areas than 
randomly throughout a population (Sauer 1952; 
Edmonson 1961). It might also be hypothesized 
that innovative individuals are continuously at- 
tracted to areas that seem to be able to make use 
of their talents. Paris as a center of fashion over a 
long period of time is an obvious example. 

The cumulative principle of development. An 
innovation most often consists of a new combina- 
tion of some pre-existing elements, and so it can- 
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not come into being until these elements are at 
hand either as ideas or as hardware. The cumu- 
lative principle is most easily appreciated in the 
realm of material culture, but undoubtedly it also 
exists in immaterial culture. The gradual develop- 
ment of scientific theory is a case in point, but so 
also is the growing richness through time in modes 
of expression in the visual arts and music. Be- 
cause of this principle, it did not help very much 
in practice that Leonardo da Vinci invented an 
aircraft or Charles Babbage a computer; the actual 
realization of these things as working innovations 
had to await the appearance of basic component 
parts before production and spread were possible. 
However, this combination of more basic elements 
into more advanced ones certainly does not take 
place everywhere with equal ease, and the incen- 
tive to make such combinations is also affected by 
spatial variables. 

Choice of solution. Although a certain spec- 
trum of simpler parts must exist and be known in 
order to allow a new step forward, it is by no 
means self-evident that more advanced tasks can 
be undertaken only in one way. Many solutions 
may be possible, and a selection may be made just 
randomly or because of certain conditions in a 
steering environment. As technological diversity 
increases, so also do the opportunities for choice 
between alternative technologies or between differ- 
ent permutations of technological elements. 

The existence of a wide range of alternative 
technologies asserts itself in the various cultures, 
within which approximately the same basic tasks 
have been solved differently and with varying de- 
grees of success. It is probably not correct to think 
of these alternative solutions as conscious selec- 
tions from a known full set of possibilities, but 
rather as an unsystematic trial-and-error use of 
elements at hand. As seen in this perspective, a 
center of innovation is, then, a place or area where 
an unusual amount of technological elements or 
“building blocks’—for the time and area—are 
available, as well as a felt need for new combina- 
tions. It would, however, be an exaggeration to 
claim that very much is known about these 
problems. 


Channels of spread 


Centers of spread represent a somewhat differ- 
ent concept. These may be original centers of inno- 
vation, but more often they are only points where 
already existing innovations coming from the out- 
side first begin to penetrate a new region. In urban 
or general innovational movements, the larger 
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cities tend to take on this function. Innovation in 
agriculture or industry shows various different sys- 
tems of centers. In general, centers of spread re- 
main stable over long periods of time, fulfilling a 
normative function for their surroundings. 

Centers of spread are parts of the channels of 
spread. The nature of these channels is relatively 
well known, both through historical studies and 
modern quantitative analysis. Even today, when 
the sheer quantity of information pouring forth 
from the mass media may seem to make every- 
thing known equally well everywhere without 
much delay, the fact is that personal communica- 
tion between pairs of individuals and direct ob- 
servation are still the basic instruments for the 
diffusion of innovation. On second thought this is 
not so surprising, if one realizes that new things 
and ideas often look complicated to the novice and 
that therefore people have to be able to ask ques- 
tions, take things in their hands, try them, and get 
a feeling for them in order to fully understand 
and adopt. 

The exchange of information between individu- 
als is a never-ending process; it is perhaps the most 
important ingredient in molding a society (or one 
might even say the whole human race) into one 
system of connected links, Everyone functions in 
that system as a sender, a receiver, and an ampli- 
fier. The influence that reaches an individual and 
the influence he can exert are both a function of 
his particular position in the network, a position 
that he is born into and subsequently can extend 
or change only to a certain degree, Most of the 
time the influences that travel through the links 
only reinforce the stable set of cultural traits that 
prevails in a particular subregion of the system; 
but occasionally—and today more and more often 
—an innovational flow will pass through and, if 
adopted, change the patterns of traits. 

Spatial range of contacts. For most individuals, 
it seems that the links of personal communication 
are heavily restricted by distance. Few actual in- 
vestigations of the spatial range of regular social 
contacts have been carried through. However, the 
few that exist indicate that among farming popu- 
lations in Europe and Asia the probabilities of con- 
tact decrease at a rate steeper than the square of 
the distance (Hägerstrand 1953; 1965a). A minor- 
ity of the population may be assumed to have wider 
contacts, but most of these are confined to the na- 
tional area; very few have links with the inter- 
national field. The reason for this is not so much 
the difficulty of movement as the existence of 
linguistic barriers. As the ability to understand and 
use different languages varies from country to 


country, so inevitably the flows of influences tend 
to show corresponding asymmetries. 

The fact that in general the links of communi- 
cation tend to be a decreasing function of distance 
accounts for the neighborhood effect noted in the 
diffusion of innovation. It also means that in any 
local area a growing pressure on nonadopters of 
an innovation builds up very quickly as the num- 
ber of adopters grows. This is a process that can 
be simulated quantitatively and to a certain extent 
predicted, if the parameters of the links of social 
communication can be estimated (Hägerstrand 
1965b). 

Changes in the network. The observation that 
the spread of innovations from certain centers 
tends to follow repeatedly the same spatial course 
seems to indicate that the communication network 
has a very stable configuration over time. The 
frequency of contacts between areas and places 
remains very much the same through time, even 
though the individuals involved change. The struc- 
ture of this network, which now can be seen only 
vaguely, deserves close study as to morphology and 
ways of change. That some change takes place is 
obvious, In the more developed parts of the world, 
the distances over which regular contacts occur is 
gradually increasing as the means of transporta- 
tion improve. This change makes the neighborhood 
effect more diffuse today, in the pure spatial sense, 
as compared to the pattern observed in earlier 
times, when an innovation sometimes moved along 
a sharper front. 

The observation that the speed of diffusion of 
innovation is accelerating should not be explained 
only in terms of spatially increasing links of com- 
munication. The growing diversification in the 
occupational composition of populations also has 
to be taken into account as an explanatory varia- 
ble (Edmonson 1961). Many innovations, although 
important for the whole society in which they be- 
come introduced, nevertheless are of immediate 
concern only for a fraction of the total population. 
This situation may mean a very fast diffusion over 
a wide geographical area among those profession: 
ally engaged in the application of the innovation. 
At the same time the spread, in terms of the num- 
bers of adopters, need not take place differently 
from a corresponding development in a larger 
population. 

Cultural boundaries. The notion of a rather 
stable network of communication links between 
individuals, and especially between populations in 
places and areas, makes it possible to explain the 
existence of cultural boundaries in at least two 
meaningful ways. The first and classic case is the 
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cultural boundary that follows a barrier in the com- 
munication system, caused, for example, by physi- 
ography or gaps in the continuity of settlement. 
The second and more interesting case is the fol- 
lowing. Influences that step by step move out from 
two different centers of spread located at a dis- 
tance from each other are bound to meet along a 
line or zone in between. If the elements are mutu- 
ally exclusive, as in religious affiliation or in the 
use of a certain dialect, then the flows will arrest 
each other. A cultural boundary will appear, re- 
main in a fixed position, and become reinforced 
as long as the two competing centers retain the 
same position and strength (Weiss 1952). 


Receptivity factors 

If influences channeled through the links of 
social communication were the only determinants 
of the course of diffusion and the distribution of 
culture elements, then the cultural map would not 
be as complicated:as it actually is. We also have 
to consider a quite different group of factors, 
which explain the differentials in resistance and 
receptivity toward innovational influences. 

An aggregate of complicated circumstances 
causes keenness, hesitation, delay, or refusal to 
adopt innovations suggested by signals through 
the communication network. Part of the picture is, 
of course, the personal characteristics of individ- 
uals. Another part is the necessity for various time- 
consuming adjustments and preparations before 
an adoption can be fitted into pre-existing proce- 
dures, habits, and value systems. Also, the cost 
involved in adoption of an innovation may slow 
down adoption, or the revenue expected from adop- 
tion may cause the opposite reaction (Griliches 
1957; 1960). 

Again, not much is known in any systematic 
way. One can observe, however, how resistance 
factors work when a spreading trait suddenly stops 
along a religious borderline or when the spread of 
a new crop dies off where soil is less suitable. A 
similar killing effect on the spread of new types of 
production has also resulted from the existence of 
a limitation on the market. When the market can- 
not absorb more, it becomes gradually more diffi- 
cult for new producers to enter the business and 
take up the innovation; several instances of out- 
ward growth that came to a halt because of market- 
ing difficulties can be cited for Europe. Cases in 
point that have been analyzed in some detail are 
the agglomeration of nursery gardens in Holstein, 
Germany (Briiggemann 1953) and of glass-making 
factories in Smaland, Sweden (Nordström 1962). 

If a diffusional process fails to break through 
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some area of resistance, this fact may sometimes 
create a shadow effect over a wider region, because 
the nonreceptive area does not send out a stimulus 
for further adoption in the way it could have if the 
innovation had moved straight through. 

Reducing resistance to change. Diffusion of in- 
novation is an all-important part of the economic 
growth in all countries but particularly so in those 
that need development. Thus a significant question 
is to find out how diffusion of innovation can be 
encouraged. Two methods of intervention present 
themselves; one refers to the pattern of communi- 
cation and the other to the distribution of resist- 
ance to change. 

It is probably not possible to influence the struc- 
ture of communication in a society through outside 
action, and therefore the channels of spread must 
be taken as given. But knowledge about the pre- 
existing matrix of links would be useful, because it 
would make it possible to choose points of intro- 
duction of desirable innovations at the most effi- 
cient centers of spread. One method for locating 
such centers is to try to follow the course of some 
earlier innovations. 

The receptivity factors are perhaps more open 
to moderately successful manipulation, at least as 
far as innovations in the economic field are con- 
cerned. It is well known that economic rewards 
can help to break down resistance to change. The 
notion of a stable and given network of social 
communication suggests that a response to sub- 
sidies or similar devices should be most powerful 
where the knowledge of the innovation is most 
widespread, that is, just outside the areas where it 


has already been adopted. 
TORSTEN HÄGERSTRAND 


[See also CULTURE, article on CULTURE CHANGE; Evo- 
LUTION, article on SOCIAL EVOLUTION; INNOVATION; 
and the biography of RATZEL.) 
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LLLI 
INTERPERSONAL INFLUENCE 


Interpersonal influence is a term used by sociolo- 
gists to refer to the role of intimate, interpersonal 
relations in the communication of information, in- 
fluence, and innovation. Students of interpersonal 
influence have concerned themselves primarily 
with describing the ways in which networks of 
interpersonal communication intercept messages 
that originate in the mass media or elsewhere and 


the consequent effect on the formation of public 
opinion, the diffusion of innovation, and the like. 

Some effort has been made to compare the 
workings of oral communication systems in tradi- 
tional societies with the flow of interpersonal 
influence in modern society. These investigations 
have emphasized, relative to modern society, the 
formal situations in which influence is transmitted; 
the elite status, often based on ascription, which 
gives communicators their authority; the multiple 
spheres over which the same communicators exert 
influence; and, correlatively, the hierarchical and 
asymmetrical flow of such influence (Lerner 1958; 
Eisenstadt 1955). It follows that the flow of inno- 
vation is not very rapid in such societies and that 
acceptors of change are often deviant or marginal 
members, although the successful conversion of a 
formal leader is often sufficient to accomplish the 
conversion of an entire group (Barnett 1953, chap- 
ter 14). The rate of change is also limited by inter- 
group relations, since acceptance of an innovation 
across group boundaries is dependent on the char- 
acter of social relations between the groups and on 
more mundane things, such as distance and physi- 
cal barriers. The success of such professional advo- 
cates of change as traders, missionaries, colonial 
administrators, and, more recently, agents of tech- 
nical assistance appears to be strongly determined 
by their personal relations with the target group, 
by their ability to understand and harness existing 
communications channels in an acceptable way, 
and by the compatibility of their message with 
extant values and vested interests. Alternatively, 
some professional agents of change have succeeded 
only by breaking apart the traditional structure oF 
capturing and caring for groups of deviant souls. 
Altogether, though, the role of informal communi- 
cation has probably been underemphasized in the 
study of traditional societies (Gluckman 1963). 

The rise of egalitarian structures of interpersonal 
influence in modern societies has interested stu- 
dents of public opinion. Tarde (1901) went so far 
as to argue that conversation itself is essentially a 
modern phenomenon, and he described the way 1n 
which the intellectual salons of his day were able 
to effect changes in the art, politics, and general 
culture of French society. Speier (1950) has shown 
that the middle-class coffeehouses of seventeenth- 
century England were similarly influential. Cop: 
temporary equivalents of these institutions yi 
readily suggest themselves; but the communicar ta 
processes that they embody have not been muc! 
studied by sociologists [see CREATIVITY, article on 
SOCIAL ASPECTS]. 


Interpersonal influence and mass communica- 
tion, Indeed, interpersonal influence seemed ir- 
relevant at first to students of the media of mass 
communication, At the time empirical research on 
the effects of radio was begun, in the 1930s, it was 
widely thought that the mass media would exert 
a powerful and direct influence on thought and 
practice, the more so because society was conceived 
as a mass of atomized individuals alienated both 
from traditional institutions, such as church and 
guild, and from intimate contact with other people. 
Yet, empirical research has revealed that, at least 
in a democratic society, the mass media are far less 
potent than had been thought and, in themselves, 
are quite unable to effect radical changes of opin- 
ions, attitudes, and actions on subjects that “matter” 
(Klapper 1960), At the same time, one of the major 
discoveries of research on mass communications 
(just as it is one of the major discoveries of re- 
search on mass production) is the enduring and 
powerful influence of interpersonal relations, even 
in “mass society.” However, the role of interper- 
sonal relations in transmitting (or retarding) the 
fiow of influence and innovation is not the same 
now as it was in the past. While there are some 
kinds of messages that apparently travel with start- 
ling speed through interpersonal networks alone 
(Opie & Opie 1960), the central problem of com- 
munication and diffusion in modern society has to 
do with the nature of the linkages between net- 
works of interpersonal relations, on the one hand, 
and the media of mass communication, on the 
other, 

The two-step flow of communication. The dis- 
covery that “people” play a part in the mass com- 
munications process is, in a sense, a by-product of 
a study of the decisions made by voters during the 
U.S, presidential election campaign of 1940 (Lazars- 
feld, Berelson, & Gaudet 1944). Employing the 
panel method of repeated interviews with the same 
people, the authors found that, despite the bar- 
rage of mass communications, there were remark- 
ably few voters who had changed their minds 
during the course of the campaign. Those who did 
change their vote jntentions, however, reported 
that other people, rather than the mass media, had 
influenced them. 

An attempt was then made to locate people who 
had been influential on others. Study of these 
“opinion leaders” revealed that they had few per- 
sonal or social traits that distinguished them from 
others. This suggested that such influence goes on 
in the course of everyday contacts among like- 
minded people who share similar characteristics. 
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The influentials were different, however, in their 
greater exposure to the mass media. A “two-step 
flow of communication” was therefore hypothe- 
sized; the media were held to reach widely dis- 
persed opinion leaders, who then passed on what 
they had learned to those for whom they were 
influential. 

This hypothesis inspired a series of studies of 
the influence of different channels of communica- 
tion, including interpersonal influence, on the pur- 
chase of consumer goods, the communication of 
political ideas, the adoption of a new drug by physi- 
cians, and the like. Taken together, these studies 
suggested that conversation with other people is 
generally more influential than the mass media in 
decisions to accept some new idea or practice; that 
people who influence others are typically close asso- 
ciates; that influentials “specialize,” for example, a 
woman who influences another in the decision to 
try a new food product will probably not be influ- 
ential in the field of fashion; and that both partners 
to an influence transaction have to have a mini- 
mum level of interest in the subject before any 
influence is exchanged, but that the influential 
person is probably somewhat more interested. It 
also appeared that influentials are more exposed 
to “outside” sources, generally, but particularly to 
those sources—mass media, or particular institu- 
tions, or other people—that have direct bearing on 
their spheres of influence; that, despite this greater 
exposure, it is unusual even for an influential to 
report that one of the media of mass communica- 
tion has been the major source of influence on his 
own decision; and that the function of interpersonal 
relations is not only to transmit information but to 
“legitimate” decisions (or to veto them), as well as 
to provide the kind of social support necessary for 
taking innovative risks (Katz 1957). 

Respecification of the two-step flow hypothesis. 
Thus the studies mentioned above contributed to 
the specification of the two-step flow hypothesis by, 
on the one hand, exposing it as overly simple, but, 
on the other hand, confirming the validity of ap- 
proaching the flow of influence and innovation via 
the interaction of mass media and interpersonal 
communication. As a direct result, two other things 
were also accomplished. First, there was effected a 
kind of convergence, or, at least, a growing mutual 
interest, among various traditions of study of the 
diffusion and acceptance of innovation that previ- 
ously had not recognized any kinship. In particular, 
in departments of sociology and, to a certain extent, 
in schools of journalism, students of mass com- 
munication discovered, to everybody's surprise, 
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that they shared a set of problems with rural soci- 
ologists studying the diffusion and acceptance of 
new farm practices, with anthropologists studying 
the spread of culture and technology, and with 
folklorists, researchers in education, archeologists, 
geographers, and others (Katz, Levin, & Hamilton 
1963). Second, this series of studies gradually led 
to the realization that the design of research on 
the diffusion of innovation needed revision. 


The design of diffusion research 


The 1940 voting study had departed from the 
traditional design of mass communications re- 
search in that it did not seek to locate the effect of 
a given message in a given population but, instead, 
sought to “reconstruct,” with the help of the re- 
spondent, the sources of information and influence 
that had gone into the making of a decision, It was 
this research procedure that made possible the dis- 
covery that interpersonal influence had a substan- 
tial part in the process of decision making. As the 
role of interpersonal influence became clearer, 
methods were developed for locating and studying 
influentials (Katz & Lazarsfeld 1955). And, grad- 
ually, the focus of research shifted from a concern 
with interpersonal influence as simply a channel 
of communication bearing on individual decisions, 
to a conception of interpersonal relations as net- 
works of communication through which influence 
and innovation spread through society. 

Diffusion, acceptance, and influence. With this 
conception in mind, a study was carried out of the 
diffusion of a new drug among physicians. This 
study employed the methods of sociometry to map 
the potential channels of interpersonal communi- 
cation; in this way, it was possible to explore the 
extent to which the channels were actually utilized 
in the transmission of an innovation over time and 
to locate those points at which they were linked to 
“outside” sources of influence (Coleman, Katz, & 
Menzel 1957). Note that this concern with the 
process of “diffusion” (as distinct from a concern 
with the effects of mass communications or with 
the interlocking roles of mass media and interper- 
sonal communication in the making of individual 
decisions) led also to the choice of a specific item 
for study (a new drug) and to a method for dating 
the acceptance of the item by each adopter [see 
SCIENCE, article on SCIENTIFIC COMMUNICATION]. 

The components of the process called diffusion 
thus came into better focus. Diffusion could then 
be defined as the acceptance of some specific item, 
over time, by adopting units—individuals, groups, 
communities—that are linked both to external 
channels of communication and to each other by 


means of both a structure of social relations and 
a system of values, or culture. Rural sociologists 
studying the diffusion and acceptance of new farm 
practices arrived at much the same conceptions 
(Lionberger 1960), and it is noteworthy that the 
design of a study of the diffusion of hybrid seed 
corn (Ryan & Gross 1943) parallels, in a number 
of important respects, the just described, much 
later study. 

This definition of the diffusion process makes it 
possible to point out some of the methodological 
problems in designing research on the role of inter- 
personal influence in diffusion and in discussing 
some of its findings. It is evident, first of all, that 
one must specify the kind of item one has selected 
for study. While the original formulation of the 
two-step flow hypothesis referred only vaguely to 
the flow of ideas, it is altogether clear, by now, that 
items of information must be distinguished from 
items that require changes of attitude (influence) 
or action (innovation). Various proposals have been 
made for classifying innovations in terms of such 
concepts as their communicability (how easily can 
their working be described), their pervasiveness 
(how many of the adopters’ roles are implicated), 
the amount of risk involved, and the like. The fa- 
miliar distinction between “material” and “nonma- 
terial” items may be redefined in these terms 
(Menzel 1960). The important point, in the present 
context, is that different kinds of items are trans- 
mitted over different kinds of channels. 

Second, the concept of “acceptance” must be 
given operational definition. Distinctions have been 
made between such concepts as “external con- 
formity” and “internalized acceptance,” or between 
“compartmentalized acceptance” and “full fusion 
with previous values.” There is a noteworthy con- 
vergence here between students of acculturation 
and social psychologists concerned with change of 
opinions and attitudes (Spicer 1954; Kelman 1958), 
A related emphasis is to be found in recent writings 
in the field of rural sociology, where the process of 
acceptance is divided into stages, beginning with 
the stage of “awareness” of the innovation, and 
continuing through “interest,” “evaluation,” “trial,” 
and finally “acceptance” (Lionberger 1960; Rogers 
1962). While there is debate over the names and 
the number of stages, whether the stages are se- 
quential in time and prerequisite to each other 
(thus constituting degrees, or levels, of acceptance), 
or whether they are simply different dimensions of 
decision making, which may be ordered differently 
or even concurrently (Mason 1964), it is now well 
established that the several media are differentially 
appropriate to different stages. 


After characterizing the item selected for study, 
and giving an operational definition to what con- 
stitutes its acceptance, there remains the task of 
determining whether or not influence has taken 
place, and, if so, by what means. Most diffusion 
studies have relied on the respondent's recall, aided 
by a detailed interview, of how (and perhaps when) 
he first heard of the item and what other sources 
he consulted prior to adopting it. In an attempt to 
obtain more valid data, other studies have preferred 
variations on this basic method. Thus, many studies 
have attempted to infer causal relationships from 
observed correlations between a respondent's (usu- 
ally self-reported) communications behavior and 
his innovative behavior. A typical finding is that 
farmers who have more frequent contact with the 
extension agent adopt new practices earlier (see, 
for instance, Emery & Oeser 1958). Another major 
set of studies is based on tracing the flow of 
“planted” messages, rather than relying on more 
nearly natural events (Dodd 1958). 

Particularly problematic is the methodology of 
taking account of interpersonal influence within 
the design of the sample survey. In small com- 
munities, such as those of physicians or farmers, 
complete enumeration is possible; and sociometric 
methods can be readily adapted. To trace the spread 
of some new fashion in a metropolis, on the other 
hand, networks of interacting individuals must 
somehow be sampled. One way to do this is to select 
the usual kind of representative sample and then to 
build up a social “molecule” around each respondent. 


Some functions of interpersonal influence 


As for the functions, or the impact, of interper- 
sonal communication, the first thing that must be 
said is that it is not a major source for the diffusion 
of noteworthy information. The function of the 
mass media is to provide a direct link between the 
public and the makers of news, the creators of 
fashion, or the producers of new products; inter- 
personal communication cannot compete with the 
mass media in this respect. The two-step flow 
hypothesis was understood as implying that opin- 
ion leaders relay information to others who are not 
exposed to the media, and, as a result, several 
studies were undertaken to prove that the vast 
majority of the American public learns about major 
news events directly from the mass media (Deutsch- 
mann & Danielson 1960). Interestingly, when an 
event is of such magnitude that early hearers run 
to tell others what they have heard, interpersonal 
channels do serve as a major source of informa- 
tion; the assassination of President Kennedy was 
such an event, and it is likely that upward of 50 
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per cent of the population first heard the news from 
other people (Greenberg 1964). Interpersonal com- 
munication may also play an informational role 
when an event is of importance to a small group 
of people, or where the mass media are under strict 
official control (Bauer & Gleicher 1953), or where 
the mass media are not yet fully accessible. Indeed, 
it appears that the two-step flow hypothesis, as 
applied to the diffusion of information, best fits 
developing areas, where some of those who are 
exposed to the mass media pass on what they have 
learned to “less active sections of the population,” 
who are otherwise unexposed and who do not relay 
the message further (Abu-Lughod 1963). 

What is true for information about news events 
is also true for first information about technical 
innovations. Rural sociologists and others have 
found that the channel that makes for awareness 
of an innovation is typically a formal one—gen- 
erally one of the mass media, or an institutionalized 
source such as a dealer or salesman, whose job it 
is to spread news of the innovation as quickly as 
possible to as many people as possible. But ex- 
tremely few people adopt an innovation on the basis 
of awareness alone. At the stages of interest and 
evaluation, more personal sources, both formal and 
informal, take over; and the impersonal media be- 
come quite unimportant. The mass media “inform” 
decisions; interpersonal contacts “support” and 
“legitimate” them. 

Effectiveness of interpersonal influence. Many 
writers have speculated about the reasons for the 
greater effectiveness of interpersonal influence. 
Interpersonal influence, it has been pointed out, is 
two-way communication; the audience can talk 
back and convince itself. It is flexible, in the sense 
that the communication can be adapted to the 
recipient’s particular situation. It is more trust- 
worthy, because it is assumed to be motivated by 
the recipient's interest rather than the personal or 
professional interest of the influential; one rarely 
gets advice not to do something from a professional 
agent of change, whereas friends and neighbors 
frequently give such advice. Interpersonal influ- 
ence is typically embedded in casual conversation 
and is often a by-product of such conversations 
(which are gratifying in themselves), compared 
with the more “purposive” communications of for- 
mal sources of influence. Interpersonal influence 
is generally “local” and thus is more likely to be 
both instrumentally and normatively appropriate, 
under local community conditions; and, what is 
more, a local influential can be held directly ac- 
countable for his advice. By the same token, a 
person seeking to influence another can provide 
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immediate reward or punishment for acceptance 
or the lack of it, whereas other communication 
media can rarely provide immediate reward or 
punishment. 

Some of these attributes also characterize more 
formal and purposive interpersonal communicators, 
such as agricultural extension agents, salesmen, or 
community health workers, Or, to put the matter 
another way, formal sources of influence (particu- 
larly when these are also personal) may be per- 
ceived by some people as sufficiently trustworthy, 
sufficiently accessible, sufficiently normative (al- 
though these norms may be at odds with those of 
the local community), or sufficiently gratifying in 
themselves to motivate the acceptance of influence. 
This would seem to be the case for early adopters 
of technical innovation, in particular, but also for 
marginal members of a community who embrace 
the agent of change as a welcome companion. For 
most people, however, the word of a trusted friend 
or a respected colleague is a prerequisite to action. 

Interpersonal influence and social change. It is 
incorrect, however, to infer from the role of inter- 
personal influence in the acceptance of social and 
technical change, whether in the field of diffusion 
or, say, in a field like group dynamics (Lewin 
1947), that interpersonal influence is a channel 
that functions primarily in the service of change. 
Indeed, rural studies have shown that, under cer- 
tain conditions, the more farmers one talks to the 
later one is likely to accept an innovation; and the 
same thing is true if one uses interpersonal chan- 
nels to learn about an innovation for the first time 
(Wilkening, Tully, & Presser 1962; Copp, Sill, & 
Brown 1958). Studies of political communication 
have shown that while the mass media may moti- 
vate someone to change his political intentions, it 
is conversation that prevents the change (Deutsch- 
mann 1962). In fact, the mass media may be said 
to be most effective when interpersonal communi- 
cation is inoperative for some reason, as when a 
topic is not sufficiently interesting or important to 
motivate discussion, when one finds oneself among 
others whom one prefers not to hear, when a topic 
is taboo, or when a totalitarian regime suppresses 
personal contacts (Klapper 1960). Otherwise, it 
may safely be said that the tendency of interper- 
sonal influence is a conservative one. Since a person 
who is motivated by the mass media on a given 
subject is also more likely to discuss it with others, 
and since these others typically share his prior 
sentiments, interpersonal communication probably 
reduces the pace of change. Only when a communi- 
cation or a proposed innovation is deemed com- 
patible with group norms, particularly if the group 


has an over-all norm of “progressiveness” or “open- 
ness to change,” or when one’s consultant has al- 
ready been won over, does interpersonal influence 
reinforce the message of change. In other words, 
whether interpersonal communication will result 
in a change of attitude or behavior depends partly 
on whom one talks with. But it should be borne in 
mind that the influential members of a group are 
also bound by group norms; while influentials in 
progressive groups have a higher rate of innovation, 
influentials in conservative groups lead in resisting 
change (Marsh & Coleman 1954). 


The flow of interpersonal influence 


In discussions of the flow of interpersonal influ- 
ence, the central figure appears to be the natural 
leader or the opinion leader, in other words, the 
individuals to whom others look for advice. It is 
evident, from many studies, that such people exist; 
and while sociometric techniques are probably the 
most appropriate means for locating them, there is 
also some reason to believe in the validity of a 
self-designating technique whereby respondents are 
asked (in effect) whether they have recently influ- 
enced another person in some matter, or whether 
anyone has come to them for advice. Experience 
has shown that it is rare for more than 30 per cent 
of a population to answer such questions in the 
affirmative. 

Characteristics of influentials. Whether influ- 
ence is accepted from an influential who is pre- 
cisely like the seeker, or from one who is different 
in some way, apparently depends on a number of 
factors. In modern societies, at any rate, influen- 
tials are more similar to those whom they influence 
and more specialized in the areas over which they 
are influential; influentiality is also less likely to 
be synonymous with the holding of formal office. 
The early studies of decision making in the realms 
of marketing, fashion, politics, and the like em- 
phasized the fact that influentials were hardly dis- 
tinguishable from those whom they influenced, at 
least as far as attributes of social status were con- 
cerned. Studies in the field of rural sociology have 
emphasized that farmers who are actively seeking 
information reach out beyond their own neighbor- 
hood and status levels to consult other farmers who 
are more competent than they, even though their 
access to the farmers they consider “best” may be 
limited by status disparities and clique boundaries. 
This behavior is characteristic of “progressive 
communities, and particularly communities of pro- 
fessionals. Where the need for information and 
advice is less keenly felt, the source is likely to be 
found much closer to home (Blanckenburg 1964). 


Competence and accessibility are not the only 
attributes of influentials; strategic location, with 
respect to some relevant outside source, is another, 
as is the case with drivers of buses and trucks who 
bring news of the outside world to outlying vil- 
lages in underdeveloped areas, or even up-to-the- 
minute market news in modern agricultural 
settings. This, of course, is simply a version of the 
two-step flow of communication that in more sophis- 
ticated guise takes the form of propositions such 
as the one that finds rates of diffusion in different 
communities related to the extent to which farmers 
in each community talk about farming with re- 
spected others who have more extensive contacts 
with the communications media than themselves 
(Coughenour 1964). In other words, communities 
in which interpersonal and mass media networks 
are more closely integrated are characterized by a 
more rapid rate of diffusion. In such communities, 
interpersonal networks are organized hierarchically 
in terms of competence and range of outside con- 
nections, although probably less so in terms of 
social class. The diffusion of a new fashion, to take 
a different example, probably proceeds through 
even more homogeneous channels, although this 
does not preclude the possibility that the chains of 
interpersonal contact link up, at key points, with 
those of a higher social class. The fact that most 
individuals follow the lead of their closest associ- 
ates does not necessarily contradict the hypothesis 
(Tarde 1890) that fashion “trickles down” through 
society. 

Chains of communication. It is important to 
emphasize that influentials and influences are not 
usually very different from each other, nor do they 
constitute isolated dyads. Instead, they are better 
conceived as part of a long chain of interpersonal 
connections that embraces large segments of a 
community, or even a nation. In fact, it is very 
likely that the concepts of “opinion leader” and 
“influential” have been overstressed—the more so 
because it is almost impossible to single them out 
as a distinguishable target audience. The more 
basic stress should be placed on intimate inter- 
personal relationships and their function as chan- 
nels of communication and as anchorage points for 
group norms, 

For instance, given an undifferentiated popula- 
tion, communicating freely and continually with 
one another, it is possible to specify mathemati- 
cally the properties of the growth curve that would 
describe the flow of innovation. Deviations from 
the expected curve may result from differentiation, 
and thus from the restricted access of certain peo- 
ple to each other; or they may reflect external 
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sources of influence (such as*the mass media) 
impinging on the network/at.more than one. point. 
Such mathematical models’ have long been of in- 
terest to diffusion researchers (Tarde 1890; Chapin 
1928; Dodd 1958; Colemian| 1964; Hägerstrand 
1965). It has even been suggested that the neo- 
lithic era must have been characterized by this kind 
of homogeneous population and, consequently, that 
there was an essentially uninterrupted, world-wide 
spread of innovations moving at a constant rate 
(Edmonson 1961). 

Differential activation of communication net- 
works. It should also be noted that a social struc- 
ture of interacting individuals typically includes 
many different networks of communication that are 
differentially activated by different topics. For ex- 
ample, the drug study referred to earlier found that 
the network of professional relations among phy- 
sicians was particularly active early in the history 
of the new drug, whereas the network of friendship 
relations among physicians began to transmit in- 
formation and influence only after the drug was 
better established (Coleman, Katz, & Menzel 1957). 
An attempt to replicate the same study in a large 
city suggested that only the professional network, 
but not the friendship network, was active; this 
implies that the size of a community may affect the 
degree of overlap and specialization, as well as the 
different functions, of different networks of inter- 
personal relations. A parallel example may be found 
in the fact that different social classes employ dif- 
ferent kinds of networks for transmitting similar 
communication content; this is the case with lay 
networks of referral and advice giving in the seek- 
ing of medical help (Freidson 1961). By the same 
token, the culture of a community defines the dif- 
ferential appropriateness of different interpersonal 
channels for the different sorts of communication; 
in a traditional society, for example, channels 
reserved for sacred communications are violated 
by the transmission of secular communications 
(Eisenstadt 1955). 

Creation of new networks, Sometimes people 
go outside their interpersonal networks to send or 
receive influence. One way of looking at revolution- 
ary change, for example, as compared with changes 
in fashion or technology, is to examine the extent 
to which new networks are created for the trans- 
mission of influence. Each successive fashion tends 
to travel through the same networks, simply suc- 
ceeding the fashion that came before. Revolution- 
ary change, on the other hand, may often be char- 
acterized as both a disruption of the traditional 
networks in which individuals have been embedded 
and as the creation of new channels of communi- 
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cation. Another occasion on which individuals may 
go beyond their traditional networks of communi- 
cation is during disasters or other types of major 
catastrophe. The death of President Kennedy, for 
example, found people telling the news to total 
strangers on the street—and discussing it with 
them (Greenberg 1964; Greenberg & Parker 1965). 
interpersonal communication to think of how much 
It is a good illustration of the latent functions of 
more than mere information was being conveyed 
in these conversations. 

ELIHU KATZ 


[See also COMMUNICATION, MASS; DISASTERS; MAss 
SOCIETY; SOCIOMETRY.] 
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DILTHEY, WILHELM 


Wilhelm Dilthey (1833-1911), German philos- 
opher, historian, literary critic, and biographer, was 
professor of philosophy at Basel in 1867, at Kiel 
from 1868 to 1870, at Breslau from 1871 to 1881, 
and at Berlin, where he succeeded Hermann Lotze 
in 1882. He had studied philosophy at Heidelberg 
under Kuno Fischer and at Berlin under Friedrich 
Trendelenburg; while at Berlin he had also been 
deeply influenced by such leading historians as 
August Béckh, Jacob Grimm, and, above all, Leo- 
pold von Ranke. His chief interest as a philosopher 
was in the logic and methodology of the historical 
and social studies (Geisteswissenschaften). His 
conclusions are set forth in the Einleitung in die 
Geisteswissenschaften (see Gesammelte Schriften, 
vol. 1) and in the unfinished “Entwiirfe zur Kritik 
der historischen Vernunft” (ibid., vol. 7, pp. 189- 
291). 


The “Geisteswissenschaften” 


Dilthey agreed with Kant and the positivists in 
their rejection of metaphysics but differed from the 
positivists in that he did not accept natural science 
as a model that the Geisteswissenschaften should 
follow. Natural science can do no more than ex- 
plain (erklären) observed events by relating them 
to other events in accordance with natural laws. 
These laws tell us nothing of the inner nature of 
the things and processes that we study. But with 
human beings there is a sense in which it is pos- 
sible to go behind observable actions to something 
internal: we may understand (verstehen) their 
actions in terms of their thoughts, feelings, and 
desires. We can know not merely what a man does 
but the experiences (Erlebnisse), the thoughts, 
memories, value judgments, and purposes that 
have led him to do it. 

Knowledge in this field is not, as in natural 
science, merely phenomenal and external. We have 
direct insight into the transitions whereby percep- 
tions lead to thoughts, these to feelings, and these 
again to desires and acts of will. Such connections 
constitute the “structure” of the individual person- 
ality, and the understanding of them is also the key 
to a wider understanding of historical processes. 
Because men can communicate with one another, 
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one man’s experiences can arouse thoughts and 
feelings and lead to actions on the part of other 
men as well as himself, and thus the individual 
“structural” pattern ramifies and becomes the life 
pattern of social groups, of nations and civiliza- 
tions. The historical life of mankind is a continual 
process of interactions of this kind, and to under- 
stand a particular event or action or utterance, we 
must see it in this kind of context. 

Dilthey’s Geisteswissenschaften are a somewhat 
heterogeneous group of subjects. They include an 
experimental and generalizing science (psychol- 
ogy), a study of individual persons and societies 
in the concrete particularity of their lives and ac- 
tions (history, biography, autobiography ), and nor- 
mative and valuational studies (jurisprudence, 
moral theory, political theory, literary criticism, 
etc.). What all these have in common, according 
to Dilthey, is that they are all aspects of the study 
of human life and experience and that that study 
is not complete unless they are all brought in. 
Taken together, the Geisteswissenschaften show 
that men do live under conditions that can to some 
extent be formulated in general laws, whether of 
the individual psyche or of social groupings. Men 
are intelligible to us as individuals and interesting 
to us precisely because of their individuality and 
uniqueness. And all human experience and activity 
are shot through with choices, preferences, value 
judgments. Because human life as known to us is 
in itself more complex and many-sided than the 
phenomena of nature, the Geisteswissenschaften 
must also be a more various and many-sided body 
of disciplines, and no one method or principle can 
govern them all. They are all, however, dependent 
on our ability to understand the “structural” pat- 
tern of experience and thereby to see human be- 
havior from within. 

In the Einleitung in die Geisteswissenschaften 
Dilthey also argued that psychology has a funda- 
mental place among these studies. He was not 
thinking of the experimental science of psychology 
as we know it today but of a descriptive and com- 
parative kind of psychology that would culminate 
in a theory of personality types. Such a theory 
would be a useful tool in all the Geisteswissen- 
schaften. While the Einleitung endorsed psychol- 
ogy, it was critical of sociology, which Dilthey con- 
sidered to be a pseudo science. He was thinking 
of sociology in the grand manner, as conceived by 
Auguste Comte and Herbert Spencer, a study em- 
bracing al] forms of cultural as well as social life 
and comparable in the vastness of its range to a 
philosophy of history such as Hegel's. Such a grand 
synthesis, Dilthey held, would not give unity to the 
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Geisteswissenschaften; they would achieve this 
unity only if world history were written in a way 
that made use of all the detailed insights that the 
Geisteswissenschaften can offer. Years later, in 
some notes made with a view to a revision of the 
Einleitung, Dilthey made it clear that he had no 
objection to sociology if it meant merely a com- 
parative study of different forms of social group- 
ings and stratifications. 


Historiography 

In the “Entwiirfe zur Kritik der historischen Ver- 
nunft” and elsewhere in his later writings, Dilthey 
laid less emphasis on the role of psychology and 
turned his attention to a philosophical analysis of 
the process by which one mind becomes aware of 
what goes on in another. The process may be 
summed up in the words experience, expression, 
understanding (Erlebnis, Ausdruck, Verstehen). 
We understand an expression by re-experiencing 
(nacherleben) in our own consciousness the expe- 
rience from which the expression arose. This re- 
experiencing is, of course, not a perfect reproduc- 
tion of the original experience; it is schematic, 
telescoped, incomplete, fallible. Dilthey distin- 
guished different types of expression and different 
degrees of accuracy and confidence with which they 
can be interpreted. His particular approach to the 
problem of understanding led him to an interest in 
hermeneutics, that is, in the possibility of laying 
down principles and working rules for the guidance 
of those whose work is the interpretation of written 
texts. He showed how a theory of hermeneutics 
arose in patristic times out of the needs of scrip- 
tural exegesis, how it was developed under the 
influence of Reformation controversies and the 
beginnings of Biblical criticism, and how it was 
generalized and made into a philosophical disci- 
pline in the nineteenth century by Friedrich Schlei- 
ermacher. And taking the art of understanding 
expressions as the underlying factor common to all 
the Geisteswissenschaften, he showed that there is 
an easy transition from personal experience to auto- 
biography, thence to biographical and historical 
writings, thence to the more abstract and gener- 
alizing studies and the sectional disciplines, and 
finally to the grand synthesis in world history. 

Dilthey’s doctrine of understanding as re-expe- 
riencing is open to question. Some may feel that 
it is too intense, too intimate and personal, and 
that he expects of the historian and social scientist 
too much of the poet's or novelist’s gift. But by 
raising the question in the way he did, Dilthey 
touched off a lively and fruitful discussion, both 


among philosophers interested in the theory of 
knowledge and among those historians and social 
scientists who are interested in the aims and 
methods of their disciplines but are not satisfied 
with a statistical and behavioristic approach. 

Dilthey’s interest in historical method was shared 
by Wilhelm Windelband and Heinrich Rickert; but 
these, while better equipped than Dilthey in the 
logical techniques of philosophy, had no compa- 
rable experience of the actual work of historical 
writing. Dilthey was himself a biographer (Das 
Leben Schleiermachers 1870; “Die Jugendgeschichte 
Hegels,” Gesammelte Schriften, vol. 4, pp. 1-187); 
a historian of ideas (“Auffassung und Analyse des 
Menschen im 15. und 16. Jahrhundert,” ibid., vol. 
2, pp. 1-89); and a literary historian and critic 
(Das Erlebnis und die Dichtung . . . 1905). 


Relativism 


Dilthey was one of the proponents of the doc- 
trine known as historicism, which insists that all 
human customs, institutions, and ideas are condi- 
tioned by the historical circumstances in which 
they arise and flourish and that although every 
society and every individual thinker professes to be 
in possession of objective truth, an outside ob- 
server can always see how this “truth” is condi- 
tioned by social and historical factors. Applied un- 
critically in the theory of knowledge, this view can 
lead to a historical relativism, that is, to the doc- 
trine that all “truth” is relative to time and place 
and that objective knowledge is impossible. Dil- 
they’s view can also lead to a psychological rela- 
tivism. He believed that a man’s Weltanschauung, 
the complex of his beliefs and judgments concern- 
ing ultimate questions, is determined as much by 
his psychological structure and basic attitudes as 
by valid reasoning from sound premises. He de- 
veloped a typology of Weltanschauungen; the basic 
types are naturalism, the idealism of freedom, and 
objective idealism. Naturalism means that one 18 
impressed chiefly by the impersonal order of na- 
ture; idealism of freedom, that one gives priority 
to the unique status of man as a free agent; and 
objective idealism, that one conceives of the uni- 
verse as an organic whole. Schools of art, and re- 
ligious and philosophical systems, can be classified 
by their conformity to and expression of one of the 
three main types of attitude or, as may happen, of 
any combination of these. 

Views such as these seem to verge on an ultimate 
skepticism. Controversy has arisen both about the 
merits of Dilthey’s argument in itself and about 
the degree to which he personally drew skeptical 


conclusions. He was in fact no skeptic and did not 
believe that his principles must lead to skepticism; 
he believed rather that in those spheres where 
empirical methods can be applied, which include 
some sections of the Geisteswissenschaften as well 
as the natural sciences, real discoveries and real 
progress can be made, and there is objective knowl- 
edge. It is in the realm of value judgments and life 
attitudes that he felt that relativity is inescapable, 
but also that proper acceptance of it can lead to an 
enrichment of life rather than to frustration (Hodges 
1952, pp. 310-314), Karl Jaspers, for instance, 
who in his early work was influenced by Dilthey’s 
typology of Weltanschauungen (see 1931), has 
moved on to a form of existentialist philosophy, 
and this is another possible outcome of Dilthey’s 
teaching. 

H. A. Hopces 


[See also History, article on THE PHILOSOPHY OF HIS- 
TORY; KNOWLEDGE, SOCIOLOGY OF; and the biog- 
raphies of MANNHEIM; MEINECKE; ORTEGA Y 
GASSET.] 
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DIPLOMACY 


There are at least two senses in which the term 
“diplomacy” is used: the first and more narrowly 
defined refers to the process by which governments, 
acting through official agents, communicate with 
one another; the second, of broader scope, refers to 
modes or techniques of foreign policy affecting the 
international system (Nicolson [1939] 1964, pp. 
13-14), 

In the past it was believed that the narrower 
notion of diplomacy embraced all official contacts 
and connections of a peaceful nature between state 
units (Satow [1917] 1962, p. 1). It is now clear 
that it does not do so: governments have means of 
communicating officially that could scarcely be 
called diplomatic. In 1962, during the Cuban mis- 
sile crisis, official messages were broadcast to save 
time and the protagonists negotiated in a number 
of other unorthodox ways (Schlesinger 1965). More 
generally, public statements of policy, speeches by 
influential leaders, and revelations to the press have 
served as means of direct contact with foreign 
states. In its restricted definition, then, diplomacy 
refers specifically to the use of accredited officials 
for intergovernmental communication, not simply 
to communications links between states. 

Origin and development of diplomacy. Employ- 
ment of diplomatic envoys is as ancient as polities 
themselves (C. D. Burns 1931), but not until the 
fifteenth century were the first permanent legations 
established. The Italian states inaugurated the am- 
bassadorial system, which rapidly spread to the rest 
of Europe. Until the beginning of the nineteenth 
century two classes of diplomatic representatives 
were utilized: ambassadors, who were obliged to 
vie for precedence in the capital to which they were 
assigned; and semiofficial agents, who, though less 
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involved in court functions, did not have access to 
fully authoritative sources of information. At the 
Congress of Vienna in 1815, four categories of rep- 
resentatives were established: (1) ambassadors, 
papal legates, and nuncios; (2) envoys extraordi- 
nary and ministers plenipotentiary; (3) ministers 
resident; and (4) chargés d’affaires. Precedence 
was to be based on the rank of the appointment 
conferred by the home government and on seniority 
of service in the particular capital. Thus were 
enunciated the basic diplomatic conventions as we 
know them today (Nicolson [1939] 1964, pp. 28, 
31-33). 

Until the twentieth century, members of the 
diplomatic corps were recruited from the wealthy 
classes (C. D. Burns 1931). Those selected were 
generally amateurs, whose rank and social position 
entitled them to consideration for diplomatic ap- 
pointment. Examinations, when required, placed 
inordinate emphasis upon linguistic competence, 
and a degree of financial independence was a pre- 
requisite. By World War 11 an appreciable democra- 
tization and professionalization of foreign services 
had occurred (Ilchman 1961). Competitive sub- 
stantive examinations requiring high educational 
attainment had been instituted in most major coun- 
tries (Nicolson [1939] 1964, pp. 208-218). Inde- 
pendent means was no longer a requirement for 
entry into the diplomatic corps, and women be- 
came eligible for appointment. 

Impact of technology on diplomacy. With the 
technological revolution of the twentieth century, 
the role of the diplomatist has changed appreciably. 
In the 1920s diplomatic officials were still given a 
certain latitude by their controlling agencies on 
questions of secondary importance; only in a crisis 
did they act merely as messengers (C. D. Burns 
1931). Today the reporting function has become 
virtually all-encompassing; negotiations are largely 
conducted by foreign offices, and diplomatic repre- 
sentatives are utilized only for transmission of re- 
quests and responses. The daily traffic of United 
States messages alone totals more than 400,000 
words (U.S. Congress . . . [1964] 1965, p. 584). 
Rewards in the American Foreign Service are dis- 
tributed partly on the basis of the variety and 
quality of cables sent to Washington. We are now 
far removed from Jefferson’s complaint in 1791 to 
William Carmichael, American chargé in Spain, 
that he had received only one dispatch from him 
in 2% years (ibid., p. 584). This is not to say that 
ambassadors exercising their independent judg- 
ment do not occasionally make determinations of 
very great importance (Rusk [1964] 1965, p. 582; 
Merchant 1964, pp. 123-124). At the time of the 


Dominican crisis in 1965, Ambassador Bennett 
made assessments of crucial significance concern- 
ing the need for United States military intervention; 
these were heeded and acted upon in Washington. 
Once information and interpretation have been pro- 
vided, however, decisions are taken in national 
capitals. Even on fairly minor matters the local 
ambassador possesses little power (Kennan [1964] 
1965, pp. 589-590). 

Diplomatic proliferation. The decline in de- 
cision-making authority of the individual diploma- 
tist is partly correlated with the vast expansion of 
diplomatic missions since World War n. In 1930 
the London diplomatic corps totalled 56 embassies 
and legations, the largest including only 17 staff 
members (C. D. Burns 1931). In 1964 London had 
96 foreign missions, with an average of 13.6 diplo- 
mats each. The United States housed diplomatic 
establishments from 107 countries. If each of the 
almost three thousand American diplomats abroad 
in 1964 had been allotted independent negotiating 
tasks, the foreign relations of the United States 
would soon have been in disorder (Brams 1966, 
pp. 39, 42). Even communications with overseas 
missions posed great difficulties. Of the 1,300 in- 
coming daily cables, the secretary of state saw only 
twenty to thirty; of the one thousand outgoing 
cables, he read about six. Increasingly, lesser of- 
ficials and desk officers have become responsible for 
the information on which policy determinations are 
made (Rusk [1964] 1965, p. 578). 

Another important feature of present-day diplo- 
macy is its multilateral character. In 1963 the 
United States belonged to intergovernmental organ- 
izations having a gross membership of 1,141; 103 
states were represented in this total. France held 
an even larger number of common memberships 
(Brams 1966, p. 49). In the same year the United 
States attended over 400 international conferences 
and participated in more than 10,000 votes (Mer- 
chant 1964, p. 128). As a consequence, much of 
the organization of foreign ministries has been de- 
termined by the need to prepare for and develop 
positions in multilateral bodies (Beloff 1961). The 
result has probably been to influence in some meas- 
ure the content of foreign policy (E. Haas 1958; 
1964). 

Changing diplomatic techniques. The second 
and more broadly defined meaning of “diplomacy 
applies to modes or techniques of foreign policy 
affecting the international system. In this extended 
sense diplomatic techniques have undergone a con- 
siderable metamorphosis since the eighteenth cen- 
tury (Rosecrance 1963). The extensive use of 
propaganda, subversion on a wide scale, and the 


manipulation of national economic instruments for 
foreign policy purposes have greatly enlarged the 
range of multilateral dealings on the world scene. 
Now even cultural and educational exchange may 
be seen as a tool in the cold war (Coombs 1964). 
To be sure, a plentiful armory of diplomatic weap- 
ons has been the stock in trade of state action 
since the time of Machiavelli. The most vigorous 
use of these implements, however, awaited the 
French Revolution and the fruition of nationalism 
(Rosecrance 1963). And it was not until the 1930s 
and World War u that the “old diplomacy” was 
transformed (Craig 1961, pp. 23-25). 

The evolution of techniques was, at the same 
time, an alteration of the boundary between domes- 
tic and external politics. While domestic affairs 
during the eighteenth century had remained largely 
unaffected by wars, in the twentieth violence came 
to be expressed internally, as well as externally 
(Scott 1964), Subversion, propaganda manipu- 
lation, and economic pressure combined to sap the 
sinews of domestic strength. 

Of equivalent importance, new systems of action 
appeared in international diplomacy. The totali- 
tarians inaugurated a new form of diplomacy, con- 
sisting of bluff, bluster, and intemperate attack. 
They did not hesitate to sign agreements with 
democratic powers, knowing that they could vio- 
late them later with impunity. 

After World War 1 the emergence of a large 
number of new nations added a new dimension to 
the practice of diplomacy. In 1914 approximately 
twenty states had abiding interests in foreign rela- 
tions, and of these all but two or three were Euro- 
pean (Craig 1961). A generation after World 
War u there were more than 115 states engaged in 
world politics and more than half were Afro—Asian. 
These new nations operated on behalf of divergent 
objectives, and the techniques they employed were 
often distinct (Binder 1958). The hypothesis could 
be entertained that the new nations represented a 
separate subsystem of international relations. In 
the twenty years following World War 1 an essen- 
tially European system of international relations 
had been transformed into a world system. The 
world system was not fundamentally European: it 
consisted of a congeries of separate regional sub- 
systems, unified only at the topmost level of inter- 
action. Under the circumstances it was not sur- 
prising that nations should follow different “rules 
of the game.” Perhaps the most characteristic dif- 
ference between European and Afro—Asian systems 
was illustrated in attitudes toward the balance of 
power. Even minor shifts in Europe brought forth 
a countervailing powerful coalition to prevent fur- 
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ther imbalance; in Asia even fundamental altera- 
tions in the power balance did not stimulate forma- 
tion of an opposing bloc (Rosecrance 1956). 

Diplomatic patterns. Diplomatic techniques are, 
of course, related to diplomatic patterns. Twenty 
years after the close of World War 11 it was still 
difficult to discern the direction in which interna- 
tional constellations were moving. In the immedi- 
ate postwar era, bipolarity was an accepted general 
description of the state of the international system. 
As a range of new nations attained independence 
and refused to align with either of the two major 
blocs, however, multipolarity was hailed as the 
emergent system of the future (Rostow 1960). Some 
authors found a mixed system most realistic, with 
bipolarity characteristic of military matters and 
multipolarity of political matters (Liska 1963). 
Still others declared the two major alliance systems 
to be incidents of the balance of power process; 
alliance patterns would be reshaped as new threats 
emerged, and the world would revert to the com- 
binations and procedures of the nineteenth cen- 
tury (Bull 1964). 

Part of the debate revolved around the nature 
of dynamic processes: international politics could 
be seen as a sociological process, with political out- 
comes dependent upon the balance and intensity 
of communications flows (Deutsch 1953); it could 
be seen as a power process, with political outcomes 
dependent upon the external impact of other actors 
(Hinsley 1963). If diplomacy depended partly upon 
cultural communications and political, economic, 
and historical ties, some degree of North Atlantic 
cohesion might survive after cooperation was no 
longer strictly necessary on grounds of threats to 
the peace. If diplomacy was a product of military— 
political factors only, the quiescence of Russia and 
the resurgence of China should produce entirely 
new constellations of force. 

Material considerations were also relevant. In 
economic and technological terms the Soviet Union 
and the United States were likely to remain the 
dominant international powers until at least the 
beginning of the 21st century (Waltz 1964). Their 
rate of industrial development, as compared to that 
of the new nations, was likely to increase their pre- 
eminence. The challenges to technological bipo- 
larity were to be found within or on the fringes of 
major-power alliance systems. Japan, western Eu- 
rope, and China would be independent power cen- 
ters, in economic-technological terms, at some 
point in the future. These powers, if they pursued 
separate international courses, could create an 
oligopolistic international order, partially akin to 
the five-power confraternity of the eighteenth and 
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nineteenth centuries. For a considerable period of 
time, however, western Europe seemed destined to 
remain disunited: the relaxation of cold war hos- 
tilities, if it diminished the need for Atlantic co- 
hesion, also reduced the momentum behind Euro- 
pean integration. And as long as western Europe 
was divided, it would remain a tempting prize in 
the cold war and a reason for continued, if limited, 
hostility between the Western and the Soviet camps. 
In the circumstances, it was interesting that the 
divergence of interests between the haves and have- 
nots led to so little conflict along North-South lines. 
East-West disputes continued to be controlling 
(Russett 1965; Alker 1964). Such patterns, if per- 
petuated, suggested the projection of a modified 
bipolarity well into the future. 

Determinants of diplomacy. Future patterns of 
diplomatic action depend, of course, on the same 
basic factors that underlie international order. 
Some writers have viewed these as primarily domes- 
tic and sociopsychological, arguing that wars are 
caused by internal tensions and that domestic con- 
flicts, explicit or latent, spill over into the interna- 
tional arena (Freud 1930; M. Haas 1964; and 
Rosecrance 1963). There has been some historical 
evidence for this proposition in the period since 
the French Revolution: the working out of domes- 
tic revolutionary issues often had consequences for 
the international system; patterns of external peace 
often were correlated with periods of internal sta- 
bility (Rosecrance 1963). And yet this was not 
always the case: military power could be either a 
facilitative or a restraining influence (Hinsley 
1963). 

It was possible even to advance contrary hypothe- 
ses; that domestic change might be caused by inter- 
national factors (Liska 1963) or, at the least, that 
domestic conflict was not clearly related to foreign 
conflict in certain recent periods (Rummel 1963). 
Such approaches led directly to claims that the form 
of the international system—specifically the lack of 
a strong supranational framework, capable of regu- 
lating international conflict—ensured the continu- 
ance of war (Waltz 1959), Whatever the validity 
of such contentions (and some evidence pointed in 
another direction: Deutsch et al. 1957), they could 
not be fully tested in the absence of international 
government. World government, moreover, was, as 
nearly everyone admitted, unobtainable, and theo- 
rists of diplomacy were forced back on less funda- 
mental remedies for international strife. At this 
level no particular relationship was found between 
specific alliance patterns and the existence of war 
(Singer & Small 1965a, p. 35). 

Some writers, however, observed a connection 


between the levels of international threat and a 
conflict spiral ending in war. The argument was 
that if threats (or the perception of injury ) reach a 
certain magnitude, a state is tempted to go to war 
regardless of the military consequences. This theory 
was intended as an explicit refutation of the deter- 
rent hypothesis, i.e., that states never attack when 
they believe their opponent is stronger at the time 
they must make the decision. Its advocates claimed 
that political threats overbore military threats 
(Zinnes et al. 1961). 

This argument is based upon a partial mis- 
understanding of deterrent mechanisms that was 
developed by military writers (Ellsberg 1960). De- 
terrence is not simply a matter of respective mili- 
tary proportions; it is inextricably connected with 
the probability that military forces might be used. 
If a state was sure that its opponent was going to 
attack it anyway, that state might decide to attack 
first, despite an unfavorable balance of military 
force, in order to reap the advantages of the initia- 
tive. If the status quo had worsened decisively from 
the point of view of one state, it might wish to 
attack (even though the probability of its winning 
was very small) in order to avoid the disastrous 
consequences of the situation. In the theory of 
deterrence, the risks of not striking had to be 
assessed alongside the risks that striking entailed 
(Wohlstetter 1959). Deterrence calculations, then, 
did not rest solely or even primarily on the balance 
of military postures. Nor did it follow that military 
deterrence analyses prescribed in crisis situations 
an increased military threat against an opponent in 
order to forestall his possible decision for war. 

Because of the threat posed by nuclear weapons, 
it was generally believed that their acquisition by 
additional countries would raise the risks of war. 
Such an impact could not be discounted (Beaton 
& Maddox 1962); neither, however, on the basis of 
available evidence, could it be confirmed. There re- 
mained the possibility that nuclear weapons might 
turn out to be an expensive detour, useful for pres- 
tige purposes but irrelevant for war. As smaller 
states opted for nuclear capabilities, larger ones 
were concentrating on conventional and counter- 
insurgency postures. The very success of deterrence 
in preventing nuclear war raised the stakes and 
possible rewards of internal violence and warfare 
(Huntington 1962). In the longer run deterrent 
stability seemed likely to focus attention on con- 
ventional arenas, in which middle powers might 
contend much more equally with the great states 
for position and influence than they would be able 
to do in a nuclear arms race. Under such conditions 
the spread of nuclear weapons would merely divert 


resources from those types of conflicts in which 
major political and military gains would be likely 
and enhanced bipolarity could actually result [see 
DETERRENCE]. 

The prospects for diplomacy. Different inter- 
national constellations would require different diplo- 
matic methods to attain desired outcomes. A bipolar 
order places major emphasis upon military compe- 
tition between two great blocs. Intrabloc diplomacy 
becomes coalition management, in the face of ex- 
terior threat, As long as the danger of war between 
blocs is not negligible, individual interests are sub- 
merged in a common opposition to the external foe. 
In such circumstances diplomacy proceeds by mili- 
tary reassurance of allies and military deterrence of 
enemies. Integrated or quasi-supranational military— 
political mechanisms may be set up within blocs. 
Between blocs only the most tenuous diplomatic 
connections need be maintained. In other words, a 
bipolar world largely substitutes military for diplo- 
matic skills and operates within clearly distin- 
guished arenas of common and opposed objectives. 

In a multipolar order, on the other hand, ene- 
mies and allies are no longer clearly differentiated 
and different standards of diplomatic competence 
and practice are therefore required. International 
outcomes will be determined as much by diplo- 
matic skill as by military force. As long as no single, 
salient threat to the international system emerges, 
alliances and antagonisms will be tentative and 
short-lived. Since the major powers will by and 
large be able to fend for themselves, military ar- 
rangements with other states will be less necessary. 
Increments to national position will accrue more 
from diplomatic than from military feats. A truly 
multipolar order will require a vastly increased 
diplomatic corps and enhanced diplomatic pre- 
science. Since political combinations will be hard to 
predict and the number of significant actors very 
large, diplomatic intelligence and research will be 
all-important. 

In the intermediate case, that of an oligopolar 
order, military arts would continue to exert an im- 
portant influence, while the number of crucial na- 
tional actors would rise to five or six. Major mili- 
tary capacities would be confined to the international 
oligopoly; other states would not play a significant 
military role. Relations between the large powers 
would require a delicate balancing of military and 
diplomatic techniques. Changes in combinations 
among the ruling powers might have a decisive im- 
pact upon the security of one of their number. 
Diplomatic virtuosity akin to that practiced in the 
eighteenth century would be a necessary supple- 
ment to nuclear deterrence. A united oligarchy 
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would have theoretical supremacy over the rest of 
the world; its internal divisions, however, would be 
likely to sanction diplomatic contests for the alle- 
giance of smaller states. Again military and diplo- 
matic variables would contend for influence. 

A tentative assessment of these possibilities 
would stress the growing influence of diplomacy. 
If hostilities are focused and raised to peak in- 
tensity, military factors are dominant; if they are 
diffused and lessened, diplomacy becomes pre- 
eminent. Even in a bipolar order, diplomacy has 
played an important part: nations might have 
wished for war or an overturn of past patterns of 
international outcomes; if they used coercion to 
achieve their objectives, however, its expression 
had to be disciplined and restrained. Negotiation 
was as essential an ingredient as proportionate 
force. In the 1960s it remains possible that there 
will be in the future a revival of historic diplo- 
macy: if unlimited violence cannot be tolerated, 
diplomacy may flourish anew. 


R. N. RoSECRANCE 


[See also ALLIANCES; COMMUNISM, article on THE 1N- 
TERNATIONAL SYSTEM; FOREIGN POLICY; INTERNA- 
TIONAL POLITICS; NEGOTIATION; SYSTEMS ANALYSIS, 
article on INTERNATIONAL SYSTEMS. Guides to other 
relevant material may be found under INTERNA- 
TIONAL RELATIONS and WAR.] 
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DISARMAMENT 


From the time that men first organized them- 
selves into exclusive social groupings, they have 
relied upon the threat or use of physical violence 
as a major means of influencing the behavior of 
neighboring groups. As the size of the group con- 
trolling the tools of violence increased, so did the 
destructiveness of these tools; thus, by the mid- 
twentieth century, that group was the national 
State, and its tools included nuclear warheads, 
intercontinental missiles, and biochemical weap- 
ons. However, the acquisition and maintenance of 
national armaments is never a wholly attractive 
enterprise, and some incentives for the absence, 
reduction, or elimination of such capabilities al- 
ways exist. Since the considerations affecting de- 
cisions to arm and disarm are highly interdepend- 
ent, it might be useful to examine them in sequence 
at the outset. 

Incentives to arm. First, the traditions of the 
international community make it clear that armed 
forces are the most tangible evidence of a nation’s 
independence and sovereignty. Second, armed 
forces at the disposal of the governing elites can 
be, and often are, used to impose and maintain 
domestic order. Third, their establishment provides 
jobs (and training) for otherwise underemployed 
people and legitimate economic activity for the 
nation’s industry and technology (Benoit & Bould- 
ing 1963). 

In addition to these symbolic and domestic con- 
siderations, political elites may arm their nations 


in order to secure and enhance the nation’s position 
in the still ungoverned international community. 
Legal norms, moral restraints, and political institu- 
tions at the international level may all impinge on 
such efforts, but a nation’s power remains the most 
effective basis for pursuing its self-defined interests. 
And though there is increasing evidence that other 
components are beginning to replace military ca- 
pability in a nation’s power equation, arms are still 
the primary component. Thus, national leaders will 
seek to develop and maintain military capability 
not only in order to use it but to threaten to use it. 
The actual employment of such capability could 
be for the purpose of weakening or destroying an- 
other nation, or of defending or counterattacking 
if such action were undertaken in the first place. 
More often, however, military capability exists in 
order to be brandished or alluded to in the normal 
processes of diplomatic confrontation and bargain- 
ing; the objective is to back up demands and re- 
quests or to resist those made by others. Finally, 
armed forces may be used or kept in readiness for 
such disparate purposes as carrying out enforce- 
ment obligations under international organizations, 
such as the League of Nations or the United 
Nations, or for occupying the territory of a defeated 
nation, 

Incentives to disarm. Normally, there is a sharp 
discrepancy between the number of weapons and 
men that nations retain in peacetime and the num- 
bers deployed in wartime. Since the diplomatic and 
military effectiveness of a country’s armed forces 
is a function of the size and quality of one’s allies 
and one’s adversaries, there is little incentive for a 
nation that supports the status quo to arm up to 
full capacity unless the potential adversaries are 
doing so or are expected to do so. This tendency 
toward modest levels of military preparedness is 
reinforced by two essentially domestic considera- 
tions. First, voluntary enlistments in most societies 
are relatively low, and conscription is usually un- 
popular in peacetime. Second, the designing, test- 
ing, building, and maintaining of military hardware 
are quite costly, and unless it is acquired from out- 
side, under a military aid program, the money must 
be raised by some form of public or corporate tax- 
ation. Lower taxes are always preferable to the 
public, and if taxation is relatively high, the gen- 
eral preference is for its application to services of 
a more socially useful nature. Finally, in many 
societies there is a deep suspicion of, and hos- 
tility to, the military establishment on ideological 
grounds, 

Given this combination of incentives and disin- 
centives, it becomes clear that the size and qual- 
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ity of a nation’s armed force represent a com- 
promise between the two sets of pressures, Only 
within the context of these two types of pressure 
can a nation’s tendency to arm or disarm be under- 
stood. Even in periods of extreme diplomatic tran- 
quillity, there will be serious pressures toward an 
increase in capability even in nations having no ag- 
gressive designs; likewise, pressures for arms re- 
duction often exist even in nations at, or over, the 
brink of war. It is in response to these contrasting 
external and internal forces that nations get caught 
up in both armament and disarmament “races.” 

Arms races. The cost and danger of an arms 
race, or the actual consequences of one, usually 
produce the incentives for disarmament. If we bear 
in mind the contrasting economic, political, and 
military considerations at work, it becomes evident 
that an arms race is a highly reciprocal social 
process, involving interaction not only between the 
governments of the nations involved but also con- 
flicting factions within and across the national 
boundaries. In the absence of effective interna- 
tional government, nations have no real source of 
security other than their own power. In seeking to 
maximize such power vis-a-vis others that might 
threaten their security, nations compete with one 
another for prestige, markets, raw materials, water- 
ways, territory, allies, and spheres of influence. 
The pursuit of such goals by one nation is often 
detrimental to the interests of another, which can 
resist by using various diplomatic, economic, and 
psychological techniques. Most often, these clashes 
of interest are temporarily resolved by tacit or nego- 
tiated compromise, but occasionally both parties 
commit themselves to goals that are clearly incom- 
patible and not susceptible to such settlement. The 
goal seems important enough to one to justify an 
abnormal allocation of its resources to military 
capabilities, and thwarting that effort seems equally 
important to the other. At this point, the normal 
level of internation competition is exceeded, and 
one or both parties try to improve their bargaining 
position by seeking temporary allies, by increasing 
their individual capabilities, or by some combina- 
tion of the two. Once either party does this, the 
other must either make serious concessions or try 
to counteract by increasing its own bargaining 
power. As each invests further in its own strength, 
its decision makers feel justified in increasing 
their demands, making the cost of capitulation 
that much higher for the other. If that cost then 
seems prohibitive, its decision makers must im- 
prove their capacity to resist or to win. 

Whether increases in power are sought in arms 
or in allies, the domestic sector must be increas- 
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ingly mobilized. In order to produce the manpower, 
money, and material for such bargaining, the popu- 
lation must be persuaded that its vital interests 
or its very survival are at stake. Gradually, this 
propaganda program succeeds in associating com- 
promise with treason and conciliation with ca- 
pitulation. 

Thus, even while there is still time to turn back, 
the policy makers seldom can afford to do so; the 
public—or at least the political opposition—would 
raise the cry of treason. At this point, the process 
is largely out of the hands of those who initiated 
the arms race in the first place, and serious diplo- 
matic bargaining becomes increasingly difficult. 
While the conflict may terminate in a gradual de- 
celeration of weapons procurement, eventual pub- 
lic disenchantment, the appearance of new and 
more pressing conflicts, or negotiated settlement, 
the odds increasingly favor war as the likely 
outcome. 

The above is a highly simplified and abstract 
treatment of what seems to have been the pattern 
of arms races over the past two centuries or so. 
Unfortunately, no scholar has yet attempted a 
comprehensive and comparative study of arms 
races in the modern international system. But some 
useful studies do exist and could provide a solid 
basis from which to proceed. A Study of War 
(Wright 1942, appendix 22) provides data for the 
major powers from 1820 to 1937, and “A Mathe- 
matical Study of the Present Arms Race” (Smoker 
1963) covers the 1949-1960 period. A less quanti- 
tative but more theoretical effort is “Arms Races: 
Prerequisites and Results” (Huntington 1958), 
which offers a penetrating verbal analysis from the 
Franco-British naval race of 1840-1846 to the post- 
World War 11 race between the United States and 
the Soviet Union. In Arms and Insecurity (Richard- 
son 1960) we find the nearest approximation to a 
comparative, quantitative analysis, but it covers 
only the races preceding both world wars [see 
RIcHarRpDson]. Finally, there are studies of indi- 
vidual arms races (Hirst 1937; Sloutzki 1941). 
Thus the interpretation offered here must be recog- 
nized as tentative and prescientific, resting as it 
largely does on incomplete data, anecdotal recol- 
lection, and theoretical deduction. 


Types of disarmament 


No formal definition of disarmament has been 
offered here since the phenomenon may take many 
forms. In the light of historical experience as well 
as logical possibilities, it is useful to classify the 
types of disarmament along two particular dimen- 
sions. One such dimension is the degree of arms 


reduction, and the other may be called the reci- 
procity dimension. The first refers to the quanti- 
tative and qualitative extent of a nation’s military 
reductions, the second to the extent to which its 
reductions are matched, or in some fashion recipro- 
cated, by other nations. 

Degree of arms reduction. The “degree of dis- 
armament” continuum is not, in fact, limited to 
disarmament measures in the literal sense. In 
accord with diplomatic custom and scholarly usage, 
we include arms control as well as a number of 
intermediate restraints that fall well short of 
reduction. 

Total disarmament. At the upper end of the 
continuum is total or complete disarmament, in- 
volving the elimination of all military capability 
beyond that defined as necessary for the main- 
tenance of domestic order. That minimal level may 
include local and national police, border guards, 
or perhaps even a modest paramilitary force for 
antiriot types of duty, depending on the treaty pro- 
visions, the requirements imposed by other nations, 
or the unilateral decision of the disarming nation 
itself. Complete disarmament has been proposed 
frequently (most recently in the partially com- 
patible Soviet and American draft treaties on gen- 
eral and complete disarmament of March 22 and 
April 18, 1962, respectively) but it has never been 
achieved by formal negotiations among the major 
powers. It has, however, been imposed on defeated 
nations by victorious ones at the close of war, but 
seldom with lasting effect. As commitments shift 
according to the vagaries of international align- 
ments, the victors either acquiesce in gradual re- 
armament, as after World War 1, or even encour- 
age it, as did the United States and the Soviet 
Union soon after World War 11. 

Arms control. At the opposite and lowest end 
of the degree continuum are arms control meas- 
ures. These are provisions that may not call for 
reduction or prohibition of any weapons, yet have 
the effect of inhibiting a nation’s full development 
of a given weapon category. Arms control measures 
do not directly prohibit the production or possession 
of that weapon type but rather seek to work in- 
directly by limitations or prohibitions on the test- 
ing, deployment, or use of it. Leaving out the last 
(once weapons are made and deployed, any com- 
mitment not to use them is unlikely to survive even 
the mildest temptation after hostilities begin), We 
note that limits or prohibitions on testing may well 
prevent, retard, or diminish the quality of the weap” 
on’s production. Or, for a variety of political, mili- 
tary, or technological reasons, it may be desirable 
to permit production (and thereafter possession ) 
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but to restrict or ban their deployment within a 
given geographical region. One of the rare suc- 
cesses in negotiating a reciprocal ban on testing 
was the August 5, 1963, Treaty of Moscow, which 
prohibited the signatories from experimental det- 
onations of nuclear weapons in outer space, in the 
atmosphere, or under water. Although a number 
of demilitarized frontiers have been arranged in 
the past, today’s major powers have been unable to 
negotiate any regional deployment prohibitions. 
Arms control also may be applied to certain less 
obvious measures having no effect at all on the 
quantity or quality of a nation’s armed forces. 
Normally, they deal with information about mili- 
tary capability rather than the capability itself and 
generally require more information than is avail- 
able through normal military intelligence channels. 
In the past, much emphasis of this sort was on 
publishing military budgets, war college curricula, 
manning and organization tables, results of weap- 
on experiments, inventories, and war plans; often 
observers were invited to attend maneuvers or to 
verify published information. As the advance of 
military technology enhanced the offense at the 
expense of the defense, however, surprise attack be- 
came an increasingly serious concern for the de- 
fenders of the status quo. As a consequence, much 
of the information exchanged (or proposed for 
exchange) in earlier days has become a potential 
aid to the aggressor; the types of information ex- 
change proposed in the post-1945 era are intended 
to redress that imbalance and to aid the defender 
(more accurately, the retaliator, since effective 
strategic defense has become nearly impossible). 
For example, it has been proposed that information 
regarding the location of land-based or sea-borne 
retaliatory weapons be kept secret and that the 
potential adversary commit himself not to seek out 
such information. In a similar vein, but tending 
toward more information, foreign observation posts 
and limited aerial inspection within a nation’s 
boundaries have been discussed as a means of im- 
proving the early-warning system of its neighbors. 
Closely related to such techniques for reducing 
the advantage of the attacker are those intended 
for crisis control. In 1963 the Soviet Union and the 
United States installed a direct communication 
line between their two capitals, so that prompt and 
reliable information might be sought and trans- 
mitted during crises; the objective here was to 
mitigate the ever-increasing probability of acci- 
dental war because of erroneous reading of intel- 
ligence reports or radar and sonar displays. As in 
prior periods, the intent is to reduce the incentive 
to attack by reducing the fear of being attacked, 
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and the objective remains one of preserving the 
relative military status quo. 

A final form of arms control (liberally defined) 
is neutralization. In such a case, the neutralized 
nation or region is not necessarily prohibited from 
acquiring or maintaining armed forces but is pro- 
hibited from joining a military alliance. Generally, 
the neutralized region is either outside the sphere 
of influence of two potential enemy coalitions or 
in an area in which they overlap; hence such an 
arrangement usually requires formal or tacit nego- 
tiation between major powers. Although the en- 
forcement of neutralization may be assigned to an 
international organization or an ad koc multina- 
tional commission, its continuance depends on the 
major powers’ willingness to merely deny the 
region to the other rather than acquire it for 
themselves [see DISENGAGEMENT]. 

Partial disarmament. In the middle of the con- 
tinuum are those measures known as partial dis- 
armament. They may cover (a) incomplete reduc- 
tions in all weapon categories or (b) complete 
reductions in some categories, or (c) some com- 
bination of the two. In the first case, the tendency 
has been to seek across-the-board cuts or limits 
based on a budgetary or a manpower ceiling, the 
idea being that adherence to such a ceiling will 
compel nations to keep their armaments down. 
Normally, the plan is to permit each nation to allo- 
cate its military resources, within the budgetary 
restriction, to whichever weapon types it sees fit. 
The monetary maximum for any given year or 
period may, in turn, be based on average expendi- 
tures during a prior period, on the highest expendi- 
ture during a given prior period, or on some ratios 
among the signatories, to preserve their relative 
power positions. If a mobilized manpower ceiling is 
contemplated, similar baselines or else a percent- 
age of each nations’ total of male population or of 
males in a given age bracket, may be used. Normal- 
ly, a manpower ceiling prescribes numerical and 
age limits not only upon active duty personnel but 
on reserves of different categories, and it will also 
set minimum periods of active duty in order to pre- 
vent the rapid training of many reserves by high 
turnover in a relatively small active force. 

Another version of the across-the-board, but 
partial, measure is that based on real or hypo- 
thetical war-making units. A German periodical in 
1909 proposed, for example, that each nation be 
allotted one unit for each 700 in its population and 
that 10 army men or 50 tons of ship be thought of 
as one unit. Thus, under this scheme the 63 mil- 
lion Germans would have been entitled to 90,000 
units and the 45 million British to 64,300, to be 
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allocated among ships and men as the decision 
makers of each country thought best. 

When the second type of partial reduction— 
complete prohibitions in a few categories—is at- 
tempted, the prospective signatories are left free to 
arm fully in the nonproscribed categories; such 
partial provisions are also referred to as qualita- 
tive disarmament. This type may be negotiated, 
imposed, or unilaterally undertaken in order to 
(1) avoid the expenditure for more elaborate or 
costly weapons, (2) eliminate the more inhuman 
ones if war occurs, (3) make the soldier’s burden 
lighter in weight, (4) strengthen the defense vis-a- 
vis the offense in order to make aggression less 
attractive, or (5) compensate for the asymmetries 
arising out of the different geographical or tech- 
nological security needs of particular nations. All 
of the above considerations have been explicitly 
noted either in formal proposals or in actual agree- 
ments. Qualitative measures might embrace the 
prohibition, elimination, or a ceiling upon mobile 
artillery, rifles above a certain caliber, bullets under 
a certain weight, railroads within a certain distance 
of a border, ships above a certain length or ton- 
nage, ships capable of submerging, armor plates 
of a given thickness or hardness, aircraft capable 
of carrying bombs, chemical weapons, nuclear 
weapons on board space satellites, delivery vehicles 
of given range, and so forth; they may also em- 
brace certain classes of trained personnel. The 
incentives may not be completely pacifistic, and 
the results may actually be to increase a nation’s 
over-all military capability, but these partial meas- 
ures clearly belong under the rubric of disarma- 
ment. 

Reciprocity of disarmament measures. Turning 
to the second dimension—reciprocity—the major 
question is whether a given nation disarms uni- 
laterally and unconditionally or receives (or ex- 
pects) reciprocal behavior from one or more other 
nations as a condition of its own limitations or 
reductions. There are many possible positions on 
this continuum, five of which will serve to explore 
and define it. 

Unilateral disarmament. At one end is a na- 
tion's elimination of its armaments and demobiliza- 
tion of its military personnel without any quid pro 
quo at all. Such a unilateral move might be 
prompted by the decision makers’ conviction that: 
(1) the maintenance, threat, or use of armed force 
is morally wrong; (2) the domestic economy can- 
not afford the cost of the military establishment; 
(3) public opinion strongly favors such a move; 
(4) war is extremely unlikely in the relevant fu- 
ture; (5) diplomacy does not require the threat of 


force behind it; (6) alternative sources of influ- 
ence, such as wealth, skill, or prestige, are more 
effective; (7) other relevant nations are, or have 
finished, disarming unilaterally; (8) others will 
probably follow this example; (9) others will not 
exploit a disarming or disarmed nation; (10) third 
parties will intervene to prevent any attempted 
diplomatic exploitation of the disarmed nation; 
(11) others will defend it in case of attack; or 
(12) alternative responses to invasion, such as 
nonviolent resistance or noncooperation, will be 
more effective than military resistance. Although 
not exhaustive, this list suggests the possible at- 
tractions of unilateral disarmament, even in a 
largely ungoverned world. When and if the po- 
litical and military efficacy of global organiza- 
tion increases, such incentives would tend to be 
magnified. 

Multilateral disarmament. At the opposite end 
of the reciprocity continuum would be the disarma- 
ment agreement embodied in a bilateral or multi- 
lateral treaty, following formal negotiations. In 
such a situation, every reduction undertaken by 
one nation is conditional upon the reductions ac- 
cepted by others, The primary objective here is to 
assure the preservation (or, if possible, the im- 
provement) of one’s relative military power posi- 
tion vis-a-vis the other signatories, both during and 
at the end of the disarmament process. Normally, 
such negotiations involve nations of approximate 
military parity which are already in partial conflict 
with one another, and although some or all of the 
unilateral incentives may be motivating the negoti- 
ators, the inevitable lack of trust between some of 
the parties creates serious domestic and interna- 
tional inhibitions to success. The propensity to 
compromise is generally low, the bargaining is 
hard, and the treaty (if ever concluded) tends to 
be full of detailed procedures, contingencies, and 
exceptions. 

Somewhere between the pure unilateral and the 
pure reciprocal type of disarmament is imposed 
disarmament. Although usually the consequence of 
formal negotiation, it is negotiation between na- 
tions of extremely disparate power, however im- 
permanent that disparity may be. And although it 
may be nearly unilateral in its effects, the domi- 
nant bargainer normally accepts certain limited re- 
strictions on its own armed forces. The extreme 
form of imposed disarmament is the one accepted 
by the defeated nation following a war whose con- 
clusion is close to unconditional surrender, such 
as the disarmament of the Axis powers after 
World War 1m. In its milder version, it is a form 
of demilitarization that is demanded by one oF 


more major powers and acquiesced in by minor 
ones. 

At an intermediate point between the unilateral 
and the imposed forms is a semivoluntary process, 
undertaken on the premise that failure to do so 
voluntarily might require the nation to disarm 
later under duress. This usually occurs in minor 
powers which are clearly within a major power's 
sphere of influence. At the other intermediate 
position, between the multilateral and the imposed 
forms, is the tacitly negotiated arrangement, in 
which the political realities make it mutually evi- 
dent that the nations are overarmed for their 
security purposes. As the perceived need for armed 
forces diminishes, one nation may express that 
perception by some minor reductions, and others 
may respond in kind, thus setting in motion a 
reciprocal process of apparently voluntary disarma- 
ment, However, if one party levels off at a certain 
state of preparedness or reverses the process and 
begins to rearm the others are likely to follow suit. 


The negotiating process 


Of the five degrees of reciprocity outlined above, 
the one that is most frequently attempted, most 
frequently unsuccessful, and yet most relevant to 
the search for international stability and peace is 
the formally negotiated, highly reciprocal agree- 
ment; hence this article’s emphasis on it at the ex- 
pense of the other forms. Before turning to certain 
other considerations in, and obstacles to, negoti- 
ated disarmament, let us examine the effects of 
the forum or setting within which these negotia- 
tions might take place. Two dimensions are most 
relevant here: (a) which nations participate and 
(b) under whose auspices? 

Since reference here is not to an imposed ar- 
rangement, it is understood that at least two of the 
negotiating nations are powerful enough to resist 
dictation by one another but are not sufficiently in 
agreement to dictate to third parties. Consequently, 
the locale and composition of the negotiations is 
itself a subject for bargaining. The party whose 
position is most likely to be politically attractive to 
third parties or who wants to bring extra pressure 
to bear on the negotiating partner will tend to ad- 
vocate a larger number of participants and may 
urge that the sessions be held at the seat of, or 
under the auspices of, the relevant international 
organization, such as the League of Nations or the 
United Nations. On the other hand, preference for 
smaller, isolated sessions may reflect either a na- 
tion’s unwillingness to bargain seriously or a con- 
viction that the technicalities and sensitivities of 
disarmament require private negotiation among 
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the major powers only. During the post-World 
War 1 period, the Soviet Union and many of the 
nonaligned nations have favored the large and 
open conference, while the United States and its 
allies have tended to prefer private negotiations 
among a small number of major powers. As a con- 
sequence, both approaches have been used, but 
with equally limited success. 

Despite the advantages of quiet and privacy that 
the smaller session affords, the larger forum is gen- 
erally to be preferred. First, when the key nations 
are ready to parley in earnest, their representatives 
can easily meet in private. Second, the sharp bi- 
lateral cleavages become blurred by the involve- 
ment of third and other parties, with the key pro- 
tagonists under pressure to satisfy their demands. 
Third, while some of these pressures will be purely 
nationalistic ones, many will create new incentives 
for compromise on the part of the major protago- 
nists. Fourth, relations between the representative 
and his home government become more compli- 
cated by the injection of unexpected people, and 
he may be given more bargaining latitude. Finally, 
even if one side is more interested in propaganda 
than diplomacy, the other side has an opportunity 
for face-to-face education of those third parties 
who might otherwise be easily misled at a distance. 

A final point in regard to the negotiation of dis- 
armament concerns the composition and latitude 
of the separate delegations. Generally speaking, if 
they arrive with rigid instructions and are required 
to clear all departures from them with the foreign 
office or defense ministry, the prospects for success 
will not be good. It is equally unsatisfactory for a 
negotiator to be sent under quite flexible conditions 
only to discover that the bargains he has struck 
will not be honored by his government, owing to 
his lack of effective support at home. In this same 
vein, a government can easily paralyze negotiations 
by sending delegates who may have considerable 
bargaining freedom but whose personal interests 
and associates are opposed to a treaty. At the first 
Hague conference, for example, the United States 
was represented by Admiral Mahan, the British by 
Admiral Fisher, Germany by Colonel von Schwarz- 
hoff, and so on; such men could not be expected 
to bargain away the basis of their careers and 
prestige, and they were quite adept at discovering 
and articulating technical and tactical reasons for 
rejecting the tsar’s proposals, It is one thing to have 
military experts on hand and quite another to give 
them a dominant role in the negotiations. More 
recently, the difficulty has been compounded by 
assigning engineers and physical scientists to the 
delegations (or to the groups responsible for in- 
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structing the delegations)—the careers and pres- 
tige of these men are often dependent upon the 
very weapon programs whose termination the dele- 
gates are allegedly trying to negotiate. 

Studies of almost all conferences and negotia- 
tions have been made by historians, political scien- 
tists, and lawyers. Perhaps the two most useful 
general analyses of the several interwar negotia- 
tions are Madariaga (1929) and Tate (1942). 
Among those concentrating on the post-World 
War 1 efforts are Bechhoefer (1961), Spanier and 
Nogee (1962), and Jacobson and Stein (1966). 
[See NEGOTIATION. ] 


Direction and phasing 


Up to this point we have made two simplifying 
assumptions. One is that there is no essential dif- 
ference between reducing armament levels down 
to some specified level and permitting increases up 
to such a level. The fact that in one case a nation 
begins with more than the agreed capability and 
must therefore come down to it and that in the 
other case the nation begins with less than the 
ceiling and is free to go up to it must obviously 
have some important implications, The other sim- 
plification is that the rate at which a nation moves 
up or down to a given threshold in a given weapon 
category need not be a source of concern and that 
the nature of the limit or prohibition is of major, 
if not sole, concern, It is equally evident that phas- 
ing can be profoundly impertant to both the dis- 
armament process and its outcome. 

With regard to the matter of direction, experi- 
ence and logic both point to the advantage of 
negotiating or deciding upon a ceiling or prohi- 
bition before it is breached, rather than after. First, 
for every weapon developed and produced, national 
decision makers have had to invest a certain 
amount of money, national prestige, and their own 
personal reputations. Thus, these weapons have a 
high symbolic as well as security value and they 
will not be given up without a high price. Second, 
the technicians who developed the weapons, the 
manufacturers and laborers who produce them, 
and the military personnel who maintain them 
have profited financially and psychologically in the 
process; they are not only interested in continuing 
such profits, but they are likely to have more po- 
litical influence after performing these roles than 
they had before. Third, due to the inspection prob- 
lem, there is less certainty that the potential ad- 
versary is reducing his weapons than not adding 
to them, creating serious risks in the security 
realm. Fourth, the decision makers will be uncer- 


tain how many of a given weapon type the others 
have and will therefore be reluctant to engage in 
reductions that possibly increase the others’ nu- 
merical advantage. 

Conversely, negotiations are likely to be more 
successful when material and psychic investments 
are low, the antireduction lobby has not yet been 
created or strengthened, the military risks are 
lower, and the inspection requirements are mini- 
mal. To put it another way, it is less difficult not 
to arm than to disarm. 

Closely related to the direction of arms limita- 
tion is its phasing, or the relative speed with which 
all or certain classes of weapons are to be limited, 
reduced, or eliminated. The critical nature of this 
problem is manifested when we recall that no two 
nations have precisely the same security require- 
ments, even if neither is interested in aggression. 
From the first Hague conference of 1898 through 
the pre-World War 1 negotiations, for example, 
Britain’s heavy reliance on sea power posed certain 
difficulties quite distinct from those facing France, 
Germany, Russia, and other land-oriented powers. 
Likewise, in the Soviet-American negotiations of 
the post-1945 period, the discrepancies in the two 
powers’ needs and capabilities made successful 
bargaining especially difficult. There is not only the 
hoary problem of quantitatively equating battalions 
and battleships, tanks and submarines, bombers 
and missiles, but the fact that even successful 
equating of them can be dangerous to one party or 
the other. Thus, if one side is superior in long-range 
strategic forces and the other in localized conven- 
tional forces because of differing location, tech- 
nology, demography, and military doctrine, it may 
feel justified in demanding the elimination of all 
or most of the other's forces before proceeding to 
any reduction of its own. Or one may require over: 
seas bases more than the other, making negotiation 
about bases nearly impossible. f 

But even if these were not such complicating 
qualitative differences, the phasing problem would 
pose awkward obstacles to negotiated disarma- 
ment. On the one hand is the necessity to reduce 
personnel, budgetary, or weapon levels fast enough 
to (a) make it clear that there is a disarmament 
process under way; (b) escape from the instability 
of a heavily armed world; and (c) take advantage 
of and perpetuate the temporary period of mutual 
confidence that would probably accompany the 
signing and ratification of the treaty or convention. 
On the other hand are the fears of reducing ones 
capabilities before (a) there is assurance that 
others are doing likewise; (b) there is evidence 


that the others have given up their “aggressive” 
designs; and (c) alternate modes of national pro- 
tection have been developed. Further complicating 
the timing and phasing problem is the likelihood 
that some signatories, seeking revision of the global 
distribution of influence, will have developed and 
are prepared to use nonmilitary or other indirect 
techniques of waging international conflict; the 
nations that support the status quo may, therefore, 
insist on a very gradual and prolonged disarma- 
ment schedule. Finally, all parties will have some 
fear that whatever inspection and control agencies 
are created will be unduly influenced or dominated 
by the others. 

It is probably safe to say, on the basis of the few 
available analyses of disarmament negotiations, 
that the issue of phasing has been as much a his- 
torical obstacle to success as has the question of 
which weapons should be reduced or limited to 
what levels. 


Inspection 

Given the likelihood that disarmament negotia- 
tions will generally take place, if at all, within an 
atmosphere of moderate to high distrust and fear, 
it is reasonable to expect suspicion regarding com- 
pliance of whatever agreements might be reached. 
To weaken one’s major source of security without 
some assurance that the others are doing likewise 
is so unattractive a prospect that inspection is 
generally considered an essential adjunct of arms 
reduction or limitation. 

Such inspection may be carried out by other 
signatories, by ad hoc international commissions, 
or by international organizations, and the objects 
of such inspection may be budgets, war plans, 
training facilities, weapon installations, factories, 
transport junctions, or entire regions, depending 
upon the activities proscribed by the agreement. 
Just as in the reduction schedule itself, the prob- 
lem of phasing in inspection is often 3 major 
source of failure: how much inspection is neces- 
sary and safe at which stages of reduction? In 
order of increasing intrusiveness, inspection may 
focus upon (a) the destruction or dismantling 
process to ensure that the reduction schedule is be- 
ing followed; (b) test or production facilities to 
inhibit or prevent the acquisition of a weapon e 
or (c) present inventories (or a sample thereof) 
to ensure that a given ceiling has not been exceeded. 

Often the negotiations reveal a marked differ- 
ence in attitude toward the rate at which inspec- 
tion should be introduced, and its intensiveness or 
extensiveness, once established. The dilemma is 
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dramatized when we consider that a nation may 
refuse to consider any reductions or limitations 
until it knows the size and quality of the other’s 
inventories, while inspection for such purposes 
would be the most far-ranging, onerous, and costly. 
Yet the demand for even a modest form of pre- 
reduction surveillance, before any arms reduction, 
may well be interpreted by another nation as mo- 
tivated by espionage intentions only. These basic 
difficulties can be mitigated if the negotiations and 
early reductions occur in an environment of moder- 
ate trust, permitting each side to tolerate more 
inspection and demand less. Or high inspection 
requirements can be mitigated by remote surveil- 
lance techniques, made increasingly possible by 
such advanced technology as intercontinental radar 
and orbiting camera satellites, or modern computer- 
based analysis of full production, inventory, and 
cost records. 

Although the above principles of inspection may 
continue to hold, the rapidity of technological in- 
novation produces constant change in both the 
problems encountered and the techniques avail- 
able. There is already a rich literature covering 
inspection and its technological aspects (Melman 
1958), legal aspects (Henkin 1958), and psycho- 
logical aspects (Milburn et al. 1961); many an- 
thologies deal with the interplay of politics, 
strategy, and technology (Brennan 1961; Frisch 
1961; The American Assembly 1961; and Singer 
1963). The many hearings and reports of the 
United States Senate Subcommittee on Disarma- 
ment, the Congressional Joint Committee on Atomic 
Energy, and reports of the various United Nations 
conferences are highly informative and detailed. 


Enforcement and protection 


Whether a disarmament agreement provides for 
on-site or remote, human or machine, or adversary 
or international inspection, one problem always 
remains: how to respond to suspected or verified 
evasions? 

At one extreme is the emphasis on unilateral 
abrogation, and at the other is the demand for pun- 
ishment by some international or supranational 
agency. Neither seems feasible, In all past, present, 
and future processes of arms reduction or limita- 
tion, many evasions can be expected; some will be 
intentional, some will be accidental, and many 
more will merely be suspicions of an evasion, If the 
purpose of inspection is to generate mutual con- 
fidence, responses must be graduated and appro- 
priate. Thus, suspected violations should produce 
increased inspection, not a renewed arms race or 
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premature punishment; extreme sanctions would 
be relevant only as evasions become patently inten- 
tional and serious. When such a situation develops, 
of course, the signatories are at a critical turning 
point, the outcome of which will depend on the 
intentions and fears of the respective decision 
makers, the distance already traveled toward ful- 
fillment of the agreement, and the strength of the 
international institutions established in conjunc- 
tion with the arms arrangements. 

Considerations of enforcement, of course, go to 
the very heart of the disarmament problem. Re- 
calling the earlier discussion of the incentives 
affecting a nation’s armament policy, we note that 
the major source of national security has been and 
largely remains that of military capability relative 
to potential enemies. If the decision is to limit or 
reduce such capability, it is usually on the assump- 
tion of reciprocal measures by other nations. But 
even if reciprocal reductions were to carry most of 
the world’s nations down to complete disarmament 
levels without any upset by violation—and this has 
so far never occurred—there would still remain 
not only the problem of inspection and enforce- 
ment in a disarming world, but also that of con- 
flict control and protection in a disarming and a 
disarmed world. Put another way, nations require 
supranational agencies both to help in carrying 
through any disarmament process and to protect 
them at the end of the process. Incompatibilities 
among nations might well be as serious and dan- 
gerous after disarmament as during it, and con- 
flicts all along the way are inevitable, Without 
supranational institutions, no disarmament process 
of any significance is likely to be completed, and 
without them in a world of already disarmed na- 
tions, the rearmament process would be nearly 
inevitable. 

If and when nations disarm, therefore, they 
need to arrange, in a cautious and gradual man- 
ner, both the diminution of their own capabilities 
and the accretion of the international organiza- 
tion’s capabilities. These shifts of relative capa- 
bility must, of course, embrace both the military 
and the political realm, requiring the gradual 
build-up of supranational peace-keeping facilities. 
Implicit in such shifts is an inevitable modification, 
however slow and imperceptible, in the structure 
and the culture of the international system, and 
although all the specifics need not be negotiated in 
advance, the major powers and many of the minor 
ones must be quite aware of the transition they are 
setting in motion. Therefore, there must be agree- 
ment on the rules by which the rules will be made 


and modified, even though the concrete future is 
only dimly perceived. 

There may be less radical paths to disarmament, 
but they are unlikely to be irreversible ones. His- 
torically, nothing approximating global disarma- 
ment has been successfully negotiated, and the 
more restricted attempts have tended to be short- 
lived, With no alternative means of self-protection 
and conflict resolution, the nations have consist- 
ently returned to armament, regardless of their 
peaceful intentions. 


Research and policy 


Although much of the analysis presented here 
may convey an impression of certainty, such an 
impression would be misleading; the above discus- 
sion is based almost entirely on “trained intuition” 
and impression. The same must be said of almost 
all the writing on the subject: whether the study 
is by university scholars, governmental or inter- 
governmental agencies, journalists, or private re- 
search organizations, the results fall wel] short of 
the most modest scientific criteria. Where hard 
comparative evidence, gathered by visible and 
replicable observational procedures, is called for, 
we find haphazard impressions or highly selective 
recollections. Where an explicit and disciplined 
theoretical framework is called for, we find incon- 
sistent and incompatible arguments based all too 
often on the immediate demands of a highly ephem- 
eral political “reality.” Because few of those who 
conduct research bearing on disarmament and na- 
tional security are trained in or committed to social 
science methods, most of the writing on the subject 
cannot be taken as a reliable guide to policy. In no 
nation of the world today (as far as can be ascer- 
tained) is any significant amount of rigorous Te- 
search on the political, social, or psychological 
aspects of disarmament under way. As indicated 
above, however, some high quality work on eco- 
nomic obstacles to, and consequences of, disarma- 
ment, has been completed, and on the technological 
side some excellent research has been and con- 
tinues to be undertaken. Under the U.S. Arms Con- 
trol and Disarmament Agency—which in general 
has shown little scientific competence or political 
courage in the social science research it has Cons 
ducted or supported—several dozen studies of high 
quality on inspection and verification have been 
completed to date. 

Part of the difficulty lies in the fact that most of 
the political and strategic problems fall in the 
domain of political scientists and lawyers, and 
neither field has a strong tradition of scientific 


method. And when scholars from the more rigor- 
ous social sciences (such as psychology and soci- 
ology) or from the physical sciences and engineer- 
ing move into this research realm, they tend to 
leave their methodological norms behind, while re- 
sponding more to their ethical values or the 
demands of role and career. 

Admittedly the research problems are difficult 
and complex, given the lack of historical prece- 
dents and parallels to the contemporary situation, 
but several options are nevertheless available. One 
would be the systematic examination of those 
aspects of diplomatic and military history which 
are comparable to the present, and there are many 
such partial parallels waiting to be identified and 
described. Another would be the search for basic 
principles in governmental decision making, public 
opinion formation and its change, internation 
bargaining, etc. A third might be the systematic 
examination of analogous situations in such other 
empirical domains as intranational politics, labor- 
management relations, racial conflict, small group 
experiments, and the like. Finally, there is the pos- 
sibility of rigorous simulation of real world situa- 
tions using role-playing human subjects, more 
formal computer simulations, or some combination 
of men and machines for the replication and anal- 
ysis of present and projected situations. Some 
progress in the application of this latter strategy 
has been made, but the validity of such simulations 
is highly dependent upon inputs derived from the 
other three approaches, and they have not yet been 
provided in the quantity and quality that is neces- 
sary. [See SIMULATION, article on POLITICAL 
PROCESSES. ] 

The prospects for disarmament are not promis- 
ing. The combination of domestic and foreign in- 
centives and constraints tends to inhibit political 
elites that might want to negotiate arms reduction 
agreements. National citizens, propagandized as 
they are by the elites, lack the information and 
motivation that might permit them to modify the 
domestic incentives of the elites, And even if these 
obstacles could be overcome, the policy makers 
themselves have so weak a research base upon 
which to make their political judgments that they 
are unlikely to make the correct decisions even if 
they want to. An effective combination of good 
will, courage, knowledge, and luck is required if 
the nations are to move toward meaningful dis- 
armament, and at this writing we seem to be con- 
siderably short of achieving that combination. 

J. Davip SINGER 
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[See also DipLomacy; PEACE; War. Other relevant 
material may be found in INTERNATIONAL RELA- 
TIONS; MILITARY.] 
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DISASTERS 


The word “disaster” evokes many different men- 
tal images and is used in diverse ways. In popular 
speech, the term is often used loosely to refer to 
any sudden, unexpected, or extraordinary misfor- 
tune, regardless of whether it occurs to an individ- 
ual, a family or other small group, a community, 
a region, a nation, or the entire world. 

Social scientific interest tends to center on rela- 
tively large-scale community or societal disasters— 
the sudden or rapidly developing events that dis- 
rupt the prevailing order of life and produce dan- 
ger, injury, illness, death, loss of property, or other 
severe privations to large numbers of people resid- 
ing within a common geographic area. Such events 
are produced by a variety of natural and man-made 
destructive agents, including earthquakes, epidem- 
ics, floods, hurricanes, tidal waves, tornadoes, vol- 
canic eruptions, explosions, fires, and wartime 
bombing attacks. 

In formal sociological terms, a disaster may be 
defined as “an event, concentrated in time and 
space, in which a society, or a relatively self- 
sufficient subdivision of a society, undergoes severe 
danger and incurs such losses to its members and 
physical appurtenances that the social structure is 
disrupted and the fulfillment of all or some of the 
essential functions of the society is prevented” 
(Fritz 1961, p. 655). Viewed in this way, a disaster 
is an event that disturbs the vital functioning of 
a society. It affects the system of biological survival 
(subsistence, shelter, health, reproduction), the 
system of order (division of labor, authority pat- 
terns, cultural norms, social roles), the system of 
meaning (values, shared definitions of reality, 
communication mechanisms), and the motivation 
of the actors within all of these systems. 

Disasters differ in various ways: in the nature 
of the precipitating agent (earthquake, explosion, 
etc.); in their source of origin (natural forces or 
human actions); in their degree of predictability, 
probability, and controllability; in their speed of 

onset (instantaneous, progressive); in their scope 


(focalized, diffused); and in their destructive ef- 
fects on people, physical objects, and the natural 
environment. Human behavior differs somewhat in 
relation to each of these features of disaster, and 
also within a given disaster by spatial zones, time 
periods, type of involvement, nature and degree of 
prior preparation and training, frequency and 
recentness of previous disaster experience, and by 
differences in culture and personality. The behav- 
ioral responses of disaster survivors also vary in 
relation to the type of protective actions taken by 
responsible political authorities in the predisaster 
and postdisaster periods, and in accordance with 
the human management techniques used by the 
organized disaster warning, control, relief, and 
rehabilitation agencies. Detailed and specific com- 
parisons of disaster behavior must take these and 
other distinctions into account. This article reviews 
the typical forms of behavior that have been dis- 
covered in a wide variety of disaster research 
studies. 

Significance of disaster studies. Disasters pro- 
vide a realistic laboratory for testing the integra- 
tion, stamina, and recuperative power of large-scale 
social systems. They provide the social scientist 
with advantages that cannot be matched in the 
study of human behavior under more normal or 
stable conditions. Disasters compress vital activities 
into a brief time span and bring normally private 
behavior under public observation; social processes 
and linkages between social and personal charac- 
teristics thus become much more visible. Cycles 
and patterns of human behavior that usually span 
many years are enacted in a matter of hours, days, 
or months. Because disasters disrupt and under- 
mine social distinctions and force people to make 
critical choices under similar conditions, they also 
provide the social scientist with a unique oppor- 
tunity to study human nature and the basic proc- 
esses of social interaction. ; 

The findings of disaster research have obvious 
practical applications. They provide foreknowledge 
of the social and psychological conditions brought 
about by disaster: in other words they enable us 
to know in advance what the survivors would know 
afterward. Thus they provide a basis for develop- 
ing more realistic and effective preparations for 
disaster warning, control, relief, and recovery. 

Historical review of disaster research. Although 
there is nothing novel or modern about man’s 
desire to know what happens in disasters, the s0- 
cial scientific study of human and organizational 
behavior in disaster is relatively new. The first 
attempt to apply systematic social science concepts 
to the study of disaster was Samuel H. Princes 


investigation of the munitions ship explosion in 
the harbor of Halifax, Nova Scotia, in 1917. His 
pioneering study of the social effects of disaster 
has provided a major source of stimulating ideas 
and hypotheses for subsequent investigators 
(Prince 1920). During the period between this 
study and World War 11, empirical studies of dis- 
aster were conducted sporadically, primarily by 
single investigators using personal observation 
techniques or small interview and questionnaire 
samples. Representative examples of the research 
conducted during this period are Prasad’s study 
(1935) of rumor in the Indian earthquake of 1934, 
Kutak’s study (1938) of the social effects of the 
flood of 1937 in Louisville, Kentucky, and Mira’s 
psychiatric observations (1939) of the effects of 
bombing attacks on civilians during the Spanish 
Civil War. 

World War II and postwar research. The World 
War 1 bombing of British cities stimulated a large 
number of useful observations and individual stud- 
ies by psychologists and psychiatrists. Many of 
these studies, together with government-sponsored 
research on the problems encountered in evacuat- 
ing civilians from London during the blitz, have 
been reviewed by Titmuss (1950). 

Immediately following World War 11, the United 
States conducted large-scale sample surveys of 
German and Japanese cities that had been sub- 
jected to bombing attacks (U.S. Strategic Bombing 
Survey 1947a; 1947b). These retrospective studies 
were concerned primarily with indicators of morale 
rather than disaster behavior. Janis later reana- 
lyzed these data, summarized the findings of other 
wartime studies, and developed a more systematic 
psychological interpretation of disaster behavior 
(1951). 

In the 1950s, a planned program of disaster 
research studies emerged, stimulated and sup- 
ported by various government agencies charged 
with responsibility for handling the hazards in- 
volved in the new arsenal of nuclear, chemical, 
and biological weapons. The devastating power of 
these new weapons dramatized, as never before, 
the need for developing more adequate, systematic 
knowledge of the social and psychological con- 
sequences of widespread disasters. In Canada, 
Tyhurst (1951) conducted immediate postimpact 
studies of individual reactions to several disasters. 
In 1950, the National Opinion Research Center 
began a four-year program of disaster field studies 
in various disaster-struck communities throughout 
the United States (Marks & Fritz 1954). Similar 
field study programs were undertaken by the Uni- 
versity of Oklahoma Research Institute (Logan & 
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Killian 1952), by the University of Maryland 
Psychiatric Institute (Powell 1954), and by the 
University of Texas (Moore 1958). 

In 1952, the National Academy of Sciences— 
National Research Council appointed the Commit- 
tee on Disaster Studies (reorganized in 1957 as 
the Disaster Research Group), This group sup- 
ported a wide-ranging program of disaster studies 
in the United States and several other countries. 
Its 19 “Disaster Study Series” publications (1956— 
1963), together with numerous unpublished mate- 
rials, constitute a major contribution to the social 
scientific literature on disasters. 

The Disaster Research Group was dissolved in 
1962, but its extensive bibliography and library 
were transferred in 1963 to the Disaster Research 
Center at Ohio State University, which serves as 
a centralized repository for disaster research infor- 
mation. This center has conducted numerous field 
studies of disasters throughout the world, including 
the Vaiont Dam flood of 1963 in northern Italy, 
the Alaskan and Japanese earthquakes of 1964, 
and the Chilean and San Salvador earthquakes of 
1965. Particular attention has been given in these 
studies to organizational responses to disaster. 

The volume of social scientific research studies 
on disaster has thus grown to large proportions. 
In 1961, an inventory of disaster field studies re- 
vealed a total of 114 studies of 103 different events 
(National Research Council 1961); these studies, 
together with the U.S. bombing surveys, produced 
nearly 22,000 interviews and questionnaires — 
about 15,000 obtained from individuals exposed 
to peacetime disasters and about 7,000 from Ger- 
mans and Japanese affected by World War 1 
bombings. By 1965, these totals had been aug- 
mented by over 25 additional field studies and by 
several thousand interviews. 

Common misconceptions of disaster behavior. 
One of the first tasks of systematic disaster re- 
search involved an act of intellectual debris clear- 
ance: the accumulated myths and distorted ideas 
of many centuries had to be cleared from the con- 
ceptual attic before more fundamental studies of 
human and organizational behavior in disasters 
could be made. 

The traditional picture of disaster behavior con- 
sists of lurid scenes of society and human nature 
in the process of disintegration. Most of these 
simple stereotypes have not stood the test of criti- 
cal research scrutiny. Contrary to the commonly 
held notions, disaster field studies have shown that 
panic is a comparatively rare form of behavior in 
disasters. Stricken populations do not, on the whole, 
become hysterical but help themselves and per- 
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form most of their own rescue and relief tasks. 
Only a few cases of looting have been reported. 
The major problem of crowd control is not so 
much the flight of victims from the disaster scene 
as the convergence of vast numbers of people from 
outside the disaster area. There are only isolated 
examples of the breakdown of moral codes. Emo- 
tional aftereffects are widespread but relatively 
transitory. Morale and optimism are quickly re- 
stored; indeed, the morale of the survivors may be 
abnormally high in some respects, Abatement of 
the usual social conflicts greatly strengthens social 
integration, and most of the social and personal 
behavioral pathologies of everyday life either fail 
to increase significantly or actually decline under 
disaster conditions, 

Disasters do indeed pose many serious problems 
of human planning and management, but the ac- 
tual problems discovered by research rarely coin- 
cide with those envisaged by the usual notions of 
disaster behavior. Moreover, the traditional stereo- 
types have obscured the fact that disasters produce 
not only disruptive and disorganizing effects but 
also reconstructive and regenerative human re- 
sponses. Both of these perspectives must be kept 
in mind in any balanced treatment of the social 
aspects of disaster. 

Disaster preparations. The minimization of so- 
cial disruption requires highly organized prepara- 
tions for disasters before they occur. It is difficult, 
however, to establish and maintain an adequate 
state of preparation under normal conditions, espe- 
cially if there has been no recent disaster experi- 
ence. Programs designed to prepare people for an 
uncertain future threat must compete with imme- 
diate and pressing human concerns: the day-to-day 
problems of earning a livelihood, protecting oneself 
and family from the daily dangers to life and 
health, and securing recognition, response, and 
Status in relations with members of one’s personal 
community. Most people are not vitally concerned 
with problems outside the orbit of their immediate 
personal lives, even when there is threat of war 
or other disaster. 

Disaster preparedness programs relying on indi- 
vidual initiative for developing protective measures 
have always failed because they ignore this over- 
whelming preoccupation with daily concerns and 
the universal human tendency to err on the side 
of normalcy by assuming that everything is all 
right until events clearly prove otherwise. As a 
consequence, the recognition of danger and prepa- 
ration for disaster are usually postponed until it 
is too late to prepare well-organized precautionary 
and protective measures. 


Warning and threat. The common tendency to 
deny that danger is at hand is illustrated in studies 
of disaster warning. Difficulties in public warning 
often start with the persons or agencies responsible 
for detecting the danger and issuing the warnings, 
They are usually reluctant to issue a specific warn- 
ing until they are reasonably certain that danger 
will actually materialize. Waiting for this degree 
of certainty has sometimes delayed the dissemina- 
tion of the warning until it is too late. 

Even where the existence, nature, and time of 
the danger can be adequately forecast, it is difficult 
to secure public acceptance of warning messages. 
People tend to seize on any vagueness, ambiguity, 
or incompatibility in the warning message that 
enables them to interpret the situation optimisti- 
cally, They search for more information that will 
confirm, deny, or clarify the warning message, and 
often they continue to interpret signs of danger as 
familiar, normal events until it is too late to take 
effective precautions. 

Disaster impact. When danger is recognized as 
imminent and personal, people seek safety or es- 
cape, and their behavior is generally adaptive. 
Rather than engaging in irrational acts, which 
increase the danger, they usually take action to 
protect themselves and their associates. The suc- 
cess of their actions depends to a large extent upon 
the possibilities available in the situation and the 
adequacy and accuracy of information that they 
possess. : 

Flight is one means of escaping danger, but it 
should not be equated with panic. It is more often 
orderly and goal-directed than disorderly or panic- 
stricken, and it is often the only rational choice if 
one is to survive. 

Even during violent impacts, most people con- 
tinue to be concerned about the safety of other 
people in their immediate surroundings and the 
welfare of their family members. The family group 
is a central object of orientation during the impact 
period and throughout all phases of the emergency: 

During and immediately after impact, people 
tend to define the situation in terms of what is hap- 
pening to themselves and their immediate sur- 
roundings and to think of it as something happen- 
ing only there. This tendency to underestimate the 
scope and destructiveness of the disaster makes for 
considerable variability in the initial behavior of 
disaster-struck populations; and it helps to account 
for the seemingly chaotic, confused, and conflict- 
ing behavioral patterns that the outside observer 
witnesses in the immediate postimpact period. 

Postimpact emergency. The most significant 
problems of behavior in the immediate postimpact 


period derive from social disorganization and unco- 
ordinated action rather than from irrational indi- 
vidual behavior or helplessness of the survivors. 
Many, if not most, of the immediate tasks of rescue 
and relief are handled informally by the disaster 
survivors before the arrival of outside aid. How- 
ever, much of this informal activity is sporadic 
and unsystematic. The separate and independent 
actions of many thousands of people and small 
groups, each trying to cope with the disaster in 
its own way, often produces duplication of effort 
or conflicting patterns of action. They constitute 
a “mass assault” on the problems of disaster that 
frequently accomplishes the essential tasks but 
with considerable inefficiency of effort. 

A persistent source of uncoordinated activity 
found in all disasters arises from anxiety over 
missing family members and friends, Within 
moments after impact, survivors who are separated 
from family members or other intimates begin a 
desperate search for them. These independent and 
separate search activities by large numbers of peo- 
ple produce conflicting patterns of action and often 
interfere with the performance of community- 
oriented tasks. 

Separation anxiety is sometimes an important 
problem for personnel who have important official 
disaster responsibilities. As Killian (1952) has 
noted, people are confronted with “role conflicts” 
in choosing between family duties and their more 
formal social roles. Most people quickly resolve the 
conflict in favor of loyalty first to intimates, and 
only then turn their attention to larger social- 
group loyalties. For people with clearly defined 
disaster jobs, however, the conflict may persist and 
cause considerable personal stress unless special 
provisions are made to assure them of the location 
and condition of their families. 

Convergence behavior. A central problem of 
coordination and control in disasters derives not 
from the victim population but from the informal, 
spontaneous “convergence behavior” (Fritz & 
Mathewson 1957) of persons living outside the 
disaster area, Shortly after impact, thousands of 
persons begin to converge on the disaster area and 
on first-aid stations, hospitals, relief centers, and 
communication centers near the area. Along with 
this movement of persons, incoming messages of 
anxious inquiry and offers of help from all parts 
of the nation and from foreign countries overload 
communication facilities; and tons of unsolicited 
equipment and supplies of clothing, food, bedding, 
and other materials begin to arrive. 

This convergence action, which 
days and weeks following a disaster, 


continues for 
seriously con- 
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gests transportation and communication facilities 
and hampers the administration of organized 
rescue, medical, emergency relief, and rehabilita- 
tion programs. It graphically demonstrates that the 
people affected by disaster are not confined to the 
immediate geographical area of destruction, death, 
or injury. All persons who are related to, or who 
identify themselves with, persons and organiza- 
tions in the stricken area are also affected, Thus 
the effective unit of disaster management is usu- 
ally national in scope, even when the primary 
sufferers are in a single community. 

Problems of disaster agencies. Various prob- 
lems of coordination and control stem not from 
public responses to disaster but from inadequacies 
in the plans, organization, and operations of formal 
disaster agencies. Disaster plans are often too lim- 
ited in scope, both functionally and geographically, 
to provide the requisite coordination and integra- 
tion of organizational actions. Conflicts in authority 
sometimes arise because there is an absence of an 
agreed-upon, understood division of labor among 
different groups and agencies concerning which 
official or agency has the authority to make deci- 
sions. Duplicating or conflicting agency actions 
frequently occur because the disaster management 
authorities have failed to establish centralized 
mechanisms for coordinating requests for equip- 
ment and supplies; distributing patients among 
hospitals; pooling casualty and survivor informa- 
tion; or for screening, organizing, and using large 
numbers of untrained volunteers. 

Much of the confusion and lack of coordination 
in the rescue, relief, and rehabilitation efforts re- 
sults from the lack of systematic procedures for 
maintaining a central strategic overview of the 
disaster, so that leadership, controls, and resources 
can be applied in the places where they are most 
critically needed. Lack of coordination also stems 
from the natural human tendency to feel a great 
sense of urgency to get something done to help 
the victims, which makes taking time to communi- 
cate and coordinate decisions seem a luxury. 
Under stress it is also difficult to exercise the more 
complex intellectual processes such as looking 
ahead and thinking about the indirect consequences 
of a decision. 

Communication is often inadequate, partly be- 
cause of the destruction of communication facili- 
ties but more generally because of improper use 
of these facilities. Finally, many of the problems 
of disaster management result from insensitivity 
to the changes in the structure, values, and pre- 
vailing climate of opinion among the victims dur- 
ing the various postdisaster phases, resulting in a 
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lack of fit between the rather different conceptions 
of need held by relief organizations and by those 
whom they are trying to help. 


Social adaptations to disaster 


The great significance of behavioral problems in 
disaster has, understandably, caused many observ- 
ers to overlook the fact that disasters produce 
many therapeutic effects on social systems. The 
sharing of a common threat to survival and the 
widespread suffering produced by disaster usually 
result in a dramatic increase in social solidarity 
and a temporary breakdown of pre-existing social 
and economic distinctions. This newly developed 
solidarity is of major significance in facilitating 
both social and personal recuperation. It resolves 
pre-existing personal and social conflicts; prevents 
or ameliorates the usual disorganized responses to 
trauma, loss, and privation; reduces the amount of 
self-aggressive and antisocial behavior; and moti- 
vates people to devote their energies to socially 
regenerative tasks. 

Evidence supporting the existence of these thera- 
peutic results during the integrative phase of dis- 
aster may be found in many different accounts 
and reports, both past and present, wartime and 
peacetime. In general, the findings can be sum- 
marized as follows: (1) injured disaster victims 
and their families are unusually calm, quiet, unde- 
manding, and thoughtful of other people; (2) eth- 
nic and minority group differences tend to disap- 
pear; (3) families tend to develop greater internal 
solidarity; (4) social relationships tend to be 
strengthened, not only with kin and extended 
family but with all survivors; (5) disaster sufferers 
report a relatively low sense of deprivation when 
comparing their losses with others; (6) there are 
frequent cases of remission of pre-existing neurotic 
and psychosomatic symptoms; (7) there is a de- 
crease in admissions to mental hospitals and to 
psychological outpatient clinics; (8) the suicide 
rate declines; (9) homicides and crimes against 
the person show a downward trend; and (10) the 
verified cases of theft, looting, profiteering, and 
other forms of exploitation and antisocial behavior 
are quantitatively insignificant when compared 
with actions aimed at mutual aid, restoration, and 
reintegration. 

These integrative effects of disaster apparently 
provide a major stimulus for rapid restoration and 
social reconstruction. Virtually all modern disaster- 
struck communities and nations have not only been 
quickly restored, but in many cases they have 
experienced an “amplified rebound,” in which the 
society is carried far beyond its predisaster levels 


of solidarity, productivity, and capacity for growth, 
Likewise, historical analyses of wars and other 
large disasters usually have emphasized their leg- 
acy of positive social change. 

It is obvious that most disasters produce some 
types of relatively persistent social change. How- 
ever, the exact nature of these long-run changes, 
and the process by which they occur in various 
societies (both developed and underdeveloped), 
under varying disaster conditions, is still imper- 
fectly understood. The study of the long-term ef- 
fects of disaster is a major area for future syste- 
matic research. 

General implications of research findings. Dis- 
asters produce a new and different referential 
framework within which people perceive and judge 
their experiences. The recurrent crises and acci- 
dents of everyday life tend to be isolated, random 
events that produce private human troubles and 
suffering but do not induce changes in the organi- 
zation of society. In contrast, disasters are suffi- 
ciently concentrated in time and place to pose a 
clear, easily perceivable threat to social survival. 
They affect all persons indiscriminately, and there- 
by produce a temporary breakdown in hierarchical 
status distinctions. The reference changes from 
“Only I have suffered” to “All of us have suffered; 
we are all in it together.” Danger, loss, and suffer- 
ing become public, rather than private, phenom- 
ena, Disasters thus lead to social remedy and social 
change, rather than requiring the individual or 
small group to bear the burden of readjustment to 
an intact, unchanged society. 

The social disorganization that occurs in disaster 
is essentially a disruption of the complex structure 
of social differentiations and culturally defined 
communication networks of secondary group life. 
Except momentarily, disaster does not disorganize 
primary group life. On the contrary, this is strength- 
ened, becomes more pristine, and is more widely 
based than in ordinary social life. The quality of 
interaction throughout the entire community of 
sufferers approximates more closely the character- 
istics of intimate, informal interaction set forth in 
the concept of the primary group. 

This capacity of human societies under severe 
stress to contract from a highly elaborated set of 
secondary group organizations to a kind of uni 
versal primary group existence is probably their 
central, built-in protective mechanism. It helps to 
account for the fact that nations and communities 
typically demonstrate amazing toughness and m 
siliency in coping with the destructive effects O 
disaster and unusual speed in restoring and regen” 
erating more complex forms of social life. 


If disaster studies have demonstrated nothing 
else, they have shown that man is a highly adap- 
tive social animal when he is directly confronted 
with clear challenges to his continued existence. 
He has survived every conceivable form of disaster 
in the past, and, short of total annihilation, he is 
likely to do so in the future. 

CHARLES E, Fritz 


[See also COLLECTIVE BEHAVIOR; MASS PHENOMENA.] 
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DISCOVERY 

See CREATIVITY; INNOVATION; RESEARCH AND 
DEVELOPMENT; TECHNOLOGY; and the biography of 
OGBURN. 


DISCRETE DISTRIBUTIONS 
See under DISTRIBUTIONS, STATISTICAL, 


DISCRIMINANT FUNCTIONS 
See MULTIVARIATE ANALYSIS, article on CLASSI- 
FICATION AND DISCRIMINATION. 


DISCRIMINATION, ECONOMIC 


As a general and somewhat loose statement, eco- 
nomic discrimination can be said to occur against 
members of a group whenever their earnings fall 
short of the amount “warranted” by their abilities. 
Public interest has usually been centered on groups 
that are either numerical minorities (such as 
Negroes in the United States, Untouchables in 
India, Jews in the Soviet Union, Laplanders in 
Sweden, and Chinese in Indonesia) or are political, 
social, or economic minorities (such as the Blacks 
in South Africa, immigrants from Africa in Israel, 
and women in most countries), Until recent years, 
economic discrimination was generally seen as evi- 
dence of exploitation of minorities by majorities. 
This interpretation is clearest in writings with 
Marxist sympathies (Aptheker 1946), although it 
is also found in weaker form in much of the liter- 
ature (Allport 1954, p. 210). 

In the past, economists tended to neglect the 
study of discrimination against minorities, primar- 
ily because they were reluctant to interpret any 
appreciable economic phenomena in terms of “ex- 
ploitation,” which stems from what is technically 
called monopsony power [see MonoroLY]. The 
growing interest of economists in discrimination 
during the last decade has been stimulated by an 
approach that is not based as much on exploitation 
as on considerations that may involve even a 
sharper break with traditional economic theory. In 
this approach, members of a group earn less than 
their abilities warrant if other persons are willing 
to “pay” (that is, give up resources) in order to 
avoid employing, working with, lending to, train- 
ing, or educating these members. 

Discrimination in the market place. Further 
discussion is facilitated by distinguishing discrimi- 
nation affecting the talents that are brought to the 
market place from discrimination in the market 


place itself. Since the latter is taken to mean that 
less is earned than is warranted by productivity, 
such discrimination would not exist if all persons 
maximized money incomes and if all markets were 
competitive. For, under these circumstances, the 
increased money incomes of employers, employees, 
or consumers, resulting from associating with mem- 
bers of a group receiving less than their produc- 
tivity, would stimulate a demand for such members 
that would continue until their earnings rose suffi- 
ciently to cover their productivity. 

Accordingly, discrimination in competitive mar- 
kets is said to occur because some participants 
have tastes for discrimination, more loosely called 
“prejudice.” Because of these tastes, they are will- 
ing to forfeit money income or other resources in 
order to avoid employing, working with, or buying 
from members of a particular group. Indeed, the 
intensity of their prejudice is measured by how 
much they are willing to forfeit (Becker 1957, 
chapter 1). The measuring rod of money provides 
an operational and quantifiable concept of preju- 
dice that is consistent with the economist’s concept 
of tastes and the statistician’s concept of subjective 
probabilities (Savage 1954). Objections to such a 
behavioristic concept of prejudice and hatred might 
be tempered upon reflecting that a hatred cannot 
be very strong, in any meaningful sense, if there 
is a reluctance to satisfy it by parting with some 
resources. This approach to discrimination not 
only incorporates common notions about prejudice 
much more fully than does one based on exploita- 
tion, but also is more consistent with the general 
prevalence of workably competitive markets. 

Although the extent of market discrimination 
depends, of course, on the intensity of the average 
taste for discrimination, it by no means depends 
on this alone, Also very relevant are any differences 
among participants in the desire to discriminate, 
the relative number and occupational distribution 
of minority members, the extent of competition in 
product markets, and the degree of substitution 
between different groups (Becker 1957). For ex- 
ample, consider discrimination by employers against 
a group working in a competitive industry. Even if 
the employers’ average taste for discrimination 
were large, market discrimination could be negli- 
gible if the group were a small fraction of the total 
labor force in the industry and if some employers 
tastes for discrimination were weak (or even neg- 
ative). For then, members of the group could be 
fully employed by these employers, An increase in 
the size of the group or a decline in the fraction 
of employers with weak tastes for discrimination 


would tend to increase market discrimination, per- 
haps substantially, because members could no 
longer be fully employed by these employers, and 
some members would have to find employment 
among those employers with stronger tastes for 
discrimination. However, the latter would employ 
these members only if the cost of discriminating 
were sufficiently large—that is, the earnings of 
members sufficiently low—to offset their tastes. 

Even the market form in which the discrimina- 
tion manifests itself can be very important. If the 
result were simply lower hourly earnings, dis- 
criminators would have to balance the gain from 
discriminating—satisfying their prejudice—against 
the cost of discriminating—their loss in income 
from not associating with persons receiving less 
than their productivity. If, however, because of 
market imperfections, trade union behavior, or 
minimum wage or “equal pay” legislation (Alchian 
& Kessel 1962), hourly earnings of minorities were 
prevented from falling, the cost of discrimination 
would be eliminated because no resources would 
be forfeited by not hiring minorities; and discrimi- 
nators would be encouraged to discriminate still 
more. The result would be greater unemployment 
of minorities, and discrimination would take the 
market form of, say, lower annual earnings rather 
than lower hourly earnings. 

Quantitative evidence on the extent of market 
discrimination against different minorities is sur- 
prisingly limited; the most extensive evidence re- 
lates to Negroes in the United States. Significant 
market discrimination against them is suggested 
by the much lower earnings of urban male Negroes 
than of urban male whites of the same age, years 
of schooling, and region, and the somewhat greater 
unemployment of Negro males, even when occu- 
pation is held constant, The absolute occupational 
position of Negroes has risen substantially in both 
the North and South during the last hundred years. 
Since, however, the position of whites has risen at 
about the same rate, the relative position of Negroes 
has changed surprisingly little, the greatest change 
(a rise) coming after 1940. One of the most strik- 
ing findings is that market discrimination is appar- 
ently relatively slight against young new entrants 
into the labor force, but it increases significantly 
with age. This finding is presumably explained by 
a combination of inadequate preparation of Negroes 
for occupational advance and greater market prej- 
udice against more advanced Negroes. Similar find- 
ings would probably apply to other minority groups 
in the world. (For evidence on the economic posi- 
tion of Negroes, see Becker 1957, chapters 7-10; 
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Dewey 1952; Gilman 1963; and Ginzberg et al. 
1956.) 

Deficiencies brought to the market place. Per- 
haps more than half the total difference between 
the earnings and abilities of Negroes in the United 
States results from deficiencies brought to the mar- 
ket place, rather than from market discrimination 
itself; again, a similar relation would probably be 
found for other minorities, These deficiencies in- 
clude poor health, low morale and motivation, and 
limited information about opportunities, The most 
pervasive force, however, is insufficient and in- 
ferior education and training. For example, in 
1960, Negro males aged 25-64 averaged less than 
8 years of schooling, while whites averaged more 
than 10.5 years; and Negro schooling has clearly 
been inferior in quality. 

An important cause of deficiencies in education 
and training are the limitations imposed by poverty 
itself, especially when combined with the usual 
difficulties in financing large expenditures. Of 
course, poverty may in part result from either past 
discrimination, as exemplified by Negro slavery, or 
current discrimination, thus leading to the so-called 
vicious circle of discrimination (Myrdal 1944), Dis- 
crimination by governments and private institu- 
tions in charge of education and training facilities 
has been common, as illustrated by the government 
restrictions on the education of Blacks in South 
Africa and (at least in the past) Jews in eastern 
Europe, the quotas applied by certain private uni- 
versities in the United States, or the racial (and 
even religious ) restrictions on admission to appren- 
ticeship and other training programs administered 
by trade unions in many countries, 

Role of government. Undoubtedly, some of the 
worst economic discrimination, as well as other 
kinds, is directly traceable to government action, 
One need only note the restrictions placed on the 
economic advance of Blacks in South Africa, the 
harassment of the Chinese in Indonesia and other 
parts of Asia, or the virtual confiscation of some 
of the property of Japanese Americans in the 
United States during World War 1. On the other 
hand, government action is often a force in widen- 
ing opportunities and thus breaking down economic 
discrimination: of greatest influence is the enforce- 
ment of equality before the law, although the pro- 
vision of free or subsidized education has also been 
important. Legislation of the “fair employment” 
kind, prohibiting racial, ethnic, religious, and other 
forms of discrimination in employment, can have 
some effect too. Very little scholarly analysis has 
been made of the economic effects resulting from 
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this kind of legislation, In the United States, the 
evidence provided by the states with such legisla- 
tion suggests that it somewhat increases both the 
earnings and unemployment of minorities (Landes 
1966), 

A few of the more extreme nineteenth-century 
advocates of a competitive market economy be- 
lieved that eventually its extension and develop- 
ment would eliminate most economic discrimina- 
tion and hatred. Unfortunately, this has not yet 
taken place; discrimination exists, and at times 
even flourishes, in competitive economies, the posi- 
tion of Negroes in the United States being a clear 
example. At the same time, one must realize that 
the pressures of competition, the emphasis on ma- 
terial goods, the impersonality of and the cost ac- 
counting in a market economy often help mitigate, 
and sometimes even eliminate, the effects of preju- 
dice. Indeed, in desperation, market participants in 
the United States, South Africa, Great Britain, In- 
donesia, and elsewhere have (successfully) ap- 
pealed to their governments for protection from the 
“unfair” competition of various minorities, 

In spite of the widespread public interest in eco- 
nomic discrimination throughout the world, rela- 
tively little is known about its quantitative im- 
portance and socioeconomic determinants, Much 
greater understanding of the formation of prejudice 
is necessary, although economists are unlikely to 
be of great value here. Their comparative advan- 
tage lies in analyzing how prejudices combine with 
various institutional arrangements to produce 
actual discrimination, It has already been men- 
tioned that the degree of competition, the size of 
minority groups, and many other factors as yet in- 
sufficiently studied also affect the amount of mar- 
ket discrimination, 

Likewise, the effect of prejudice, by the elector- 
ate, trade union members, or directors of educa- 
tional institutions, on the extent of nonmarket 
discrimination is determined by whether different 
political issues are effectively tied together, whether 
the costs of discriminating or the gains from not 
discriminating are hidden, and on many other in- 
stitutional arrangements (Becker 1957, chapter 5; 
1959). Consider, for example, the election by ma- 
jority rule of representatives to decide on two 
issues, one dealing with discrimination against a 
minority, Assume that the minority is more con- 
cerned about discrimination while the majority, 
although more concerned about the other issue, 
does not agree on how to resolve it. A successful 
campaign could be conducted by promising to satis- 
fy the minority on discrimination and a segment 
of the majority on the other issue. The decision on 


discrimination could, therefore, be more favorable 
to the minority than if each issue were decided 
separately (by, say, referendum), or if the majority 
were less split on the other issue, or if the majority 
attached greater importance to discrimination. Con- 
siderably more study of institutional arrangements 
is required in order to know more about the factors 
determining the talents that minorities are per- 
mitted to bring to the market place. 

Gary S, BECKER 
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DISENGAGEMENT 


Disengagement became part of the vocabulary 
of international polities after World War n. It was 
offered as an alternative policy prescription to the 
perilous face-to-face confrontation, especially in 
central Europe, of opposing political systems and 
armed forces equipped with nuclear weapons capa- 
ble of total destruction, The essence of disengage 
ment is the separation of the contending forces and 
the creation, in the in territory, of a nom 
aligned state (or states) with political, social, and 
economic systems equally acceptable (or unaccept- 
able) to both adjacent powers, including a level of 
military forces that by itself does not constitute a 
significant threat to either side, 

A second, but less common, definition of disen» 
gagement pertains to areas where a variety of 
powers contend for dominant influence and where, 
as in the Middle East during the post-1045 epoch, 
there is great political instability but not a direct 
military confrontation of major powers, Here dis- 
engagement describes a national policy character- 
ized by aloofness and watehful nonintervention, 
rather than the withdrawal of two forces pitted 
face to face, 


This article will deal with disengagement pri- 


to keep Germany and Russia apart by creating be 
tween them a belt of nations consisting of the 
Baltic states, Poland, and Czechoslovakia, whose 
sovereignty was to be guaranteed by the major 
Western 
Buffer states are weak states existing by leave 
of their preponderant neighbor, whose military, 
political, and economic Interests they serve. An 
Belgium, which owed its independent 
existence, from 1831 to World War n, to its chief 


creign territory of a neighboring state. Pomerania 


gave 

access to the Baltic Sea, thereby separating Bast 
Prussia from the rest of Germany and creating the 
called corridor across Poland, which became a 


powers, 
sanitaire, which was employed to keep potential 
are withdrawn—or disengaged—would not favor 
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Through 1964 more than a hundred different 
plans for the disengagement of East and West 
military forces in central Europe had been put for- 
ward by national governments, individual states- 
men, prominent ex-leaders, and private individuals 
and groups. (For a listing, see Hinterhoff 1959, 
pp. 414-442.) A distinguishing feature of these 
plans is that all concern the future of Germany. 
Solutions for the divided state of Germany range 
from U.S. Secretary of State James F. Byrnes’s 
1946 proposal for a 25-year four-power treaty guar- 
antee against further aggression from a united, dis- 
armed Germany to Soviet Premier Nikita Khrush- 
chev’s 1959 plan for withdrawal of foreign troops 
from all of Europe, creation of an atom-free zone 
in central Europe, and retention of the status quo 
of a divided Germany. Over the years, the concept 
of disengagement evolved from taking measures 
to deal with renewed threats of German aggression 
to creating broad neutral zones in Europe. These 
zones would require dismantling of nonindigenous 
military installations and either retaining the po- 
litical status quo or providing for the reunification 
of Germany under changed circumstances. This 
evolution of plans, none of which received support 
from both East and West, was marked by specific 
proposals of the Polish foreign minister, Adam 
Rapacki; the leader of the British Labour party, 
Hugh Gaitskell; and the chairman of the U.S. State 
Department's Policy Planning Staff, George F. Ken- 
nan. Each of these statesmen proposed major 
changes in the stance and policies of the major 
powers in Europe; the essential common denomi- 
nator of the three proposals was the disengagement 
of American and Russian forces, 

The first version of the Rapacki plan, in October 
1957, provided for an atom-free zone covering 
Poland, Czechoslovakia, East Germany, and West 
Germany, with a four-power guarantee but no with- 
drawal of foreign forces or changes in the alliances 
of the powers. The second version, in November 
1958, and a later revision added provisions for re- 
ducing and eventually withdrawing foreign forces 
from the zone, with appropriate control measures. 
The Gaitskell plan was publicized in lectures at 
Harvard University in January 1957 and differed 
from the Rapacki plans in that it provided for the 
reunification of East and West Germany in the con- 
text of a restricted armaments zone from which 
foreign troops were to be gradually withdrawn, the 
zone to include, in addition to Germany, Poland, 
Czechoslovakia, Hungary, and possibly Rumania. 
This arrangement was to be guaranteed by a Euro- 
pean security pact, which both the Soviet Union 


and the United States would sign. The Rapacki 
plans, which appeared to have support from the 
Soviet government, were rejected by the major 
NATO powers. The Gaitskell plan was unaccept- 
able to the Soviet bloc and was also rejected by the 
West German government of Konrad Adenauer. 

Ambassador Kennan’s disengagement policy pro- 
posals were in some ways the best thought out, and 
their elaboration in the Reith lectures over the BBC 
in the autumn of 1957 (later published as Russia, 
the Atom and the West, 1958) provoked wide- 
spread public discussion and controversy involving 
high political leaders of the major powers. Kennan 
had gained an international reputation for framing 
and defining the West's policy of containment 
(1947). For Kennan, disengagement was a logical 
successor policy to containment, which by the mid- 
1950s had pitted effectively equal powers hard 
against one another, creating an inflexible situa- 
tion in which either an incident or a misinterpre- 
tation of intentions could unleash world-wide nu- 
clear destruction. He held that this danger could 
only be reduced through the separation of military 
forces and the reuniting of Europe through a ne- 
gotiated settlement of the German problem. Spe- 
cifically, Kennan proposed reunifying Germany and 
joining it with Poland, Czechoslovakia, and Hun- 
gary in a zone from which NATO and Soviet troops 
would be withdrawn. The balance of power in the 
zone would be maintained by national armed forces 
and militia. Germany's freedom to associate with 
NATO or the East would be restricted, while Ger- 
many and the other states constituting the zone 
would receive guarantees from NATO and the War- 
saw Pact powers who would join in a European 
security pact. 

Criticisms of disengagement. In the heated de- 
bate stirred up by Kennan’s proposals, the entire 
concept of disengagement was sharply criticized. 
Some critics took issue with the concept because it 
was diametrically opposed to the idea of forcing a 
unilateral withdrawal of the Russian army and liber- 
ating eastern Europe from communist domination. 
Other critics, including some of Kennan’s former 
colleagues, such as Dean Acheson (1958), op- 
posed and ridiculed disengagement on a variety of 
other grounds. Foremost among these was the 
charge that the idea of disengagement rested on à 
fundamental misunderstanding of national and 
international power. According to this view, a 
classic imperialist power such as the U.S.S.R. 
could not realistically be expected to keep from dis- 
rupting the precarious balance in a relatively weak 
territory from which forces of the major powers 


had been withdrawn. A parallel criticism was that 
a disengagement in Europe would be inherently 
unequal. Soviet Russia's forces would withdraw 
several hundred miles, whereas U.S. forces would 
be pulled back thousands of miles across the At- 
lantic. Another major criticism was that disengage- 
ment could not achieve any positive result because 
the essential danger is in the confrontation of mis- 
siles poised thousands of miles apart. Indeed, some 
argued that disengagement would create a vacuum 
in central Europe and increase the danger that the 
missiles would be used because they would be the 
only resort in the event of an incident (Acheson 
1958). 

These criticisms were eloquently refuted by Ken- 
nan in an article, “Disengagement Revisited” 
(1959), in which he accused opponents of disen- 
gagement of seeking the ideal military posture, 
which “is simply the enemy of every political 
détente or compromise.” This article and other 
writings by proponents of disengagement focused 
on the problems of Germany's partition which, 
they argued, could not continue indefinitely. Un- 
less reunification were achieved through East-West 
negotiations resulting in disengagement, direct 
West German dealings with Moscow would become 
a constant threat that could disrupt NATO, while 
eastern Europe remained under communist domi- 
nation. 

The future of disengagement. Research and 
writing on disengagement falls into the general 
field of international political research, chiefly per- 
taining to Europe. Theoretically the concept could 
be applied to situations in other parts of the world; 
for example, to the confrontation between Arab 
and Israeli forces. The term has in fact been ap- 
plied to United States policy of the 1960s in the 
Middle East, adopting the second and less common 
definition of disengagement, described above, as a 
policy of watchful nonintervention (Nolte 1964). 
But it is understandable that the focus of disen- 
gagement proposals and research has been on the 
critical European situation, where the principal 
nuclear powers confront one another. Recent schol- 
arly efforts to deal with this situation fall into two 
categories: (1) proposals, plans, and formulas for 
the achievement of disengagement; and (2) dis- 
cussions of its feasibility in the context of evolving 
East and West strategies and goals. Apart from 
Hinterhoff’s book, only Michael Howard's Disen- 
gagement in Europe (1958) concerns itself exclu- 
sively and explicitly with evaluating the prospects 
of these policy prescriptions. In Howard's opinion, 
the adoption of any disengagement proposal is un- 
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likely as long as German reunification on terms 
acceptable to a West German government is to be 
a primary result. This result remains inimical to 
the stated aims of the Soviet government. In addi- 
tion, West Germany's territory and its armed forces 
play a central role in NATO’s defense against pos- 
sible aggression; a scheme in which West Germany 
is excluded from NATO is, therefore, likely to re- 
quire too great a sacrifice in terms of present West- 
ern strategy. Beyond these considerations, it needs 
to be noted that most proposals for disengagement 
are characteristically all-inclusive formulas pro- 
viding for instant resolution of a complex of prob- 
lems. Although the problems of Berlin, Germany, 
eastern Europe, nuclear strategy, and disarmament 
may be inextricably interrelated, any sharp or 
wholesale change in this delicate structure poses 
even greater dangers than its continuance; formu- 
las for radical change are, therefore, likely to re- 
main unacceptable to both sides. Possibly the only 
practical disengagement proposals that can be im- 
plemented are those of George Kennan and the few 
others who do not prescribe formulas for salvation 
but rather encourage the adoption of new attitudes 
and aims. The West will have to recognize the 
existence of Soviet Russia’s influence in central 
Europe for the foreseeable future, while Russia 
will have to realize that hegemony in Europe can 
be achieved only at a catastrophic price. These new 
attitudes may then lead to compromises by both 
sides and to the gradual unraveling of the network 


of conflicts. 
GERALD FREUND 


[See also CONTAINMENT; INTERNATIONAL POLITICS,] 


BIBLIOGRAPHY 

AcHEson, DEAN 1958 The Illusion of Disengagement. 
Foreign Affairs 36:371-382. 

GAITSKELL, HucH 1957 The Challenge of Coexistence. 
Cambridge, Mass.: Harvard Univ. Press. 

HINTERHOFF, EUGENE 1959 Disengagement. With a fore- 
word by John Slessor. London: Stevens. 

Howarp, MicHaEL 1958 Disengagement in Europe. Har- 
mondsworth, Middlesex (England) and Baltimore: 
Penguin. 

Kennan, GEORGE F. 1947 The Sources of Soviet Con- 
duct. Foreign Affairs 25:566-582. > This article was 
written under the pseudonym X. 

Kennan, GEORGE F. 1958 Russia, the Atom and the 
West. New York: Harper. 

KENNAN, GEORGE F. 1959 Disengagement Revisited. For- 
eign Affairs 37:187-210. 

Notte, RicHarp H. 1964 United States Policy and the 
Middle East. Pages 148-182 in American Assembly, 
The United States and the Middle East. Edited by 
Georgiana G. Stevens. Englewood Cliffs, N.J.: Prentice- 
Hall. 


214 DISHONESTY 


DISHONESTY 
See MORAL DEVELOPMENT. 
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DISTRIBUTED LAGS 


In theory, distributed lags arise when any eco- 
nomic cause, such as a price change or an income 
change, produces its effect (for example, on the 
quantity demanded) only after some lag in time, 
so that the effect is not felt all at once at a single 
point in time but is distributed over a period of 
time. Thus, when we say that the quantity of ciga- 
rettes demanded is a function of the price of 
cigarettes taken with a distributed lag, we mean 
essentially that the full effects of a change in the 
price of cigarettes is not felt immediately, and only 
after some passage of time does the quantity of 
cigarettes demanded show the full effect of the 
change in the price of cigarettes. 

To consider the matter more concretely, let q; be 
the quantity of cigarettes, say, demanded per unit 
time, and let p, be the price per unit of cigarettes 
during time period t. Other things remaining con- 
stant, such as income, population, and the price 


of cough drops, we may express q, as a function of 
current and past prices: 


(1) qi =F (Pt, Pias Pr- ***)s 


In particular, let us assume the demand function 
f to be linear with constant coefficients a, b,, by, --- 


(2) qi =a + bpi + b Pir + bP t e. 


Suppose the price of cigarettes has been constant 
for a long time at a level p, so that p = p: = Pei = 
Pi- =+. Then the quantity of cigarettes demand- 
ed per unit time will also be a constant: 


(3) q=a+bp+bpt+bpt+--=at+pyb,. 


i=0 


Now let the price of cigarettes change from p to 
p + Ap in period t + 1 and remain at this new level 
indefinitely. The effect of the change in period 
t + 1 will be to change the quantity demanded from 
q to q + b,Ap. But the effects of the price change 
do not stop here; in the next period, t+ 2, the 
quantity demanded is further altered to q + b,Ap + 
b,Ap. In general, after @ periods, the change in q 
will be Ap Ef.» b;. Thus, the effect of the change in 
price on demand is distributed over time: b,Ap the 
first period, b,Ap + b,Ap the second, and so on. 

The example used to illustrate the concept of 
a distributed lag is taken from demand analysis, 
but the use of distributed lags is not restricted to 
analysis of problems of consumer demand. The 
wide application of distributed lags in econometrics 
may be indicated by a few examples: import de- 
mand (Tinbergen 1949); hyperinflation (Cagan 
1956); investment (Koyck 1954; Jorgenson 1963; 
Eisner 1960); demand for chemical fertilizers in 
the United States (Griliches 1959); advertising of 
oranges by the two major U.S. orange grower co- 
operatives (Nerlove & Waugh 1961). 

The causes of distributed lags in economic rela- 
tionships are as varied as the variables entering 
such relationships. To illustrate, the cigarette ex- 
ample may be used again. When the price is in- 
creased, we expect, other things being equal and if 
cigarettes are not an inferior good, that the demand 
will fall off. How it falls off, that is, the time path 
that demand follows, is not discussed in the ordi- 
nary static theory of demand. At first, a price rise 
may induce many people to stop smoking, but the 
tenacity of the habit being what it is, some will 
return to it. Others may either temporarily or per: 
manently reduce consumption. It is also conceiv- 
able that those who do not regard the price increase 
as permanent, and who do not react at first, may 
come eventually to believe in the permanence of 
the price change as it persists and be willing to 80 
through the painful process of adjustment. Finally, 
some who are nonsmokers, but who might have 
begun smoking when prices were lower (for exam- 
ple, adolescents), may now never begin. Thus, we 
see that there are three general types of factors 
that cause the effects of the price change to be 
distributed over time: (1) habit persistence, (2) 
expectational rigidities, and (3) a semitechnologi- 
cal factor related to the age distribution of the 
population and the conventional time at which a 
person begins to smoke. 

We could just as well deal with an income 
change or with the effects of a change in the price 
of a factor of production upon its employment in 


a certain branch of manufacturing. The exact 
causes and pattern of the distribution of lag would, 
of course, depend on the particular circumstances 
studied. The pattern (and causes) could be dif- 
ferent for changes in one direction than for those 
in another. Indeed, as F. M. Fisher has pointed out 
(1962, p. 29), there is no need in theory even to 
assume, as we implicitly have, “that when a de- 
cision is made at given time, t, the events of a 
certain fixed period previous are given a certain 
weight, regardless of what those events were or of 
what happened before or since their occurrence.” 
One can imagine the effect if a government report 
condemning smoking were issued just after the 
price of cigarettes declined. Nonetheless, for most 
practical purposes of econometrics, drastic simplifi- 
cations must be made and the results can be re- 
garded only as approximate. 

History and forms. Irving Fisher (1925) was 
the first to use and discuss the concept of a dis- 
tributed lag. In a later paper (1937, p. 323), he 
stated that the basic problem in applying the theory 
of distributed lags “is to find the ‘best’ distribution 
of lag, by which is meant the distribution such 


that . . . the total combined effect [of the lagged 
values of the variables taken with a distributed lag 
has] . . . the highest possible correlation with the 


actual statistical series . . . with which we wish to 
compare it.” Thus, we wish to find the distribution 
of lag that maximizes the explanation of “effect” 
by “cause” in a statistical sense. There are unso- 
phisticated, almost mechanical, approaches to this 
problem; and there are more sophisticated ap- 
proaches involving considerable use of economic 
theory to develop underlying models of dynamic 
adjustment that in turn generate the distributed lags 
observed. There are also both more sophisticated 
and less sophisticated approaches to estimation. 
No assumption may be made about the form of 
the distribution of lag, i.e., the relationships among 
the b’s in an equation such as (2). This is the 
approach adopted by Tinbergen (1949) and Alt 
(1942). The procedure is then to estimate an equa- 
tion such as (2), for example, by least squares. 
Since the number of observations is limited, only 
a finite number of lag terms may be included. 
Indeed, the coefficients of the lagged values may 
quickly become quite erratic because of the pres- 
ence of strong serial correlation in most economic 
time series. Such difficulties Jed Alt, Irving Fisher 
(1925; 1937), and others to suggest that the par- 
ameters of a relationship such as (2) beycon- 
strained by specifying the form of the Bet 
of lag. Fisher (1937) suggests, for example, tha 
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all b’s, after a certain point ð, are proportional to 
an arithmetical progression up to a certain maxi- 
mal lag, 6’. Thus, 


(4) b=b(1- =) 


o-1 

Fisher (1925) earlier had suggested that the 
weights of the distribution might plausibly be 
assumed to follow a logarithmic normal probability 
density function (with appropriate modification for 
the measurement of time in discrete units). Alt 
recommends a number of exponential forms. Koyck 
(1954) suggests weights that decline geometrically. 
Theil and Stern (1960) consider weights propor- 
tional to what are essentially approximations to 
the densities given by a gamma distribution with 
mean 2. Solow (1960) presents perhaps the ulti- 
mate flexibility and sophistication obtainable by a 
purely formal approach in suggesting the two- 
parameter family of unimodal lag distributions gen- 
erated by the Pascal distribution. Thus, the weights 
are 


(5) bwsh thot) a-w, 


This family provides the discrete analogue of the 
exponential and gamma distributions. It represents 
a natural generalization of the geometrically de- 
clining weights of Koyck and may be thought of as 
r simple Koyck lags cascaded in series. The case 
r = 1 is the original Koyck lag. The distribution of 
lag is skewed; the larger the value of à and the 
smaller the value of r, the greater is the degree of 
skewness. Both the center of gravity and the spread 
of the distribution increase with \ and r. This dis- 
tribution offers a considerable gain in flexibility as 
compared with earlier suggestions. 

A more complete review of the literature up to 
1958 and some additional suggestions are con- 
tained in Nerlove (1958a, pp. 1-25). 

Rather than directly specify the form of the lag, 
an alternative approach is to develop a dynamic 
model that leads to a distributed lag or lags in the 
observed relationships. This may be done at vary- 
ing levels of generality, either by dealing with 
broad classes of causes in an attempt to derive 
models of wide applicability or by attempting to 
isolate dynamic features of a particular problem 
and show that these lead to certain forms of lag 
structure. Brown (1952) and Friedman (1957) 
develop different dynamic models designed to ex- 
plain the behavior of total consumption expendi- 
tures. As Nerlove (1958a; 1958c) shows, both 
models lead essentially to a Koyck distribution of 


216 DISTRIBUTED LAGS 


lag in the relation between income and consump- 
tion. The two models are examples of the two gen- 
eral classes of models leading to distributed lags 
extensively discussed by Nerlove (19584; 1958b; 
1958c): expectational models and dynamic adjust- 
ment models. Both classes in their simplest forms 
lead to Koyck distributions of lag but have quite 
different implications for estimation and are not 
at all the same in more complicated cases, Fried- 
man’s model is a member of the class of expecta- 
tional models; and Brown’s model is a member of 
the class of dynamic adjustment. The two models, 
however, were designed to be quite general and to 
apply to a variety of problems. Examples taken 
from the area of agricultural supply analysis are 
used here to illustrate each model (Nerlove 1958b, 
pp. 25-26). 

An expectational model of supply response. 
Suppose that the quantity supplied in year t, x, , is 
a linear function not of the current price p; but of 
the price expected for year t in year t — 1, p} (for 
example, many agricultural crops are planted or 
planned far ahead): 


(6) 


where u, is a stochastic residual. Suppose further 
that price forecasts are corrected each period by 
farmers in proportion to the error made: 


(7) Pi — Pir = BCP Pi); 0<f<1, 


where £ is called the coefficient of expectations. 
A similar model is used explicitly by Cagan (1956) 
to generate expectations of changes in the general 
price level during hyperinflations. After some 
manipulation, it may be shown that 


x, = aß + bBp,, 
+ (1 = B)Xi tue — (1-8). 

A dynamic adjustment model of supply response. 
In contrast, suppose that the desired or equilibrium 
quantity supplied, x}, is linearly related to the price 
at the time of decision, p,-:: 


(9) 


However, for a variety of technological and eco- 
nomic reasons (including plain habit), only a frac- 
tion, y, of adjustment occurs each period: 


x,=a+bp;+u,, 


(8) 


x}=a+t bp,,+u,. 


(10) Xi — Xea = Y(X; —%,,), 0<y<1, 
hence 
(11) Xi = ay + bypia + (1 — y)Xi + u. 


Apart from the residual terms, which differ in 


the two relations, both equations imply: 
x, =a + ba [Pi + (1 — a)pı- 

+ (L-a) Prst > lE 
where a equals £ for the expectational model, and 


(12) 


@ equals y for the adjustment model. Equation (12) 


represents supply as a function of price taken with 
a Koyck distribution lag. In multiple-equation sys- 
tems, however, the expectational and dynamic 
adjustment models do not lead to such similar 
results; in particular, in expectational models one 
can make use of the fact that the distribution of 
lag for the same variable in different equations 
should be the same (Nerlove 1958a, pp. 31-39). 

Examples of further extension toward developing 
distributed lag relationships incidental to a more 
fundamental dynamic model of behavior are given 
by Jorgenson (1963) and Muth (1961). Jorgenson 
gives a model of investment behavior based on a 
theory of the demand for capital goods over time 
and on a theory of the relation between such 
demand and its translation into realized invest- 
ment, which in turn rests on a distribution of times- 
to-completion of new investment projects. This 
model results in a case of a very general distribu- 
tion of lag. Jorgenson calls the general case the 
“rational power series distribution”; the Pascal dis- 
tribution discussed above is a member of this class. 
Jorgenson’s distribution of lag, although general 
in form, results in fact from a highly particularized 
model of dynamic adjustment in the investment 
decision-and-realization process. 

Muth elaborates on a model of expectation for- 
mation. In the highly simplified case of a single 
market with a nonstochastic demand function in 
which the quantity supplied is a linear function of 
expected price plus a residual generated by shocks 
of a permanent and transitory nature, the “rational 
expectations” that Muth develops can be shown to 
satisfy (7). Hence, the model leads to a distributed 
lag of the Koyck form. However, in more realistic 
models the rational expectations are no longer SO 
simple. 

Estimation problems. If one examines equa- 
tions (8) or (11)—those that one might attempt 
to estimate by least-squares methods if one sought 
to determine both the distribution of lag and the 
long-run supply response—it can be seen that the 
presence of serial correlation in the residual terms 
will cause serious trouble. For example, least 
squares estimates of autoregressions with serially 
correlated residual terms are known to be statisti- 
cally inconsistent. Indeed, from (8) it appears that 
serially correlated residuals will be the rule. 


p 
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is a subject that is too technical and complex for 
discussion here. The reader is referred to Koyck 
(1954, pp. 32-39); Griliches (1961); Klein 
(1958); Liviatan (1963); Malinvaud (1964); 
Phillips (1956); and especially Hannan (1964), 
who gives the most complete and fundamental dis- 
cussion of this problem and its solution. While 
Hannan’s results refer primarily to Koyck lags, 
they can be generalized. 

Marc NERLOVE 
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1 
SPECIAL DISCRETE DISTRIBUTIONS 


The continuous distributions mentioned in this 
article, together with others, are described in detail 
in the companion article on SPECIAL CONTINUOUS 
DISTRIBUTIONS. 


A discrete probability distribution gives the prob- 
ability of every possible outcome of a discrete ex- 
periment, trial, or observation. By “discrete” it is 
meant that the possible outcomes are finite (or 
countably infinite) in number. As an illustration of 
the ideas involved, consider the act of throwing 
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three coins onto a table. Before trying to assess the 
probability of various outcomes, it is necessary first 
to decide (as part of the definition of the experi- 
ment) what outcomes are possible. From the point 
of view of the dynamics of the falling coins, no two 
throws are exactly identical, and there are thus 
infinitely many possible outcomes; but if the ex- 
perimenter is interested only in the heads and tails 
shown, he may wish to reduce the number of 
outcomes to the eight shown in Table 1. 


Table 1 — Possible outcomes of the experiment of tossing 
three coins 


Coin 1 Coin 2 Coin 3 
Head Head Head 
Head Head Tail 


Head Tail Head 
Head Tail Tail 
Toil Head Head 
Tail Head Tail 
Tail Tail Head 
Tail Tail Tail 


If the coins are fair, that is, equally likely to show 
a head or tail, and if they behave independently, 
the eight possibilities are also all equally probable, 
and each would have assigned probability §. The 
list of eight }’s assigned to the eight outcomes is 
a discrete probability distribution that is exactly 
equivalent to the probabilistic content of the verbal 
description of the defining experiment. 

The same physical activity (throwing three coins 
on a table) can be used to define another experi- 
ment. If the experimenter is interested only in the 
number of heads showing and not in which par- 
ticular coins show the heads, he may define the 
possible outcomes to be only four in number: 0, 1, 
2, 3. Since of the eight equally probable outcomes 
of the first experiment, one yields zero heads and 
one yields three heads while three yield one head 
and three yield two heads, the discrete probability 
distribution defined by the second experiment is 


probability of no heads 
probability of one head 
probability of two heads = 
probability of three heads = }. 


Iu 


as 
$: 


It is frequently convenient to write such a list in 
more compact form by using mathematical sym- 
bols. If p, means the probability of n heads, the 
second experiment leads to the formula 


aa è 
a) m=5 la) n=0,1,2,3, 
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where (¥), the binomial coefficient, is the mathe- 
matical symbol for the number of combinations of 
N things taken n at a time, 


NN i. N! 
(x) ~ nN —n)l’ 
n! (which is read “n-factorial”) being equal to 
n(n —1)(n—2)---3-2-1. Note that in (1) the 
form is given for the probabilities and then the 
domain of definition is stated. 

A domain of definition of a probability distribu- 
tion is merely a list of the possible outcomes of the 
experiment. In nearly every case, the experimenter 
finds it convenient to define the experiment so that 
the domain is numerical, as in the second experi- 
ment above. However, there is no reason why the 
domain should not be nonnumerical. In the first 
experiment above, the domain could be written 
(HHH), (HHT), (HTH), (HTT), (THH), (THT), 
(TTH), (TTT), where T means tails and H means 
heads. 

An expression for p, like the above needs only two 
properties to describe a valid discrete probability 
distribution. It must be nonnegative and the sum 
over all possible outcomes must be unity: Pn > 0; 
Xp» = 1. It is therefore possible to invent any num- 
ber of discrete probability distributions simply by 
choosing some function p, satisfying these simple 
conditions. In both theory and applications, it is 
more important to study certain probability dis- 
tributions that correspond to basic probability ex- 
periments than to attempt a systematic classifica- 
tion of all functions satisfying these conditions. In 
the following list of important discrete distribu- 
tions, the corresponding experiment is stated in 
very general terms, accompanied in many cases by 
concrete interpretations. 


Discrete distributions 

Binomial distribution. Suppose an experiment 
can have only two possible outcomes, convention- 
ally called success and failure, and suppose the 
probability of success is p. If the experiment is Te- 
peated independently N times under exactly the 
same conditions, the number of successes, n, can 
be 0, 1, ---, N. A composite experiment consisting 
of observing the number of successes in the N 
performances of the individual experiment can 
therefore have one of the following outcomes: 
0, 1, 2, +--+, N. The probability of observing exactly 
n successes forms the binomial distribution: 


(2) m= (era Apya n=0,1, °°, N- 
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The second experiment mentioned in the introduc- 
tion is of this form, where N = 3 and p=}. 

The mean value of the binomial distribution is 
Np and igs variance is Np(1 — p). The tails of the 
binomial distribution (and of the negative binomial 
discussed below) may be expressed as incomplete 
beta functions. 

The following list of interpretations for the ab- 
stract words success and failure will indicate the 
breadth of application of the binomial distribu- 
tion: male-female; infected—not infected; working— 
broken; pass—fail; alive-dead; head-tail; right- 
wrong. If a common test is known to give the right 
answer (using the last pair as an example) nine 
times out of ten on the average, and if it is per- 
formed ten times, what is the probability that it 
will give exactly nine right answers and one wrong 
one? 


Po = Ge (.9)9(,1)! = 38742. 


In using the binomial distribution for this pur- 
pose, it is important to remember that the proba- 
bility of a success must be constant. For example, 
if each of 30 students took an examination com- 
posed of 3 equally difficult parts and each separate 
part was graded pass or fail, the number of fails 
out of the 90 grades would not have a binomial 
distribution since the probability of failure varies 
from student to student. 

If, for the binomial distribution, N is large and 
p is small, then the simpler Poisson distribution 
(see next section) often forms a good approxima- 
tion. The sense of this approximation is a limiting 
one, with N > œ and Np having a finite limit. 

The normal distribution is often a good ap- 
proximation to the binomial, especially when N is 
large and p not too close to 0 or 1. The limiting 
sense of the normal approximation is in terms of 
the standardized variable, n less its mean and di- 
vided by its standard deviation. 

Poisson distribution. Imagine a machine turn- 
ing out a continuous roll of wire or thread. Minor 
defects occur from time to time; their frequency 
and their statistical behavior are of interest. If the 
defects occur independently and if the risk of a 
defect is constant (that is, if the probability of a 
defect in a very small interval of the wire is a con- 
stant, A, times the length of the interval), then the 
average number of defects per unit length is à and 
the probability that a unit length contains exactly 
n defects is 
d"e-® 

n! 


WEDL 2st: 


> 


(3) Pn = 
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This distribution, named for the French mathema- 
tician Poisson, is an important example of a dis- 
tribution defined over an infinite set of integers. 
Any value of n is possible, although large values 
(compared with à) are unlikely. The mean and 
variance of the Poisson distribution are both À. If 
the segment of unit length is replaced by a segment 
of length L, this is only a change in scale and one 
need only substitute LA for à in (3) and in the 
moments, The tails of the Poisson distribution are 
incomplete gamma functions, 

The Poisson distribution has a very wide range 
of application, not only because of its mathematical 
simplicity but because it represents one important 
concept of true randomness. In addition, the Pois- 
son distribution, as has been noted, is useful as an 
approximation. 

The most important interpretations of the Pois- 
son distribution occur when the line is considered 
to be the axis of real time and the points to be 
times at which certain events take place. The fol- 
lowing is a partial list of events that have been 
compared with the Poisson distribution: deaths by 
horse kick, radioactive particles entering a Geiger- 
Miiller counter, arrival of patients into a doctor's 
waiting room, declarations of war, strikes of light- 
ning, registration of vehicles over a road tape, de- 
mands for service at a telephone switchboard, 
demands for parking space, occurrence of acci- 
dents, demands for electric power, instants of 
nerve excitation, occurrence of suicide, etc. [for a 
discussion of some of these applications, see 
QUEUES]. 

On the other hand, the interpretation of the 
Poisson distribution as a criterion of perfect dis- 
order remains valid in space of any number of 
dimensions, provided only that L is interpreted as 
length, area, or volume as the case may require. 
Space examples are perhaps not as numerous, but 
the following can be noted: one dimension, mis- 
prints per line, cars per mile; two dimensions, 
bomb bursts per square mile, weeds per square 
yard, stars per square unit of photographic plate; 
three dimensions, raisins per loaf of raisin bread, 
bacteria per cubic centimeter of fluid. 

Morse distribution. The Poisson distribution is 
an example of a counting distribution. In a single 
dimension (whether time or space) it is character- 
ized by the fact that the continuous density func- 
tion (the gap distribution) of distance between 
consecutive points is negative exponential (with 
X= 1/6). If the gap distribution is more general 
(for example, the Pearson Type 11, with À = 1/9), 
the counting distribution may be very complicated 
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indeed. With Pearson Type mr gaps, the counting 
probabilities are 


Pn = (142) Dorie — Tne] + (1 — 2) [Par — Tarr] 
+ (AL/1) [ = Larra + 22 ura — Paral, 


where 


y= fone a 
(n= 1)!Jyu 


This distribution takes its name from Philip Morse 
(1958), and reduces to the Poisson for r= 1. In 
data that fitted such a function, one would have a 
mixture of randomness and regularity, suggesting 
two factors at work. 

Negative binomial distribution. There are two 
traditional important probability models leading to 
the negative binomial distribution, although others 
have been suggested. (i) If each member of a 
population experiences Poisson events (for exam- 
ple, accidents) but with the mean value à varying 
statistically from member to member according to 
the Pearson Type mı distribution, then the proba- 
bility that n events occur in time T among all mem- 
bers of the population is 


Po = (Eii D) pa-p, 


where p = 0/(T + 0) andr and @ are the parameters 
of the Pearson Type mı distribution, (ii) If ex- 
periments are performed in sequence with a fixed 
probability p of success (as for the binomial dis- 
tribution), then the negative binomial distribution 
gives the probability that exactly n failures will 
precede the rth success. This is particularly useful 
for r= 1, since in many applications one calls a 
halt after the first “success,” for example, in repeat- 
ed dialings of a telephone that may be busy. The 
negative binomial distribution takes its name from 
its formal similarity to the binomial distribution. 

The mean of the negative binomial distribution 
is (1 — p)r/p and its variance is (1 — p)r/p?. It is 
an example of a mixed distribution, obtained by 
assuming a parameter in one distribution to be it- 
self subject to statistical fluctuation. [Many other 
such distributions, including those called “con- 
tagious,” are discussed in DISTRIBUTIONS, STATIS- 
TICAL, article on MIXTURES OF DISTRIBUTIONS. ] 

Geometric distribution. In certain types of 
waiting lines, the probability of n persons in the 
queue is 


m= (1-36) 


where À is the mean arrival rate and p is the mean 
service rate and à < u. The mean value of the geo- 


n=0,1, 5%, 


n=0,1,2,---, 


metric distribution is à/ (u — à), and the variance 
is Au/(u~—)?. The geometric distribution is a 
special case of the negative binomial distribution 
with r=1 and p= (p/d) — 1. [These waiting line 
problems are discussed in QUEUES.] 

Hypergeometric distribution. Suppose a collec- 
tion of N objects contains k of one kind and N:— k 
of another kind—for example, any of the dichot- 
omies of the binomial distribution model. If ex- 
actly r objects are taken at random from the collec- 
tion, the probability that n of them will be of the 
first kind is 


Gr =a) 

(4) m= MOF, n= 
(G) 

The mean of this distribution is kr/N and the 

variance is kr(N — k)(N —r)/N?. 

The binomial distribution and the hypergeo- 
metric distribution can be regarded as arising from 
analogous experiments, the former by sampling 
“with replacement” and the latter by sampling 
“without replacement.” This can be seen from 
either of two facts: (i) if, in the probability experi- 
ment described for the hypergeometric distribution, 
each of the r objects is put back before the next one 
is chosen, the probability model leading to the 
binomial distribution would result; (ii) if N> % 
in equation (4), with (N —k)/N replaced by P, 
the resulting limit is equation (2). Thus, if the 
total number of objects is very large and the pro- 
portions of the two types remain fixed, the hyper- 
geometric model approaches the binomial model. 
If the number of objects of the first kind is very 
small in comparison with the total number of ob- 
jects, the hypergeometric distribution will ap- 
proach the Poisson. The hypergeometric distribu- 
tion has important statistical application in Fisher's 
so-called exact test in a 2 x 2 table [see COUNTED 
DATA]. 

It is possible to obtain many other distributions, 
both discrete and continuous, as special or limiting 
cases of the hypergeometric. Karl Pearson used the 
probability model leading to the hypergeometric 
distribution as the basis for his analysis of density 
functions and obtained 12 separate types as special 
limiting cases. Of these, only the Pearson Type 11 is 
still frequently called by his designation, although 
many of the most common continuous densities 
(including the normal) fit into his classification 
system [see DISTRIBUTIONS, STATISTICAL, article on 
APPROXIMATIONS TO DISTRIBUTIONS]. 

Negative hypergeometric distribution. If the 
parameter p in a binomial distribution is itself beta 


0, 1, +++, min (r, k). 
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distributed, then the unconditional distribution of 
the discrete variate is 
_fN\B(n+p,N+q—n) 
Pa =() Pete eee, n=0,1,°:*,N, 
where p and q are the parameters of the beta dis- 
tribution and 


B(p, q) = f x — x)" dx, 


This model is the discrete time analog of the first 
interpretation of the negative binomial distribution; 
the events (such as accidents) would be counted 
in short time periods so that only a success or fail- 
ure would be recorded. 

Occupancy distribution. If each of k objects is 
thrown at random into one of N boxes, the proba- 
bility that a given box contains n objects is 


_ /k\ (N= 1) 
p= (5) Noo’ 


This distribution has mean k/N and variance 
(k/N)(1 —1/N). Problems of this type are called 
occupancy problems and are important in statis- 
tical mechanics. 

Busy period distributions. A box contains 7 
objects, and additional objects are thrown into the 
box at random instants, that is, in accordance with 
a Poisson distribution (mean A). If one object is 
removed in every time interval of length 1/, then 
the probability that exactly n objects will pass 
through the box before it first becomes empty is 


n=0,1,-+-,k. 


Pa = Tr yi nerd edna (A/S 


(n=r 
n=r,r+1l, r+2 


This is the Borel-Tanner distribution, with mean 
7/(1—d/p) and variance (àr/p)/(1—A/p)*. IE 
the objects are removed also in accordance with a 
Poisson distribution (mean p), then the proba- 
bility that n will pass through before the box first 
becomes empty is 


_r/{m-r- N) _ a", 
r= n-1 laa 


pan EI 


which is called the Narayana distribution. This 
distribution has mean r/(1— A/p) and variance 
[Or/p) (1 = 7p) = Au). 

Such distributions are important in the theory 


of queues, where the box represents the collection 


of people waiting. Then the number n, which has 
probability p, , is the length of a busy period for the 
service mechanism, beginning with r in the system 


[see Queues]. 
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Uniform distribution. An experiment with N 
equally likely outcomes corresponds to the discrete 
uniform (or discrete rectangular) distribution: 


Pn = xj" n=1,2,--+,N. 


z= 


For a single throw of a die, N = 6. This distribution 
has mean 3(N + 1) and variance (N? — 1). 

Yule distribution. Some experimental data sug- 
gest that in a long list of names (for example, a 
telephone book) the probability that a randomly 
chosen name occurs n times is 


_k(n+k\" 
man) 


Fisher distribution, In the distribution of fre- 
. quency of plant species, the distribution 


We'd, Doves 


pa = Lap", n=1,2,°", 
occurs, where 1/a = —log, (1 ~ p). The Fisher dis- 
tribution has mean ap/(1—p) and variance 
ap(1—ap)/(1—p)*. This distribution has also 
been called the logarithmic distribution because of 
its close relationship to the Taylor series for 
—log. (1 — p). Fisher's distribution can be obtained 
from the negative binomial by truncating the zero 
category and letting r approach zero. 


Formation of distributions from others 


The most important families of secondary dis- 
tributions are the so-called mixed distributions, in 
which the parameter in a given distribution is itself 
subject to statistical fluctuation [see DISTRIBU- 
TIONS, STATISTICAL, article on MIXTURES OF DIS- 
TRIBUTIONS]. Two examples above (the first case 
of the negative binomial and the negative hyper- 
geometric) are also mixed, In addition, the follow- 
ing paragraphs illustrate other methods of obtaining 
secondary distributions, 

Truncated distributions. It may happen that 
the domain of definition of a distribution does 
not exactly agree with some data, either for theo- 
retical reasons or because part of the data is un- 
obtainable, although the model is in other respects 
quite satisfactory. The most famous example of 
this is concerned with albinism. The Poisson dis- 
tribution gives a satisfactory fit to the number of 
albino children born to parents genetically capable 
of producing albinos, except for the value of p,, the 
probability of no albino children. This frequency 
cannot be observed, since such parents with no 
albino children are ordinarily indistinguishable 
from normal parents. Therefore, a new distribution 
is formed from the Poisson by removing the zero 
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category, and dividing each probability by 1 — p. 
so that the total sum remains one: 
“hn 
Oar ay n=1,2,3, °°. 

In certain applications, it has been useful to trun- 
cate similarly the binomial distribution. [The prac- 
tice, which has also been applied to continuous 
density functions, is discussed in STATISTICAL 
ANALYSIS, SPECIAL PROBLEMS OF, article on TRUN- 
CATION AND CENSORSHIP. | 

Bissinger’s system. In connection with an inven- 
tory problem, the distribution qa, formed from a 
given distribution p, by the transformation 


m=] =) 


has been useful. A more general transformation of 
this type is 


n=0,1,°*:, 


n=0,1,2,-:-, 


which reduces to the simpler form for k = 1. This 
transformation is discussed by Bissinger in Patil 
(1965). 

Joint occupancy. Let p, be the probability of an 
object of type 1 in a box, and let 8 be the probability 
that an object of type 1 brings an object of type 11 
with it into the box, Then the probability q, of a 
total of n objects in the box is 


lin} P 
Ga= 2E a y Pai (1 — B)" Bi, 
= 094525 =i, 


where [k] represents the integer part of k. This 
form of distribution has been applied to the num- 
ber of persons in an automobile and to the number 
of persons in a group buying a railway ticket. The 
two types of objects might be male-female or adult- 
child. 

Analogy with density functions. Itis only neces- 
sary to adjust a constant in order to convert a con- 
tinuous distribution into a discrete one. For ex- 
ample, the negative exponential density de, 
usually defined over 0 <x < =, can be applied to 
the domain n = 0, 1, 2, ---. However, since 
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the discrete probability distribution will be 


Pe= (1—e*} es; n=0,1, =. 


Several such discrete analogs, including the nor- 
mal, have appeared in the literature. 


Alternative descriptions of distributions 

A probability distribution p, can be transformed 
in many ways, and, provided the transformation is 
one-to-one, the result will characterize the distribu- 
tion equally well. Some of the principal auxiliary 
functions are 


P,= 5 Pi, the cumulative distribution, or left 

< tail; 

Q =E pi = 1 — Pp, the right tail of the dis- 

pe tribution; 

o(s) = p:s‘, the generating function; 

a(s) =$ pis'/i!, thë exponential generating 
Le function; 

the moment generating func- 

n tion; 

o(s) =Ņ(1 + s)ipi, the factorial moment gen- 
Sia erating function. 


ACs) = Že"pi, 


The last two functions are so named because in 
the corresponding power series expansion the co- 
efficients involve, respectively, the central mo- 
ments and the factorial moments, 

For certain mathematical operations on discrete 
probability distributions, such as the calculation of 
moments, generating functions are extremely use- 
ful. In many cases the best way to obtain p, from 
a defining experiment is to calculate $(s) first. 
Many examples of this type of argument can be 
found in Riordan (1958). 


Multivariate distributions 


There have been a few discrete multivariate dis- 
tributions proposed, but only one, the multinomial, 
has been very widely applied. 

Multinomial distribution. If an experiment can 
have r possible outcomes, with probabilities pı, P, 
+++, p, and if it is repeated under the same condi- 
tions N times, the probability that the jth outcome 
will occur n; times, j = 1, 2, «++, 7, is 

N! Mr see Or 
iaoe eaa er? 
which reduces to the binomial for r = 2. 

Negative multinomial distribution. If, in the 
scheme given above, trials stop after the rth out- 
come has been observed exactly k times, the joint 
probability that the jth outcome will be observed 7) 
times (j = 1, ---, r— 1) is called the negative mul- 
tinomial distribution. 
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Bivariate Poisson distribution. The joint prob- 
ability 


Pring = exXp[— (À + u + p): 


min (ty,mg) * 
j 
Pp 
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has been proposed as a generalization of the Pois- 
son; when p=0, n, and n, are independently 
Poisson. 

Frank A. HAIGHT 


[Other relevant material may be found in PROBABILITY 
and in the biographies of FisHer, R. A.; PEARSON; 
Potsson.] 
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I 
SPECIAL CONTINUOUS DISTRIBUTIONS 


This article describes, and gives the more im- 
portant properties of, the major continuous distri- 
butions that arise in statistics. It is intended as both 
an overview for the reader generally interested in 
distributions and as a reference for a reader seek- 
ing the form of a particular distribution. 

Technical terms, such as “density function,” “cu- 
mulative distribution function,” etc., are explained 
elsewhere [see PROBABILITY]. 

The present article is, with a few exceptions, re- 
stricted to univariate distributions. Further specific 
references to numerical tabulations of distributions 
are not generally given here; most of the distribu- 
tions are tabulated in Owen (1962) or in Pearson 
and Hartley (1954), and full references to other 
tabulations are given in Greenwood and Hartley 
(1962). Tables of many functions discussed here 
and an extensive reference list to tables are given 
in Zelen and Severo (1964). An index to properties 
of distributions is given in Haight (1961). 

Normal distributions. The most important fam- 
ily of continuous distributions is that of the normal 
(or Gaussian) probability distributions. [See PROB- 
ABILITY, article on FORMAL PROBABILITY, for more 
of the many properties of the normal distributions. ] 

The normal probability density function is 


aE] ro 


where p is the mean (and also here the median and 
the mode) and ø is the standard deviation. Figure 1 
shows the shape of this density. Note that it is sym- 
metric about p, that is, f(u +x) = f(u = x), and 
that the density is essentially zero for x > p + 30 
and x < p — 30. The normal distribution is some- 
times said to be “bell-shaped,” but note that there 
are many nonnormal bell-shaped distributions. The 
normal distribution is “standardized” or “normal- 
ized” by the transformation z= (x — #)/o, which 
gives the standard—normal density, 


geriz). 
A standardized normal random variable is also re- 
ferred to as a “unit” normal since the mean of the 
standardized form is zero and the variance is one. 
The cumulative distribution function for the nor- 
mal distribution in standardized form is 


1 2 k: 
l exp(— 44°)dy. 
Thus, to find the probability that a normal random 
variable with mean p and standard deviation o 1s 
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Student t-distribution 
A y f=3=--- 


Normal distribution 
p=0,0=1 


Figure 1 — Standard normal distribution and Student 
t-distribution showing density on the ver- 
tical scale corresponding to argument on 
the horizontal scale 


less than x, first compute the number of standard 
deviations x is away from p, that is, let z= 
(x = 4.)/o. Probabilities associated with x then may 
be read from tables of the standardized distribu- 
tion. Care must be exercised to determine what is 
tabulated in any particular table. Various tables 
give the following: the cumulative probability; the 
probability in the right-hand tail only; the sum of 
the probabilities in the two tails; the central prob- 
ability, that is, the probability that the absolute 
value of the random variable is less than the argu- 
ment; and others. 

A great many probability distributions are de- 
rived from the normal distribution; see Eisenhart 
and Zelen (1958); Kendall and Stuart ([{1943- 
1946] 1958-1961); Korn and Korn (1961); Zelen 
and Severo (1964), These references also list addi- 
tional continuous distributions not covered here. 

Normal distributions are sometimes called Gauss- 
ian, sometimes Laplace-Gauss distributions, and 
sometimes distributions following Laplace’s second 
law. This terminology reflects a tangled and often 
misstated history [see Walker 1929, chapter 2; see 
also Gauss; LAPLACE]. 

Chi-square distributions. If X is unit normally 
distributed, X? has a chi-square distribution with 
one degree of freedom. If X, , X}, --+ , X; are all unit 
normally distributed and independent, then the 
sum of squares X? + X? + --- + X? has a chi-square 

distribution with f degrees of freedom. 


The probability density function for a chi-square 
random variable is 


1 ci (f-2), z 
Dra te 
This is also a probability density function when f 
is positive, but not integral, and a chi-square dis- 
tribution with fractional degrees of freedom is 
simply defined via the density function. 

The mean of a chi-square distribution is f, and 
its variance is 2f. For f > 2, the chi-square distribu- 
tions have their mode at f — 2. For 0 < f < 2, the 
densities are J-shaped and are maximum at the 
origin. Figure 2 shows the shape of the chi-square 
distributions. 

If Y, , Y, are independent and chi-square distrib- 
uted with f,, f+ degrees of freedom, then Y, + Y, 
is chi-square with f, + f: degrees of freedom. This 
additivity property extends to any finite number of 
independent, chi-square summands. 

If Y has a chi-square distribution with f degrees 
of freedom, then Y/f has a mean-square distribu- 
tion, or a chi-square divided by degrees of freedom 
distribution. 

The cumulative distribution function for the chi- 
square random variable with an even number of 
degrees of freedom (equal to 2a) is related to the 
Poisson cumulative distribution function with mean 
à as follows: 


O<x<o, 


ern 1 fe adagi dx 
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where f = 2a. 
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Figure 2 — Chi-square distributions showing density 
on the vertical scale corresponding to 
argument on the horizontal scale 
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Gamma or Pearson Type m distributions. The 
gamma distribution, a generalization of the chi- 
square distribution, has probability density function 


wore) was): 


where 0<0<%,0<r<%, and0<x<~», Note 
that r does not have to be an integer. The mean of 
this distribution is rð, and the variance is r@*. A 
simple modification permits shifting the left end- 
point from zero to any other value in this distribu- 
tion and in several others discussed below. If r= 
4f and @=2, the gamma distribution reduces to a 
chi-square distribution. If X has a chi-square dis- 
tribution with f degrees of freedom, then Y = 30X 
has a gamma distribution with parameters 0, 3f. 
If Y has a gamma distribution with parameters 90, 
r, then 204Y has a chi-square distribution with 2r 
degrees of freedom. 

Negative exponential distributions. Negative ex- 
ponential distributions are special cases of gamma 
distributions with r = 1. The probability density is 
(1/0) exp (—x/0) for 0 <x < œ, 0 <90 < %; the 
mean is @ and the variance is 6°. If @ = 2, the nega- 
tive exponential distribution reduces to a chi-square 
distribution with 2 degrees of freedom. 

The cumulative distribution is 


1 — exp (—x/0), O<x<%,0<0<%, 


This distribution has been widely used to represent 
the distribution of lives of certain manufactured 
goods, for example, light bulbs, radio tubes, etc. 
[see QUALITY CONTROL, STATISTICAL article on RE- 
LIABILITY AND LIFE TESTING]. 

Suppose that the probability that an item will 
function over the time period t to t+ At is inde- 
pendent of t, given that the item is functioning at 
time t. In other words, suppose that the age of an 
item does not affect the probability that it con- 
tinues to function over any specified length of fu- 
ture time provided the item is operating at present. 
In still other terms, if X is a random variable de- 
noting length of life for an item, suppose that 

Prix 2x | X 2 é} = Pr{(X >x— €} 
for all € and x > €. This “constant risk property” 
obtains if, and only if, X has a negative exponential 
distribution, The underlying temporal process is 
called a stationary Poisson process; and, for a 
Poisson process, the number of failures (or deaths) 
in any given interval of time has a Poisson distri- 
bution [see QuEvEs and the biography of Potsson]. 

Noncentral chi-square distributions. If Xi, X:, 
-++, X, are all normally distributed and independ- 
ent, and if X; has mean pi and variance one, then 
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the sum X? + X? + +++ +X? has a noncentral chi- 
square distribution with f degrees of freedom and 
noncentrality parameter A=p?+pi+<-*+ pi. 
This family of distributions can be extended to 
nonintegral values of f by noting that the density 
function (not given here) obtained for integral 
values of f is still a density function for the non- 
integral values of f. The mean of the noncentral 
chi-square distribution is f + and the variance 
is 2(f + 2d). 

Perhaps the main statistical use of the noncen- 
tral chi-square distribution is in connection with 
the power of standard tests for counted data [see 
COUNTED DATA]. 

The distribution also arises in bombing studies. 
For example, the proportion of a circular target 
destroyed by a bomb with a circular effects region 
may be obtained from the noncentral chi-square 
distribution with two degrees of freedom if the 
aiming errors follow a circular normal distribution. 
(A circular normal distribution is a bivariate nor- 
mal distribution, discussed below, with p = 0 and 
ox =oy.) 

Noncentral chi-square distributions have an addi- 
tivity property similar to that of (central) chi- 
square distributions. 

Weibull distributions. A random variable X has 
a Weibull distribution if its probability density func- 
tion is of the form 


2 (5) en (- ay, 
0\0 PATO, 


where 0 < x < %, 0 < 8 < % and r > 1. This means 
that random variables with Weibull distributions 
can be obtained by starting with negative exponen- 
tial random variables and raising them to powers 
> 1. If in particular r = 1, the Weibull distribution 
reduces to a negative exponential distribution, The 
Weibull distributions are widely used to represent 
the distribution of lives of various manufactured 
products. The mean of X is or[(r+ 1)/r], and 
62{T[(r + 2)/r] —I°[(r + 1)/r]} is the variance of X, 

Student (or t-) distributions. If X has a unit 
normal distribution and if Y is distributed inde- 
pendently of X according to a chi-square distribu- 
tion with f degrees of freedom, then X/ VY/f has 
a Student (or t-) distribution with f degrees of free- 
dom. Note that f need not be an integer since the 
degrees of freedom for chi-square need not be in- 
tegral. The only restriction is 0 < f < %. The den- 
sity for a random variable having the Student dis- 


tribution is 


T(4f +4) ee =f) ; 
VTO (1+ ) 
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where —% < x < +%, A graph of the density func- 
tions for f = 1 and f = 3 is shown in Figure 1. Note 
that the density is symmetric about zero. As f ap- 
proaches œ, the Student density approaches the 
unit normal density. 

The rth moment of the Student distribution exists 
if and only if r< f. Thus for f>2, the mean is 
zero; and for f > 3, the variance is f/(f — 2). The 
median and the mode are zero for all f. 

The Student distribution is named after W. S. 
Gosset, who wrote under the pseudonym Student. 
Gosset’s development of the t-distribution, as it 
arises in dealing with normal means, is often con- 
sidered to be the start of modern “small sample” 
mathematical statistics [see GOSSET]. 

The sample correlation coefficient, r, based on 
n pairs of observations from a bivariate normal 
population, may be reduced to a Student t-statistic 
with n — 2 degrees of freedom, when the population 
correlation coefficient is zero, by the transformation 
t=(rV¥n—2)/V1—F [see MULTIVARIATE ANAL- 
YsIs, articles on CORRELATION]. 

Cauchy distributions. The Cauchy distribution 
is an example of a distribution for which no mo- 
ments exist. The probability density function is 


SES eee: Devers 
mB 1+[(x— )/BP’ 


where —» <x < +, 8 > 0, and —% < À < +, The 
cumulative probability distribution is 


Pr{X <x} = Larcn (25 à) +4. 
The median and the mode of this distribution are 
at x= À, For 8 = 1 and à = 0, the Cauchy distribu- 
tion is also a Student t-distribution with f = 1 degree 
of freedom. 

Noncentral t-distributions. If X is a unit normal 
random variable and if Y is distributed independ- 
ently of X according to a chi-square distribution 
with f degrees of freedom, then (X + 8)/VY7f has 
a noncentral t-distribution with f degrees of free- 
dom and noncentrality parameter ô where 0 < f<% 
and —% <8 < %, The mean for f> 2 of this dis- 
tribution is c,,8 and the variance for f>3 is c..5? 
+ Cz, Where ca = Vf/2T (if —4)/T(3f), ca = 
[f/(f—2)]—¢2,, and c,,=f/(f—2). If 8=0, 
the noncentral t-distribution reduces to the Stu- 
dent t-distribution. Note that the moments do not 
exist for f= 1; despite this similarity, the non- 
central t-distribution with #0 is not a Cauchy 
distribution, The noncentral t-distribution arises 
when considering one-sided tolerance limits on a 
normal distribution and in power computations for 
the Student t-test. 


F-distributions. If Y, has a chi-square distribu- 
tion with f, degrees of freedom and Y, has a chi- 
square distribution with f. degrees of freedom, and 
Y, and Y, are independent, then (Y\/f,)/(¥2/ fs) 
has an F-distribution with f, degrees of freedom 
for the numerator and f: degrees of freedom for 
the denominator. The F-distributions are also known 
as Snedecor’s F-distributions and variance ratio 
distributions. They arise as the distributions of the 
ratios of many of the mean squares in the analysis 
of variance [see LINEAR HYPOTHESES, article on 
ANALYSIS OF VARIANCE]. 

The density of F is 


TGA + af AA amo 
PORMA) (+ hx 


where 0 < x < %, 0 < f, < ~ and 0 < f, < ~. Figure 3 
shows a plot of this density for four cases: fı = 
fi=1, fi=f:=2, fp=3, fo=5, and f,=f,=10. 
Of these the cases of f, = 3, fs = 5, and f, = fa = 10 
are most typical of F-distributions. 

The mean of this distribution is f:/(f:— 2) for 
f,>2; for f.>4 the variance is given by 
FCF, + fa — 2)/IF,(F, — 2)*(F, — 4)]; and the mode 
is f.(f: — 2)/[f, (fe + 2)] for fı > 2. The F-distribu- 
tion is J-shaped if f, < 2. When f, and f; are greater 
than 2, the F-distribution has its mode below x = 1 
and its mean above x = 1 and, hence, is positively 
skew. 

Let Fy, represent a random variable having an 
F-distribution with f, degrees of freedom for the 


Mn 


Figure 3 — F-distributions showing density on the ver- 
tical scale corresponding to argument on 
the horizontal scale 


DISTRIBUTIONS, STATISTICAL: Continuous 


numerator and f, degrees of freedom for the denom- 
inator. Then Pr{F; s Sc} =1— Pr{Fy,.4, < 1/0}. 
Hence the F-distribution is usually tabulated for 
one tail (usually the upper tail), as the other tail 
is easily obtained from the one tabulated. 

The chi-square and t-distributions are related to 
the F-distributions as follows: f:/F..;, has a chi- 
square distribution with f, degrees of freedom; 
f\F),.2 has a chi-square distribution with f, degrees 
of freedom; F,,;, is distributed as the square of a 
Student-distributed random variable with f+ degrees 
of freedom; 1/F,,,.. is distributed as the square of a 
Student-distributed random variable with f, degrees 
of freedom. 

Let E(n, r, p) =S2,,(1)p'(1 — p)"+, that is, let 
E(n, r, p) be the probability of r or more successes 
for a binomial probability distribution where p is 
the probability of success of a single trial [see Dis- 
TRIBUTIONS, STATISTICAL, article on SPECIAL DIS- 
CRETE DISTRIBUTIONS], Let C».s,f, be defined by 
Pr{Fy,s, S Crs} = V. Then the following relation- 
ship exists between the binomial and F-distributions. 
If E(n, r, p) =v then 

= r . 

P= FF n- r E)n 
A slight variation on the F-distribution obtained by 
the change of variable Y = [f1/f:]X is often referred 
to as the inverted beta distribution, the beta distri- 
bution of the second kind, or the beta prime dis- 
tribution. These names are also occasionally ap- 
plied to the F-distribution itself. Another variation 
on the F-distribution is Fishers z-distribution, 
which is that of 41n F. 

Beta distributions. A random variable, X, has 
a beta distribution with parameters p and q if its 
density is of the form 


T(p +4) A — x02 
Tew. vt ca 
where 0<x <1; pq>0. If the transformation 
X = fiFy1,/(fs + fiFr.4) is made, then X has a 
beta distribution with p = 4f; and q = 3f2- 
The cumulative beta distribution is known as 
the incomplete beta function, 
TOA) f wa- u) du. 
Lipa) = E, O 
The incomplete beta function has been tabulated 
by Karl Pearson (see Pearson 1934). The rela- 
tionship L(p,q) =1—h-2(4.P) is often useful. 
The mean of the beta distribution is p/(p + q), 
and the variance is pq/[(p + 9)*(P +4 + 1)]. 
The mode for p > 1 and q > 1, but p and q not 
both equal to 1, is (p — 1)/(p+q-2). For p<1 
and q > 1, the density is in the shape of a reversed 
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J; for p > 1 and q < 1 the density is J-shaped; and 
for p<1 and q <1, the density is U-shaped. For 
both p and q = 1, the density takes the form of the 
density of the rectangular distributions with a = 0 
and b= 1. The beta distribution is also known as 
the beta distribution of the first kind or the incom- 
plete beta distribution. 

Let E(n, r, x) =E r, (4)x'(1 — x)", that is, let 
E(n, r, x) be the probability of r or more successes 
in n trials for a binomial probability distribution 
where x is the probability of success of a single 
trial. Then I.(p,q) = E(p+q-1,p,x). In other 
words, partial binomial sums are expressible directly 
in terms of the incomplete beta function. To solve 
L.(p,q) = y for x, with y, p, and q fixed, find x from 
x= p/[p + qCr-y,24,»] Where Cys, is defined by 
Pr{Fy,9, S Cyt} = Y, that is, cy,y,,s, is a percentage 
point of the F-distribution. 

If Y, and Y, are independent random variables 
having gamma distributions with equal values of 0, 
r=n, for Y,, and r= n; for Y,, then the variable 
X= Y,/(Y, + Y.) has a beta distribution with p = n, 
and q = m. 

Bivariate beta distributions. The random vari- 
ables X and Y are said to have a joint bivariate beta 
distribution if the joint probability density function 
for X and Y is given by 


NEES eLearn SA 
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for xy>0 and x+y< 1,0 < fı < %,0 < f: < %, 
and 0 < fs < %, This distribution is also known as 
the bivariate Dirichlet distribution. The mean of X 
is given by f,/f,, and the mean of Y by f:/f., 
where f, represents (fı + f:+ fs). The variance 
of X is f,(f,+f,)/f(F, +1); the variance of Y 
is f,(f, + f,)/PCf, +1); the correlation between 
X and Y is —fyfe/ Vfifo( fo + fs) (hs + fa). The condi- 
tional distribution of X/(1 — Y), given Y, is a beta 
distribution with p = fı and q = fs. The sum X + Y 
has a beta distribution with p = fı + f: and q = fs. 
Noncentral F-distributions. If Y, has a non- 
central chi-square distribution with fı degrees of 
freedom and noncentrality parameter À, and if Y, 
has a (central) chi-square distribution with f, de- 
grees of freedom and is independent of Y,, then 
(Y./fx)/(Yo/f2) has a noncentral F-distribution 
with f, degrees of freedom for the numerator and 
f: degrees of freedom for the denominator and non- 
centrality parameter A. The cumulative distribution 
function of noncentral F may be closely approxi- 
mated by a central F cumulative distribution func- 


tion as follows: 
fe l 
Pr{FY aS c} = Pr {Fes < FAN 
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where FY , , has a noncentral F-distribution and 
Frey, hasa central F-distribution and f* denotes 
(fı +A)?/ (fı + 2). The mean of the noncentral 
F-distribution is f.(f; + A)/[(f: —2)f:] for fs > 2, 
and the variance is 


2fil(fr + A)? + (fı + 2A) (fe - 2)] 
(f. — 2)°F CFs — 4) 
for f. > 4. The means do not exist if f < 2 and the 
variances do not exist if f+ < 4. 

The ratio of two noncentral chi-square random 
variables also arises occasionally, This distribution 
has not yet been given a specific name. The non- 
central F-distribution is, of course, a special case 
of the distribution of this ratio. The noncentral 
F-distribution arises in considering the power of 
analysis of variance tests. 

Bivariate normal distributions. To say that X 
and Y have a joint (nonsingular) normal distribu- 
tion with means uy and p,., variances o and o}, 
and correlation p is to say that the joint probability 
density function for X and Y is 


1 
Qroyoy V1 — P 


xp \? X—px \/ Y- y—my \* 
oa ox )-29( ox X oy )+( or ) | 
—2(1 — p*) 

The cumulative distribution function occurs in 
many problems; it is a special case (two-dimen- 
sional) of the multivariate normal distribution. A 
fundamental fact is that X and Y are jointly normal 
if and only if aX + bY is normal for every a and b 
[see MULTIVARIATE ANALYSIS], 

Distributions of the sum of normal yariables. 
Let X,,X,, +++, X, be jointly normally distributed 
random variables so that X; has mean ķi and vari- 
ance oj and the correlation between X; and X; is 
pi. Then the distribution of the weighted sum, 
aX, + a.X, +-+- +a,X,, where a, as, +++, a, are 
any real constants (positive, negative, or zero), is 
normal with mean ay, + ap: +- +a, and 
variance 


n n on 
LEHIN È aiajoiospi;. 
isi isi jaisi 
If the normality assumption is dropped, then the 
means and variances remain as stated, but the 
form of the distribution of the sum a,X, + aX, + 
+++ +@,X, is often different from the distribution 
of the X’s, If a linear function, SGX: , a: + 0, of 
a finite number of independent random variables 
is normally distributed, then each of the random 
variables X,, X.,-+-, X, is also normally distributed. 


Note that in this instance independence of the 
random variables is required. 

In particular, if X, and X, are jointly normally 
distributed, the sum X, + X, is normally distributed 
with mean p, + p, and variance o? + o? + 2p,,0,0,. 
The difference X, — X, is normally distributed with 
mean p,— p, and variance o? + o? — 2p,.0,0;. If 
X,,X., +++, X, are jointly normally distributed with 
common mean u and variance o* and are inde- 
pendent, then the mean of the X’s, that is, R= 
(1/n)=}., X,, is normally distributed with mean p 
variance o*/n. 

Rectangular distributions. The rectangular (or 
uniform) distribution has the following density: 
it is zero for x < a; it is 1/(b — a) for a < x < b; 
and it is zero for x> b, where a and b are real 
constants, In other words, it has a graph that is a 
rectangle with base of length b — a and height of 
1/(b— a). The cumulative distribution function is 
zero for x < a; it is (x — a)/(b — a) for a < x S b; 
and it is one for x > b. The mean and the median 
of this distribution are both ¿(a+ b), and the 
variance is (b — a)?/12. 

One of the principal applications of the rectan- 
gular distribution occurs in conjunction with the 
probability integral transformation, This is the 
transformation Y = F(X), where F(X) is the cumu- 
lative distribution function for a continuous ran- 
dom variable X. Then Y is rectangularly distributed 
with a= 0 and b = 1. Many distribution-free tests 
of fit have been derived starting with this transfor- 
mation [see GOODNESS OF FIT; NONPARAMETRIC 
STATISTICS]. 

If Y has the rectangular distribution with a = 0, 
b=1, then —2InY has the chi-square distribu- 
tion with 2 degrees of freedom. It follows, from 
the additivity of chi-square distributions, that if 
Y,,Y.,-+++,Y¥, are jointly and independently dis- 
tributed according to rectangular distributions with 
a=0 and b = 1, then the sum, —2 55", In Y,, has 
a chi-square distribution with 2n degrees of free- 
dom. 

There is also a discrete form of the rectangular 
distribution. 

Pareto distributions. The density functions for 
Pareto distributions take the form: zero for x < b; 
(a/b)(b/x)** for b <x <=, where a and b are 
positive real constants (not zero). The cumulative 
distribution function is zero for x < b and is equal 
to 1 — (b/x)* for b<x<~, For a> 1, the mean 
of the Pareto distribution is ab/(a—1), and for 
a> 2 the variance is (ab*)/[(a — 1)2(a — 2)]. The 
median for a> 0 is at x = 2™"b, and the mode is 
at x=b. The Pareto distribution is related to the 
negative exponential distribution by the transfor- 
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mation Y = @a In (X/b) where Y has the negative 
exponential distribution and X has the Pareto dis- 
tribution. 

Pareto distributions have been employed in the 
study of income distribution [see INCOME DISTRI- 
BUTION]. 

Laplace distributions. A random variable X has 
the Laplace distribution if its probability density 

=À 


function takes the form 
J ea 
20 P Gigh We 


where —% <A} < +%,0 < 0< ~and—» <x< +4, 
The mean of this distribution is \, and the variance 
is 26%. This distribution is also known as the double 
exponential, since the graph has the shape of an 
exponential function for x > A and it is a reflection 
(about the line x=) of the same exponential 
function for x< AÀ. The Laplace distribution is 
sometimes called Laplace’s first law of error, the 
second being the normal distribution. 

Lognormal distributions. A random variable X 
is said to have a logarithmic normal distribution 
(or lognormal distribution) if the logarithm of the 
variate is normally distributed. Let Y=InX be 
normally distributed with mean p and variance o’. 
The mean of X is exp (p + 40°) and the variance 
of X is (expo? — 1)exp (2p + 0°). Note that 0 < 
X <æ, while -o<Y<+. The base of the 
logarithm may be any number greater than one 
(or between zero and one), and the bases 2 and 
10 are often used. If the base a is used, then the 
mean of X is amO», and the variance of X is 
(ating — 1 Jam Unao, 

Logistic distributions. The logistic curve y= 
A/[1 + ye] is used frequently to represent the 
growth of populations. It may also be used as a 
cumulative probability distribution function. A 
random variable, X, is said to have the logistic 
probability distribution if the density of X is given 


by 
T CA kui 

a] 

40y3 o ]2y3 
where —o <æ < +o; pis the mean of X and o 
is the variance of X. As with the normal distribu- 
tion, the transformation z= (x = p)/o gives a 
standardized form to the distribution. The cumula- 
tive distribution function for the standardized 
variable is 


1/{1 + exp (—72/V3)], 


e shape of this cumulative 
mbles the normal distri- 
normal and logistic dis- 


where —% < z < +. Thi 
distribution so nearly rese 
bution that samples from 
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tributions are difficult to distinguish from one 
another. 

The exponential family of distributions. The 
one-parameter, single-variate, exponential density 
functions are those of the form 


c(0) exp [0A(x) + B(x)], 


where c, A, and B are functions usually taken to 
satisfy regularity conditions. If there are several 
parameters, 0, , --:, 0,, the exponential form is 


++, 0,) exp [ÈE 0;A;(x) + B(x)]. 


Analogous forms may be considered for the 
multivariate case and for discrete distributions. 
Most of the standard distributions (normal, bino- 
mial, and so on) are exponential, but reparameter- 
ization may be required to express them in the 
above form. 

The exponential distributions are important in 
theoretical statistics, and they arise naturally in 
discussions of sufficiency. Under rather stringent 
regularity conditions, an interesting sufficient sta- 
tistic exists if, and only if, sampling is from an 
exponential distribution; this relationship was first 
explored by Koopman (1936), Darmois (1935), 
and Pitman (1936), so that the exponential dis- 
tributions are sometimes eponymously called the 
Koopman—Darmois or Koopman-Pitman distribu- 
tions [see SUFFICIENCY]. 

A discussion of the exponential family of distri- 
butions, and its relation to hypothesis testing, is 
given by Lehmann (1959, especially pp. 50-54). 

It is important to distinguish between the family 
of exponential distributions and that of negative 
exponential distributions. The latter is a very spe- 
cial, although important, subfamily of the former. 


Dona.p B. OWEN 
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m 
APPROXIMATIONS TO DISTRIBUTIONS 


The term approximation refers, in general, to 
the representation of “something” by “something 
else” that is expected to be a useful replacement 
for the “something.” Approximations are sometimes 
needed because it is not possible to obtain an exact 
representation of the “something”; even when an 
exact representation is possible, approximations 
may simplify analytical treatment. 

In scientific work, approximations are in con- 
stant use. For example, much scientific argument, 
and nearly all statistical analysis, is based on 
mathematical models that are essentially approxi- 
mations. This article, however, is restricted to ap- 
proximations to distributions of empirical data and 
to theoretical probability distributions. 


When approximating empirically observed dis- 
tributions—for example, a histogram of frequen- 
cies of different words in a sample of speech—the 
primary objectives are those of compact descrip- 
tion and smoothing. These are also the primary 
objectives of much approximation in demographic 
and actuarial work [see LIFE TABLES]. 

On the other hand, approximation to a theo- 
retical distribution is often needed when exact 
treatment is too complicated to be practicable. For 
example, an econometrician who has developed a 
new estimator of price elasticity may well find the 
exact distribution of his estimator quite intractable; 
he will probably resort to large-sample (asymptotic) 
methods to find an approximate distribution. 

It may also happen that a distribution arising 
not from statistical considerations, but from an- 
other kind of mathematical model, requires approx- 
imation to improve understanding. For example, a 
psychologist may use a probabilistic model of the 
learning process, a model that leads to a theoretical 
distribution for the number of trials needed to 
reach a specified level of performance. This dis- 
tribution may be so complicated that an approx- 
imate form will markedly increase appreciation of 
its meaning. 

The final section of this article discusses a gen- 
eral requirement, the measurement of the goodness 
of any particular approximation; this is especially 
important in the comparison of different ap- 
proximations. 

Approximation to empirical distributions. This 
section deals with approximations to distributions 
of numerical data representing measurements on 
each of a group of individuals, (Usually the group 
is a sample of some kind.) Among important tech- 
niques not discussed here are those that are purely 
mathematical (such as numerical quadrature and 
iterative solutions of equations) and those associ- 
ated with the analysis of time series (such as trend 
fitting and periodogram and correlogram analysis) 
[see TIME SERIES]. , 

As a specific example, data on the distributions 
of diseases by frequency of diagnosis (for males) 
in the teaching hospitals of England and Wales for 
the year 1949 (based on Herdan 1957) are pre- 
sented in Table 1. The figures mean, for example, 
that out of 718 different diseases of males reported 
during the year, 120 occurred between one and 
five times each, while two were reported between 
4,000 and 5,000 times each. 

The figures shown in the table have already been 
grouped but present a rather irregular appearance: 
it would be useful to summarize the data in a more 
readily comprehensible form. A quick way to do 
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Table 1 — Diseases by frequency of diagnosis in the 
teaching hospitals of England and Wales 


for 1949 
FREQUENCY OF NUMBER OF DISEASES 
DIAGNOSIS Observed Fitted 
1-5 120 103 
6-10 55 74 
11-15 44 54 
16-20 31 43 
21-25 28 34 
26-30 32 28 
31—40 45 45 
41-50 37 35 
51—60 33 28 
61-70 22 23 
71-80 23 19 
81-90 22 17 
91-100 15 14 
101-119 21 22 
120-139 18 19 
140-159 19 15 
160-179 19 13 
180-199 12 n 
200-239 19 17 
240-279 24 13 
280-319 13 n 
320-359 4 8.5 
360-399 9 7.0 
400-499 15 13.1 
500-599 13 91 
600-699 5 67 
700-799 6 5.0 
800-899 2 4.0 
900-999 2 3.1 
1,000-1,999 6 14.0 
2,000-2,999 1 41 
3,000-3,999 1 19 
4,000-4,999 2 1.0 
> 5,000 - 2.25 
Total 718 717.75 


Source: Herdan 1957. 


this is to group further and to form a histogram 
(as in Figure 1). [Further information on this 
method is presented in GRAPHIC PRESENTATION; 
STATISTICAL ANALYSIS, SPECIAL PROBLEMS OF, 
article on GROUPED OBSERVATIONS.] y 
Grouping is in itself a kind of approximation, 
since it does not reproduce all features of the origi- 
nal data. For concise description, however, repre- 
sentation by a formula can be more useful. This is 
effected by fitting a frequency curve. If the fitted 
curve is simple enough and the effectiveness of 
approximation ( “goodness of fit”) is adequate, con- 
siderable benefit can be derived by replacing a large 
accumulation of data, with its inevitable irregu- 
larities, with a simple formula that can be handled 
with some facility and is a conveniently brief way 
of summarizing the data. 
The present-day decline 
fitting observed frequency 
be only a temporary phenomenon. In 


in the importance of 
distributions may well 
the years 
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1890 to 1915 (roughly) there was a need to dem- 
onstrate that statistical methods did apply to real 
physical situations. The x° test developed by Karl 
Pearson (1900) demonstrated clearly that the 
normal distribution, which had previously been 
assumed to be of rather general application, was 
not applicable to much observed data, It was de- 
sirable, therefore, to show, if possible, that some 
reasonably simple mathematical formula could give 
an adequate fit to the data. 

Subsequent development of the theory of mathe- 
matical statistics has, on the one hand, been very 
much concerned with clarification of the logical 
principles underlying statistical method (assuming 
that there is some fairly well-established mathe- 
matical representation of the distributions in- 
volved); and on the other hand it has produced 
“distribution free” procedures, particularly signifi- 
cance tests, that eliminate the need for considering 
the actual form of distribution (to any but the 
broadest detail) [see NONPARAMETRIC STATISTICS]. 

Both these lines of work tend to reduce interest 
in description of actual distributions in as precise 
detail as the data allow, However, from the more 
general viewpoint of scientific inquiry, the neglect 
of systematic study of distributional form in favor 
of application of formal techniques to arbitrarily 
hypothetical situations can represent a wasteful 
use of the data. 

In the data of the example, the frequency of 
occurrences is naturally discrete—it takes only 
integer values—and, therefore, prime consideration 
should be given to formulas appropriate to discrete 
variables. In view of the wide range of variation of 
frequency, however, formulas appropriate to con- 
tinuous variables may also give good approxima- 
tions, and they are worth considering if they offer 
substantially simpler results. 

A number of families of frequency curves have 
been found effective in approximating observed 
sets of data. Provided that a suitable form of curve 
has been chosen, fitting a maximum of four param- 
eters gives a reasonably effective approximation. 
Fitting the curve is equivalent to estimating these 
parameters; this may be effected in various ways, 
among which are (i) the method of maximum 
likelihood; (ii) the method of moments, in which 
certain lower moments of the fitted curve are made 
to agree with the corresponding values calculated 
from the observed data; and (iii) the method of 
percentile points, in which the fitted curve is made 
to give cumulative proportions agreeing with those 
for the observed data at certain points (7% , 25%, 
75%, and 93% are often recommended) [see 
EsTIMATION, article on POINT ESTIMATION]. 
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In cases where a clear probabilistic model can 
be established, there is usually good reason to pre- 
fer the method of maximum likelihood. When it is 
not possible to establish such a model, another 
method is required. 

Fitting is sometimes facilitated by using appro- 
priately designed “probability paper,” with the ordi- 
nate and abscissa scales such that if cumulative 
frequency (ordinate) is plotted against variable 
values (abscissa), a straight line relationship 
should be obtained [see GRAPHIC PRESENTATION]. 

Among the more commonly used families of fre- 
quency curves are those representing the normal, 
exponential and gamma distributions, These specify 
the proportion less than, or equal to, a fixed num- 
ber x (the cumulative distribution function, often 
denoted by the symbol F(x)) as a function depend- 
ing on two or three parameters (and also, of course, 
on x) [see DISTRIBUTIONS, STATISTICAL, article on 
SPECIAL CONTINUOUS DISTRIBUTIONS]. The best- 
known system of curves is the Pearson system 
(Elderton 1906), which is derived from solutions 
of the differential equation 


æF al c(x + a) i 


dé” ak |X +ant+a 


Originally (Pearson 1895) this equation was ar- 
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Figure 1 — Diseases by frequency of diagnosis in the teaching 
hospitals of England and Wales for 1949 


125 150 175 200 225 250 


Source: Herdan 1957, 


rived at by (i) considering limiting cases of sam- 
pling from a finite population and (ii) requiring 
the curve y = dF/dx to satisfy certain natural con- 
ditions of unimodality, smoothness, etc. (Note that 
there are four parameters, C, a, @:, a2, in this 
equation. 

Jone ae have been worked out (Elderton 1906) 
for estimating the values of these parameters from 
the first four moments of the distributions. In cer- 
tain cases, the procedures can be much facilitated 
by using Table 42 of the Biometrika tables (Pearson 
& Hartley 1954) or the considerably extended 
version of these tables in Johnson et al. (1963). 
Entering these tables with the values of the mo- 
ment ratios (VB; and £:) it is possible to read di- 
rectly standardized percentage points (X;) for a 
number of values of P. The percentage points X 
(such that F(X,)=P) are then calculated as 
Mean + X;, x (Standard Deviation). h 

Other ehai are based on “transformation R 
normality” with z = a, + a.g[c(x + a)l, where ed 
anormal variable with zero mean and unit stand 
deviation, and g[] is a fairly simple explicit we 
tion. Here, c, a), a,, and a, are again parameters. 
gly] = logy, one gets a lognormal curve. Such a 
curve has been fitted to the distribution of frequency 
of diagnosis in Figure 1 (for clarity, up to fre- 
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quency 100 only). Since z= a, + a:log[e(x + a)] 
can be written as z=a'+a,log (x+ a), where 
now a; = a, + a, log c, there are in fact only three 
separate parameters in this case. In fitting the 
curve in Figure 1, a, has been taken equal to —0.5, 
leaving only a, and a. to be estimated. This has 
been done by the method of percentile points (mak- 
ing the fitted frequencies less than 11 and less than 
200 agree with the observed frequencies—175 and 
596 respectively). The fitted formula is (rounded) 


Number less than, or equal to, x (an integer) 
= 718 &(.55 In x — 1.95), 


where ® is the cumulative distribution function of 
the unit-normal distribution [see DISTRIBUTIONS, 
STATISTICAL, article on SPECIAL CONTINUOUS DIS- 
‘RIBUTIONS]. Numerical values of the fitted fre- 
quencies are shown in the last column of Table 1. 

While a reasonable fit is obtained in the center 
of the distribution, the fit is poor for the larger 
frequencies of diagnosis. As has been noted, al- 
though the data are discrete, a continuous curve 
has been fitted; however, this is not in itself reason 
for obtaining a poor fit. Indeed many standard 
approximations—such as the approximation to a 
binomial distribution by a normal distribution—are 
of this kind. 

Another system of curves is obtained by taking 
the approximate value of F(x) to be 


P(x) -$ a an(Z=4), 


where ® is the jth derivative of ®, and where the 
parameters are @, @,°**, Qk, é, and o > 0. The 
number of parameters depends on the value of k, 
which is usually not taken to be greater than 3 or 4. 
This system of curves is known as the Gram- 
Charlier system; a modified form is known as the 
Edgeworth system (Kendall & Stuart 1943-1946, 
chapter 6). If all a's after a, are zero (and a, is 
equal to one) then F(x) is simply the normal cumu- 
lative distribution. The terms after the first can be 
regarded as “corrections” to the simple normal 
approximation. It should not automatically be 
assumed, however, that the more corrections added 
the better. If a high value of k is used, the curve 
may present a wavy appearance; even with a small 
value of k it is possible to obtain negative values 
of fitted frequencies (Barton & Dennis 1952). , 
Similar expansions can be constructed replacing 
(x) by other standard cumulative distribution 
(see Cramér 1945, sec. 20.6). If the ®”’s are re- 
placed by cumulative distribution functions ®;(x), 
(with, of course, a + 4 +a, +: +a = 1), then 
F(x) is represented by a “mixture” of these distribu- 
tions [see DISTRIBUTIONS, STATISTICAL, article on 
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MIXTURES OF DISTRIBUTIONS]. When joint distribu- 
tions of two or more variates are to be fitted, the 
variety of possible functional forms can be an 
embarrassment. A convenient mode of attack is to 
search for an effective normalizing transformation 
for each variate separately. If such a transforma- 
tion can be found for each variate, then a joint 
multinormal distribution can be fitted to the set of 
transformed variates, using the means, variances, 
and correlations of the transformed variates. A dis- 
cussion of some possibilities, in the case of two 
variates, will be found in Johnson (1949). 

Approximation to one theoretical distribution by 
another. Approximation is also useful even when 
it is not necessary to deal with observed data. An 
important field of application is the replacement of 
complicated formulas for the theoretical distribu- 
tions of statistics by simpler (or more thoroughly 
investigated) distributions. Just as when approxi- 
mating to observed data, it is essential that the 
approximation be sufficiently effective to give useful 
results. In this case, however, the problem is more 
definitely expressible in purely mathematical terms; 
there are a number of results in the mathematical 
theory of probability that are often used in con- 
structing approximations of this kind, 

Among these results, the “Central Limit” group 
of theorems has the broadest range of applicability. 
The theorems in this group state that, under ap- 
propriate conditions, the limiting distribution of 
certain statistics, Ta, based on a number, n, of 
random variables, as n tends to infinity, is a normal 
distribution. [See PROBABILITY for a discussion of 
the Central Limit theorems.] 

If the conditions of a Central Limit theorem 
are satisfied, then the distribution of the statistic 
T= T, Z E(T, Meo(T,)}"; where E(T,) is the 
mean and ø (T,) the standard deviation of T, , may 
be approximated by a “unit normal distribution.” 
Then the probability Pr[T„ <7] may be approximated 
by using tables of the normal integral with argu- 
ment [7 — E(T,,)]/o(T,)- 

There are many different Central Limit theo- 
rems; the most generally useful of these relate to 
the special case when the X’s are mutually inde- 
pendent and T, = X, + X: ++" + X,. The simplest 
set of conditions is that each X should have the 
same distribution (with finite expected value and 
standard deviation), but weaker sets of conditions 
can replace the requirement that the distributions 
be identical. These conditions, roughly speaking, 
ensure that none of the variables X; have such 
large standard deviations that they dominate the 
distribution of Ty. ` 

Central Limit results are used to approximate 
distributions of test criteria calculated from a 
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“large” number of sample values. The meaning of 
“large” depends on the way in which the effective- 
ness of the approximation is measured and the 
accuracy of representation required. In turn, these 
factors depend on the use that is to be made of the 


results. Some of the problems arising in the meas- — 


urement of effectiveness of approximation will be 
described in the next section. 

Very often there is a choice of quantities to 
which Central Limit type results can be applied. 
For example, any power of a chi-square random 
variable tends to normality as the number of degrees 
of freedom increases. For many purposes, the 
one-third power (cube root) gives the most rapid 
convergence to normality (see Moore 1957), al- 
though it may not always be the best power to use. 
While many useful approximations can be sug- 
gested by skillful use of probability theory, it should 
always be remembered that sufficient numerical 
accuracy is an essential requirement. In some cases 
(for example, Mallows 1956; Gnedenko et al. 1961; 
Wallace 1959), theoretical considerations may pro- 
vide satisfactory evidence of accuracy; in other 
cases, ad hoc investigations, possibly using Monte 
Carlo methods, may be desirable. Interesting exam- 
ples of these kinds of investigation are described 
by Goodman and Kruskal (1963) and Pearson 
(1963). [A discussion of Monte Carlo methods may 
be found in RANDOM NUMBERS.] 

Another type of result (Cramér 1945, sec. 20.6) 
useful in approximating theoretical distributions 
and, in particular, in extending the field of Cen- 
tral Limit results, may be stated formally: “if 
liM, Pr[X, < €] = F(€) and Y, tends to y in prob- 
ability, then lim,.,.. Pr[X, + Y, < £] = F(€—),” or 
less formally, “the limiting distribution of X, + Y, 
is the same as that of X, + 7.” Similar statements 
hold for multiplication, and this kind of ‘useful re- 
sult may be considerably generalized. A fuller dis- 
cussion is given in the Appendix of Goodman and 
Kruskal (1963). 

A classic and important example of approxima- 
tion of one theoretical distribution by another is 
that of approximating a binomial distribution by a 
normal one. If the random variable X has a bino- 
mial distribution with parameters p, n, then the 
simplest form of the approximation is to take 
Pr{X < c} as about equal to 


c— np 
mpa ory" 
where c= 0,1,2, ::-,n. One partial justification 
for this approximation is that the mean and stand- 
ard deviation of X are, respectively, np and 
[np(1—p)]}; the asymptotic validity of the ap- 


proximation follows from the simplest Central 
Limit theorem. 

The approximation may be generally improved 
by replacing c — np, in the numerator of the argu- 
ment of ®, with c—np+}. This modification, 
which has no asymptotic effect, is called the “con- 
tinuity correction.” Together with similar modifica- 
tions, continuity corrections may often be used 
effectively to improve approximations at little extra 
computational cost. 

The methods described in the section on approxi- 
mating numerical data can also be used, with ap- 
propriate modification, in approximating theoretical 
distributions. Different considerations may arise, 
however, in deciding on the adequacy of an approx- 
imation in the two different situations. 

Measuring the effectiveness of an approximation. 
The effectiveness of an approximation is its suit- 
ability for the purpose for which it is used, With 
sufficiently exhaustive knowledge of the properties 
of each of a number of approximations, it would 
be possible to choose the best one to use in any 
given situation, with little risk of making a bad 
choice. However, in the majority of cases, the 
attainment of such knowledge (even if possible) 
would entail such excessively time-consuming labor 
that a main purpose for the use of approximations 
—the saving of effort—would be defeated. Con- 
siderable insight into the properties of approxima- 
tions can, however, be gained by the careful use of 
certain representative figures, or indexes, for their 
effectiveness. Such an index summarizes one aspect 
of the accuracy of an approximation. 

Since a single form of approximation may be 
used in many different ways, it is likely that differ- 
ent indexes will be needed for different types of 
application. For example, the approximation ob- 
tained by representing the distribution of a statistic 
by a normal distribution may be used, inter alia, 
(a) for calculating approximate significance levels 
for the statistic, (b) for designing an experiment 
to have specified sensitivity, and (c) for combining 
the results of several experiments. In case (a), 
effectiveness will be related particularly to accuracy 
of representation of probabilities in the tail(s) of 
the distribution, but this will not be so clearly the 
case in (b) and (c). It is important to bear in 
mind the necessity for care in choosing an appro- 
priate index for a particular application. 7 

This article has been concerned with approxima- 
tions to theoretical or empirical distribution func: 
tions. The primary purpose of such approximations 
is to obtain a useful representation, say F(x), of the 
actual cumulative distribution function F(x). It is 
natural to base an index of accuracy on some func- 
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tion of the difference F(x) — F(x). Here are some 
examples of indexes that might be used: 


(1) max |F(x) -— F(x)| 
(2) max |{F(x,) — P(x.)} — (F(x) — F(x:))| 


(3) max |P(x) = F(x)| 

x F(x){1—F(x)} 
max LEC) =POu)} = (Fn) F) 
sues (F(x) —F(%2)}[1— (F(a) —F(x2)}] 
The third and fourth of these indexes are modifi- 
cations of the first and second indexes respectively. 
A further pair of definitions could be obtained by 
replacing the intervals (x,, x.) in (2) and (4) by 
sets, w, of possible values, replacing |F(x,) — F(x:)| 
and |F(x,) — F(x.)| by the approximate and actual 
values of Pr[X in œ], and replacing max,,.,, by MaXw. 
Indexes of type (3) and (4) are based on the 
proportional error in the approximation, while in- 
dexes of type (1) and (2) are based on absolute 
error in the approximation. While the former are 
the more generally useful, it should be remembered 
that they may take infinite values. Very often, in 
such cases, the difficulty may be removed by choos- 
ing w to exclude extreme values of x. 

In particular instances w may be quite severely 
restricted. For example, the 5% and 1% levels may 
be regarded as of paramount importance. In such 
a case, comparison of F(x) and F(x) in the neigh- 
borhood of these values may be all that is needed. 
It has been suggested in such cases that actual 
values between 0.04 and 0.06, corresponding to a 
nominal 0.05, and between 0.007 and 0,013, cor- 
responding to a nominal 0.01, can be regarded as 
satisfactory for practical purposes. 

In measuring the accuracy of approximation to 
empirical distributions, it is quite common to calcu- 
late x2. This is, of course, a function of the differ- 
ences F(x) — F(x). Even when circumstances are 
such that no probabilistic interpretation of the 
statistic is possible, indexes of relative accuracy of 
approximation have been based on the magnitude 
of x’. A similar index for measuring accuracy of 
approximations to theoretical distributions can be 
constructed, Such indexes are based on more or 
less elaborate forms of average (as opposed to 
maximum) size of error in approximation. 

N. L. JOHNSON 


(4) 


[See also GoopNESS OF FIT.] 
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Iv 
MIXTURES OF DISTRIBUTIONS 


A mixture of distributions is a weighted average 
of probability distributions with positive weights 
that sum to one. The distributions thus mixed are 
called the components of the mixture. The weights 
themselves comprise a probability distribution 
called the mixing distribution. Because of this prop- 
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erty of the weights, a mixture is, in particular, 
again a probability distribution. 

As an example, suppose that the probability 
distribution of heights of 30-year-old men in New 
York is approximately a normal distribution, while 
that of 30-year-old women in New York is another 
approximately normal distribution. Then the prob- 
ability distribution of heights of 30-year-old people 
in New York will be, to the same degree of approxi- 
mation, a mixture of the two normal distributions. 
The two separate normal distributions are the com- 
ponents, and the mixing distribution is the simple 
one on the dichotomy male-female, with the 
weights given by the relative frequencies of men 
and women in New York. 

Probability distributions of this type arise when 
observed phenomena can be the consequence of 
two or more related, but usually unobserved, phe- 
nomena, each of which leads to a different proba- 
bility distribution. Mixtures and related structures 
often arise in the construction of probabilistic 
models; for example, models for factor analysis. 
Mixtures also arise in a number of statistical con- 
texts. A general problem is that of “decomposing” 
a mixture on the basis of a sample, that is, of 
estimating the parameters of the mixing distribu- 
tion and those of the components. 

Mixtures occur most commonly when the param- 
eter, 0, of a family of distributions, given, say, by 
the density or frequency functions f (x; @), is itself 
subject to chance variation. The mixing distribu- 
tion, say g(@), is then a probability distribution on 
the parameter of the distributions f(x; 0). 

The components of a mixture may be discrete, 
continuous, or some of each type. Mixtures are 
classified, in accordance with the number of their 
components, as finite, countable, or noncowntably 
infinite. 

The generic formula for the most common form 
of finite mixture is 


o È f 000); 


the infinite analogue (in which g is a density func- 
tion) is 


(2) [ Fox 6)9() ao. 


As an illustration of the above ideas, consider 
the following simple example. Two machines pro- 
duce pennies, which are fed into a common bin. 
The pennies are identical except that those pro- 
duced by machine 1 have probability 0, of showing 
a head when tossed, while those produced by ma- 
chine 2 have probability 6. # 0, of showing a head. 
Let a be the proportion of coins produced by ma- 


chine 1, and 1 — æ the proportion produced by ma- 
chine 2. 

A coin chosen at random from the bin is tossed 
n times. By the basic rules of probability theory, 
the probability of observing x heads is 


Pr{x heads out of n tosses) 
= Pr{penny from machine 1} Pr{x heads 
given machine 1} 
(3) + Pr(penny from machine 2} Pr{x heads 
given machine 2} 


=a (z)sa —6,)™* 
+ (1—a)(™) aa —0,)m. 


This is a mixture of two binomial distributions (the 
components) with mixing distribution g(6,) =a, 
g(0:) =1—a. [See Propaniuiry, article on FOR- 
MAL PROBABILITY, for a discussion of the rules 
giving rise to (3).] 

By contrast, if 0, = 6, = 0, that is to say if the 
two machines produce exactly identical coins, then 


Pr{x heads out of n tosses} = (7) o(1—6)"*, 


which is the simple binomial. Similarly if the ex- 
perimenter selected a coin from a particular ma- 
chine, say machine 2, rather than choosing a coin 
at random, the distribution of outcomes would be 
a binomial with parameter 0.. A simple binomial 
also results if n distinct coins are chosen and each 
is tossed a single time. In this case, however, the 
binomial parameter is a0, + (1 — a)0:. 

A generalization of the above example is a mix- 
ture of two trinomial distributions. It is applicable 
to description of the sex distribution of twins (cf. 
Goodman & Kruskal 1959, p. 134; Strandskov & 
Edelen 1946). Twin pairs fall into three classes: 
MM, MF, and FF, where M denotes male and F 
female. This leads to the trinomial distribution. 
Since, in addition, twins may be dizygotic or mono- 
zygotic, a mixture of trinomials results. Because the 
sexes of individual dizygotic twins are independent, 
the corresponding trinomial has parameters p°, 
2p(1 — p), and (1 — p)?, where p is the probability | 
of a male. Monozygotic twins, however, are ge- 
netically identical, so that the corresponding tri- 
nomial has parameters p, 0, and q. The mixing 
distribution is determined by the relative fre- 
quencies of monozygotic and dizygotic twin births. | 

Some properties of mixtures. Mixtures are them- | 


selves probability distributions, and hence their 
density or frequency functions are nonnegative and 
sum or integrate to unity. a 

While the definitions (1) and (2) are given m | 
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terms of the density or frequency functions, pre- 
cisely the same relationships hold with regard to 
the corresponding cumulative distribution func- 
tions. Similarly, the moments (about zero) of a 
mixture are weighted averages (that is, mixtures) 
of the moments of the component distributions. 
Thus, for a finite mixture, if /(@,) is the rth 
moment about zero of the ith component, then the 
rth moment about zero of the mixture is 


2 LOAI 


The same is true of the factorial moments. A 
similar, although slightly more complicated, rela- 
tionship holds for moments about the mean. 

An important property of many common mix- 
tures of discrete distributions is the relationship 
imposed upon the generating functions: if ¢,(s) 
and @,(s) are the generating functions of the mix- 
ing distribution and the component distributions, 
respectively, then $,(.(s)) is the generating func- 
tion of the mixture. This greatly simplifies certain 
calculations. [See DISTRIBUTIONS, STATISTICAL, 
article on SPECIAL DISCRETE DISTRIBUTIONS, for a 
discussion of generating functions.] 

Mixtures of standard distributions may possess 
interesting geometric properties. They may be 
bimodal or multimodal and are ordinarily more 
dispersed than are the components. Properties such 
as these account, at least in part, for the fact that 
mixtures frequently fit data more satisfactorily 
than do standard distributions. 

Conditionality. The concept of conditionality 
underlies the definitions (1) and (2), for the func- 
tion f(x;@) is the conditional probability dis- 
tribution given the value of 9, while the product 
of f(x; 0) and g(@) is the joint probability distribu- 
tion of x and 0, and the sum (or integral) is the 
unconditional distribution of x. 

The importance of conditionality in applications 
can be seen in the coin example: the machine on 
which the randomly chosen coin was produced may 
be thought of as an auxiliary chance variable (tak- 
ing on the values 1 and 2 with respective proba- 
bilities œ and 1 — a); conditional on the value of 
the auxiliary chance variable, the distribution of 
heads is a simple binomial. 

The auxiliary chance variable is usually unob- 
servable and hence cannot play a direct role in 
data analysis. The merit of introducing the auxil- 
iary chance variable in this context is, therefore, 
not that it results in simpler data analyses but that 
it yields a proper understanding of the underlying 
mathematical structure and simplifies the deriva- 
tion of the probability distribution. 
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This situation is typical; the conditional distribu- 
tion of the chance variable under investigation, 
given the value of a related unobservable chance 
variable, is either known exactly or is of a rela- 
tively simple form. Because of the inability to ob- 
serve the related chance variable, however, the 
unconditional distribution is a mixture and is 
usually much more complex. 

Some applications. Because of the immense 
complexity of living organisms, variables that the 
investigator can neither control nor observe fre- 
quently arise in investigations of natural phenom- 
ena. Some examples are attitude, emotional sta- 
bility, skills, genotype, resistance to disease, etc. 
Since mixtures of distributions result when such 
variables are related to the variable under investi- 
gation, mixtures have many applications in the 
social and biological sciences. 

Mixtures have played an important role in the 
construction of so-called contagious distributions. 
These distributions are deduced from models in 
which the occurrence of an event has the effect of 
changing the probability of additional such occur- 
rences, An early example of a contagious distribu- 
tion was given by Greenwood and Yule (1920) in 
an analysis of accident data. Greenwood and Yule 
derived the classical negative binomial distribution 
as a mixture of distributions. Other contagious dis- 
tributions that are mixtures are given by Neyman 
(1939, the Neyman types A, B, and C) and Gur- 
land (1958). 

Mixtures of distributions also arise in dealing 
with unusual observations or outliers. One ap- 
proach to the problem of outliers is predicated 
upon writing the underlying probability distribu- 
tion as a mixture of distributions (see Dixon 
1953). [For further discussion, see STATISTICAL 
ANALYSIS, SPECIAL PROBLEMS OF, article on OUT- 
LIERS.] 

Mixtures of distributions also arise in the Bayes- 
ian approach to statistical analysis. Bayes’ Pro- 
cedures are constructed under the assumption that 
the parameters of the underlying probability dis- 
tribution are themselves chance variables. In this 
context, the mixing distribution is called the a priori 
distribution [see BAYESIAN INFERENCE]. 

The following are a few additional applications 
of mixtures: 

(1) Life testing of equipment subject to sudden 
and delayed failures (Kao 1959). 

(2) Acceptance testing, when the proportion of 
defectives varies from batch to batch (Vagholkar 
1959). 

(3) Latent stucture analysis based on mixtures 
of multivariate distributions with independence be- 
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Figure 1 — A mixture of two normal distributions 
with u, =0, p,=4, oF =1, 02 =9, and 
&, = 0.4, showing f(x) on the vertical axis 
corresponding to x on the horizontal axis 


tween the variates within each component (Lazars- 
feld 1950; Green 1952). Factor analysis also comes 
under this general description. [See FACTOR ANALY- 
SIS; LATENT STRUCTURE. ] 

(4) Construction of a learning model as a 
weighted average of two simpler such models 
(Sternberg 1959). 

(5) A model for spatial configurations in map 
analysis (Dacey 1964). 

Special mixed distributions. Using the defini- 
tions (1) and (2), it is easy to generate large 
numbers of probability distributions. Furthermore, 
the process of mixing of distributions may be re- 
peated; that is, new mixtures, in which the com- 
ponents are themselves mixtures of distributions, 
may be formed by repeated application of ( 1) and 
(2). Thus, it is possible to form an unlimited num- 
ber of probability distributions by this relatively 
simple process. In addition, many classical proba- 
bility distributions can be represented nontrivially 
as mixtures. Teicher (1960) gives two representa- 
tions of the normal distribution as a mixture of 
distributions. [See DISTRIBUTIONS, STATISTICAL, QT- 
ticle on SPECIAL DISCRETE DISTRIBUTIONS, for both 
classical and mixture representations of the nega- 
tive binomial distribution.] 

It is customary, when both component and mix- 
ing distribution are well known, to call the mixture 
by the names of the distributions involved. 

The following are a few specific examples of 
mixtures: 

Mixture of two normal distributions. The den- 
sity function of a mixture of two normal distribu- 


tions is 
fe) = Fe[S ew eer 
Eaa 


where —2 < x <œ, 0 < ar< 1, 0 < <1, % F 
a:=1, and pı, p, and oł, o? are the respective 
means and variances of the component normal dis- 
tributions. This example is due to Pearson (1894). 
The distribution has mean aœ,4, + œx, and variance 
aT} + a,r: + a,a,( H, — p)? An example of this 
mixture with p, =0, w,=4, of=1, of =9, and 
a, = 0.4 is given in Figure 1. 

Mixture of two Poisson distributions. The fre- 
quency function of a mixture of two Poisson dis- 
tributions is 


P(x) = i [ares + arse |, x=0, 1,2, 
: + 
where a, and a; are as-before and A, # A, are the 
parameters of the components. This distribution 
has mean a,A, + aA, and variance œ), + aA, + 
a,a.(A; — A.)*. An example of this discrete mixed 
distribution with A,=1, A.=8, and a,=0.5 is 
given in Figure 2. 
Poisson-binomial distribution. The frequency 
function of the Poisson—binomial distribution is 


pose So e(Z)ea-om 
= x=0,1,2,:, 


where 0<0<1, 0<A<~, and n is an integer. 
The components are binomial distributions; the 
mixing distribution is Poisson. The probabilities 
can also be obtained by successive differentiation 
of the generating function 
$(s) =exp {A[(1 — 0 + 8s)" — 1]), 

or from the recursion formulas 

p(0) = exp {A[(1 — 0)” — 11}, 


pe + 1)= wa Ayh» S e) 2. 


“+1 T=? 
The mean is n0); the variance is n@A[1 + (n — 1)0]. 
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Figure 2 — A mixture of two Poisson distributions 
with A, = 1, A, = 8, and a, = 0.5, show- 
ing p(x) on the vertical axis corresponding 
to x on the horizontal axis 
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Poisson—negative binomial distribution. The fre- 
quency function of the Poisson—negative binomial 
distribution is 


EN en 
p(x) =e pee i) (1-0), 
x=0,1,2, =, 


where 0<0<1,0<A\< %, and0 <n < œ. This is 
a Poisson mixture of negative binomial distributions. 
Here 


T0 


o(s) = exp {A[0"(1 — s + s0)” — 1]}; 
the recursion formulas are 


p(0) = exp [~ (1 — 6")], 


p(x + D=- S 
(x+1—i)er(1 -07-0 |; 


The mean and variance are nà(1-—0)/0 and 
nd(1 — 0)(n — nO + 1)/0°, respectively. 

Neyman Type A contagious distribution. The 
frequency function of the Neyman Type A distribu- 
tion is 


a 
KENVER 
p(x) = GAT ye il ey, 


i=0 
where 0<d, <% and 0<dA.<~. The Neyman 
Type A is a Poisson—Poisson. The generating func- 
tion is 
o(s) = exp (Ailes? — 1]}s 
the recursion formulas are 


p(0) = exp [N (e™ — 1)], 


a aN 7 
pati Dien ge 
i=0 


The mean is A,Az, and A,A.(1 — à) is the variance. 

Point probability—negative exponential. An ex- 
ample of a mixture of a discrete and a continuous 
distribution is written in terms of its cumulative 
distribution function, 


F(x) =0 ifx<0 
=a ifx=0 
=a, +a,(1—e*/*) if x > 0, 


where @>0 and a, and a, are as in previous ex- 
amples. An application in economics is given by 
Aitchison (1955). The distribution has mean a6 


and variance a,(1 + a2). See 
Identifiability. The subject of identifiability, 


that is, of unique characterization, is of concern on 
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two levels. On both levels, identifiability, or lack 
thereof, has very important practical implications. 

The first level is in model construction. Here the 
question revolves around the existence of a one-to- 
one correspondence between phenomena observed 
in nature and their corresponding mathematical 
models. That such one-to-one correspondences need 
not exist is apparent in the application of con- 
tagious distributions to accident data. Greenwood 
and Yule (1920) devised two models, proneness and 
apparent proneness, for accidents. Both lead to the 
negative binomial distribution. As noted by Feller 
(1943), even complete knowledge of the under- 
lying negative binomial distribution therefore would 
not enable the experimenter to distinguish between 
proneness and apparent proneness as the cause of 
accidents. 

The question of identifiability is an important 
consideration in mathematical modeling generally. 
It is particularly crucial in attempting to distinguish 
between competing theories. Such distinctions can 
be made through properly designed experiments. 
For example, proneness and apparent proneness 
can be distinguished in follow-up studies or by 
sampling in several time periods. 

The question of identifiability also arises in a 
purely mathematical context. A mixture is called 
jdentifiable if there exists a one-to-one correspond- 
ence between the mixing distribution and the re- 
sulting mixture. Mixtures that are not identifiable 
cannot be expressed uniquely as functions of com- 
ponent and mixing distributions. This is true, for 
example, of a mixture of two binomials when n = 2. 
For example, both a, = 4, p=.3, Pr=6 and 
a, = 6, pı = .36, pP: = .66 result in identical mixed 
distributions. 

The derivation of conditions under which mix- 
tures are identifiable in this sense is difficult. 
Teicher (1963) derives some such conditions and 
discusses identifiability for many common mixtures. 
Identifiability in this sense has important statistical 
implications in that it is not possible to estimate 
or test hypotheses about the parameters of uniden- 
tifiable mixtures. [For further discussion, see STA- 
TISTICAL IDENTIFIABILITY.] 

All of the mixtures listed in the section on 
“Special mixed distributions” are identifiable. The 
mixture of two binomial distributions given in 
equation (3) is identifiable if, and only if, n > 3. 

Estimation. The construction of estimates for 
the parameters of mixtures of distributions is diffi- 
cult. Procedures such as maximum likelihood and 
minimum x, which are known to be optimal, at 
least for large sample sizes, require the solution 
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of systems of equations that are typically intrac- 
table for mixtures. The problem has been somewhat 
alleviated with the advent of high-speed electronic 
computers, but it is by no means resolved. In any 
case the distributions of the estimators are likely 
to be difficult to work with. [For further discussion, 
see ESTIMATION. ] 

Because of this complexity, it is not uncommon 
to choose estimation procedures almost solely on 
the basis of computational simplicity. Moment esti- 
mators are often used even though they may be 
inefficient in that they require larger sample sizes 
to attain a given degree of accuracy than do maxi- 
mum likelihood estimators. Moment estimators are 
constructed by equating sample moments to popu- 
lation moments, the latter being written in terms 
of the parameters of the underlying distribution, 
then solving the resulting system of equations for 
the parameters. For example, for the Poisson— 
binomial distribution the moment estimators based 
on the sample mean, %, and variance, sè, are 


p= MDF 6= s?—z 
nS =) ~ (n-1)š 


Even moment estimators can become quite com- 
plex. Moment estimates for mixtures of binomial 
distributions can require formidable calculations 
(see Blischke 1964). Pearson’s (1894) solution for 
a mixture of two normal distributions requires ex- 
traction of the roots of a ninth-degree equation. 
In the normal case, the problem is greatly simplified 
if the two normal components are assumed to have 
the same variance (see Rao 1952, section 8b.6). 

Moment estimators and/or maximum likelihood 
estimators for the distributions mentioned above 
are given in the references. For a summarization 
of these results and some additional comments on 
the estimation problem, see Blischke (1963). 

Another aspect of the estimation problem that 
can be troublesome in practice occurs if two com- 
ponents of a finite mixture are nearly identical. 
If this is the case, extremely large sample sizes are 
required to estimate the parameters of the mixture 
with any degree of accuracy. Such mixtures are, for 
all practical purposes, unidentifiable (cf. Chiang 
1951). 

Nomenclature. It is important to distinguish 
between the concepts of mixing of distributions and 
the distribution of a sum of random variables. The 
latter distribution is called a convolution and, ex- 
cept for special cases or particular notational con- 
ventions, is not a mixture of distributions. 

There are, however, several additional terms 
sometimes used as synonyms for “mixture of dis- 
tributions.” These include “compound distribution,” 


“mixed distribution,” “probability mixture,” “super- 
position,” “composite distribution,” and “sum of dis- 
tributions.” The most common terms are “mixture,” 
“mixed distribution,” and “compound distribution,” 
In addition, certain “generalized distributions” are 
mixtures of some specific structure (cf. Feller 
1943). 

The terms “dissection” and “decomposition” are 
sometimes used in connection with estimation for 
finite mixtures. These terms are descriptive since 
the estimates give information about the com- 
ponents from observations on the composite. 


WALLACE R. BLISCHKE 
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DIVISIA, FRANCOIS 


Francois Divisia (1889-1964), was born at Tizi- 
Ouzou, the principal town in the Great Kabylia 
region of northern Algeria. His family had been 
established for three generations on both the French 
and Algerian shores of the Mediterranean. He was 
awarded baccalaureates in mathematics and phi- 
losophy at the Algiers lycée. In 1910 he won ad- 
mission to both the Ecole Normale Supérieure and 
the Ecole Polytechnique and chose to attend the 
latter. After two years at the Ecole Polytechnique, 
he entered the government department of civil en- 
gineering (Ponts et Chaussées). In 1913 he be- 
came a graduate student at the Ecole Nationale 
des Ponts et Chaussées, and at the outbreak of 
World War 1 he was mobilized as a lieutenant in 
the engineering corps. Promoted to the rank of 
captain in 1916 and wounded in 1917, he was 
commended in dispatches and named chevalier of 
the Légion d'Honneur. Having completed his engi- 
neering studies in 1919, he pursued his career as 
a government engineer for about ten years and 
was then transferred to the Ministry of National 
Education to devote himself entirely to research 
and teaching in economics, a subject that had 
previously absorbed his spare time. It was the 
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influence of Clément Colson that was decisive in 
turning Divisia toward economics; eventually he 
was to succeed Colson as professor of applied eco- 
nomics and finance at the Ecole Nationale des 
Ponts et Chaussées. (This was in 1932, after he 
had also taught for brief periods at the Ecole Libre 
des Sciences Politiques, the Conservatoire National 
des Arts et Métiers, and the Ecole Polytechnique. ) 

Divisia was to remain strongly influenced by the 
training, both practical and scientific, that he had 
received as an engineer. It played an important 
role in his contributions to mathematical econom- 
ics, where statistical data are constantly used, and 
it led him quite naturally to take an active part 
first in the creation and then in the development 
of econometrics. A founding member and vice- 
president of the Société d’Econométrie in 1931, he 
was elected president of the society in 1935. In 
the meantime he was also elected to the Institut 
International de Statistique in 1933, and he be- 
came president of the Société de Statistique de 
Paris in 1939. 

At the outbreak of World War 11, he was asked 
to administer the seaports for the Ministry of 
Public Works. There he was one of the first to 
appreciate the problem of determining optimum 
transportation routes. Confronted by the mathe- 
matical difficulties that this problem involved, he 
called upon the mathematician Maurice Fréchet. 
Fréchet turned the problem over to two of his 
students, one of whom, Nachman Aronszajn, 
submitted a solution, published only much later. 
In this way, Divisia contributed to the genesis of 
linear programming, which was soon to be one 
of the most productive branches of operations 
research [see OPERATIONS RESEARCH; PROGRAM- 
minc]. In his work Technique et statistique (1941), 
he was also one of the first, perhaps the very first, 
in France to envisage and advocate the application 
of statistics to technical problems. 

Divisia resumed his many teaching activities 
during the occupation and in the years following 
France's liberation. He participated in an increas- 
ing number of scientific organizations, particularly 
on an international level. In 1951 he became a 
foreign member of the Accademia Nazionale dei 
Lincei. In 1950 he resigned from his professorship 
at the Ecole Nationale des Ponts et Chaussées and 
was then able to give more of his time to work in 
the laboratories he had set up at the Conservatoire 
National des Arts et Métiers in 1943 and at the 
Ecole Polytechnique in 1950. The laboratory at the 
latter institution concentrated its research on 
the relationships between the French economy and 
national defense; that at the Conservatoire focused 
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on the collection and interpretation of statistics 
relevant to the state of the economy: rates of re- 
turn, price indexes, and characteristics of indus- 
trial, commercial, and banking activity, as they are 
related to international commerce. Divisia valued 
this kind of research very highly and devoted con- 
siderable time to it. 

Major contributions. The earliest of Divisia’s 
important contributions to economics is contained 
in his book L'indice monétaire et la théorie de la 
monnaie (1926). The monetary index, commonly 
known as the “Divisia index,” is related to a general 
equation of exchange, known mainly through the 
work of Irving Fisher but independently developed 
by Divisia [see Money, article on QUANTITY THE- 
ory]. Using this equation, Divisia, by differentia- 
tion, distinguished two indexes whose product is 
always proportional to the total payments, ex- 
pressed in money, of the period to which the equa- 
tion applies. These two indexes are a monetary or 
price index, expressing the general level of prices 
for the totality of monetary transactions, and an 
index of exchange activity (a quantity index), 
measuring the volume of transactions for the 
period involved. 

The first of these is a weighted price index, with 
weights that vary from period to period, each 
period being treated as sufficiently short for the 
weights to be regarded as constant within the 
period. Price level comparisons between periods 
further separated in time are effected by the chain 
index method, which in contrast to a fixed-base 
method compounds the price index changes of 
successive (intermediate) periods, each with its 
own set of weights. Consequently, it is not possible 
to compare the price levels of nonadjacent time 
periods that link together the extremities of the 
chain of prices. The Divisia monetary index there- 
fore involves more information about prices and 
quantities than do fixed-base indexes. 

Divisia's Economique rationnelle (1928a) is 
without doubt his masterpiece: witness its favor- 
able reception by mathematical economists as well 
as by economists of the traditional school, Essen- 
tially devoted to the classical interrelationships, it 
is a didactic account, both synthetic and concise, 
and its conclusions are drawn from concrete and 
objective observations. The work also advances an 
original view of the role of money in the theory of 
value and of economic equilibrium. Divisia’s repu- 
tation in the scientific world was enlarged by this 
book; in France the work was awarded prizes by 
the Académie des Sciences and by the Académie 
des Sciences Morales et Politiques. 

The questions that Divisia raised in his L’épargne 
et la richesse collective (1928b) in many instances 


anticipated aggregative problem areas that have 
since become very important in the writings of 
economists. 

Since Divisia addressed himself in his teach- 
ing to practitioners in business and in industry 
and to students of engineering with a scientific 
background, he was able, while at the Ecole Na- 
tionale des Ponts et Chaussées, to maintain close 
contact with the problems faced by nonacademic 
economists, especially problems related to techno- 
logical change. In his courses he discussed, with 
great care and originality, problems concerning 
money, credit, the establishment and management 
of public works, and the operation of transporta- 
tion facilities. 

Divisia’s last important work, Traitement éco- 
nométrique de la monnaie, l'intérêt, l'emploi 
(1962), is a kind of summing up of his opinions 
on some of the most important problems in eco- 
nomics, It is necessary to remember that Divisia 
was moved to write it primarily because of his dis- 
satisfaction with the lack of precision in Keynes's 
general theory. To highlight Keynes's deficiencies, 
Divisia used mathematics for logical sequence and 
Statistics for handling evidence. 

Reacting against Keynes's methods and faithful, 
instead, to classical traditions, Divisia stressed the 
danger of macroeconomic studies. He based his 
own model on the analysis of individual behavior, 
i.e., on microeconomics. In his opinion this is the 
only approach that offers sufficient reliability for 
the analysis of the permanent motives in decision. 

The most original characteristic of his model is 
its distinction between three kinds of flow: (a) the 
technical flow of the circulation of goods, meaning 
physical units; (b) the monetary flow, expressed. 
in nominal value and fed by payments correlative 
to the transactions involving the goods of the tech- 
nical flow; and (c) the flow of credits and claims 
arising from these transactions and expressed in 
monetary terms. These three circuits do not over- 
lap. They are interrelated, however, in that claims 
originating on the technical level are established 
on the contractual level and are liquidated by the 
circulation of money. 

This analysis of economic behavior is highly 
aggregative, since it introduces only four types of 
economic agents: the household, the business en- 
terprise, the state, and the bank. Even with this 
limitation, the model generates a great many equa- 
tions dealing with the categories that are usually 
considered in presentations of this kind. 

Divisia then attempted to discover the “degree 
of freedom” possessed by the economy, given a 
system of relationships that contains seven mathe- 
matical unknowns. He subtracted what are now 
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called “policy variables,” that is, the behavioral 
data reflecting the policies of the economic agents. 
(After extensive research he had decided that there 
are five such variables, corresponding to policies 
with respect to the floating of shares, loans, self- 
financing, production, and the issuing of cur- 
rency.) The number of unknowns is thus reduced 
to two, and these must be identified with the rates 
of flow in the technical and monetary circuits, i.e., 
the total flow of goods and the total flow of money 
—in other words, the real and nominal manifes- 
tations of national income. In terms of equations, 
the two unknowns are the flow of the labor force 
employed and the general price level governed by 
the monetary unit chosen. 

This leads to conclusions sharply opposed to the 
theories of Keynes. Here is how Divisia put it: 
“Full employment is possible provided that we 
accept the real wages that follow from it; or, if 
real wages are fixed, in any way, at a higher level, 
the necessary outcome is a certain amount of un- 
employment” (1962, p. 179). The model on which 
Divisia essentially based his critique of Keynes is 
a revised version of Walras’ concept of equilibrium. 
To be sure, it differs considerably from the latter, 
for it is in no way static or timeless; it can be 
applied to a trend and also to cyclical fluctuations. 
It is well suited to economic calculations because 
of its high degree of aggregation. 

Divisia therefore confidently expressed his judg- 
ment of Keynesian theories: “. . . both the concept 
of a general, long-term equilibrium that establishes 
endemic underemployment without any reaction 
from the economic agents, and a theory of interest 
rates that is based solely on a preference for 
liquidity and excludes the productivity of capital, 
seem indefensible” (1962, p. 195). At best, he felt 
Keynes's propositions may be used to explain rela- 
tive changes over short periods of time. Construed 
in this way, they do not even differ fundamentally 
from other explanations of the particular relation- 
ships that constitute the framework of the general 
equilibrium. A necessary further step is to account 
for all the factors involved and for their various 


interrelationships. 
RENE Roy 


[For the historical context of Divisia’s work, see the 
biography of KEYNES, JOHN Maynarp. For discus- 
sion of the subsequent development of Divisia’s 
ideas, see ECONOMETRIC MODELS, AGGREGATE; Eco- 
NOMIC EQUILIBRIUM; INDEX NUMBERS; PROGRAM- 
MING.] 
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1926 L'indice monétaire et la théorie de la monnaie. 
Paris: Sirey. 
1928a Économique rationnelle. Paris: Doin. 
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2: Assise pour les études et techniques monétaires. 
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L’épargne et la richesse collective. Paris: 


DIVISION OF LABOR 

For the origins of the concept of division of 
labor, see the biographies of PLato and ARISTOTLE; 
for the development of the modern tradition of 
thought into which the concept enters, see the 
biographies of BÜCHER; DURKHEIM; Marx; SMITH, 
ADAM; WEBER, Max. The economic concept is dis- 
cussed in SPECIALIZATION AND EXCHANGE, Some 
sociological aspects of the division of labor are 
discussed in ECONOMY AND SOCIETY; MEDICAL 
PERSONNEL; OCCUPATIONS AND CAREERS; PROFES- 
sions, For anthropological thought, see SOCIAL 
STRUCTURE, article on THE HISTORY OF THE CON- 
cept. Material on the effects of technological 
change on the division of labor will be found in 
AGRICULTURE, article on SOCIAL ORGANIZATION; 
AUTOMATION; MODERNIZATION; TECHNOLOGY. 


DIVISION OF POWERS 
See CONSTITUTIONAL LAW, article on DISTRIBU- 
TION OF POWERS; DELEGATION OF POWERS; FED- 


ERALISM. 


DIVORCE 
See FAMILY, article on DISORGANIZATION AND 


DISSOLUTION. 


DIXON, ROLAND B. 


Roland Burrage Dixon was born in Worcester, 
Massachusetts, in 1875 and died at his bachelor 
home in Harvard, Massachusetts, in 1934. He was 
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appointed assistant in anthropology in the Peabody 
Museum at Harvard upon his graduation from that 
institution in 1897, thereupon beginning a career 
of anthropological teaching and research at his 
alma mater that continued until his death. While 
a graduate student there he engaged in archeo- 
logical field work in Ohio and later, as a member 
of the Jesup North Pacific Expedition of the Ameri- 
can Museum of Natural History, in ethnological 
field work among the Indians of British Columbia. 
In 1899 he spent the first of six field seasons 
among the California Indians. The following year, 
upon completion of his thesis dealing with the 
language of the Maidu Indians, he was awarded the 
PH.D. degree. Thus, Dixon became one of the first 
Americans to receive a doctors degree in the 
emerging discipline of anthropology. Interestingly 
enough, Harvard’s other doctorate in anthropology 
in 1900 went to John R. Swanton, a man whose 
anthropological orientation was very similar to 
Dixon's, Following a year spent in research and 
travel in Germany, Siberia, and Mongolia, Dixon 
returned to Harvard in 1901 as an instructor in 
anthropology. He advanced through the various 
academic grades to full professor and was for a 
long period chairman of the division of anthro- 
pology. His Harvard service was briefly interrupted 
during World War 1, when he did special work in 
Washington, D.C.; he was sent to Paris with the 
American Commission to Negotiate the Peace. 

Dixon brought to anthropology a marked natu- 
ral-history orientation. He enjoyed outdoor life, and 
his early interest in geography was deepened by 
intensive reading combined with travel and re- 
search in New Zealand, Australia, Tasmania, the 
Himalayas, Burma, Malaysia, Indonesia, China, 
Japan, and Central America. No student ever took 
a course with Dixon without becoming aware of 
the interrelationship between culture and the 
natural environment. This geographic interest was 
combined with a historical approach. Dixon was 
well-read in French, Italian, German, and Rus- 
sian and in the Scandinavian languages. The 
range and depth of his anthropological knowledge 
were reflected in his writings and his classroom 
lectures. 

Dixon was a generalist at a time when such 
virtuosity was still possible in anthropology. Fol- 
lowing his brief introduction to mound archeology, 
he concentrated for some years on California 
ethnology, and his many publications on this cul- 
ture area cover the fields of descriptive ethnogra- 
phy, folklore, and linguistics. Although he was not 
a trained linguist, his work on California languages 
provided the guidelines for future work in this field. 


Some of his California publications were done 
jointly with Alfred L. Kroeber. The linking of the 
names of these two men is appropriate, since they 
not only were contemporaries (Kroeber received 
his PH.D. from Columbia in 1901), but both also 
did extensive ethnographic work in California and 
took a basically historical approach to anthro- 
pology. Dixon lacked Kroeber’s philosophical over- 
view and theoretical interests, but the two were 
similar in the range and depth of their scholarship. 
They held each other in mutual esteem, and 
Kroeber's obituary of Dixon is unusually percep- 
tive (Tozzer & Kroeber 1936). 

After completing his California work, Dixon 
turned from ethnography to a wide range of other 
problems. In connection with the thirteenth census 
of the United States he compiled the most complete 
and accurate enumeration of the Indian population 
by stocks and tribes, and after tabulating the 
sterility and fecundity of pure Indian marriages in 
comparison with mixed marriages, he concluded 
that the latter produced both more and healthier 
offspring (U.S. Bureau of the Census 1915). 
Some years later he published his most ambitious 
work, The Racial History of Man (1923). Utilizing 
three basic indices, Dixon analyzed all the anthro- 
pometric data then available and classified men 
into racial types. Although this methodology was 
criticized by physical anthropologists, the book did 
demonstrate the morphological diversity of many 
populations, particularly the American Indians and 
Polynesians. In The Building of Cultures (1928) 
Dixon attempted to deal with theory and with cul- 
tural dynamics. The real contribution of the book, 
however, lay in its meticulous discussion of the 
diffusion process. 

In the final analysis, Dixon was not so much a 
theoretician as a culture historian who combined 
vigorous scholarship with geographic understand- 
ing. His study of the early migrations of the New 
England Indians (1914), for instance, illustrates 
his interest in movements of peoples, reinforced by 
an understanding of ecological factors and a mas- 
tery of ethnohistorical sources—all features that 
came increasingly to characterize his later writings. 
Insofar as it was possible for one man to do 50, 
Dixon mastered the ethnography of the world, par- 
ticularly in the field of material culture. He was 
less interested in details of social organization. 

During most of his years at Harvard, Dixon 
served as librarian of the Peabody Museum. He 
brought to this task the same bibliographic ex- 
pertise that characterized his scholarship, with the 
result that the library catalogue eventually became 
an index of all anthropological literature, listed not 
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only by author but also by geographical area and 
by topic. This great research tool, invaluable to 
generations of Harvard graduate students, has 
since been updated and made available to institu- 
tions all over the world (1963). 

Dixon’s greatest contribution to American an- 
thropology may well have been the building up of 
an outstanding program at Harvard—broad in 
scope and designed for both the undergraduate and 
the graduate student, Dixon's work at Harvard is 
in some ways comparable to that of Boas at Co- 
lumbia and Kroeber at California, the three institu- 
tions that trained the bulk of American anthro- 
pologists during the first third of the twentieth 
century. Dixon founded no “school” and left no 
disciples; but his students gained from him a tre- 
mendous body of carefully organized ethnographic 
information and a training in rigorous scholarship. 
Many of these students in turn became profes- 
sional anthropologists, including such early Viking 
Medalists as A. V. Kidder, J. O. Brew, Hallam 
Movius, Ralph Linton, I. H. H. Roberts, Carleton 
Coon, W. W. Howells, and S. K. Lothrop, together 
with many other eminent teachers and researchers 
active in anthropology throughout the world. 


ROBERT A, MCKENNAN 


[For the historical context of Dixon's work, see the biog- 
raphy of KROEBER. For discussion of the subsequent 
development of his ideas, see Race and the biog- 
raphy of KIDDER.] 
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DOMESTICATION 


1, THE FooD-PRODUCING 
REVOLUTION 


I. ANIMAL DOMESTICATION 


Robert J. Braidwood 
Robert H. Dyson, Jr. 


I 
THE FOOD-PRODUCING REVOLUTION 


The domestication of plants and animals marked 
the beginnings of effective food production by man, 
a stage in human sociocultural development that 
has frequently been called the Neolithic. 

Usages of the word “Neolithic.” There have 
been a number of confusing usages of the word 
“Neolithic” (a term this author does not normally 
employ in his own writing), and it is clear that the 
confusion is not yet resolved. 

Earliest usages. Neolithic has prehistoric arche- 
ological connotations: the word is a Neo-Grecism 
for “New Stone Age,” coined a century ago by Sir 
John Lubbock, along with “Paleolithic,” or “Old 
Stone Age.” The contrast between the two ages was 
seen to depend upon the fact that various inven- 
tories of archeological materials from western 
European ice-age contexts yielded stone tools with 
forms and cutting edges prepared only by chipping, 
whereas certain archeological inventories from 
early postglacial contexts also included stone tools 
whose outlines or cutting edges were prepared by 
grinding or polishing. The end of the “Neolithic” 
was seen as marked by the appearance of archeo- 
logical inventories with at least some metal tools, 
usually of copper or bronze; hence, the “Bronze 
Age” followed the “Neolithic.” Moreover, it was soon 
sensed that “traits” other than ground-stone tools 
might be used to characterize the general range of 
“neolithic” inventories or assemblages. Traces of 
crude pottery vessels, weaving, architecture, and 
the bow, as well as traces of ground-stone artifacts 
other than cutting tools, came to be included as 
characteristic “traits” of the “Neolithic,” provided 
that items of metal or of other, more sophisticated 
substances or manufacturing procedures did not 
appear in the same assemblage. 

Both the nomenclature (i.e., “Paleolithic,” “Neo- 
lithic,” “Bronze Age,” and “Iron Age”) and its ration- 
ale reflect an early period of evidence collection and 
of scholarly thought about European prehistory, 
during which a kind of world-wide evolutionary 
chronological and cultural uniformitarianism was 
assumed to have been the ordering principle of 
things. During this early period in the development 
of prehistoric scholarship it was thought, for the 
most part, that at a given time in the past, gen- 
erally similar cultures were producing generally 
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similar types of artifacts. At first, the consequences 
of this assumption were not too serious, but as the 
twentieth century opened and more came to be 
learned of prehistory beyond the classic western 
European area and of ethnology in general, it be- 
came increasingly clear to perceptive prehistorians 
that the uniformitarian image was a vastly over- 
simplified one. 

In the meantime, however, the word “Neolithic” 
has come to be used as an adjective or as a noun 
and may denote either (1) the chronological age, 
(2) the developmental level or stage, or (3) the 
typological appearance or description of an archeo- 
logical site or of an artifact or assemblage of arti- 
facts. Thus, in a given piece of writing, the author 
may hop at will between the chronological, devel- 
opmental, and typological connotations, his lack of 
precision in meaning neatly camouflaged behind 
the learned-looking Neo-Grecism. 

Techno-economic usage. V. Gordon Childe’s 
presidential address to the Prehistoric Society in 
1935 marked the crystallization of a new meaning 
for “Neolithic” as “a self-sufficing food-producing 
economy” (Childe 1953, p. 193). It should be noted 
that Childe credited Elliot Smith, Harold J. E. 
Peake, and others with the original notion of “. . . the 
revolutionary contrast between food-gatherers and 
food-producers.” The importance of the achieve- 
ment of food production was hardly a new idea, 
even to archeologists (e.g., Mortillet 1882, p. 576). 
What interests us here is Childe’s selection of a 
developmental factor—the techno-economic factor 
—as the most meaningful sense for “Neolithic,” if 
this was to remain a useful term for prehistorians 
and culture historians. Owing to the deservedly 
wide popularity of Childe’s general writings, his 
viewpoint that the “Neolithic” began with the ap- 
pearance of effective (“self-sufficing” ) food produc- 
tion is now usually followed, at least in western 
European prehistoric thought. This means that 
“neolithicness” commenced wherever and when- 
ever effective food production was assured, regard- 
less of the typological descriptions of the associated 
artifacts. In fact, Childe’s Huxley lecture (1944) 
shows his own firm grasp of the principle of homo- 
taxis—that the same techno-economic stage may 
appear in the record of different regions at quite 
different times in a real chronological sense. 

There is still no very new or clear thought on 
what marks the end of the “Neolithic” stage or era. 

Usage relating to regions outside Europe. Prob- 
ably the greatest confusion with the usage “Neo- 
lithic” has occurred in writings concerning the pre- 
history of regions other than those in Europe. 


Fortunately, the usage of “Neolithic” has not ap- 
pealed to Americanist archeologists (Ehrich 1963), 
although effective food production certainly did 
appear in the prehistoric New World. It seems 
probable that, in the Old World, confusion in- 
creases as prehistorians deal with paleoenviron- 
ments different from those of Europe, especially if 
the prehistorians have had European training and 
if they retain a European image of the way things 
ought to have worked. However, we need not expect 
that the three factors in the important equation— 
chronology, developmental level, and typological 
description—were always the same everywhere in 
the world. 

Thus, the exact timing of the sequences in extra- 
European regions may be quite different from those 
of Europe (factor of chronology). Food production 
appears to have reached Europe with the diffusion 
of a southwestern Asiatic complex of domesticated 
food-plants and animals and to have become effec- 
tive in Europe with the acceptance of this complex 
by indigenous European peoples already at an in- 
tensified or “mesolithic” level of food collection. It 
is certainly not clear that what happened in Eu- 
rope was a universal pattern for the first appear- 
ance of food production (factor of developmental 
level). When we view artifacts and characteristic 
assemblages of artifacts as a manifestation of 
man’s cultural means of adaptation to environ- 
ments and their natural resources, it is not sur- 
prising that both the artifacts themselves and also 
significant constellations of artifacts should look, 
and be, quite different in non-European paleo- 
environmental situations (factor of typological de- 
scription ). 

Since most prehistorians’ impressions of the 
meaning of “Neolithic” seem to have been cast in 
a European image, one of two things appear to 
follow the farther we move from Europe. Either 
the word “Neolithic” is qualified with a bewildering 
variety of prefixes (e.g., “pre-Neolithic,” “proto: 
Neolithic,” “preceramic-Neolithic”) which have little 
or no bearing on the Childean usage (i.e., effective 
food production), or the Childean usage is im- 
plicitly (Arkell 1959, p. 237) or explicitly (Oklad- 
nikov 1962, p. 273) denied. In fact, Okladnikov 
believes that Childe’s “. . . so-called ‘neolithic revo- 
lution’ in the Near East should be assigned to the 
mesolithic” and, on the basis of his own Siberian 
materials, holds “. . . to the older view that neo- 
lithic culture, in contradistinction to the meso- 
lithic and paleolithic, is founded on the appearance 
of ground axes and pottery as well as of the bow 
and arrow tipped with a bifacially worked point. 
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Following Okladnikov, then, food production did 
not appear in Siberia until long after the beginning 
of the “Neolithic.” 

In the opinion of the writer, the word “Neo- 
lithic” has completely outlived its usefulness. What 
follows is a brief consideration of present knowl- 
edge about the earliest levels of the stage of effec- 
tive food production. 


The earliest levels of the food-producing stage 


The appearance of effective food production. 
Three trends in the over-all pattern of human bio- 
logical and cultural evolution have been particu- 
larly notable: increasing group size, increasing 
sociocultural complexity, and increasing perma- 
nency of settlement. Little reflection is needed to 
realize the direct bearing that the availability of a 
stable food supply and the artifactual means of 
assuring it have on these trends. Except under 
highly specialized environmental circumstances, 
moreover, man has seldom been a very efficient 
food gatherer or even intensive collector. Also, ex- 
cept under such highly specialized (and hence, 
relatively rare) environmental circumstances, food 
gatherers or collectors have not achieved any de- 
gree of what are normally referred to as “higher” 
sociocultural forms. With the appearance of a pro- 
duced food supply, based upon effective domestica- 
tion of plants and sometimes of animals, the 
situation may be otherwise. 

In fact, the record of man’s past is sufficiently 
clear to allow the generalization that human evo- 
lution (at least on its cultural side) may be di- 
vided into at least three grand techno-economic 
stages: a very long food-gathering and collecting 
stage, a relatively recent food-producing stage, and 
the present-day industrial stage. (There is also 
some tendency to consider a dawning stage of 
nuclear energy.) Passage from the first to the sec- 
ond stage was achieved independently more than 
once in different parts of the world, but in neither 
of the two instances for which we now have any 
degree of understanding did the event take place 
before approximately 10,000 B.C. ; 

The two instances of which we have some fair 
notion were centered in southwestern Asia and in 
Mesoamerica, and the independent appearances of 
food production in these two foci depended on the 
domestication of two quite peculiar and separate 
complexes of (1) plant-and-animal elements Or 
(2) primarily plant elements, and upon the devel- 
opment of artifactual adaptations to a life based 
on these complexes. It is also completely evident 
that both the southwest Asian and the Mesoameri- 
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can foci of appearance of food production immedi- 
ately served as centers of diffusion, from which 
the primary plant-and-animal or plant elements, 
and the respective artifactual responses to them, 
moved outward into regions still inhabited by food 
collectors. The exact cultural mechanics of these 
“movements” are not yet understood in either case, 
nor are the mechanics by which indigenous out- 
lying food collectors accepted or rejected portions 
of the new way of life. 

Without doubt, there was at least one or more 
other independent or semi-independent appear- 
ances of food production. Sub-Saharan Africa, 
southern or southeastern Asia, China, the Andean 
slopes, and Amazonia have all been named (cf. 
Braidwood & Willey 1962) as likely areas, but 
there is as yet far less surety of evidence or agree- 
ment among authorities in these cases. 

It is also quite clear that there has been no 
achievement of an urban and civilized way of life, 
in any meaningful sense of these words, save upon 
the basis of an effective level of food production. 
The appearances of the known ancient urban 
civilizations each followed several thousand years 
after the earliest appearances of effective food pro- 
duction. In fact, it is quite impossible to conceive 
of the sociocultural dimensions of urban civilization 
based solely upon food-gathering or collection. 

The background for food production. We are 
not concerned here with the over-all length of 
Pleistocene time and of man’s early appearance 
toward the beginning of this time. We need only 
note that by about 40,000 years ago, if not earlier, 
our fully modern single species, Homo sapiens, was 
spreading into all parts of the world where human 
inhabitation was possible. This spread depended 
largely upon man's increasing capacity for culture, 
including the acceleration of a long trend toward 
increasing dependence upon artifactual (i.e., cul- 
tural) rather than immediately biological means of 
adapting to ever greater varieties of environmental 
circumstances. A probable concomitant of this 
trend seems often to have been an increasing tend- 
ency toward sedentism, or “in-settling,” and more 
experimentation with an increasing number of 
elements in the total resource spectrum of any 
given environmental niche (cf. Braidwood 1960a; 
1960b). 

Following about 15,000 B.C., the most spectacu- 
lar known responses to this general trend—still 
within the general stage of food-gathering, but at 
a rather highly intensified level of specialized hunt- 
ing and collecting—are those of the French Dor- 
dogne and of Franco-Cantabria. In these regions 
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appeared the brilliant “upper paleolithic” cave art, 
one cultural manifestation of peoples whose ways 
of life were based upon their highly successful pro- 
cedures for hunting the great gregarious animals 
of that particular paleoenvironment. Another type 
of adaptation had appeared at least as early in 
central eastern Europe and beyond, and involved 
open-air “in-settling” based on mammoth hunting 
(cf. Klima 1962). 

On the same time horizon, however, we may se- 
lect for contrast the far less spectacular assem- 
blages from the Zagros mountain flanks in south- 
western Asia, The animals hunted here were far 
smaller and more elusive than those pictorialized 
in the Franco-Cantabrian art: they included wild 
sheep, goat, and pig, all species that were poten- 
tially domesticable. Although the animals were 
undoubtedly still wild following about 15,000 B.c. 
and were clearly the objects of hunting pursuits 
alone, the archeological sites indicate a consider- 
able increase in the proportions of the bones of 
these particular animals (in contrast to those of 
deer, bear, onager, and other available, but per- 
sistently wild, forms). Moreover, from what the 
immediately following phases show in several sub- 
regions of southwestern Asia, it would appear that 
certain wild cereal grains must also have been 
attracting men’s (or women’s?) attentions at this 
time. 

Natural habitat zones. We appear to be dealing 
with the following situation, Certain regions of the 
world seem to have served as “natural habitat 
zones” for certain clusters of potentially domesti- 
cable plant-and-animal or primarily plant forms 
while these were still in their wild states. It is cer- 
tainly possible that more such “natural habitat 
zones” existed than man took advantage of. To be- 
come an effective “natural habitat zone” for our 
purposes, the “potential domesticates” of a region 
would have had to be domesticated. This did hap- 
pen in southwestern Asia and quite separately in 
Mesoamerica and, as noted above, may also have 
happened independently or semi-independently in 
several other regions. 

It has always been exasperatingly difficult, ar- 
cheologically, to point to the very beginnings of 
food production. The moment of beginning—of the 
first manipulation of a plant or animal form in a 
direction that would lead specifically to domestica- 
tion—would hardly be apparent to us either in the 
morphologies of the plant or animal forms (if by 
great fortune traces of these should be recovered), 
or by an artifactual response to domestication. 
Also, unfortunately, beginning artifactual “re- 
sponses” are seldom exclusive or specific; a hoe- 


like tool was easy to devise, but might it not have 
been devised for collecting edible roots rather than 
for conscious planting? A very efficient flint blade 
for a sickle was easily knapped, but was it neces- 
sarily used on plants that were consciously planted 
and cultivated, or simply on wild ones? 

The level of incipience. Following an explicit 
suggestion made by Julian Steward (1949), the 
notion of a level of incipient cultivation and domes- 
tication has come into some use and is seen as the 
first level or phase of the food-producing stage 
(Braidwood 1960a). It is not supposed that this 
level resulted in “self-sufficiency” based on pro- 
duced food; in fact, the great bulk of the diet of 
people living at this level must have resulted from 
intensified hunting and collecting. The idea of a 
level of incipient cultivation and domestication has 
been applied in considerations of both the south- 
west Asian and Mesoamerican evidence, as well as 
of the bits of evidence that are available from other 
natural habitat zones. It is maintained (Braidwood 
& Willey 1962, especially pp. 342 ff.) that the 
presently available evidence suggests that: 

(1) The important natural habitat zones seem 
to have been in semiarid regions (of temperate to 
tropical latitudes) of some subregional diversity 
and without an overabundance of collectible foods. 
However, the possibility that there were independ- 
ent or semi-independent beginnings of tropical 
forest “vegeculture” is not ruled out. 

(2) There is no binding reason to see the in- 
cipience of food production as the result of marked 
degrees of climatic or environmental change. 

(3) The incipient level of food production was 
probably bound to the environment of its particular 
natural habitat zone, with neither its domesticates 
nor the preliminary artifactual responses to them 
seen as viable elements for extrazonal diffusion. 

(4) Incipience seems to have appeared within 2 
context of some degree of pre-existing “in-settling. 

(5) The trend from generalized food-collecting 
to incipient cultivation and domestication appears 
to have been a slow process. 

(6) Incipience does not seem to have been at- 
tended by sudden or marked changes in culture. 

Diversity within the natural habitat zones. As 
field work progresses, it is probable that the 
matter of subregional diversity within a given 
natural habitat zone will be seen as increasing in 
importance. Braidwood’s (1952) older notion of 
the “hilly flanks of the Crescent,” as a unitary 
locale of the natural habitat zone in southwestern 
Asia, has needed revision and expansion (Braid- 
wood 1962), and the desiccation and oasis—riverine 
propinquity theory of the origins of food production 
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(cf. Childe 1936, p. 86; Toynbee [1934-1939] 
1947, p. 69) is now suspect. The sheep-goat-pig, 
emmer-einkorn wheat, two-row barley complex in 
the Zagros flanks subregion was certainly present 
long before approximately 7000 B.C., when we 
first know it from archeological context. There is a 
suggestion that sheep may have been domesticated 
two thousand years earlier, although claims for 
cultivated cereals on the basis of a restricted pollen 
count (Solecki 1963) are not so widely accepted. 
The Natufian culture of Palestine, which on arti- 
factual grounds “looks as if” it ought to have been 
based upon incipient food production, has as yet 
yielded no primary evidence of domesticated plant 
or animal elements, and Perrot rejects his earlier 
claims (1962) that they should have been there. 
The domesticated goat is only questionably evi- 
denced in the so-called “Pre-Pottery Neolithic B” of 
Tell es Sultan (Jericho), but despite the size and 
artifactual complexity of this site, there is an 
opinion (Zeuner 1958) that cultivation did not yet 
obtain in its earliest levels. 

It is likewise becoming clear that consideration 
will have to be given to subregional diversities in 
Mesoamerica. Willey's (1962, p. 88) useful résumé 
of the long development of incipient cultivation 
following about 7000 B.c. in Tamaulipas must now 
be supplemented with MacNeish’s report of work 
in the Tehuacán valley (Phillips Academy 1962). 
In Tamaulipas plant-food collection was gradually 
replaced by increasing dependence on domesticated 
pumpkin, pepper, and beans, with a small type of 
maize (similar to that from Bat Cave, New Mex- 
ico), appearing about 2000 B.C. In Tehuacan, a 
more primitive variety of maize seems to have ap- 
peared almost two thousand years earlier in a com- 
plex that already included squash and pepper, 
gourds, amaranth, and possibly beans. MacNeish 
is of the opinion that full-time agriculture with 
semipermanent village settlements was present in 
Tehuacán after 3000 B.C. 

The primary village-farming community level. 
Unfortunately, it is not yet possible to point to a 
clear and uninterrupted sequence, on a single site, 
in which a progression from the developing phases 
of incipience into those of “self-sufficing” food pro- 
duction may be seen. In this sense, Mesoamerica 
(on present archeological evidence) appears to 
have proceeded somewhat more slowly and evenly 
than did southwestern Asia. In both regions, how- 
ever, with a modest degree of typological extrapo- 
lation from one generalized assemblage to a next 
succeeding one, we arrive at a broad picture of a 
second level of early food production. Direct and 
ample evidence of the plant-and-animal or pri- 
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marily plant domesticates is associated with an 
‘increasingly impressive inventory of artifacts, 


which reasonably imply means and procedures 
adapted to producing food. 

It would be within the development of successive 
phases of this primary village-farming community 
level that we sense quite new sociocultural dimen- 
sions. With a more stable food supply, population 
growth is indicated by architectural evidence of 
further increase in size and number of sedentary 
communities. Specialized artifacts, indicating ef- 
forts at carpentry, boatbuilding, housebuilding, 
weaving, and the pyrotechnics of pottery produc- 
tion, hint at (1) the gradual rise of specialized 
crafts, (2) some increasing degree of division of 
labor, and (3) freedom of some of the community 
from the immediate activities of the quest for food. 
A gradual shift in religious focus is implied by 
items reasonably interpreted as having some role in 
assuring, among other things, the increase of fer- 
tility. It would be impossible, however, in the 
present state of our knowledge, to generalize on the 
details of the expanding sociocultural complexity 
of the level, but it is also quite apparent that a 
marked expansion must have obtained. 

The spread of primary food production. There 
is ample evidence to indicate that the advancing 
phases of the primary food-production level served 
as viable planes from which extraregional diffusion 
might take place. It may be that a critical factor 
here was the appearance of mutant forms of the 
plant-and-animal or primarily plant domesticates 
that would tolerate new environmental circum- 
stances. As noted above, the exact mechanics of 
such diffusion of either the primary plant and/or 
animal domesticates, and of the artifacts and pro- 
cedures necessary to make them effective, are not 
yet fully understood. Childe’s (1953) account of 
the spread of food production into Europe may be 
supplemented by the accounts given by several 
authors in the Courses Toward Urban Life sym- 
posium (Braidwood & Willey 1962), The latter also 
contains résumés of present knowledge regarding 
domestication in sub-Saharan Africa, India, China, 
and the New World. 

It would probably be fair to say that by about 
4000 B.c. on the alluvial plain of classic southern 
Mesopotamia, and perhaps by about 1000 B.C. in 
Mesoamerica, the primary level of food production 
was superseded by a further level of at least incipi- 
ent urbanization. On the other hand, a few remote 
groups of even present-day peoples in their pristine 
conditions, such as the Eskimo or the Kalahari 
Bushmen, have not yet entered the level of primary 
food production. It is anticipated that future re- 
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search on the origins and early development of 
food production will expand its concerns to include 
the other possibilities of independent or semi- 
independent achievement of the stage. Further, 
what were the mechanics of cultural diffusion, mix- 
ing, and change, and of acceptance and rejection, 
as the stage proceeded? Only through archeological 
means can problems bearing on long-range socio- 
cultural development and change be approached. 


ROBERT J, BRAIDWOOD 
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n 
ANIMAL DOMESTICATION 


The term “domestication” as applied to the ani- 
mal kingdom is commonly defined as that condition 
in which animals of one species continue to breed 
freely under conditions of captivity by another. 
Domestication involves the establishment of a 
symbiotic relationship from which both species de- 
rive some benefit. As practiced by man domestica- 
tion profoundly affects the ecology and social habits 
of the animal species as well as man’s own cultural 
patterns. 

From a zoological viewpoint, domestication by 
Homo sapiens is viewed as “that condition wherein 
the breeding, care and feeding of animals are, to 
some degree, subject to continuous control by 
man” (Hale 1962, p. 21). Such control, whether 
loosely or intensively exercised, induces changes in 
the morphology, physiology, and behavior of the 
animals concerned. The nature of these changes 
has been the subject of study since the time of 
Darwin in the middle nineteenth century, with an 
emphasis first on skeletal morphology and, more 
recently, on behavior (Hafez 1962). Such studies 
make it clear that the cumulative changes brought 
about by domestication “serve to shift the species 
to a new adaptive peak, characterized by the domes- 
tic habitat, and make it unsuitable for independent 
existence in nature” (Hale 1962, p. 50). Reed con- 
cludes that “in a very important sense, then, 
domestic animals (as well as plants) are a type of 
human artifact, since they exist in a form changed 
by man” (1959, p. 1638, note 7). = 

On the other hand, from the viewpoint of social 
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science, the establishment and practice of domesti- 
cation are significant for human society. The suc- 
cessful performance of the activities of domestica- 
tion imposes requirements upon a society which 
affect both its social and economic structure and 
its system of symbols and values. At the same time 
the impact of the community upon the landscape 
is greatly intensified through the destruction of 
natural vegetation by grazing, through the opening 
up of new areas for pastoral exploitation, and be- 
cause of the increased ability to travel and trans- 
port goods. Maintenance of human control over an 
animal species for several generations usually leads 
to dependence upon the animals for cultural well- 
being. The animals frequently become an integral 
part of the human habitation area, at least for part 
of the time, Such intimate accommodation and 
control are justified by the fact that the animals 
fulfill one or more roles in the cultural pattern. 
Most commonly these roles include serving as a 
source of food, fuel, power, or raw materials. 
Equally important, however, are the roles the ani- 
mals play as symbols of wealth, prestige, or re- 
ligious belief, as sources of aesthetic pleasure, and 
as a means of aggression and defense. Dependent 
upon the specific role or roles a given species plays 
in a particular society are multiple secondary 
effects concerning settlement pattern, architecture, 
and equipment, as well as the value placed on the 
animals and the elaboration of rules governing 
property rights concerning them. In addition, there 
is the important role that animals play in the verbal 
symbolism expressed in myths, songs, and the 
vocabulary of the society possessing them. 

In spite of the fundamental importance of ani- 
mal domestication to societies based upon food- 
producing economies, the study of the initial stages 
of domestication has lagged. During the early 
twentieth century the problem was discussed more 
or less independently by zoologists studying mor- 
phology without benefit of population genetics and 
by ethnologists working largely without supporting 
archeological data. The latter tried to reconstruct 
prehistoric events from observation of contempo- 
rary nonliterate peoples. The best known of these 
efforts is probably the Hi irtenkulturkreis hypothesis 
formulated by Wilhelm Schmidt of the Vienna 
school. According to this hypothesis domestication 
of the horse and reindeer preceded that of other 
animals, thus permitting pastoral nomads to pro- 
vide the organizational impetus leading to the 
higher civilizations. With more intensive archeo- 
logical field work and the growth of genetic studies 
in zoology, these earlier theoretical formulations 
have been largely discredited. Since 1950 there has 
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been a reassessment of the available evidence and 
the initiation by archeologists and zoologists of 
cooperative research aimed at the recovery of evi- 
dence specifically relating to the question of early do- 
mestication (Fiirer-Haimendorf 1955; Hale 1962; 
Reed 1960; Zeuner 1963). 


Archeological evidence 


Archeological data concerning domestication 
have been limited and difficult to interpret. The 
evidence is both direct and indirect, consisting of 
animal bones and artifacts. Of the latter, repre- 
sentational art is a major category of information 
and has been studied in the Near East and north 
Africa. Such art is often difficult to date, and its 
conventionalized style often makes specific identi- 
fication of the animals portrayed almost impossible. 
There is also the question of whether the animal 
depicted is wild or domestic. Indirect evidence— 
such as harnesses, bits, rein rings, or similar kinds 
of equipment, and refuse such as animal dung and 
straw used in plaster or to temper pottery, and ash 
from dung fires—may occasionally shed light on 
the question of domestication. 

Animal bones themselves, with their altered 
morphology, provide a better source of evidence. 
But the use of this evidence raises complicated 
problems of identification and interpretation. First 
and foremost is the problem posed by the lack of 
adequate comparative collections of bones of the 
wild ancestors. Second, there is the mass of poorly 
documented and misidentified materials from older 
archeological excavations. This mass has been 
critically reviewed for the Near East (Reed 1960) 
and reduced to a small residue of reliable identi- 
fications. In most instances these latter materials 
date from several thousand years after domestica- 
tion must have taken place and after extensive 
morphological changes had had time to establish 
themselves. Such changes often have not occurred 
to any extent in the earliest remains, and the study 
of morphology alone is unable therefore to estab- 
lish the fact of domestication. Before an interpreta- 
tion can be made for these earliest periods, a com- 
bination of morphological and statistical evidence 
(such as the relative abundance of species, the 
relative number of animals of different age groups 
within a species, and the degree of variability be- 
tween populations) must be taken into account, 
along with stratigraphic position and general cul- 
tural level. This approach attempts to establish the 
presence of domestication by demonstrating a sta- 
tistical shift from a general paleolithic hunting 
pattern, exhibiting a random killing of individuals 
of various ages in a variety of species, to patterns 
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of selective killing of yearlings within one or two 
species. It is inferred that the ability to select a 
single age group from a single species, almost to 
the exclusion of other animals, reflects sufficient 
human control over the animal population to war- 
rant use of the term “domestication” (Reed 1962). 
Preliminary application of this approach to ma- 
terials from several sites in the Near East suggests 
that the final results will prove highly significant. 

Origins. Theoretical considerations dealing with 
the problem of how domestication began have 
focused on the environment (Childe 1928), con- 
scious human behavior (Sauer 1952), or uncon- 
scious human behavior and animal behavior 
(Zeuner 1963) as the major factors. The environ- 
mental argument has sometimes been called the 
“riverine—oasis” or “propinquity” theory. It presup- 
poses a continuous period of desiccation in south- 
west Asia and northern Africa from the end of the 
Pleistocene to the present. According to this theory, 
human and animal inhabitants of these areas were 
gradually forced together in river valleys and oases, 
the end result being domestication. The theory may 
be discounted on both ecological and paleoclimato- 
logical grounds. Recent studies by geologists in the 
Near East show that snow lines and floral zones, 
depressed during the height of the last glaciation, 
have gradually shifted upward to their present loca- 
tions. Since animals occupy specific habitat niches, 
they either move where those habitats move or they 
become extinct; they do not cross over into other 
niches and crowd together. In Africa, where desic- 
cation has occurred, many of these niches un- 
doubtedly became extinct, along with the animals 
which occupied them, but there is no evidence that 
they ever crowded into oases. It should be noted 
that desiccation in Africa was interrupted by 
periods of wetter conditions and was not a single 
event. According to present evidence the appear- 
ance of domestic animals in the area seems to have 
occurred during one of these wet rather than dry 
phases. 

Arguments purporting to identify a human mo- 
tive for domestication are more difficult to deal 
with, since they are largely logical inferences which 
are not susceptible to scientific proof. They remain, 
therefore, always in the realm of speculation. It 
seems probable that domestication occurs as a re- 
sult of human and animal behavioral interaction. 
It is apparent that all hunting-and-gathering peo- 
ples of the present day possess extensive knowledge 
of the habits of wild animals as a necessity for, as 
well as a by-product of, their hunting activities. It 
is reasonable to infer that similar knowledge was 
available to late paleolithic hunters. Given this 


knowledge, there is general agreement between 
ethnologists and zoologists that, whether or not it 
was consciously aimed at domestication, the act of 
taming (that is, the elimination of the tendency to 
flee from man) must have been the initial mech- 
anism by which animals were brought into the 
human cultural environment. Since taming can be 
achieved most effectively with young animals dur- 
ing their critical periods of socialization or im- 
printing, it is thought that the raising of such young 
animals by human beings produced domesticated 
adults. In some instances it is possible that certain 
wild adults with short flight distances relative to 
man may have become partially self-domesticated 
when tolerated in the area by human beings, Other- 
wise it would have been necessary to feed infant 
animals. The custom of using a human wet nurse, 
observed among some contemporary nonliterate 
groups, suggests itself as a possible method. Sauer 
(1952) has suggested a religious motivation in- 
volving totemic animals in this process, but there 
is no agreement that such is the case and no evi- 
dence either for or against such an assertion. Aside 
from arguments attributing domestication to re- 
ligious or other purposeful behavior on the part of 
man, there is a widely shared belief that agricul- 
ture was a necessary precondition to domestication 
(Childe 1951; Reed 1959; Sauer 1952; Zeuner 
1963). The idea has long been suggested in rela- 
tion to cattle and is accepted in that context by 
Zeuner (1963). Reed (1959) extends the thesis to 
cover all four of the basic food animals—sheep, 
goats, pigs, and cattle—pointing out that they 
occur only in the context of more or less settled 
agricultural villages. His point is that such villages 
would have reached a level of organization com- 
plex enough to meet the requirements imposed by 
the keeping of animals. It must be stressed, how- 
ever, that it was still uncertain in the mid-1960s 
that the very earliest context proposed for domestic 
sheep, Zawi Chemi Shanidar in northern Iraq, was 
in fact an agricultural settlement. 


Biological considerations 

Certain biological factors have been stressed as 
fundamental to the establishment of domestication. 
Only those animals exhibiting characteristics favor- 
able to domestication were potentially adaptable. 
These factors include patterns of sexual behavior, 
social structure, parent—young interaction, response 
to man, and characteristic foods and habitat. 
Ability to satisfy dietary needs while in man’s en- 
vironment, or to live on the by-products of his agri- 
culture, was essential to complete domestication. 
Domestication became possible only because the 
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animals involved were already preadapted socially 
and psychologically to being tamed without loss of 
reproductive ability. Observable changes in the be- 
havior of animals that have become domesticated 
are nevertheless profound. Reed (1959) points out 
that while conscious selection probably occurred 
rather late, unconscious selection must have previ- 
ously led to the establishment of desirable char- 
acteristics, such as submissiveness. 

Since the domestication of animals can have 
taken place only in those areas in which the an- 
cestors of the domesticated animals lived, large 
areas of the earth’s surface are automatically elimi- 
nated from consideration. Thus, large areas of 
southern Africa, northern Asia, Australia, and 
Oceania are ruled out. Significantly, the major cen- 
ters of domestication seem to correspond to the 
more important centers of agriculture: the Near 
East and Egypt (seed cultivation—domestication of 
dogs, sheep, goats, pigs, cattle); southeast Asia (veg- 
etative planting—chickens, ducks, geese, dogs, pigs, 
certain types of cattle); Peru (vegetative planting 
—llamas, alpacas, guinea pigs, muscovy ducks); 
Mexico (seed cultivation—turkeys). The failure to 
domesticate any of the herd animals of North 
America may have been due to the lateness of the 
development of agriculture in that area. In no world 
area is there evidence of the domestication of any 
animal before the end of the Pleistocene around 
10,000 B.c. It is important to bear in mind that 
while we are able to discuss the domestication of 
specific animals in the Near East at a very early 
date, this fact does not rule out the probability that 
animals of the same or related species were do- 
mesticated at different times in other areas. Zeuner 
is of the opinion that wherever socially preadapted 
animals lived near a human society that had 
reached a certain degree of cultural complexity, 
domestication would have followed “almost auto- 
matically.” In no case, however, is it thought that 
any purposeful behavior by primitive man occurred 
with the aim of domesticating an animal, except 
in very late times, where the idea was copied from 
existing practices—as perhaps in the case of the 
reindeer. Least of all would such action have been 
taken to obtain products such as wool or fat, which 
are largely secondary products of domestication 
itself. 

Zeuner hypothesizes a progressive order through 
feeding, taming, and domestication. In the Near 
East, where the record is most clear, archeological 
evidence suggests that the scavenger dog was the 
first to be domesticated; second, the nomadic sheep 
venger pig; and fourth and 


and goat; third, the sca 
finally, cattle, domesticated only when culture was 
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effectively established. The position of the dog at 
the head of the list is based largely on theory, since 
its earliest-known remains are in fact found only 
in the seventh millennium B.C. in pre-pottery neo- 
lithic Jericho (the Natufian dog having been identi- 
fied as a wolf) and in the Maglemosian culture of 
northern Europe at about the same time. 


History of domestication 


The oldest animal of a domestic status for which 
there is actual evidence seems to be the domestic 
sheep, which is descended from the wild sheep, 
Ovis orientalis, found in the mountains of Turkey 
and Iran. Perkins (Reed 1962), on the basis of sta- 
tistics, thinks it probable that the sheep was already 
domesticated around 9000 B.C. at Zawi Chemi 
Shanidar in northern Iraq. At this site roughly 60 
per cent of the sheep bones were from yearlings, 
and the majority of bones found were of sheep; this 
is in contrast to nearby Shanidar Cave in which, in 
the late paleolithic levels, the bones of yearling 
sheep made up only about 25 per cent of the total 
number of sheep bones, which in turn made up 
only about 33 per cent of the total number of ani- 
mal bones. Thus a dramatic shift occurred with 
the settlement of the open-air site of Zawi Chemi 
Shanidar, a shift which suggests an early stage of 
domestication, Somewhat later, around 6500 B.C., 
domestic goats descended from the wild Capra 
hircus aegagrus, found in the same general area, 
make their appearance. Bones of the domestic goat, 
identified as such largely by the flattened medial 
surface of the horn cores, have been found at 
Jericho (Jordan), Jarmo (Iraq), and Guran and 
Ali Kosh (Iran) from about this time. The goat 
seems to have been the first domestic food ani- 
mal to have become widespread, moving from 
the highlands into the lowlands of Mesopotamia 
and Egypt and on into Africa, Asia, and Europe. 
Several centuries later, perhaps around 6000 B.C., 
the domestic pig was introduced to Jarmo from 
some as yet unidentified area. The domestic status 
of the pig has been established through the mor- 
phology of its teeth, which differ from those of wild 
species presently in the area. While the pigs of 
southeast Asia generally are descendants of Sus 
vittatus, the native wild pig of those areas, the Near 
Eastern and European animals appear to be de- 
scended from Sus scrofa, the local wild boar. The 
appearance of domestic cattle, on present evidence, 
occurs somewhat later, around 5500 B.C., at Tepe 
Sabz in southwestern Iran (Hole et al. 1965) and 
at Halafian sites such as Banahilk in Iraq. It is per- 
haps significant that these sites occur in the 
Assyrian steppe zone bordering the Fertile Crescent. 


254 DOMINANCE 


It may be noted that at this period ox horns were 
in general use at Catal Hiiyiik (south-central 
Turkey) in religious shrines, but as yet there is no 
evidence that they were obtained from domestic 
animals, At an even later date in Mesopotamia an 
attempt was made to domesticate the local wild 
onager (as documented in bones found at Tell 
Asmar); but by 2000 B.c., when the true horse was 
introduced from southern Russia, this attempt had 
been given up. The horse appears to have been 
domesticated in Russia about a millennium earlier. 
Fragmentary evidence suggests that it was at about 
this time that the dromedary camel was domesti- 
cated in Arabia. The ass or donkey, previously the 
main beast of burden, appears to have originated 
in Egypt in predynastic times. 

The four major food animals, together with 
wheat and barley, formed the basic economic com- 
plex underlying the higher civilizations of the Near 
East. From here various selected elements of this 
complex diffused to cultural groups in adjacent 
temperate grassland regions, Adoption of the tech- 
niques of domestication meant the constant cross- 
breeding of animal stock with local wild relatives 
of the same species, and with the occasional sub- 
stitution of other local species. In general, before 
domestication of the horse facilitated the develop- 
ment of full pastoral nomadism, these cultural 
groups probably practiced a mixed economy with a 
few upland districts specializing in local animal 
herding. In the first millennium B.C. there is some 
evidence that the practice of domestication may 
have been borrowed from sheep and cattle raisers 
and applied to reindeer in the Minusinsk and Altai 
highlands. Thus, transport animals appear to have 
been domesticated generally later than food ani- 
mals, and animals in marginal nonagricultural 
areas later still. 

A number of other animals have been domesti- 
cated for special uses, such as the elephant, the 
ferret, etc., but little is known about them archeo- 
logically. Zeuner (1963) gives the best summary 
in English with an extensive bibliography. 


RoBERT H. Dyson, Jr. 


[See also Pastoratism.] 
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DOUHET, GIULIO 


General Giulio Douhet, Italian military theorist, 
was born in Caserta in 1869 and died in Rome in 
1930. While he was a young career officer he 
studied the application of physics—especially elec- 
tricity and low-temperature physics—to military 
problems. He wrote essays on these subjects, on 
the problems arising from the motorization of the 
army, and on the tactical lessons to be drawn from 
the Russo—Japanese War. 

From 1908 on, he devoted himself, as a student 
of strategy, to the study of the military implications 


of developments in aeronautics. As an active officer 
in the Italian army, he hoped to make his influence 
felt in the arming of the Italian forces, and after 
his appointment as commanding officer of the air 
battalion in Turin, it seemed that Douhet might 
have an opportunity to put his ideas into practice. 
He was convinced by his studies and by his ex- 
perience in the Italo-Turkish War of 1911-1912 
that the airplane was far superior to the dirigible 
as an offensive weapon. This idea was reinforced 
by his work with Gianni Caproni, the aeronautical 
engineer who designed and built the first biplane. 

Douhet’s vigorously expressed views led to dis- 
agreement with the military establishment and 
with the Italian general staff. He was removed from 
his command and received an assignment unrelated 
to his professional specialization. He was even sub- 
jected to two official investigations, which con- 
cluded quite favorably for him, but he was so 
embittered that he thought of resigning from active 
service. He retracted this decision only because of 
the outbreak of World War 1. 

During the war he frequently criticized the 
Italian high command and the Allied direction of 
the war. His dissent was mainly centered upon the 
value of offensive actions, and he continued to 
advocate use of the airplane. The nucleus of his 
criticism (which he put in writing at the time, but 
which was not, of course, printed until after the 
war) was that the strategy of the Western Allies was 
basically wrong because it insisted on partial of- 
fensives, which are exceedingly costly in men and 
weapons and cannot produce a strategic success. 
He maintained that in land warfare the defensive 
side is in a far stronger position than the attacking 
side because of the progress in defensive weapons 
and ground defense systems. Thus, the Allied 
powers should conduct the war defensively, with a 
minimal consumption of men, and should launch 
an offensive only after having built up the over- 
whelming armed strength that their superior in- 
dustrial capacity permitted them. This offensive 
action should be carried out by airplanes, and 
Douhet insisted again and again that a powerful 
fleet of bombers be built and used autonomously. 
This air force should have as its targets not the 
enemy forces but the supply centers and the lines 
of communication in order to paralyze the enemy 
military system. F. 

On the demand of a member of the Italian 
Cabinet he summarized his criticism of the Italian 
high command in a memorandum. The document 
was discovered by the military authorities, and 
Douhet was court-martialed and condemned to a 
year of confinement. After he served the sentence, 
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the new Italian commander in chief, General 
Armando Diaz, appointed him director of aeronau- 
tics in the defense ministry. In this capacity he 
was, toward the end of the war, again able to de- 
vote himself to his chosen field of action. After his 
full rehabilitation, which did not take place until 
1920, Douhet tried to find employment again, even 
in a subordinate capacity, in the air force. He was 
unsuccessful and felt doomed to uselessness. 

Douhet’s theories, although to a degree based on 
his experiences in World War 1, had already been 
outlined in some of his prewar writings (e.g. 
1910). In these articles Douhet insisted on the au- 
tonomous role of the air force, then considered as 
a mere auxiliary of the army and the navy. 

The main argument of Douhet’s theory is that 
the offensive power of an air fleet of bombers is 
the most formidable instrument of war a country 
can have; that against such a force fighters are 
ineffectual, so that it is preferable to build bombers, 
which also have fighting capability; and that the 
very first aim of a fleet of bombers is to destroy the 
enemy air force on the ground in order to achieve 
air supremacy. Once this has been achieved, a 
country has to make every effort to strike the 
severest possible blow against the enemy and to 
inflict upon him the greatest possible damage. 
Road junctions, supply and production centers, in- 
deed the cities of the enemy, should be the targets, 
so that the people will be panic-stricken and ask for 
an end to the war. Therefore it is more important 
to inflict destruction on the enemy than to try to 
avoid devastation of one’s own territory. 

Douhet’s thinking was original and bore no rela- 
tionship to any previous strategic conception. In- 
deed, it very often contradicted well-established 
doctrines; this fact and the zeal of Douhet—a man 
of great intellectual and moral probity—repeatedly 
led him into trouble. 

Douhet made a major contribution to the theory 
of air warfare. In the United States, his ideas 
undoubtedly influenced General “Billy” Mitchell. 
It has been remarked that during World War u, the 
very war envisaged in one of Douhet’s works, “The 
War of 19—” (1930), some of his assumptions 
were proved wrong. Bombers had no decisive effect, 
fighters proved to be a very effective war instru- 
ment, especially in the Battle of Britain, and land 
forces were able to penetrate deeply into the 
enemy's territory and possessed a very real of- 
fensive capability. This criticism is certainly justi- 
fied as far as World War 1 is concerned. In 
particular, it has been correctly observed that 
Douhet exaggerated the power of conventional ex- 
plosives. On the other hand, there is no doubt that 
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many of his ideas seem to be more applicable to our 
nuclear age: one may detect a first outline, in his 
theories, of the counter-force and counter-city 
strategy. It may be of some interest to add that, 
in a passage in his main work, The Command of 
the Air (1921), Douhet envisaged the possibility 
that such a formidable instrument of war as an 
air force might serve an international authority in 
enforcing the observance of existing treaties. 


Fausto BACCHETTI 


[For the historical context of Douhet’s work, see STRAT- 
EGY; War; and the biographies of CLAUSEWITZ; 
Manan.] 


WORKS BY DOUHET 

1910 La possibilità dell’aereonavigazione. Rivista delle 
communicazioni [1910]: 1303-1319. 

(1921) 1942 The Command of the Air. New York; Cow- 
ard-McCann. > First published as I! dominio dell'aria. 

1921-1922 Diario critico di guerra, 2 vols. Turin (Italy): 
Paravia. 

(1930) 1942 The War of 19—. Pages 293-394 in Giulio 
Douhet, The Command of the Air. New York: Coward- 
McCann. > First published in the Rivista aeronautica 
as “La guerra aerea del 19—.” 

1951 Scritti inediti. Edited by Antonio Monti. Florence 
(Italy): Scuola di Guerro Aerea: > Published posthu- 
mously. 


SUPPLEMENTARY BIBLIOGRAPHY 

Bronie, BERNARD 1959 The Heritage of Douhet. Pages 
71-106 in Bernard Brodie, Strategy in the Missile Age. 
Princeton Univ. Press. 

EARLE, Epwarp M. 1943 Douhet, Mitchell, Seversky: 
Theories of Air Warfare. Pages 485-503 in Edward 
M. Earle, Makers of Modern Strategy: Military Thought 
From Machiavelli to Hitler. Princeton Univ. Press. 

Sicaup, Louis A, 1941 Douhet and Aerial Warfare. New 
York: Putnam, 

VauTHIER, ARSENE Mar PauL 1935 La doctrine de 
guerre du Général Douhet. Paris: Berger-Levrault. 


DRAMA 


There is a curious contradiction within theater 
arts: nothing of man originates from deeper or 
more hidden sources, and nothing surfaces to a 
more flamboyantly exposed and lasting arena for 
observation. Refracted as art may be through the 
prisms of a nation’s culture, customs, habits, and 
manners, still and all a vital pulse of universality 
beats. This is theater’s potential for evoking emo- 
tional response. Dance and drama depict moods 
and ideas, instincts even, of so private and yet so 
public a nature that no other aspect of civilized 
life seems as revelatory or explanatory. Artifacts 
and art works, of course, visually record the de- 
tails of an ancient civilization. For example, the 
poses of the dance figurines excavated from the 
site of the Indus civilization of three thousand 


years ago demonstrate graphically the stylistic con- 
tinuities between the pre-Vedic and present-day 
Indian dance, but the life of the past returns only 
when theater preserves it, no matter how faintly. 
Recorded history tells of vanished events, but in 
the drama, history becomes a living representa- 
tion. The workings of the inner mind that created 
the past and breathed in its very atmosphere emerge 
clear and true. Poetic truth, rather than logic, de- 
termines men’s actions. Our attention is held by 
considering how man feels in the midst of events 
without requiring the dry dispassion of a scholar's 
interest. Indeed, for all its license, theater gives 
us true value, as well as face value, because it 
deals with impelling forces, not surfaces. It is not 
without reason that science to a greater or lesser 
extent has always followed, rather than preceded, 
the artist's intuitive dictate or that, so often, the 
scientist is the artist, 

The dramas of ancient Greece are now in des- 
uetude: the language dead, the choral declamation 
unknown, the music lost, and the dance gestures 
forgotten. Still, when remaining texts are staged 
they tell the story of the Greek past that transcends 
time. They give thoughts to cold marble statues; 
they inhabit the ruins with meaning and motive, 
with living yet dead people. The validity of theater 
lies in its power to span distances of time and 
space and to bridge the gaps in understanding be- 
tween peoples who may be either strangers or 
neighbors. Theater is and is not life. It embraces 
man’s fantasy. If man cannot live without fantasy, 
and yet if he must remain loyal to reality, then 
theater satisfies a human need. 

Freud, oddly enough, was the first interpretative 
critic to pinpoint the secret eternity of the Greek 
drama. By disclosing Oedipus’ double realization of 
father destruction and mother possession, Freud ex- 
posed more to drama than psychiatric content. He 
defined the potent wellspring within the human 
psyche without which art must appear as false. Be- 
fore psychology, it was the artist who intuitively 
tapped these sources. 

When dealing with the theater arts of antiquity 
one touches even deeper ur memories of man. One 
mystery is the permanent hold mudra, or gesture 
language, has kept over India’s dances and for- 
merly over Sanskrit drama. To the outsider seeing 
and hearing an Indian performance laden with 
song-words performed by the musicians and the 
dancer-actor, the language seems sufficiently ex- 
pressive of meaning. However, the performer si- 
multaneously acts with his or her hands. Some- 
times the hands illustrate the words of the song. 
At other times, in a linguistic counterpoint, the 
hands enlarge its sense or tell a different story. 


At still other times, the hands in a visual parono- 
masia will pun and countermand the musical 
words. Mudras are not limited to India. They 
spread from there throughout Asia, where they 
may still be seen, attenuated sometimes to little 
more than heightened gesture but never altogether 
absent. It is a general principle of Asian theater 
arts that hands speak while the body dances the 
mood of what the music suggests. Socrates, it is 
claimed, said to his pupils, “Speak so that I may 
see you.” An Asian artist might say, “Dance so that 
I may hear you.” Such is the linguistic interde- 
pendence between hand-formed words and sung 
words. 

Why have mudras figured so prominently in 
Indian art and, by extension, in Asian art? What 
is their hold, at once so redundant and duplica- 
tive? Of course, the gesture of hands forming pat- 
terns in the air and spinning meanings in space 
is a highly delectable aspect of aesthetics. Cer- 
tainly, it arouses pleasing responses. But it is the 
specific precision of the mudras that is so startling 
and so enduring. Mudras can be used by a deaf- 
mute, The mudras, in their very nature as sym- 
bolic communication, perform the functions of art. 

Theater arts have more tangible and instantly 
comprehensible importance as well. At the risk of 
sounding pretentious, one may say with fairness 
that theater houses its nation’s soul. Think, for a 
moment, of Chekhov and late nineteenth-century 
Russia, or of Shakespeare's England, or further 
afield from our experience, perhaps, of Japan's 
kabuki, where the noblest and most vicious atti- 
tudes of the past are enshrined. Chushingura, the 
tale of the 47 lordless knights, is a case in point. 
Not only is it a national theater piece in the sense 
of actual history embodying ideals through its 
dramatizations (its eleven acts take two days to 
perform), but it combines the realism and fantasy 
of grand theater. The drama is based on a sensa- 
tional event that happened in 1701: 47 loyal re- 
tainers bided one year before avenging the unjust 
self-immolation of their lord and then committed 
mass hara-kiri. Versions of the story appeared on 
the kabuki and puppet stages within weeks. The 
dramatic masterpiece, which endures to this day, 
appeared a few decades later, after the astonish- 
ment and the dust of controversy had settled. The 
annual revival of the drama, with its magnificent 
recital of patience, hardships endured, difficulties 
surmounted, and loyalty and duty opposed to heart 
and affection, perpetuates many of the values and 
attitudes of Japanese culture. $ 

The modern Japanese are not warriors, nor are 
they full of vengeance. They do not believe in 
eighteenth-century morality, and present society 
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could not conceivably permit the events immor- 
talized in the play of Chushingura. Stomachs are 
tenderer. But just as Americans in New York City 
weep as Chushingura unfolds its story, so do Jap- 
anese at their kabuki and puppet theaters. Why? 
Because a common chord of understanding is 
struck by such themes as human suffering and the 
righting of flagrant wrong. Thus, if one wishes to 
learn about another country, the theater serves to 
characterize cultural patterns. 

One historical axiom of drama seems to be that 
early theater portrayed the gods; then, in a later 
period, the theater portrayed kings and persons of 
high rank. Finally and lastly, it touched on the 
life of the common man. The theory behind this 
is clear. If theater enlarges life so as to appeal to 
large numbers rather than to few individuals, then 
surely it must deal with the most exalted themes 
—the gods. As time sometimes proves gods less 
than omnipotent, and the strong power of tem- 
poral rulers is keenly felt, it follows that relation- 
ships between rulers and men have greater conse- 
quence. For instance, the fall of a king or a 
kingdom is more significant than the collapse of 
the blacksmith in the next village. It is a real mark 
of theatrical evolution, however, when audiences 
perceive that depth of feeling does not and should 
not depend on the importance of the character or 
his station in life. Drama which treated the sor- 
rows and sufferings of common ordinary men with 
the same dignity as that of gods and kings and 
portrayed plights other than lost thrones, aban- 
doned queens, or vengeful gods, replaced earlier 
drama. 

In Asia, the progression from gods to kings 
to men is still present in its near-original state. 
India reflects in its theater the burden that religion 
presently imposes on other aspects of its culture. 
Many Indians have never seen a theatrical repre- 
sentation of anything other than the holy books 
Ramayana or Mahabharata, Granted that these are 
also epics and adventure stories, their intent as 
theater is spiritual elevation. An even greater num- 
ber of Indians have never heard a piece of music 
that was not a devotional song to God, even though 
they have seen modern Indian movies with their 
songs and dances. In China, that most irreligious 
of countries, the classical theater is caught in its 
history of kings and generals of the past. In the 
Peking operas, they vie and contend, intrigue and 
counterplot, all in dark tragedy and high bright 
humor and to the accompaniment of raucously 
beautiful orchestration. 

In the theatrically happier countries of China 
and Japan, it is possible to see the life of gods, 
kings, and the common man of the past as well 
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as the present, depending on which theater is 
selected. One of the marvels of Asia’s theatrical 
world is Japan. There a wise spectator can see 
dances 1,500 years old. He can attend elaborately 
staged performances (the Japanese invented the 
revolving stage and the trap door) by actors whose 
lineage and names go back 350 years. The play 
in which they appear may be an adaptation of a 
novel a thousand years old. Age, in itself and 
despite the cult of antiquity so fashionable in the 
West, proves nothing. But in the theater it does 
reveal the infinitely lasting values of morality and 
humanity glowing in the fires of aesthetic trans- 
mutation. A foreigner can weep with sincerity at 
a nō play in which the pivotal point of conflict is 
the murder of a bird. He responds to the truth, in 
this instance, of pity. 

In Japan, where modern theater was inspired 
and trained by Soviet Russia, one may see reac- 
tionary plays extolling the past even at the expense 
of modern progress. Theatrical politics is, of course, 
like a football kicked toward a variety of goals. 

For example, a good amount of the communist- 
encouraged modern theater in China deals with 
the abuses of the past, such as those of widow- 
hood or landlord cruelty. But there are also plays 
that express the turmoils and difficulties within the 
minds and hearts of present-day Chinese, as they 
come into conflict or agreement with communist 
ideals. 

One surprising fact appears. Society's immediate 
problems are not always to be found on the con- 
temporary stage. For example, the postwar diffi- 
culties between the Japanese and the American 
occupation forces were never revealed in the the- 
ater. At the same time, the Japanese theater has 
reached to the core of the social problems posed 
by Westernization without getting lost in transient 
details. A somewhat similar problem was dealt 
with in a different way in India before its inde- 
pendence. There, for lack of a genuinely thriving 
modern theater, dramatists turned to ancient his- 
tory to find the reflections or resonances that could 
convey the urgent message of anticolonialism. 
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DRAMATISM 
See under INTERACTION. 


DREAMS 


Dreaming is a unique form of behavior. It ordi- 
narily occurs only during sleep and may be the only 
psychological activity that does occur during sleep. 
It is involuntary and unintentional in the usual 
meaning of these words. Customarily, it is not 
accompanied by, and does not eventuate in, ap- 
propriate, relevant, or purposeful overt activity. 
Dreaming is expressed in the form of hallucinatory 
imagery that is predominantly visual and is often 
very vivid and lifelike in nature. It is this hallucina- 
tory experience that constitutes the dream. No other 
human experience seems to have excited so much 
interest or so much speculation regarding its cause. 

Until recently, the process of dreaming was not 
directly accessible to scientific investigation; no 
one knew how to tell when a person was dreaming. 
Nevertheless, one could study the product of dream- 
ing—the dream—when it was reported or described 
by the dreamer upon his awakening. Today, we 
can tell with almost complete certainty when a 
person is dreaming. The dream itself, however, still 
remains virtually inaccessible to direct investiga- 
tion and will remain inaccessible until the inven- 
tion of some means of transcribing the dream as it 
is taking place. Until that time, investigators must 
depend upon the dreamer to communicate his 
dream verbally or through some other medium. 
Studies of dreams, therefore, are actually studies 
of reported dreams. How much correspondence 
there is between the dream as reported and the 
dream as dreamed cannot, as yet, be determined. 
We know now that failure to recall a dream does 
not mean that the person has had a dreamless 
sleep. In fact, except under certain abnormal con- 
ditions, everyone dreams every night, having from 
four to six separate dreams. 


The process of dreaming 


Objective study of eye movements. The process 
of dreaming, as distinguished from the product, 


was made available for scientific study by the dis- 
covery of objective indicators of dreaming. The 
first of these was reported in 1953 by E. Aserinsky 
and N. Kleitman, who noted while observing sleep- 
ing subjects that bursts of rapid eye movements 
(REMs) occurred periodically during sleep. Kleit- 
man later described this fruitful discovery: 


In our laboratory at the University of Chicago we lit- 
erally stumbled on an objective method of studying 
dreaming while exploring eye motility in adults, after 
we found that in infants eye movements persisted for 
a time when all discernible body motility ceased. In- 
stead of direct inspection, as was done for infant eve 
movements, those of adult sleepers were recorded in- 
directly, to insure undisturbed sleep in the dark. By 
leads from two skin spots straddling the eye to an EEG 
machine, located in an adjacent room, it was possible 
to register potential differences whenever the eye 
moved in its socket. . . . By this method . . . slow eye 
movements were found to be related to general body 
motility. In addition, jerky rapid eye movements . . . , 
executed in only a fraction of a second and binocularly 
symmetrical, tended to occur in clusters for 5 to 60 
minutes several times during a single night’s sleep. In 
order to correlate the REMs with other concomitants, 
simultaneous recordings were made of changes in the 
sleepers’ EEG, pulse, and respiration. It was soon ap- 
parent that the REMs were associated with a typical 
low-voltage EEG pattern and statistically significant 
increases in the heart and respiratory rates . . . though 
occasionally the pulse was slowed. These changes sug- 
gested some sort of emotional disturbance, such as 
might be caused by dreaming. To test this supposition, 
sleepers were aroused and interrogated during, or 
shortly after the termination of, REMs and they almost 
invariably reported having dreamed. If awakened in 
the absence of REMs, . . . they seldom recalled dream- 
ing. (Kleitman 1963, pp. 93-94) 


Like most discoveries, the correlation of eye 
movements with dreaming was not unanticipated. 
In 1892, G. T. Ladd tentatively concluded on the 
basis of introspective studies that the eyes move 
during dreaming, and many years later, E. Jacob- 
son (1938) corroborated Ladd’s introspections by 
actually observing that the eyes do move during 
dreaming. Despite these historical antecedents, it 
was the findings of Aserinsky and Kleitman that 
launched the modern era of dream research, just 
as half a century earlier it had been Freud's book 
The Interpretation of Dreams (1900) that estab- 
lished the dream as the main vehicle for studying 
the operations and products of unconscious proc- 
esses. 

Why do the eyes move during dreaming? The 
best answer seems to be that the eyes are scanning 
the dream scene, just as the eyes of a person who 
is awake scan the visual field. There is some evi- 
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dence to support this scanning hypothesis. Dreams 
that involve much action are reported after a REM 
period in which there are many large eye move- 
ments, whereas more passive dreams are correlated 
with smaller eye movements (Berger & Oswald 
1962). In some instances, the direction of the eye 
movement has been shown to correspond with the 
direction of the movements in the reported dream 
(Dement & Wolpert 1958). For example, following 
a REM period in which only vertical eye move- 
ments were recorded, the subject reported a dream 
of looking alternately down at the sidewalk and up 
toward the top of a flight of stairs. 

Dreaming and sleep cycles. Not only is dream- 
ing a unique form of behavior, but it also occurs 
during a unique stage of sleep. Unlike any of the 
other stages, the one during which dreaming occurs 
is characterized not only by bursts of rapid, conju- 
gate movements of both eyes and by a low-voltage 
EEG pattern but also by other physiological and 
behavioral concomitants. Breathing and heart rate 
tend to be irregular, and the muscles of the throat 
adjacent to the larynx become flaccid, although, 
interestingly, the penis becomes more erect, The 
electrical resistance of the skin, which is ordinarily 
high during sleep, is reported to be even higher 
during so-called dreaming sleep. All of this evi- 
dence suggests that the stage during which dream- 
ing occurs may be, as one authority has stated, 
“a separate organismic state, different from both 
‘nondreaming’ sleeping and from waking” (Snyder 
1963). 

Dreaming is cyclical throughout the sleep period, 
with from four to six such cycles occurring during 
the night. The first cycle appears approximately 
an hour after a person falls asleep, and succeeding 
ones occur about every 90 minutes. The length of 
successive REM periods increases from 5 to 10 
minutes for the first one to 30 minutes or longer 
for the last one. This finding revokes the common 
belief that a dream lasts for only a few seconds, 
In fact, some recent evidence suggests that what 
transpires in a dream occupies approximately the 
same length of time as it would were the same 
events to occur in waking life. 

Although a hallucinatory dream experience is 
rarely reported if a person is awakened when eye 
movements are lacking, some investigators have ob- 
tained thoughtlike reports from persons who were 
awakened during non-REM periods. In one such 


study the subjects compared their reports of non- 


REM periods with those obtained from dreaming 
ports of non-REM 


sleep. They found that their re 
periods were more difficult to recall, were more 
plausible and less emotional, were more concerned 
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with contemporary events, and, generally speaking, 
resembled thinking more than dreaming (Recht- 
schaffen et al. 1963.) It has been suggested that the 
thinking the subject reports on being awakened 
from a non-REM period occurs while the sleeper 
is being awakened and does not reflect mental 
activity during sleep itself. 

Universality. By employing the objective indi- 
cators of dreaming in studies involving numerous 
subjects, investigators in the United States and 
abroad have established that everyone dreams 
every night. Exceptions to this have been noted 
when the subject is in an abnormal state, such as 
during a high fever or when certain drugs have 
been administered. Even people who say they have 
never dreamed or who rarely remember a dream 
will report dreams when they are awakened during 
a rapid-eye-movement period (Goodenough et al. 
1959). 

A young adult spends about one-fifth of the 
sleep period dreaming, babies considerably more, 
and older people less, Rapid eye movements have 
been observed in blind people whose blindness is 
not of long standing. In those who have been blind 
for several years, low-voltage EEG waves still occur 
and a nonvisual type of dream is reported when 
the blind person is awakened (Berger et al. 1962). 
Other animals besides man show cyclical periods of 
rapid eye movements during sleep. 

Biological necessity? One of the most interest- 
ing findings resulting from the use of objective 
indicators of dreaming is that when a person's 
dreaming is reduced by awakening him every time 
his eyes begin to move, there is a significant in- 
crease in REM time when he is finally permitted to 
sleep undisturbed (Dement 1960). Moreover, if a 
person is deprived of dreaming for a number of 
nights, his waking behavior appears to be adversely 
affected. He manifests various aberrant “symptoms” 
that border on being pathological, and it has been 
conjectured that if he were deprived of dreams long 
enough, he might become psychotic. These results 
seem to indicate a “need to dream,” comparable in 
its psychobiological insistence to any of the other 
basic needs of the individual. 

One authority believes, however, it may be the 
kind of sleep and not the dreaming itself that is the 
biological necessity ( Snyder 1963). Support for 
this hypothesis is found in the work of the French 
investigator Jouvet, who has observed periods of 
eye movements and low-voltage waves in decor- 
ticate cats (Jouvet 1961) and in decorticate hu- 
mans (Jouvet et al. 1961). It is considered unlikely 
that either a decorticate cat or a decorticate person 
is capable of having dreams, yet the physiological 


concomitants of dreaming persist in the decorticate 
state. It is these physiological processes and not 
the dreaming that Snyder considers to be the 
biological necessity. 

External events. The availability of indicators 
of dreaming will enable investigators to observe 
the effects of certain stimuli upon dreaming. Studies 
made prior to the discovery of objective indicators 
showed that experimentally introduced discrete 
stimuli had, for the most part, very little influence 
upon the dream (Ramsey 1953). They may appear 
either directly or in distorted form in the dream, 
but they do not instigate, control, or shape the sub- 
stance of the dream. Preliminary studies using the 
objective indicators have produced similar results, 
namely, that such external stimuli as sounds, pres- 
sures, and temperature changes have little influ- 
ence on the dream. It is believed, however, that 
more complex forms of stimulation, such as movies 
shown to a person before he retires, will influence 
dreams, It is an established fact that the experi- 
mental situation for monitoring sleep becomes rep- 
resented in dreams, especially during the subject's 
initial nights in the laboratory. It will now be pos- 
sible also to determine whether the sleeping per- 
son is more sensitive to telepathic or clairvoyant 
influences, as some authorities believe, by noting 
the appearance of subliminal stimuli in his dreams. 

One question that is being investigated is 
whether the dreams obtained from persons whose 
sleep is monitored can be compared with dreams 
remembered by them upon awakening from non- 
monitored sleep. Preliminary findings indicate that 
the two samples are not comparable. A related 
question is whether the dreams of a person through- 
out the night show any consistent pattern of the- 
matic material. Again preliminary findings suggest 
that there are minimal correspondences among a 
person’s dreams of the same night. 


Dream interpretation 


Interpretation before Freud. Although Freud 
was not the first person to assume that the dream 
has a “deeper” meaning—such an assumption 
seems always to have existed—he was the first to 
develop an empirical method for “interpreting” 4 
dream. Dream interpreters before Freud—men 
like Artemidorus, who is credited with being the 
first compiler of a dream book, and Joseph, whose 
exploits are recounted in the Bible—depended upon 
intuition, wisdom, and a scholarly knowledge of 
dream lore for deriving significance from the 
dream. Elements in a dream were assumed by 
these ancient dream interpreters to have a fixed 
meaning. It is this assumption that underlies dream 


books. Although dream books, with their prophe- 
cies of good and bad fortune, have fallen into dis- 
repute among educated people, they are still pub- 
lished and purchased in large quantities and 
influence the decisions of many people (Weiss 
1944). Among societies lacking books, dream in- 
terpreters still flourish and enjoy great prestige for 
their knowledge of a phenomenon deemed to be 
steeped in personal and even social relevance. 
There are societies like the Senoi in which people 
tell each other their dreams for the express purpose 
of reducing interpersonal tensions (Steward 1951). 

That the dream among all the cognitive activities 
of man should be singled out as possessing special 
and mysterious properties, and requiring as a con- 
sequence special and often supernatural explana- 
tions, is not surprising when one considers how 
unique the dream is. For example, the ancient 
theory that the dream is a record of the experiences 
of a soul that leaves the body during sleep is based 
upon the fact that many dreams are so vivid and 
lifelike that they are mistaken for real perceptions. 

Also, in view of the fact that the dream appears 
as an alien visitation without warning and with- 
out intention during the dead of sleep, a condition 
that is itself charged with mystery, it is not dif- 
ficult to comprehend why many people construe 
the dream as a prophetic message from super- 
natural beings, from the sleeping person’s ances- 
tors, or, in the present age of science fiction, from 
people living on other planets. Other prescientific 
theories that purport to explain dreaming are 
based upon other equally singular features of the 
dream. 

Freud’s theory. Freud's empirical method for 
interpreting a dream involves free association. 
After a person reports a dream to his analyst, he is 
instructed to say everything that comes into his 
mind when each successive element of the dream 
is presented back to him. By using the method of 
free association with his patients’ dreams, Freud 
(1900) was able to formulate a comprehensive 
theory of the dream. The dream has two kinds of 
content: the manifest (conscious) content, which 
is the dream as experienced and remembered by 
the dreamer, and the latent (unconscious) content, 
which is discovered through free association. Dream 
interpretation involves replacing the manifest with 
the latent content. The nucleus of the latent con- 
tent is an unconscious infantile wish with which 
later experiences have become implicated. The 
ultimate task of interpretation is to unearth the 
nuclear infantile wish through free association. 
Much of the latent content consists, however, of 
“day residue,” that is, memories of experiences and 
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thoughts that the dreamer has had on the day 
previous to the dream. Day residue alone is not suf- 
ficient to create a dream; it must be charged by an 
infantile wish in order to transform it into the 
conscious imagery of a dream, 

When the dream thoughts (latent content) are 
transformed into the manifest content of the re- 
called dream, they are altered in certain ways. They 
are subject to condensation; an element in the 
manifest dream may be a compression of several 
dream thoughts. They are subject to displacement; 
feeling associated with a particular dream thought 
is transferred to an otherwise neutral element in 
the manifest dream, Latent thoughts may also be 
represented in the experienced dream by symbols. 
The interpretation of a dream requires, then, that 
all of the condensed manifest elements be ex- 
panded into their constituent dream thoughts, that 
all displaced affects be traced to their proper sources 
in the latent content, and that referents be found 
for all symbols. This is a formidable undertaking, 
the result of which is an interpreted version that is 
many times longer than the text of the manifest 
dream. The practicability of this method of in- 
terpreting dreams appears to be restricted to long- 
term psychotherapy. 

Freud hypothesized that the dream work, which 
consists of the operations for transforming latent 
into manifest content is governed by two aims: 
regard for representability and disguise. Regard for 
representability refers to the transformation of ab- 
stract dream thoughts into concrete dream imagery. 
The aim of disguise is protection, based on the 
assumption that the undisguised latent thoughts 
would evoke so much anxiety that the dreamer 
would awaken. Many dreams do, in fact, awaken 
the sleeper because they are not sufficiently dis- 
guised. The sleep-protection hypothesis is a bio- 
logical one, for it explains not what we dream but 
why we dream. 

Dreams, then, according to Freud's theory, are 
useful in establishing the contents of the uncon- 
scious. Since the foundation of the unconscious is 
laid down early in life, before the age of five or six, 
and contains repressed material from the psycho- 
sexual stages, the analysis of dreams constitutes 
one of the few methods for studying early psycho- 
logical development. That the unconscious may 
also contain material from the racial past was dis- 
cussed by Freud, but he neither strongly affirmed 
nor denied the notion, although he seems to have 
been sympathetic to it. 3 

Jung's theory. It remained for one of Freud's 
early associates (later an apostate), Carl Jung, to 
examine dreams for evidence of a racial uncon- 
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scious that all men share (Jung 1960). Jung was 
convinced that there was sufficient evidence in 
dreams and other types of material, e.g., myths and 
religion, to validate the concept of a collective un- 
conscious. He called the contents of this uncon- 
scious “archetypes” and identified a number of 
them: the anima, the shadow, the earth mother, 
the wise old man, and, most important of all, the 
archetype of personal unity symbolically repre- 
sented in dreams and elsewhere by the form of the 
mandala. Whereas Freud used dreams to explore 
the formative years of a person's life, Jung used 
them to explore the psychological development of 
the race. 

Jung also thought, in contradistinction to Freud, 
that dreams are oriented to the future as well as to 
the past. They mark out for the individual the 
proper path to a more complete actualization of 
personality and help reveal poorly developed parts 
of the personality. 

Other theories. In addition to the theories of 
Freud and Jung, there are a number of other the- 
ories, for example, those of Hall (1953), French 
(1954), Hadfield (1954), Boss (1953), Ullman 
(1955; 1958; 1959), and Jones (1962). These have 
several features in common. They deal more with 
the manifest than with the latent content, and they 
are more concerned with the dream as an expres- 
sion having adaptive significance for the dreamer 
than as a disguise for infantile wishes. Hall, for 
example, regards the dream as a concrete repre- 
sentation of the dreamer’s conception of himself, of 
others, and of his world. The dream reveals more 
than it conceals. French stresses the integrative 
role played by the dream. The dreamer is attempt- 
ing to solve his emotional problems. Hadfield also 
sees the dream as problem-solving activity, and 
Ullman emphasizes the dream’s adaptive function. 
For Boss, an existential—phenomenological thera- 
pist, the dream is a confrontation experience in 
which the dreamer faces directly his own questions 
of existence as a unique experiencing self. In the 
most recent of these theoretical formulations, Jones 
describes the synthesizing function of the dream 
within the context of a developmental sequence of 
critical phases through which a person passes in 
growing up. 

It would seem from these theories that the dream 
was a complex, multidimensional, multileveled phe- 
nomenon capable of supporting diverse theoretical 
superstructures. The dream may, in fact, be just 
such a complex phenomenon, although the ratio of 
research to speculation is still so small that it is 
difficult to draw any firm conclusions regarding the 
validity of these speculations. Although research is 


scanty, the usefulness of dream analysis in psycho- 
analytic and other forms of psychotherapy seems 
to be generally acknowledged by psychotherapists 
(Bonime 1962). 


Experimental studies of dreams 

The dream as a projective device. It is not pos- 
sible to say just how much research on dreams has 
been deterred by Freud's distinction between mani- 
fest and latent content and by the complex set of 
operations that must be carried out under a very 
special type of relationship between the observer 
and the subject before the operations can be suc- 
cessful. There are indications that these methodo- 
logical obstacles are being bypassed by treating the 
manifest dream as significant material in its own 
right (Hall 1947; Eggan 1952; Jones 1962). This 
approach regards the dream, or preferably a series 
of dreams, as a projective device similar to those 
of the Rorschach and story-telling techniques. It 
may be argued that the dream is almost a pure 
form of projection, since external stimuli seem to 
have so little to do with its formation or its content. 

The projective approach may be illustrated by a 
study made by Hall and Van de Castle (1965), Fol- 
lowing Freud's theory of sex differences in psycho- 
sexual development, they hypothesized that there 
would be a greater frequency of castration anxiety 
in dreams reported by males and that there would 
be a greater incidence of castration wish and penis 
envy in dreams reported by females. Scales for 
identifying castration anxiety, castration wish, and 
penis envy in dream reports were developed, and a 
large number of male and female dream series 
were scored using these scales. The hypotheses 
were confirmed at a high level of significance. 

Another study employing the same methodology 
was conducted by Beck and Hurvich (1959). They 
predicted that depressed patients would show a 
greater incidence of manifest dreams with maso- 
chistic content than would nondepressed patients. 
A collection of dreams secured from depressed and 
nondepressed patients was scored, and the hy- 
pothesis was confirmed at an appropriate level of 
significance. 

This method of dream analysis has much to 
recommend it. Dream reports can easily be col- 
lected in large numbers from groups of people 
living under different cultural, economic, social, 
educational, and geographical conditions. They can 
be subjected to quantification, and the same set of 
dreams can be analyzed in many ways to serve dif- 
ferent empirical and theoretical purposes. Norms 
for different populations can be established so that 
an individual's deviation from the norms for the 


et. 


group may be accurately described. The influence 
of experimental manipulation of variables can be 
assessed by comparing a treated group with a 
control one. 

Of particular significance for the social sciences 
is the comparison of dreams obtained from people 
living in different cultures, Dorothy Eggan’s pio- 
neering efforts in studying the dreams of Hopi 
Indians show what can be done by correlating the 
themes of reported dreams with culturally relevant 
material (1961). 

The objective method of analyzing dreams. It 
is to be expected that the availability of objective 
indicators of dreaming will prove a stimulant to 
research and increase our knowledge of dreams 
and dreaming in the future. The physiological 
emphasis so far in research probably reflects the 
fact that rapid eye movements and brain waves 
were discovered by physiologists and that their 
measurement employs apparatus that is more fa- 
miliar to physiologists than to social and behavioral 
scientists. 

CALVIN HALL 


[Other relevant material may be found in ANALYTICAL 
PSYCHOLOGY; NERVOUS SYSTEM, article on ELECTRO- 
ENCEPHALOGRAPHY; PROJECTIVE METHODS; PSYCHO- 
ANALYSIS; SLEEP; and in the biography of JuNG.] 
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DRINKING AND ALCOHOLISM 


I. PSYCHOLOGICAL ASPECTS 
mm. SOCIAL ASPECTS 


Edith S. Lisansky 
Charles R. Snyder 
and David J. Pittman 
I 
PSYCHOLOGICAL ASPECTS 


Drinking and alcoholism will be considered as 
related but distinct phenomena. The use of alco- 
holic beverages by almost all societies over recorded 
history as a stimulant, anesthetic, social lubricant, 
and ceremonial substance is well known. Ingestion 
of alcohol may be normal or pathological; here the 
problem of definition begins. There is moderate 
drinking, acceptable in many (but not all) human 
groups; there is excessive drinking, for example, 
intoxication; there are behavior problems associated 
with drinking; and there is alcoholism. A recent 
definition of alcoholism states that “alcoholism is 
a psychogenic dependence on or a physiological 
addiction to ethanol, manifested by the inability 
of the alcoholic consistently to control either the 
start of drinking or its termination once started . . .” 
(Keller 1962, p. 312). Loss of control is the patho- 
gnomic symptom here, and the definition specifies 
for the epidemiologist those behaviors that will per- 
mit him to define the alcoholic—marked and repeti- 
tive drinking—as well as the ill effects on the 
drinker’s health or on his social or economic 
functioning. 

Drinking behavior has been interpreted in terms 
of different theoretical systems in psychology: for 
example, drinking has been viewed as a reducer of 
fear and conflict, by behaviorists, or as a dissolver 
of the superego, by psychoanalysts. Studies of the 
influence of alcohol on human behavior may be 
viewed as a wedge into related areas of knowledge: 
knowledge of the effects of alcohol on learning 
efficiency or motor coordination may be considered 
data about learning efficiency or motor coordina- 
tion per se, about general drug effects, or about 
individual differences in response. The relation of 
deviant drinking behavior to human misery, per- 
sonal and social, gives some insight into the dy- 
namics of psychic pain and how ways of coping 
are passed on and learned. Alcohol studies in social 


anthropology may suggest usable hypotheses about 
the relationship of individual maladjustment and 
social organization in different societies. There are 
some groups in which drinking is virtually uni- 
versal, where the lone drinker or abstainer is un- 
known, and where frequent intoxication and spree 
drinking is not the symptomatic, deviant behavior 
of a relatively few, for example, the fiesta drinking 
of Central American and South American Indian 
groups. In some social groups and societies, such 
as nineteenth-century European Jewish communi- 
ties, alcohol may serve to reinforce social organ- 
ization. In other groups, alcohol may hasten proc- 
esses of disorganization and deculturation; the 
example of many North American Indian tribes is 
well known. Alcohol has been used occasionally 
as an instrument to achieve the submission of and 
control over others. 


Drinking 

The term “drinking” seems simple enough: an 
organism ingests an alcoholic beverage. But as a 
behavioral term, it includes many related aspects: 
in describing drinking behavior, one needs to know 
the kind and amount of beverage ingested and the 
circumstances of ingestion, such as where it is 
drunk, how rapidly, with whom, how diluted, 
whether before or during meals, etc. The term 
“drinking” encompasses the determination of the 
blood-alcohol level produced by the drinking; the 
effects on efficiency, mood, and social interaction; 
the past experiences of the drinker; and the drink- 
ing customs of the social group. 

Organic and psychological aspects. The physio- 
logical aspects of drinking have been summarized 
in an excellent review that includes much of the 
recent physiological and biochemical research on 
alcoho] (Kalant 1962, pp. 52-93). 

The psychological aspects of drinking will be 
narrowed here to a review of research on the be- 
havioral effects of alcohol on individual organisms. 

There is a vast literature involving animal sub- 
jects and the effects of alcohol on their behavior. 
Often, these are studies of alcohol effects in clas- 
sical or operant conditioning procedures, reaction 
time and maze behavior, but there have also been 
many laboratory reports demonstrating increased 
yoluntary consumption of alcohol under different 
experimental conditions, such as vitamin deficiency 
or stress, as well as some reports of laboratory work 
demonstrating genetic strain differences in alcohol 
preference. The term “experimental alcoholism” to 
describe these studies was criticized by the World 
Health Organization, Expert Committee on Alcohol 
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(1954), which expressed the opinion that the phe- 
nomenon of alcoholism was closer to experiments 
dealing with “the relief by alcohol of experimentally 
induced neurotic manifestations in animals.” Such 
experiments include studies of experimenter-in- 
duced conflicts and stress and of “experimental 
neurosis.” Whether such experiments relate to alco- 
holism or to the relief afforded by alcohol in every- 
day, nonpathological social drinking may be a 
legitimate question, but there is another caveat: 
not to go too far in generalizing from the responses 
to alcohol of the animal in laboratory-generated 
stress to the complex, socially patterned drinking 
behavior of the human subject. 

There was a sizable amount of laboratory experi- 
mentation during the first four decades of this 
century on the effects of alcohol on different psy- 
chological variables. These have been summarized 
and evaluated in a monograph by Jellinek and 
McFarland (1940). Experiments were grouped 
under these headings: chronaxy, reflexes, sensa- 
tions, perception and attention, simple reaction 
time, muscular strength and coordination, miscel- 
laneous tests of dexterity and skill, learning and 
simple learned performances, memory, the associa- 
tive functions, judgment, reasoning and intelli- 
gence, volition, emotion and personality, The gen- 
eralization made from the authors’ analysis of the 
experiments was that “alcohol has a depressing 
effect on all psychological functions yet measured, 
and . . . such stimulation as has been reported for 
some psychological variables is a pseudostimula- 
tion” (ibid., p. 362-363). The reviewers also con- 
cluded that “the simple psychological functions are 
less affected by alcohol than the complex ones” 
(ibid., p. 363), 

These conclusions are now under critical re- 
examination and challenge. There is some evidence 
that small amounts of alcohol may in some task 
situations have a facilitating effect. It turns out, 
too, that inferences about “pseudostimulation” are 
based on a melange of psychology and neurology: 
if behavior or performance improves with alcohol, 
the argument is that such improvement occurs be- 
cause cortical inhibitions are disinhibited by the 
alcohol. But studies of behavior need to be de- 
scribed in behavioral and not in neurological terms 
(particularly not in oversimplified neurological 
terms that have been superseded by newer con- 
cepts of cerebral activity). As it appears now, alco- 
hol in small and moderate amounts is frequently 
a behavioral depressant, but may—under some con- 
ditions, for some tasks, for some individuals—act 
as a facilitator (Lisansky 1964, pp. 104-121). 


The issue of simple versus complex psychological 
functions has also been critically re-evaluated: 
“The traditional idea that intellectual functions are 
particularly susceptible to deterioration by alcohol 
is questioned . . . a reevaluation of the idea that 
intellectual functions are a ‘complex’ or ‘higher’ or 
‘fragile’ activity is necessary . . .” (Carpenter 1962, 
p. 310). 

Experimental study of the effects of alcohol on 
emotional behavior and the relationship of such 
effects to personality variables is in its infancy. 
There is some research suggesting that individual 
characteristics related to the effects of alcohol may 
be a subject's general adjustment, his introversive 
or extroversive tendencies, his past experience with 
alcohol, and his familiarity with a task. Interest- 
ingly enough, when people are asked to report their 
subjective feelings about how alcohol affects them, 
they almost invariably reply in terms of stimulating 
effects; this is true whether the people are labora- 
tory subjects or participants in surveys. Experi- 
ments involving groups and the effects of alcohol 
on the emotional behavior of individuals drinking 
together have been conducted by the Finnish Foun- 
dation for Alcohol Studies. The limited number of 
group studies to date report increases in aggressive 
behavior, in the sexual content of fantasies, and 
in emotionality of behavior generally. 

Drinking practices in the United States, in some 
Central American and South American communi- 
ties, among several African peoples, in many coun- 
tries of Europe and Asia, and in preliterate primi- 
tive societies have been described (McCarthy 1959; 
Pittman & Snyder 1962), Jellinek (1960) has re- 
ported on attitudes toward intoxication and alco- 
holism in the different countries of Europe and 
the Americas. 


Alcoholism 

Any discussion of alcoholism must begin by dis- 
tinguishing between “drinking related misbehav- 
iors” (Keller 1962, p. 310) or “{ntoxication-caused 
problems” (Tongue 1962) on the one hand and 
alcoholism on the other, Jellinek commented: “In- 
ternational experience leads to the conclusion that 
in many countries more serious problems of na- 
tional magnitude arise from other types of drinkers 
than from those who are termed ‘alcoholics’ in 
America” (1960, p. 15). Other types of drinking, 
like occasional excessive drinking, may lead to be- 
haviors that present a problem to society; examples 
of these would be “explosive drinking” in Finland, 
or “fiesta drinking” in Spain, Portugal, Brazil, and 
Argentina. Jellinek (1960) described the so-called 
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“inveterate drinker” in France, who may consume 
two or even three liters of wine daily, rarely becom- 
ing intoxicated yet frequently presenting the med- 
ical symptoms of the diseases of chronic alcoholism. 

European state programs have concerned them- 
selves with both alcoholism and alcohol-related 
problems; most of the programs in the United 
States have dealt more or less exclusively with the 
treatment and rehabilitation of alcoholics. This 
difference does not seem to be related to the se- 
verity of the alcoholism problem. It is more likely 
related to differing national attitudes toward social 
legislation and government responsibility. 

Etiology. When we refer to “etiology” or “treat- 
ment,” we are speaking of alcoholism per se as it 
presents itself in the industrialized, urbanized 
countries. Alcoholism is viewed by many in these 
countries as a disease or a symptom of a disease 
process, and the concepts of etiology and treatment 
relate to this view. To perceive the most extreme 
of alcohol problems, alcoholism, as a disease re- 
quires a kind of mental-health viewpoint not uni- 
versal in the medical profession or welfare agencies, 
to say nothing of the lay public. It is a deviant be- 
havior, and as such it is seen by many as im- 
morality, weakness of will, perversity, or a bad 
habit. 

Physiopathological theories. Etiological theories 
have been primarily physiopathological or psycho- 
pathological. Among physiopathological theories, 
there are explanations in terms of allergy, brain 
pathology, biochemical substances, nutritional defi- 
ciencies, and glandular disorder. These theories 
have not gained more than limited acceptance; one 
of the major problems has been the difficulty of 
demonstrating that the particular physiopathologi- 
cal condition offered as the cause actually predated 
the onset of alcoholism. Generally, evidence has 
been based on the condition of patients with long- 
standing alcoholism. 

The World Health Organization, Expert Com- 
mittee on Alcohol (1954) placed alcohol “in a cate- 
gory of its own, intermediate between addiction- 
producing and habit-forming drugs.” The questions 
of alcohol and physical dependence or craving, 
alcohol and withdrawal, and alcohol and toler- 
ance changes (craving, withdrawal, and tolerance 
changes being the criteria of addiction) have not 
been solved and are still in debate. 

Psychopathological theories. Psychopathological 
theories were divided by Jellinek (1960) into those 
viewing alcoholism as the illness per se and those 
viewing it as a symptom of illness. Overriding 
this distinction is the common view that some per- 


sonality deviation or difficulty in social-emotional 
development is a necessary condition of alcoholism. 
The psychoanalytic view emphasized “the oral and 
narcissistic premorbid personality,” and various 
writers, theorizing from clinical data, have written 
of passivity, self-destruction, guilt, infantile traits, 
anxiety, and oral regression. There has been debate 
as to whether there is a single psychological pre- 
disposition, several such predispositions, or whether, 
indeed, any personality is vulnerable. Psychological 
test research has failed to turn up a single “alco- 
holic type.” Psychological clinical research with 
emphasis on social as well as psychological data 
about subjects and on behavioral criteria for dif- 
ferentiating among alcoholics might yet make a 
contribution, There has been some speculation by 
psychologists about how alcoholism is learned, for 
instance, whether it is a result of the reinforcement 
involved in reduction of fear, conflict, and tension. 
A little has been written about the function alcohol 
serves in solving conscious and unconscious emo- 
tional needs. This literature has been reviewed 
(Zwerling & Rosenbaum 1959; Lisansky 1960). 

There would probably be agreement by most who 
work with alcoholic patients in the United States 
that the alcoholic is an individual with low toler- 
ance for frustration and stress, that he has not de- 
veloped much by way of ego defenses other than 
denial, and that he has weak sexual drives, his 
affectional bind being largely with alcohol. 

The weakness of psychopathological theories 
standing alone as an explanation of alcoholism is 
the difficulty of answering this question: If, as 
seems to be the case, there is no absolutely unique 
set of predispositional psychological circumstances, 
why do some people become alcoholic while others 
do not? There is, for example, no absence of neu- 
rotic problems or depression or guilt among the 
Jews, who maintain an extremely low rate of alco- 
holism. Psychopathology may be a necessary but 
not a sufficient causal explanation of alcoholism. 

Sociopsychological theories. The most promise 
lies in a sociopsychological theory of etiology. One 
needs to know not only about individual tensions 
and frustrations but also about the group’s methods 
of coping and its attitudes toward and perception 
of alcohol. Jellinek (1960), for example, has 
offered a hypothesis relating a society's degree of 
acceptance of “large daily alcohol consumption 
and the psychological vulnerability of individuals. 
It is a start. In the United States, different norma- 
tive orientations toward drinking (Pittman & 
Snyder 1962) appear in different ethnic and ges 
ligious subcultures, the Irish-American orientation 
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producing higher rates of alcoholism than the 
Jewish-American. The question remains: Why do 
some Irish-Americans become alcoholics and not 
others? Perhaps the same psychological vulnera- 
bility that predisposes toward alcoholism in the 
Irish-American group predisposes toward other 
psychopathologies in the Jewish-American group. 

In several European and South American viti- 
cultural countries, there is widespread belief in the 
“economic origin” of alcoholism. This may refer to 
economic deprivation of individuals or of the coun- 
try, and the view that intoxication and alcoholism 
are related to want and economic misery may be 
valid for certain times and groups and parts of the 
world. The “economic origin” theory also refers to 
pressure from a national economy in which viticul- 
tural interests loom large, but the oversimplification 
of such an explanation is manifest in the differ- 
ences in alcohol consumption and the rates of alco- 
holism in France and in Italy, both viticultural 
countries. 

There is no all-embracing explanation of alco- 
holism in sight, but if one begins with a distinction 
between alcoholism and problems related to alco- 
hol, if one is aware of the need for a classification 
of variants within the diagnosis of alcoholism, and 
if one is willing to include physiological, psycho- 
logical, and social variables in an explanation of 
etiology, some ground is cleared. Probably no theory 
can ever account for all the problems men have 
with alcohol. 

Treatment. In spite of the lack of solutions to 
questions about alcoholism, there has been a great 
proliferation of state programs in Europe and the 
Americas since World War u. The emphasis has 
been primarily on the treatment and rehabilitation 
of alcoholics, with secondary emphasis on educa- 
tional programs and, in a few places, some support 
for a research program. With pressure coming from 
a variety of sources—the government itself, wel- 
fare workers and physicians, temperance organiza- 
tions, or those concerned with the relationships 
between alcohol and acting-out crimes—some 
attempt has been made to offer treatment. 

Treatment programs fall into three major cate- 
gories: 

(1) Residential treatments place the alcoholic 
in (a) a hospital or a ward in a general or psychi- 
atric hospital, (b) rehabilitation living quarters, 
like a halfway house, for chronic drunkenness of- 
fenders or, relatively new, day-or-night-care centers. 

(2) Among psychological and rehabilitative 
treatments are (a) individual and group psycho- 
therapy, frequently psychoanalytically oriented, 


(b) a special form of group help, Alcoholics Anon- 
ymous (a brotherhood of former alcoholics that has 
taken hold in many countries other than the United 
States, for instance, the “Ring i Ring” in Denmark), 
(c) groups of religious and quasi-religious organ- 
izations, including the churches and the Salvation 
Army, (d) the use of hypnosis and its variations, 
and (e) the use of conditioned-reflex treatment. 

(3) Drug treatments include (a) ataractic and 
antidepressant drugs in current use as the sole 
treatment or as adjunct to other forms of treatment 
and (b) the administration of disulfiram (Anta- 
buse), discovered by Dr. Erik Jacobsen of Den- 
mark, very widely used in alcoholism treatment. 

A particular treatment program is organized, 
hopefully, in terms of the needs of the patients it 
seeks to reach. Where the patients are physically 
ill or debilitated or homeless, more than outpatient 
service is needed. There are some countries, for 
example, Sweden and Norway, in which the super- 
vision and rehabilitation of alcoholics is compulsory 
under law. In most countries treatment is more or 
less voluntarily undertaken. 


Research and problems 

Research activity has been most fruitful at those 
interdisciplinary centers designed for research in 
alcohol studies. These centers, which combine the 
efforts of physiologists, physicians, and biochem- 
ists, psychiatrists, psychologists, and social work- 
ers, sociologists, anthropologists, and economists, 
demonstrate the usefulness of a multidisciplined 
research approach to problems like drinking and 
alcoholism. The Alcohol and Drug Addiction Re- 
search Foundation of Ontario in Canada, the 
Finnish Foundation for Alcohol Studies, the Depart- 
ment of Alcohol Research of the Karolinska Insti- 
tutet in Sweden, and the Rutgers (formerly Yale) 
Center of Alcohol Studies in New Jersey have been 
outstanding. These centers have produced research 
results over a wide range of problems: physiological 
and biochemical action of alcohol, the effects of 
alcohol on behavior and performance, alcohol usage 
in different cultural and social groups, attitudes, 
prevalence of alcoholism, the effectiveness of vari- 
ous drug treatments, drinking and driving, govern- 
mental policy and social controls—the gamut of 
alcohol-related problems. 

There are many unanswered questions, but the 
most pressing problems in the psychological aspects 
of drinking and alcoholism, those which should 
have priority in research development, are: first, 
basic research on the effects of alcohol on mood, 
feelings, emotional response, and group behavior 
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and, second, a meaningful classification of the 
alcoholisms, probably in behavioral terms objec- 
tively defined, which can be used internationally 
and which will, hopefully, lead to an understand- 
ing of the different etiologies and most effective 
treatments. 

EDITH S. LisANSKY 


[Other relevant material may be found in Drucs; 
SMOKING.] 
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Lii 
SOCIAL ASPECTS 


A generation ago Donald Horton (1943) pio- 
neered the first systematic cross-cultural study of 
drinking patterns in a report on the functions of 
alcohol in primitive societies. In this work he 
offered an eloquent plea for intensifying system- 
atic research on drinking behavior and for moving 
toward the goal of incorporating empirical findings 
into a general theory of the social and psychological 
functions of alcohol. Since then, the systematic 
study of drinking behavior has proceeded at an 
accelerated pace in various branches of social sci- 
ence. Yet it cannot be said that the research of the 
past two decades has produced the general theory 
which Horton had in mind. At best it has yielded 
new facts, questions, and hypotheses suggestive of 
the directions that synthesis may eventually take. 
Here we shall first consider some of the more gen- 
eral findings of comparative studies of drinking in 
primitive societies and of sociological and related 
studies of drinking in complex societies. Attention 
will then be given to the nature of alcoholism; to 
its prevalence, patterning, and genesis; and to 
social responses to alcoholism. 

Drinking in primitive societies 

Alcohol and anxiety reduction. Central to Hor- 
ton’s research was the view that patterns of drink- 
ing behavior are determined by the interrelations 
of psychological and cultural variables and, more 
specifically, the proposition that the primary func- 
tion of alcoholic beverages in all societies is the 
reduction of anxiety. Horton recognized other func- 
tions of alcohol as well as the seeming paradox 
that drinking, in net balance, may not be anxiety- 
reducing where social controls give rise to powerful 
and inhibiting counteranxieties. Yet, weaving the 
basic notion of anxiety reduction into a larger com- 
plex of psychocultural assumptions, he derived 
some important theorems, enabling indirect tests 
of his basic proposition on a sample of 56 cultur- 
ally distinct societies thought to represent a cross 
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section of primitive societies throughout the world 
(Horton 1943). 

It was reasoned, for instance, that drinking tends 
generally to be accompanied by the release of ag- 
gressive and sexual impulses; all societies must to 
some extent inhibit expression of these impulses, 
and to reduce impulse-anxiety through alcohol is, 
in effect, to release the inhibition. Moreover, the 
strength of the drinking response, which Horton 
indexed by “the degree of insobriety,” was thought 
to vary inversely with the occurrence of painful 
social experiences associated with the release of 
these impulses—experiences which, in turn, might 
vary from culture to culture. The ethnographic 
data reviewed by Horton corroborated the hypothe- 
sis that aggressive behavior tends to be associated 
with drinking, although a marked range of varia- 
tion became apparent. To explain this variation, he 
proposed, with some supporting evidence, that so- 
cieties characterized by high levels of inhibited 
aggression (indexed by the prevalence of sorcery) 
and coordinate systems of social control, as con- 
trasted with superordinate systems capable of effec- 
tively punishing aggression, exhibit a maximum of 
drunken aggression. With regard to sexuality, the 
results were not, on the whole, what had been ex- 
pected, although the suggestion of an inverse rela- 
tion between premarital sexual restrictions and the 
degree of insobriety highlighted the possible role 
of counteranxiety in restraining drinking. 

The most dramatic of Horton’s findings, how- 
ever, related to the deduction that the strength of 
the drinking response ( “insobriety” ) varies directly 
with the general level of anxiety, which is an ex- 
pression of the state of the total social structure 
and especially of the economy. As a clue to this, 
Horton referred to “subsistence anxiety” or inse- 
curity, which he indexed by the type of economy 
(scaled from hunting through higher agriculture) 
and by specific subsistence hazards, including the 
hazards of acculturation. As Horton himself put it, 
the statistical findings supported the conclusion 
that “insobriety varies directly with anxiety as 
measured indirectly in terms of the anxiety- 
provoking conditions of subsistence insecurity and 
acculturation” (1943, p. 294). A 

Patterns of drinking behavior. Because his 
method of analysis treated variables in isolation 
from one another, Horton also offered a generalized 
qualitative sketch of three distinctive drinking pat 
terns, found in different parts of the world, which 
are expressions of the similarity in the Gira 
psychocultural variables seen as systems. The a 
pattern is associated with societies having high su 
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sistence anxiety, strong belief in sorcery, weak 
sexual restraints, and coordinate social control. 
Here inebriety is extreme and, because of the 
vacuum in social power capable of restraining it, is 
almost always accompanied by extreme aggression. 
Cultural adaptation to this condition typically in- 
volves precautions of various kinds that are usually 
entrusted to the women, who drink less than the 
men. The second pattern occurs where the belief 
in sorcery is absent or relatively unimportant but 
where subsistence anxiety is also high and the 
anxiety motive too strong to be countered by sexual 
anxieties. Horton made no attempt to characterize 
the control of aggression in these societies but as- 
sumed from the unimportance of sorcery a mini- 
mum of motivation and thus little need for control, 
In any event, in the second pattern insobrietry is 
seen as invariably excessive but as accompanied 
by only moderate aggression in most cases. Finally, 
in the third pattern, fear and restraint in drinking 
are predominant, sometimes even extending to a 
complete taboo on alcoholic beverages. This sketch 
of patterns was not intended as exhaustive, but it 
offers the nucleus of a typology of patterns that 
have recurred with some frequency in primitive 
societies on a world-wide basis. Regrettably, little 
has been done to verify or modify this sketch, al- 
though it would seem to offer a fruitful point of 
departure for research. 

Social organization. A note of challenge to 
Horton's basic proposition on anxiety reduction was 
sounded in a cross-cultural study by Field (1962), 
who used Horton's original sample of primitive so- 
cieties but extended the investigation to include 
variables not previously considered. Noting the 
highly inferential nature of Horton's measures of 
anxiety, Field introduced new indexes of “fear,” 
which were thought to be more direct reflections 
of levels of anxiety, and found no consistent corre- 
lation with “extent of drunkenness.” In his recon- 
sideration, the relation between subsistence inse- 
curity and extent of drunkenness seemed more 
indicative of differences in economy, and hence in 
social organization, than of strikingly different 
levels of anxiety. Consequently, Field abandoned 
Horton’s anxiety-reduction view in favor of explor- 
ing further the relations between social organiza- 
tion and drunkenness in primitive societies, Al- 
though, for reasons given elsewhere (Snyder 1964), 
we do not share Field's apparent dismissal of the 
role of anxiety in drinking, important relations be- 
tween drunkenness and social organization are sug- 
gested by his research. For instance, he found that 
societies with strong corporate kin groups (organ- 
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ized on unilineal principles, exhibiting continuity 
in time, capable of concerted social action, and hav- 
ing elaborate collective ceremonial and symbolism ) 
seemed markedly sober, whereas those structured 
bilaterally, with amorphous, fragmented, and loose- 
ly organized social relationships, appeared marked 
by extensive drunkenness. Generalizing his find- 
ings, Field proposed a distinction between “corpo- 
rate” and “personal” types of social organization as 
the principal determinant of sobriety or drunken- 
ness in primitive societies. 

In a later large-scale research study of drinking 
in primitive society, Margaret Bacon and others 
(1965) offered an important commentary on parts 
of Horton’s and Field's earlier work and another 
hypothesis to account for part of the cultural varia- 
tion in drinking and drunkenness. Their research 
has advantages over previous work since it virtu- 
ally doubled the societies sampled (to 110), in- 
creased the number of key variables, developed a 
factor analysis of dimensions of drinking, and em- 
ployed independent ratings of variables and more 
sensitive measures of association. The findings cor- 
roborate Horton's linkage of type of economy and 
drunkenness, call into question certain other as- 
pects of his work, and diminish the significance of 
the findings reported by Field. Yet, because the 
relevant variables are treated singly rather than in 
combination, it cannot be said that the configura- 
tions of psychocultural factors and drinking pat- 
terns suggested by Horton are without substance. 
Also, the pattern of correlations established by 
Field remains, by and large, in the predicted direc- 
tion, suggesting some validity to his conclusions on 
drunkenness and social organization. 

Socialization and dependency needs. The study 
by Bacon and others took as its starting point the 
repeated clinical observation that persons with 
drinking problems exhibit marked conflict over the 
expression of dependency needs and the fact that, 
while the helplessness of the infant is a universal 
aspect of the human situation and the socialization, 
of dependence a universal cultural imperative, 
there are pronounced differences among societies 
in the attention paid to and indulgence of infants; 
in the cultural pressures toward achievement, self- 
reliance, and general independence in childhood; 
and in the extent to which cultural attitudes pre- 
scribe, tolerate, or enjoin the expression of depend- 
ency needs in adult life. These initial observations 
were expanded into the hypothesis that amounts 
and patterns of alcohol consumption have their 

roots partly in the degree of nurturing in infancy, 
the extent of demands for self-reliance and achieve- 
ment in childhood, and the extent to which the 


expression of dependent needs is permitted in adult 
life. It was assumed further that alcohol would be 
especially rewarding where dependency conflict is 
acute, because of its triple function of reducing 
anxiety and tension, permitting the satisfaction of 
dependency needs, and facilitating uncritical in- 
dulgence of unrealistic achievement fantasies. As 
a test of this view, various indexes bearing on the 
indulgence of dependency and pressures toward 
independent behavior were correlated with inde- 
pendently rated measures of alcohol consumption 
and frequency of drunkenness, and the statistical 
results support the hypothesis with consistency. 
Thus, the conclusion that “frequent drunkenness or 
high consumption, or both, tend to occur in cul- 
tures where needs for dependence are deprived or 
punished, both in childhood and in adult life, and 
where a high degree of responsible independent 
and achieving behavior are required” (M. Bacon 
et al. 1965, p. 43) seems warranted by the facts. 
Ceremonial usage and solidarity. The extensive 
ceremonial usage of alcohol among primitive peo- 
ples has been explored by Klausner (1964) in a 
comparative study of 48 societies. He classified 
ceremonies in terms of their underlying meaning 
and characteristic form, according to the basic 
problems on which they center and the means em- 
ployed, whether exorcism, scapegoating, sacrifice, 
prayer, or other. Like M. Bacon and her associates, 
Klausner found no regular bond between cere- 
monial usage and sobriety but even noted a tend- 
ency, at the cultural level, for usage in religious 
ceremonies to be associated with heavy drinking in 
nonreligious situations. The findings also pointed 
up a connection between heavy drinking in non- 
religious situations and the use of alcohol in sacri- 
fice (relative to other religious means), especially 
when religious ceremonies were oriented toward 
“moral integration and control of the human 
world.” And regardless of the ceremonial orienta- 
tion, alcohol was more likely to be used in con- 
junction with sacrifice than with any other religious 
means. To account for cultural cases in which there 
was virtually no drinking beyond the boundaries of 
ceremonial situations, Klausner suggested—in the 
light of the ancient symbolic equation of life-giving 
power, blood, and alcohol—that when blood is con- 
sidered holy, its symbolic equivalent, alcohol, will 
not be drunk heavily in secular situations. 
Sacrifice as a ceremonial means is, as Klausner 
has noted, closely linked to the establishment of 
social solidarity, enabling worshipers to transcend 
barriers when their solidarity is threatened by sin 
and guilt. Broadly, then, he sees the widespread 
use of alcohol in the sacrificial ritual of primitive 
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peoples as suggesting a link between its use and 
coping with the problem of evil in man’s relation- 
ship to man (and God) and proposes that this 
stamp of meaning most frequently extends to secu- 
lar drinking situations. Indeed, it is suggested that 
the modern “cocktail party” exhibits (in its group 
nature, normative prescriptions, sentiments of be- 
trayal for failure to participate, removal of social 
distinctions among participants) formal similarities 
to ancient sacrificial ritual and serves a “guilt- 
ridding” and integrating function equivalent to that 
formerly met through these rituals. The prolifera- 
tion of cocktail parties and related social drinking 
in contemporary society may be thus viewed as an 
integrating response to structural breakdowns in 
society that have left individuals in socially and 
normatively ambiguous situations with consequent 
high anxiety. 

Interrelationships of factors. Although the com- 
parative studies of drinking in primitive societies 
touched upon here hardly display the richness of 
anthropological investigations of particular cul- 
tures, they highlight the importance of sociocul- 
tural factors in shaping drinking behavior and, 
without offering a total explanation, are indicative 
of crucial factors in variation, Moreover, there is 
reason to think that significant interrelationships 
exist among the variables brought to light by these 
studies. For instance, socialization maximizing con- 
flict over dependency needs, as described by Mar- 
garet Bacon and her associates, may be function- 
ally linked to the characterological requirements of 
the kinds of economies identified by Horton as as- 
sociated with extensive drunkenness. Or, again, the 
amorphous social organization depicted by Field 
may intensify dependency conflict and engender 
acute problems in sustaining the social solidarity 
emphasized by Klausner. However, substantiation 
of these probable interrelationships awaits research 
that treats systematically the interaction of varia- 
bles provisionally identified as crucially affecting 
patterns of drinking behavior. 


Drinking and alcoholism in complex society 

Bold attempts to portray the functions of drink- 
ing in complex society have been rare in the writ- 
ings of social scientists. An exception is Selden 
Bacon’s (1945) essay depicting large, highly spe- 
cialized society as broadly “anomic"—characterized 
by rapid change, lack of normative integration, 
compartmentalized social controls, the paradox of 
heightened individualism and functional interde- 
pendence, intense competition, mobility, and im- 
personality. The social value of alcohol is presum- 
ably enhanced in this context not only because it 


271 


affords relief for tension-ridden individuals but also 
because it offers a needed mechanism of social in- 
tegration, and the absence of generally held drink- 
ing norms is conducive to extremes of drinking. 
This is a setting that also seems likely to precipi- 
tate rapid deterioration and isolation of the person 
who is prone to excessive drinking and to lend 
drinking a greater individualistic significance than 
it is likely to have in better integrated societies. 

Although such a portrait is admittedly specu- 
lative, there are a number of empirical investiga- 
tions which have yielded findings consonant with 
it. Studies in the United States and Canada by E. M. 
Jellinek (1947) and John R. Seeley (1962), for 
example, have demonstrated a relationship, within 
broad cultural limits, between concentration of 
population, on the one hand, and alcohol consump- 
tion and alleged rates of alcoholism, on the other, 
The research of Richard Jessor and others (1963) 
has established connections between extreme drink- 
ing and peaks of “anomie’—both in the sense of 
dissociation of cultural goals and means (after 
Robert K. Merton) and in terms of simple break- 
down in normative consensus. Excessive drinkers 
have also been found, in a scattering of studies, to 
exhibit signs of anomie and alienation. And alco- 
holics and incipient alcoholics, as studied by Ralph 
G. Connor (1962) and Peter Park (1962), evi- 
denced difficulty in structuring their social roles in 
accordance with the role requirements of an im- 
personal complex society. 

From a historical standpoint, there is evidence 
of the increasingly widespread use of alcohol in 
rapidly urbanizing societies such as the United 
States, although this does not necessarily imply an 
increase in per capita consumption among drinkers. 
The United States, like several European countries, 
experienced a strong temperance movement during 
the nineteenth and early twentieth centuries. 
Joseph Gusfield (1962) has suggested that the rela- 
tive dominance of the abstinence orientation in the 
United States signified the ascendance in the status 
structure of a middle class whose values were con- 
gruent with the needs of small-scale capitalism and 
were generally hostile to the expression of emo- 
tional impulses. Abstinence was important as a 
criterion for acceptance in the middle class, and 
membership in the Woman’s Christian Temperance 
Union was an important symbol of status in small- 


town and rural America in the nineteenth century. 
Yet, even prior to the repeal of prohibition in the 
United States in 1933 there had been a decay in 
the abstinence tradition that reflected deeper shifts 


i i f society. 
in the styles of life and the organization o! 
The cosmopolitanism of the new middle classes 
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(Stone 1962) apparently supports a relatively per- 
missive drinking norm, and thus abstinence has 
undergone devaluation in its status connotations. 

At the bottom of the modern urban social struc- 
ture lies “Skid Row,” which is increasingly an ob- 
ject of study and concern on the part of social 
scientists (for example, Jackson & Connor 1953; 
Pittman & Gordon 1958; Rubington 1958; Bogue 
1963). In broad sociological perspective, Skid Row 
may be viewed as embracing varieties of the retreat- 
ist mode of adaptation to the anomie of modern 
society. For several decades its population appears 
to have been declining, becoming less geograph- 
ically mobile, and undergoing other demographic 
changes, although it is still almost exclusively male 
in composition. A significant proportion of these 
men are permanent residents—living in the cheap 
hotels, flophouses, and missions indigenous to the 
area—and a majority of them are characterized by 
casual labor, poverty, and homelessness. Although 
it must be emphasized that only a small fraction 
of all alcoholics are inhabitants of Skid Row, the 
incidence of alcoholism and related drinking pa- 
thologies, and of other psychiatric and physical 
disease as well, is unquestionably high among ha- 
bitués, even though comprehensive statistics are 
unavailable. While men on Skid Row are largely 
alienated from the mainstream of social life, popu- 
lar imagery of complete social isolation needs to be 
corrected by awareness that Skid Row constitutes 
a subcultural system, binding men in a complex 
network of social relationships with distinctive 
norms and values. This has now been recognized 
among the better informed as having an important 
bearing on the outcome of ameliorative approaches 
to Skid Row—a matter in which the policies and 
programs of several European nations are in ad- 
vance of those prevailing in the United States [see 
HOMELESSNESS]. 

Group rates of alcoholism. For the most part, 
recent sociological investigation of drinking in 
complex society has focused on diverse, fragmen- 
tary problems, such as the examination of drinking 
patterns and pathologies of ethnic groups, age and 
sex categories, and socioeconomic strata, or has 
aimed at delimiting the parameters and gross social 
correlates of drinking through surveys of drinking 
behavior and attitude among state, regional, or na- 
tional populations (for example, Gadourek 1963; 
Lawrence & Maxwell 1962; Mulford & Miller 1960). 
Although the scope of this article does not allow 
summarization of the varied findings of these 
studies, some of them have, potentially, a broader 
relevance for understanding alcohol problems than 


might be supposed at first glance. This is because 
marked differences in rates of alcoholism have un- 
equivocally been established for various subgroups 
and categories of contemporary society. Although 
absolute rates of alcoholism are not known with 
certainty, it has been established, for instance, that 
there are sex differences in rates (and that sex 
ratios vary from one social milieu to another) and 
that there are marked differences in alcoholism 
rates among ethnic groups. These facts pose prob- 
lems for social scientists analogous to the problem 
which Durkheim saw in varying group rates of 
suicide and likewise seem to call for explanation, 
in part at least, at the sociocultural level. 

An illustration of the broader relevance of spe- 
cific sociological studies to understanding alcohol 
problems may be seen in research on the Jews, 
whose traditional drinking patterns were studied in 
contrast to those of the Irish by Robert F. Bales 
(1944) and in the aspect of change by Charles R. 
Snyder (1958). The case is instructive because 
alcoholism is rare among Jews, although virtually 
everyone in the group is exposed to alcohol, and 
there is no absence of psychic tensions of the sort 
that may play a role in alcoholism, Moreover, there 
is sufficient heterogeneity of physical type to make 
biological immunity implausible, Careful study of 
the drinking patterns, cultural settings, and inci- 
dence of psychic disorders led Bales to the conclu- 
sion that the distinctive normative orientation to 
drinking in traditional Jewish culture accounted 
for much of the difference in rates of alcoholism 
between the Jews and groups like the Irish. More 
generally, it led to the formulation that group rates 
of alcoholism may be thought of as resultants of 
the interaction of three major sets of factors: 
dynamic factors, or the group incidence of acute 
psychic tensions or needs for adjustment sufficient 
to provide the driving force in drinking pathologies; 
alternative factors, or culturally defined possibilities 
of adopting behavior patterns that are functional 
equivalents of excessive drinking from the stand- 
point of relieving acute psychic tensions; and orient- 
ing factors, or the kinds of normative attitudes 
toward drinking itself carried in the cultures of dif- 
ferent groups. Although this scheme poses method- 
ological difficulties in application, it offers, heuris- 
tically, a more satisfactory provisional framework 
for analyzing the etiology of alcoholism than do 
one-sided schemes which neglect either the psycho- 
logical or the sociocultural dimension. 

Predominant type of alcoholism. The impor- 
tant problem of the definition of alcoholism will 
not be treated in this article; here we will simply 
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note that alcoholism should be distinguished from 
drinking and even from excessive drinking, and 
that this matter of definition is very complex. 

The extent of alcoholism in primitive societies 
remains largely unknown. In complex societies, 
such as the United States and several countries of 
northern Europe, what Jellinek (1960, p. 37) has 
referred to as “gamma” alcoholism is the predom- 
inant type, “in which (1) acquired increased tissue 
tolerance to alcohol, (2) adaptive cell metabolism 
. . . (3) withdrawal symptoms and ‘cravings’. . . 
and (4) loss of control are involved.” Gamma alco- 
holism is patterned in terms of a definite progres- 
sion of symptoms (phases), moving from psy- 
chological to apparent physical dependence and 
entailing marked behavioral changes. Jellinek’s re- 
search on the drinking of male alcohol addicts 
found that the symptoms associated with gamma 
alcoholism could be sequentially arranged into four 
phases: the prealcoholic symptomatic phase, asso- 
ciated with the relief of personal tension in drink- 
ing situations; the prodromal phase, characterized 
by, among other symptoms, the appearance of re- 
peated blackouts; the crucial phase, identified by 
the drinker’s loss of control; and the chronic phase, 
marked by prolonged bouts of intoxication. This 
view of the phasic development of gamma alco- 
holism has received some support in other research 
(Trice & Wahl 1958; Park 1962) and seems to be 
in accord with typical drinking histories of mem- 
bers of Alcoholics Anonymous. It has certainly col- 
ored the thinking of much social science research 
bearing upon alcoholism and has led not only to 
the construction of indexes of incipient alcoholism, 
enabling study of the illness at its inception, but 
also to the possibility of therapeutic intervention 
before alcoholism has run its course. 

Personality factors. The sociological and related 
literature on alcoholism runs the gamut of assump- 
tions regarding the relevance of personality factors 
to the etiology of alcoholism and about their vari- 
able or unitary nature, aside from speculation as to 
the specific factors involved. Moreover, critical re- 
views by sociologists such as Edwin H. Sutherland 
and others (1950) and Leonard Syme (1957), who 
focused on the methodological inadequacies of a 


host of investigations purporting to identify person- 
ality factors in alcoholism, serve to reinforce an 
this regard. Nevertheless, 


attitude of caution in sar a 
recent longitudinal and quasi-longitudinal resear 


that avoids the pitfall of confusing possible conse- 
quences of alcoholism with conditions at or prior to 
its inception (for example, Park 1962; McCord & 
McCord 1962; Robins et al. 1962) tends, in our 


273 


judgment, to support the view that disturbances in 
socialization experience and personality prior to 
alcoholism are the rule rather than the exception 
(at least in so-called “Anglo-Saxon” cultures). Taken 
together with a growing body of retrospective analy- 
ses of life histories of alcoholics, these studies 
strengthen the impression that, without viewing al- 
coholism as the invariable outcome of a single per- 
sonality type, certain types of personality under 
certain environmental stresses are particularly prone 
to alcoholism as a means of adjustment. Howard 
Jones’s (1963) diagnostic grouping of alcoholics 
into types characterized by acute adjustment needs 
centering on maternal dependence, ego need, social 
inadequacy, social dependence, escapism, latent 
homosexuality, and Oedipal fixation is suggestive 
in this connection. Rather typically, current studies 
of alcoholism are, like Jones's, replete with allu- 
sions to socialization experiences and personality 
dynamics in relation to the root problems of early 
emotional deprivation and dependency conflict. 
Prevalence of alcoholism. While there are a 
variety of ways of estimating the extent of alco- 
holism in small, circumscribed populations (for 
example, field studies, analyses of hospital admis- 
sions, arrest records), the Jellinek Estimation For- 
mula has been widely used in estimating the prev- 
alence of alcoholism in large populations. This 
formula is A = (PD/K)R, where A signifies all 
alcoholics, D reported deaths from cirrhosis of the 
liver in a given year, P an assumed constant per- 
centage of such deaths attributable to alcoholism 
(different for men and women), K a constant rep- 
resenting the percentage of all alcoholics-with-com- 
plications who die of cirrhosis, and R a presumed 
ratio of all alcoholics to alcoholics-with-compli- 
cations in the given time and place. This formula 
has been criticized (Seeley 1959; Brenner 1959), 
but Mark Keller (1962) has maintained that the 
underlying theory of the formula is sound, From 
statistical data on alcohol consumption and num- 
bers of drinkers, Keller has contended that current 
alcoholic rate estimates have approximate validity. 
Although the available data are insufficient to per- 
mit exact international comparisons, alcoholism 
rates for a dozen countries estimated by means of 
the Jellinek formula for various years in the decade 
following World War 1 show the following rank 
order (from higher to lower): France, United 
States, Chile, Sweden, Switzerland, Denmark, Can- 
ada, Norway, Finland, Australia, England and 
Wales, Italy (Alcoholism Research Foundation of 
Ontario 1958). Keller (1962) has concluded that 
there were approximately 4,470,000 alcoholics in 
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the United States in 1960. This figure is suggestive 
of the magnitude of the alcoholism problem, but 
the costs of alcoholism in all its ramifications from 
the standpoint of human values are not amenable 
to precise calculation, although they are certainly 
enormous. 

Social responses to alcoholism. Legislation in 
the Netherlands and Norway aimed at treating al- 
coholism dates back to the turn of the century (for 
accounts of the development of national programs 
on alcoholism and alcohol problems in Europe, see 
Tongue 1962; Pittman & Tongue 1963). However, 
it was not until the 1930s in the United States that 
a large-scale movement with a treatment orienta- 
tion toward alcoholism first took shape, with the 
appearance of Alcoholics Anonymous in 1934 and 
the establishment of an American Research Coun- 
cil on Alcoholism four years later. In 1940 the first 
issue of the Quarterly Journal of Studies on Alcohol, 
a scientific journal dealing with such problems as 
the physiological, psychosocial, and cultural rami- 
fications of alcohol and alcohol usage, was pub- 
lished at Yale University’s Laboratory of Applied 
Physiology by the Journal of Studies on Alcohol, 
Inc. In 1943 the Yale Summer School of Alcohol 
Studies was established to train persons interested 
in this field, and in 1945 the National Council on 
Alcoholism was organized to disseminate informa- 
tion to the general public. This new approach 
focused on removing the stigma from alcoholism 
and on making it the object of scientific investiga- 
tion. In the past the state and local governments 
had conceived of their roles as controlling the sale 
of alcohol and punishing the alcoholic offender, 
whereas the new emphasis predicated alcoholism 
as a problem of public health. In accordance with 
this outlook, Oregon, in 1943, became the first state 
to institute a public health program aimed at alco- 
holism. By 1965, 42 American states, the District 
of Columbia, and seven Canadian provinces had 
official agencies on alcoholism. 

It would be a mistake, however, to imply that the 
public health approach to alcoholism was wide- 
spread and always accepted with little or no resist- 
ance. Although the repeal of prohibition in 1933 
dealt a serious blow to the older “moralistic” view 
of alcoholism, attitudes which had predominated 
for generations were not easily discarded. Resist- 
ance to the treatment of the alcoholic is indicated 
by the slow acceptance by many professional health 
and social welfare personnel of the proposition that 
alcoholism is a disease requiring their special 
knowledge and skills. It was not until 1956 that 
the American Medical Association expressed the 
opinion that alcoholism should come under the 


scope of medical practice, that hospitals should 
make provisions for the care of alcoholics, and that 
medical interns should be trained in the treatment 
of alcoholics. In 1957 the association finally gave 
its official approval to the statement that alcoholism 
is a disease, and the American Hospital Association 
also urged that each case of alcoholism be exam- 
ined individually to determine whether or not the 
condition was amenable to medical treatment. This 
represents a notable change from the situation pre- 
vailing a generation ago. 


CHARLES R. SNYDER AND DAVID J. PITTMAN 


[See also ANx1ETY; Drucs.] 
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DRIVES 


Paul Thomas Young 
Judson S. Brown 


1, PHYSIOLOGICAL Drives 
mm, ACQUIRED DRIVES 
1 
PHYSIOLOGICAL DRIVES 


Physiological drives are regarded as persisting, 
organic motivations: conditions that arouse, sus- 
tain, and regulate human and animal behavior. In- 
sofar as drives are based upon metabolic conditions 
they are common to all men in all societies and to 
many animals. Examples of drives originating in 
metabolic conditions are: general hunger, specific 
food appetites, thirst, air hunger, the urges to uri- 
nate and defecate, the urge to maintain a constant 
internal temperature and to sleep, to rest when 
fatigued, to be active when rested. 

Carl J. Warden defined “drive” empirically as 
persisting goal-oriented behavior, and he measured 
the strength of hunger, thirst, sex, maternal, and 
exploratory drives by observing the frequency with 
which a rat approached and crossed a charged 
grille to reach an incentive object: food, water, 
mate, etc. Most psychologists, however, regard 
drive as an explanatory concept. Behavior is driven 
by certain internal conditions called drives; drive 
is a hypothetical construct employed to explain 
activity. 

Physiological drives must be distinguished from 
external determinants, including the whole gamut 
of social goals, interests, values, attitudes, and per- 
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sonality traits. When, for example, Buddhist monks 
in Vietnam committed suicide by burning, their 
motivation was derived from their complex social 
situation and not from physiological drives. 

Drive versus drives. The term “drive” was intro- 
duced to American psychology by Robert S. Wood- 
worth in 1918. He derived the term from mechanics 
rather than biology. A machine, he claimed, is a 
mechanism that can accomplish work only if there 
are energy transformations within its parts. The 
drive of a machine is the physical energy that 
makes it go. 

Following Woodworth, other psychologists (Fred 
A. Moss, John F. Dashiell, Edward C. Tolman, Curt 
P. Richter, Carl J. Warden, Calvin P. Stone, and 
others) adopted the term but began experimenting 
with drives (in the plural), such as the hunger, 
thirst, and sex drives. These drives were not dis- 
tinguished as forms of physical energy (mechan- 
ical, thermal, electrical, chemical, photic) but, 
rather, in terms of the external and bodily condi- 
tions that underlie and regulate behavior. 

Energizing and regulating functions. Some psy- 
chologists restrict the concept of drive to the ener- 
getic aspect of motivation. Judson S. Brown (1961) 
clearly distinguished drive as an energizing func- 
tion from the functions of directing and regulating 
behavior. Regulation is thought to be accomplished 
by associative bonds, habit systems, innate struc- 
tures. Drive stimuli originate in tissue conditions 
and as stimuli they have a common energizing func- 
tion. Brown wrote of different sources of drive 
rather than of different drives, Again, Elizabeth 
Duffy distinguished between the activating and di- 
recting aspects of motivation. She identified drive 
with activation and defined it in terms of energy 
potentially available for behavior (1962, p. 17). 

The distinction between energizing and regulat- 
ing functions is of prime importance in any defini- 
tion of the drive concept. But since energy trans- 
formations always occur within bodily mechanisms 
and since these mechanisms regulate the pattern 
and direction of activity, it is difficult, if not impos- 
sible, to consider energizing apart from bodily 
structures. 

The stimulation theory of drive. In the writings 
of Clark L. Hull, drive (D) is a nonspecific ener- 
gizing factor that can raise the excitatory potential 
of various habits ( 1943). In addition to D, Hull 
postulated various specific drive stimuli (symbol- 
ized by Sz or D,) arising from tissue needs. Sa 
strengthens only those responses that are relevant 
to satisfying a specific need, but D strengthens all 
responses that happen to be aroused when D is 
present [see HULL]. 

Some behavior theorists, for example, Neal 


Miller and Kenneth W. Spence, have staunchly 
supported Hull's postulate of two drive factors— 
general drive and specific drives—but others have 
questioned the need for a nonspecific drive factor. 
Thus, Edwin R. Guthrie argued that the concept 
of stimulus is fully adequate to account for all 
drives. A persisting stimulus from a tense bladder, 
he wrote, is adequately described as a stimulus and 
nothing is gained by designating it a drive. William 
K. Estes (1958 ) wrote that drive stimuli are similar 
to all other stimuli, and he suggested that selective 
behavior can be explained without the benefit of a 
nonspecific, energizing drive factor. Robert C. 
Bolles (1958), after reviewing the positions taken 
by behavior theorists, questioned the utility of the 
drive concept. Use of the drive concept, he wrote, 
reflects an ignorance of the stimulus; if more were 
known about the stimuli associated with each spe- 
cific drive, one could dispense with a general non- 
specific drive factor [see GUTHRIE]. 

Despite these views, there is sound physiological 
evidence, as Donald O. Hebb has pointed out 
(1955), for the existence of a general, nonspecific 
factor of arousal or activation. The evidence will be 
considered below [see STIMULATION DRIVES]. 

Chemical motivation. Every theory of drive 
must take account of persisting chemical sensi- 
tizers, excitants, inhibitors, and tranquilizers that 
act directly upon neural centers. A single example 
of hormonal control will illustrate the nature of 
chemical motivation: 

In a series of studies upon male sexual behavior, 
Frank A. Beach demonstrated a positive relation 
between the rat’s sexual reaction to an estrous 
female and his speed of locomotion to the goal box 
in which she is held. The active males, after dem- 
onstrating normal sexual activity, were castrated. 
Following the operation there was a gradual loss of 
sexual ability, shown by decrease in the intensity, 
completeness, and frequency of mating responses 
and by a steady lengthening of the time required to 
traverse a runway to reach a mate. Copulatory re- 
sponses were finally abolished, and many males 
remained in the starting box, failing to enter the 
runway. At this stage of the experiment the inactive 
males were given daily injections of testosterone 
propionate. While injections continued there was 
an increase in sexual reactions to the estrous 
female and a steady decrease in the time required 
to reach the goal box. The experiment (Beach 
1956, p. 12) clearly demonstrated the importance 
of hormonal factors in maintaining sexual re- 
sponsiveness. 

Homeostatic needs and drives. Claude Bernard 
pointed out that the cells of the body can survive 
only if the fluid matrix within which they exist 


remains relatively stable in its physical and chem- 
ical constitution. Following Bernard, Walter B. 
Cannon coined the term “homeostasis” to designate 
the steady physical state essential to life. Within 
the blood, for example, there are homeostases of 
oxygen content, water content, temperature, acidity, 
glucose, fat, proteins, and calcium, sodium, and 
other minerals. 

The principle of homeostasis provides an im- 
portant basis for defining objectively the metabolic 
needs of men and animals. To maintain homeo- 
stasis an organism needs (requires) certain nu- 
tritive substances: oxygen, water, proteins, fat, 
carbohydrate, minerals, and vitamins. Also, an 
organism needs to maintain a constant internal 
temperature despite wide variations in external 
temperature [see HomxosTAsIs]. 

Some psychologists have equated needs and 
drives, but the concepts are distinct. Homeostatic 
needs may affect growth, reproduction, metabolism, 
and health with little or no influence upon behavior. 

Curt P. Richter extended the homeostatic doc- 
trine by showing ways in which behavior aids the 
organism in maintaining homeostasis (1943). When 
homeostasis is disturbed, he pointed out, behavior 
compensates to restore a steady state. To illustrate: 
If the adrenal glands of rats are surgically re- 
moved, the adrenalectomized animals die in 10 to 
15 days as a result of loss of sodium chloride 
through the urine. But if the operated rats are 
given free access to a 3 per cent sodium chloride 
solution, they ingest several times the normal 
amount of salty fluid and survive indefinitely in 
seemingly good health. The increased appetite for 
salt serves to maintain homeostasis and to pre- 
serve life, Richter (1943) believed that the effort 
to maintain homeostasis is one of the most uni- 
versal and powerful of all behavioral urges, oF 
drives, in animals and human beings. 

Other psychologists have extended the homeo- 
static doctrine far beyond its physiological limits. 
Attempts have been made to extend the doctrine 
to the maintenance of stability within the social 
order, to the perceptual constancies of the physical 
and social worlds, to constancies in personality, 
and to psychophysical judgments. It seems prefer- 
able, however, to hold to the original physiological 
meaning of homeostasis and to use some other 
concept, for example, adjustment, in considering 
the dynamic relations between an organism and 
its environment. [See PERCEPTION, article on PER- 
CEPTUAL CONSTANCY.] 

The homeostatic doctrine has met with a good 
many criticisms. For example, in a review of the 
literature on thirst, Lawrence I. O'Kelly (19€3) 
pointed out that the homeostatic doctrine does not 
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greatly help a physiologist who is dealing with the 
specific processes that regulate body water. It is 
difficult to identify thirst with any one specific 
bodily event. What is needed is patient collection 
of empirical data rather than speculation about a 
homeostatic thirst drive. 

Motivation and functional capacity. Robert S. 
Woodworth (1958) argued that when an active 
organism is dealing with its environment, no ex- 
trinsic drive is required to explain activity, The 
organism observes, explores, manipulates, and 
studies its environmental surroundings with intrin- 
sic motivation. 

Behavior manifests the innate capacities of the 
creature. The bird is hatched with an innate ca- 
pacity to fly. From this fact one can safely predict 
that the bird will fly. The young bird, in fact, makes 
considerable progress in mastering its mode of 
locomotion before using this skill in hunting food. 
The creature exercises its capacity to fly first for 
its own sake and later in the service of food seek- 
ing. Again, the chimpanzee has considerable in- 
clination to climb trees and to brachiate—a per- 
formance for which he has considerable talent 
thanks to his bodily proportions and lever sys- 
tems, bones, tendons, and muscular and nervous 
structures. 

Henry W. Nissen (1954), Robert W. White 
(1959), Daniel E. Berlyne (1960), and others 
argued that human sense organs, muscles, and 
brain have the functional capacity to observe, €x- 
plore, manipulate, test, reflect, think. These innate 
structures require activity for health and growth. 
No extrinsic motivation is needed to explain the 
functioning of an active human brain. Man, in 
fact, seeks stimulation. He rides a roller coaster, 
engages in dangerous sports, plays games that in- 
volve risk and suspense, works endlessly to solve 
puzzles. The motivation is neurogenic, and the pat- 
terns of behavior are regulated according to a 
structure-function principle. 

White emphasized the human tendency to ac- 
quire competence in dealing with the environment, 
Underlying this tendency is a form of motivation 
he labeled effectance. Effectance motivation is se- 
lective, directed, persistent. Its energies are neuro- 
genic, being the energies of the living, active cells 
that make up the nervous system. 

Similar views have been expressed by Harry F. 
Harlow, who demonstrated that manipulative and 
exploratory behavior does not depend on homeo- 
static drives, and by Gordon W, Allport in his doc- 
trine of the functional autonomy of motives [see 
PERSONALITY: CONTEMPORARY VIEWPOINTS]. 

Neurogenic drive. Under the influence of clas- 
sical associationism and the stimulus-response 
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formula an organism was regarded as a passive 
structure aroused to action by hunger, thirst, pain, 
sexual stimulation, and other primary drives and, 
secondarily, by acquired drives associated with 
these primary motivations. This view has proved to 
be inadequate because the fact that the nerve cell 
is not inert has become increasingly evident; the 
nerve cell does not have to be excited from the out- 
side in order to discharge. The cells of the brain 
are spontaneously active even during deep sleep. 
It is not necessary to postulate an extrinsic stimulus 
or drive to activate them. 

In his outstanding book The Organization of Be- 
havior (1949), Hebb did not use the term “drive,” 
He believed that the main problem of motivational 
psychology was to account for the direction of be- 
havior and that this could be done in terms of his 
physiological theories. He changed his opinion, 
however, when it became clear that there exists an 
anatomical and physiological basis for a nonspecific 
drive (1955), The evidence for neurogenic drive 
rests upon several lines of investigation, particu- 
larly upon work in the field of electroencephalog- 
raphy, as well as studies of the reticular activating 
system, 

Brain waves and levels of activation. Donald B. 
Lindsley (1957) distinguished different levels of 
activation, or arousal, in terms of brain wave char- 
acteristics. When the subject is calm his brain 
waves reveal a smooth, rhythmic pattern of about 
ten oscillations per second, known as the alpha 
rhythm. When the subject is excited, as by the bang 
of a gun or by pain or anxiety, the alpha rhythm is 
inhibited and instead there are fast waves of low 
amplitude. This change of rhythm is known as the 
activation pattern, 


Strong excited emotion 
Alert attentiveness 
Relaxed wakefulness 
Drowsiness 

Light sleep 

Deep sleep 

Coma 


Death 


Figure 1 — Continuum showing levels of activation 
Source: Based on Lindsley 1957. 


Different levels of activation are revealed in be- 
havior and in awareness, as well as in the pattern 
of brain waves. Figure 1 represents some of these 
levels of activation [see NERVOUS SYSTEM]. 


The reticular activating system. The reticular 
activating system is a network of fibers and 
synapses located at the levels of the medulla, hypo- 
thalamus, and subthalamus. It consists of two main 
parts: (1) the brain stem reticular formation, a 
network with many synapses and neural subsys- 
tems; and (2) a system of fibers that project dif- 
fusely from the thalamic nuclei to many regions 
of the cerebral cortex. 

It is known that every sensory stimulation has 
two kinds of effects upon the cerebral cortex: 
impulses are discharged through thalamic nuclei 
directly to the cortical areas where they provide 
sensory information; and impulses are sent through 
collaterals into the reticular activating system and 
are conducted over diffuse multisynaptic pathways 
to all parts of the cortex—to both sensory and non- 
sensory areas—where they have a nonspecific 
activating effect. 

The reticular activating system is not limited to 
one-way conduction—from sensory nerves to cor- 
tex. The system is excited by cerebral action, which 
in turn produces motor effects over efferent path- 
ways. It is likely that when one anticipates mis- 
fortune, as in anxiety, the cerebral cortex excites 
the reticulum. The motor effects are general, such 
as increasing muscle tonus in widespread regions. 

The feedback from all reactions has a dual effect. 
When, for example, the organism maintains a set 
of attention or when effort is expended, as in driv- 
ing an automobile against a blinding light, there is 
a sensory feedback from the proprioceptors. This 
feedback conveys sensory information and also has 
a nonspecific activating influence. 

Cue function and arousal function. Hebb dis- 
tinguished between cue function and arousal, or 
vigilance, function. Cue function exists when sen- 
sory stimulation transmits information directly and 
efficiently to the cerebral cortex over the great 
sensory projection systems: from sensory nerve to 
sensory tract, thence to the corresponding sensory 
nucleus of the thalamus, and thence directly to one 
of the projection areas of the cortex. Arousal func- 
tion is different. Collaterals from sensory nerves 
feed excitations into the reticular activating system. 
These excitations trickle through a tangled thicket 
of fibers and synapses. There is mixing of mes- 
sages, and the scrambled messages are delivered 
indiscriminately to wide areas of the cortex. In 
fact, the excitations are no longer messages. In- 
stead of conveying information they serve to tone 
up the cortex with supporting excitations that are 
necessary if the messages are to have their effect. 
Without the arousal system, the sensory impulses 
over the direct and quick routes would reach the 
sensory cortex and go no further; the rest of the 


cortex would be unaffected; learned associations 
would be unavailable for behavior [see ATTENTION]. 

The arousal function implies a nonspecific, neu- 
rogenic drive. This drive is an energizer, not a 
guide; an activator, not a steering mechanism. 
Hebb (1955) claimed that direction and regula- 
tion are dependent upon cue function, but cue 
function is useless and impotent apart from arousal, 
or drive. 

This view is important because it points to a 
source of motivation underlying observant, manipu- 
lative, exploratory, playful, and other forms of 
behavior. One does not have to look outside the 
nervous system for sources of physiological drive. 

Affective arousals. A complete theory of physi- 
ological drives must take account of affective 
arousals associated with stimulation, frustration, 
play, rewards bestowed, and punishments inflicted. 

Traditionally, the affective processes have been 
regarded as conscious experiences of pleasantness 
and unpleasantness that are present in simple feel- 
ings, emotions, moods, and sentiments. The affec- 
tive processes vary in sign (positive or negative), 
intensity, and temporal course. Sign and intensity 
can be represented on a bipolar continuum as in 
Figure 2. 


— 
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Figure 2 — The hedonic continuum 
Source: Young 196), p. 153. 


The hedonic continuum extends from the extreme 
of negative affectivity (distress) to the extreme of 
positive affectivity (delight), with indifferent 
processes at the midpoint. Different intensities 
of affective arousal are represented by arbitrary 
units marked off upon the continuum. Arrows indi- 
cate two important directions of hedonic change: 
toward the positive pole and toward the negative 
pole. f 
Young and others have shown that affective 
processes can be studied objectively and quanti- 
tatively with need-free rats. As a single example, 
consider a study by Young and Madsen (1963). 
Using a brief-exposure preference technique, they 
obtained data for plotting a curve that showed the 
hedonic equivalence of simple solutions of sucrose 
and saccharin. Using compound solutions, they 
plotted isohedonic contour Jines on a surface de- 
fined by the concentrations of sucrose and sac- 
charin. The contour lines showed the hedonic 
equivalence of simple sucrose standards and com- 
pound sucrose-saccharin solutions. 
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The affective arousals are in the nervous sys- 
tem. They energize behavioral patterns that lead to 
approach and withdrawal and that influence choice 
and the development of preferences. The physiol- 
ogy of affective processes is under extensive in- 
vestigation. 

James Olds (1955) demonstrated that if bipolar 
needle electrodes are implanted within the limbic 
system of a rat's brain, electrical stimulation of 
subcortical points may be either rewarding or pun- 
ishing. When stimulated in the septal area, the rats 
acted as if they liked the internal excitation. For 
example, if a hungry rat was stimulated in the 
septal area while he approached food, he would 
stop at the place where internal stimulation oc- 
curred rather than proceed to the food. Again, an 
apparatus was arranged so that pressing a bar stim- 
ulated points within the septal area. Rats repeatedly 
pressed the bar, but bar pressing ceased promptly 
when the circuit was broken and there was no 
longer septal stimulation. The stimulation of other 
points was punishing, For example, if the needle 
electrode was implanted in the medial lemniscus, 
a rat acted as if stimulation hurt him, In the ab- 
sence of internal stimulation a rat pressed the bar 
in about 2 to 10 per cent of the total time during 
a two-hour test, but with self-stimulation in the 
medial lemniscus the bar-pressing score dropped to 
zero, The rat avoided the lever. Self-stimulation at 
other subcortical points was neutral—neither re- 
warding nor punishing. 

Similar results have been obtained by other 
workers and with cats, monkeys, and other sub- 
jects. There can be no doubt that affective arousals 
have an objective existence within subcortical cen- 
ters [see NERVOUS SYSTEM, article on BRAIN STIM- 
ULATION]. 

Physiological drives and reinforcement. Major 
research problems center on the relation between 
physiological drives and learning. It should be clear 
from this discussion that a psychologist must con- 
sider the nature of drive when exploring problems 
of learning and development. 

From the point of view of stimulation theory the 
major emphasis has been upon conditioning, Thus, 
John F. Dashiell hypothesized that when an ex- 
ternal stimulus frequently occurs with a drive stim- 
ulus, the external stimulus becomes a substitute for 
the drive stimulus and tends to elicit the same re- 
sponses. Again, Edward E. Anderson argued that 
in the early stages of learning, behavior is con- 
trolled by internal drives, but as learning progresses 
there is increasing environmental control of be- 
havior—a transition that he called the externaliza- 


tion of drive. 


From the point of view of homeostatic need the 
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emphasis has been upon need and drive reduction. 
Thus, Clark L. Hull argued that responses are re- 
inforced when they lead to drive reduction and the 
meeting of a need; and secondary drive is acquired 
through association with primary drive reduction. 

The neurogenic theory of drive implies that drive 
reduction is not necessary to learning. And the 
affective arousal theory implies that behavioral pat- 
terns are organized in a way that leads to:hedonic 
change toward the positive pole of the bipolar con- 
tinuum (Figure 2), independently of drive reduc- 
tion. A hedonic principle is presupposed by Edward 
L. Thorndike’s well-known law of effect. 


PAUL THOMAS YOUNG 


[See also Mortvation. Other relevant material may be 
found in Emotion; LEARNING; and in the biogra- 
phies of CANNON; THORNDIKE; WOODWORTH.) 
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nH 
ACQUIRED DRIVES 


As used by contemporary behavioral scientists, 
the term “acquired drive” (synonyms: secondary mo- 
tive, psychogenic drive or need, acquirable or learn- 
able drive, and sociogenic need ) refers to a motive, 
need, or source of motivation (rarely defined) that 
is a product of learning. In principle, animals as 
well as man can acquire drives, but man’s oppor- 
tunities for such learning during the process of 
socialization are held to be far more numerous, and 
the complexity and diversity of his activities are 
thought more urgently to demand an acquired-drive 
interpretation. 

Typically, some form of learned drive is invoked 
in accounting for such behavior as working for 
money; seeking power, affection, status, or security; 
and striving for artistic creativity or other “higher” 
goals. Behavioral phenomena involving the motiva- 
tional consequences of incentives and the acquisi- 
tion by neutral objects of rewarding power are 
closely linked with those subsumed under the head- 
ing of acquired drives. 

Precise, carefully worded definitions of the phrase 
“acquired drive” are not often encountered in the 
writings of behavior scientists, nor is it customary 
to find clear descriptions of exactly what is being 
learned or of how behavior is affected “motivation- 
ally” thereby. This looseness of conception and ex- 
pression permeates writing in this field, and the 
multiplicity of different views resulting therefrom 
Suggest that acquired-drive concepts are still in 
primitive stages of development. 

Historical notes. Philosophical, literary, and 
religious writings from the earliest times contain 


references to the desires, strivings, wishes, and 
motives that have been said to govern man’s be- 
havior. And from the Middle Ages onward, refer- 
ences to the will and to conation became increas- 
ingly common in the theories of the famous 
philosophers. But it was not until the latter part 
of the nineteenth century that the concept of mo- 
tivation entered boldly and systematically into 
science-oriented psychological concepts of behavior. 
Many factors exerted an effect at that time. Dar- 
win’s views advanced the growing tendency to re- 
gard unlearned predispositions as the most impor- 
tant sources of motivation, a tendency strengthened 
by William James's insistence on the multiplicity of 
man’s instincts and by William McDougall’s em- 
phasis on innate propensities and their correlated 
emotional qualities. Freudian theory likewise under- 
scored the importance of unlearned primitive mo- 
tivational forces and may have been the first to 
emphasize a nonspecific driving agency, the libido. 
Students of animal learning, notably Edward L. 
Thorndike, found the primary drives and rewards 
of tremendous practical importance in teaching 
animals to make new responses, and the “law of 
effect” emerged as the conceptual counterpart of 
efficient training procedures. 

During the first two decades of the twentieth 
century, Watsonian doctrine was accompanied by 
a decline of interest in the introspectionistic em- 
phasis on the will as the precursor and cause of 
action, a shift that was reflected in the rather wide- 
spread substitution of the term “reaction” for 
“action.” Robert S. Woodworth, in 1918, was appar- 
ently the first in America to use the specific term 
“drive.” He distinguished drive, as the energy that 
moves the organism, from the other mechanisms 
that direct and control reactions, a distinction that 
has been widely adopted. Moreover, his notion that 
habits can be converted into drives may constitute 
a historically memorable reference to acquired-drive 
phenomena. 

Although O. H. Mowrer’s classic paper on the 
motivating effects of learned anxieties appeared in 
1939 and Henry A. Murray published his list of 
psychogenic needs in 1938 (Explorations . . . 1938), 
Neal E. Miller was apparently the first to make use 
of the term “acquired drives,” in the title of a paper 
published in 1941. Moreover, the chapter on ac- 
quired drives in Miller and Dollard’s Social Learn- 
ing and Imitation (1941) looms large as the first 
general theoretical treatment of such drives. The 
recent emphasis upon se 
part, the intermingling 0 
response doctrines, a gro 
idea that intervening variab 


ondary drives reflects, in 
f Freudian and stimulus- 
wing confidence in the 
les could be given pre- 
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cise operational meanings, and an increasing tend- 
ency to extrapolate to humans theories based on 
animal behavior. Whereas experimental studies of 
learned drives have burgeoned during the past dec- 
ade, systematic theoretical analyses have not. Such 
cogitations will have to wait upon further devel- 
opments in the broad field of motivation in general; 
there are strong signs that cherished conceptions in 
the latter area are being seriously challenged. Prom- 
inent theories of motivation have been expertly 
summarized by Madsen (1959) and the University 
of Nebraska Symposia on Motivation, published 
yearly since 1953, provide an excellent over-all 
view of the thinking of American psychologists on 
the general topic of motivation. 

Current trends and basic problems. The present- 
day tendency to appeal to a concept of acquired 
drives can be ascribed to a variety of factors, Some 
champions of the acquired-drive view justify their 
stand on the ground that learned human motives 
are manifestly visihle “on the face of behavior.” 
This attitude may mirror the fact that the language 
to which most of us have been exposed during the 
early years of life is not the language of science, 
particularly behavioral science, but that of the arts, 
literature, and religion. This nontechnical lexical 
heritage, which we learn first and best, contains a 
wide variety of such “motivational” synonyms as 
needs, wishes, impulses, desires, motives, aspira- 
tions, and hopes. Having matured within this lin- 
guistic environment, the student of behavior is 
almost certain to believe that these well-worn words, 
like other words learned at the parental knee, de- 
note existential realities. He is also quite likely to 
feel that each of these “entities” is an important 
determinant of behavior and that his task as a sci- 
entist is simply to find ways of identifying and 
quantifying them. 

A second reason for the widespread acceptance 
of the acquired-drive notion is the general convic- 
tion that biological drives, although capable of ex- 
plaining certain instances of animal behavior, are 
inapplicable to human activities. Homeostatic regu- 
latory mechanisms and biological deficiency states 
are deemed impotent in accounting for the alleged 
fact that man is motivated to work for praise, 
power, and recognition. It is considered self-evident, 
therefore, that man is driven by nobler, more in- 
tellective, more spiritual tendencies. Perhaps this 
is why acquired-drive concepts are never invoked 
to explain the persistence of morally or socially 
undesirable behavior. The activities of theologians, 
and scientists are held to reveal the 
tives but the actions of dope 
arbage collectors are not. 


musicians, 
power of sociogenic mo 
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A third reason for stressing the role of learning 
in motivation is that individuals are seen to vary 
among themselves to a surprising degree. This vari- 
ation, both within a single society and from one 
society or culture to another, exceeds expectations 
based solely on genetic mechanisms. Attention is 
drawn, therefore, to what the individual learns dur- 
ing the course of his development when multitudes 
of objects and varieties of behavior are allegedly 
acquiring value. It is a sensible next step, some 
maintain, to assume that as values are being ac- 
quired, needs or drives for objects or for courses of 
action are also being learned. 

Reappraisal. Although these arguments carry 
some conviction, there is a growing body of opinion 
favoring the conclusion that learned-drive con- 
structs may not be essential to adequate theories 
of human behavior. For example, Estes (1958), 
Brown (1961), and others have seriously consid- 
ered the possibility that motivational constructs, at 
least those of the traditional biogenic variety, may 
be superfluous and that adequate interpretations of 
alleged motivational phenomena may in some cases 
be based solely upon associative learning concepts. 
These views have not been extended, as yet, to 
secondary-drive manifestations, but there is no 
reason, in principle, why this could not be done. 

The central theme, of course, is simply that 
complex human behavior is learned and that its 
acquisition is governed by known learning proc- 
esses. If power-seeking behavior, for instance, were 
under scrutiny, one would aver that reinforcement 
(as yet largely unspecifiable) is responsible for the 
acquisition and maintenance of the actions. In- 
stances in which motivational processes seemed to 
be operative would be termed illusory, the illusion 
being laid to the fact that some of the variables of 
which behavior is a function have been errone- 
ously dubbed “motivational” by uncritical precur- 
sors. The known capacity of such variables to affect 
the organism would be put down to altered stimu- 
lation, a stratagem that reduces all so-called mo- 
tivational variables to stimulus variables, According 
to this view, in a word, man learns to react but not 
to be motivated. Neither the artist, nor the novelist, 
nor the office secretary need be regarded as having, 
respectively, drives to paint, write, and type. Each 
has learned to perform certain acts because these 
have been repeatedly reinforced by the complex 
societal machinery in which they are embedded. 

Actually, a theorist would not be required to 
abandon the concept of motivation altogether. He 
might maintain, for example, that a theory should 
include primary but not secondary sources of drive. 
Thus, whatever motivational process the theory de- 


mands could be provided by primary sources of 
drive, such as mild hunger, thirst, or sexual needs. 
These and other biogenic sources of drive, oper- 
ating in conjunction with learned reactions, could 
easily give one the impression that persisting be- 
havior reflects a learned drive for a goal object of 
some sort. 

Those who accept, as a working hypothesis, the 
view that some kind of learned-drive concept is 
desirable or necessary must wrestle with many sub- 
sidiary problems. Chief among these is the issue 
of whether a learned drive, or for that matter any 
drive, should be conceived as having both energiz- 
ing (activating) and behavior-directing functions, 
Hull (1943), Spence (1956), and Duffy (1951) 
may be seen as representing those who have, in 
part or in whole, regarded drive as a unitary, non- 
specific, energizing agency and have attributed the 
directionality or specificity of behavior to associa- 
tive tendencies or habit strengths. The fact that 
such variables as food deprivation seem to affect 
the direction of behavior is handled by assuming 
that such variables influence not only drive level 
but also the intensity, quality, or number of depri- 
vation-generated stimuli. 

Many other writers, of course (Explorations... 
1938; Young 1961), are insistent that drives 
(sometimes motives) must be directing, response- 
selecting agents, as well as activators. Young states 
repeatedly that motivation “is the process of arous- 
ing action, sustaining the activity in progress, and 
regulating the pattern of activity” (1961, pp. 23- 
24). Moreover, he adds that he would broadly de- 
fine the study of motivation as “a search for the 
determinants (all determinants) of human and 
animal activity” (ibid., p. 24). To adopt such a 
view, of course, is to rob the term “motivation” of 
all unique characteristics. As a search for all the 
determinants of activity, motivation becomes co- 
extensive not only with all of psychology, but with 
all other sciences, natural or social, that contribute 
to the understanding of behavior. 

Closely allied with the question of whether a 
drive or motive can guide or steer behavior is the 
problem of whether the drive itself can properly 
be said to have direction. Here again those who 
embrace such broad definitions as the foregoing 
tend also to say that one has drives for particular 
goal objects or conditions. This view may well be 
a carry-over from purposive conceptions of behav- 
ior and appears to be plagued with many of the 
ancillary difficulties that teleological notions must 
surmount. For example, how can one decide, un- 
equivocally, that one particular object or bit of 
behavior, rather than something else, is the goal? 


Consider the man who works hard in his office, is 
praised by his boss, gets a raise in salary, buys a 
new car, gets a date because of the car, and con- 
sequently is able to frustrate a rival suitor. In this 
case, the praise, the raise, the car, the date, or the 
satisfaction of thwarting the rival might each be 
described as a goal. But it would make little sense 
to say that the man had acquired a drive for each 
of these goals or a drive for one to the exclusion 
of all others. Actually, advocates of the position 
that one may have a drive for a specific goal object 
have never defended this concept on systematic or 
theoretical grounds. The only support has been of 
the “we know this is true” variety. Nor has the 
attribute of “having a drive for something” ever 
been assigned specific action-determining proper- 
ties within a formal behavior system. 

Drive as dissatisfaction with status quo. Inter- 
estingly enough, although fear is widely regarded 
as a good example of an acquired drive, one never 
says that fear is a drive for anything specific. Such 
an assertion would make sense only if one were to 
argue that the drive of fear is a drive for a non- 
fearful status. Since many different actions may be 
equally effective in getting an organism away from 
fear-arousing stimuli, the “goals” of the fear drive, 
if this manner of speaking is permissible, are in- 
credibly numerous. But in the same way, of course, 
one may say that the drive for food is a drive to get 
away from hunger. Apparently all so-called drives 
or sources of drive share the property of being states 
or conditions from which the organism would pre- 
fer to escape. Even the emotion of hope, regarded 
by some as a “positive” drive, may be seen as indi- 
cating a lack of the thing hoped for and, hence, as 
essentially unpleasant. Hope, to be sure, may in- 
volve pleasant anticipations, but since it is always 
characteristic of the pregoal attainment periods, it 
is undesirable relative to the actual goal. Perhaps, 
therefore, the only meaningful sense in which a 
drive may be said to be a drive for something is in 
the sense in which it must be a drive to be anything 
but what it is at the time it is functioning as a 
drive, This holds for the drive to be an artist, to 
convert the heathen, or whatever. All involve dis- 
satisfaction with the status quo. Every assertion to 
the effect that behavior is activated by positive, 
future goals can thus be transformed gnto the 
counterproposition that the behavior is driven by a 
contemporary state of unrest or imbalance. Such a 
translation is, it will be seen, consistent with the 


view that empirical variables can be identified as 


motivational provided (1) they serve as relatively 


nonspecific energizers, (2) their elimination fol- 
lowing a response is reinforcing, and (3) their 
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appearance following a reaction is punishing. Con- 
ditions that are inherently “positive” would not meet 
the second and third of these rather widely ac- 
cepted criteria for identifying motivational vari- 
ables. 

The incentive—motivation construct. An inter- 
esting example of what appears, at first sight, to 
be an acquired drive for a specific goal is provided 
by the Hull—Spence construct of incentive—motiva- 
tion. Suppose that one has been fed at a given 
region in space and is now situated at a point from 
which a food reward is invisible. At such a time 
the visual stimuli provided by food can serve no in- 
centive function and cannot elevate drive level 
directly. Hidden rewards cannot affect an organ- 
ism’s prereward behavior save through the agency 
of a stimulus-evoked anticipatory response. Accord- 
ing to Spence, when a subject is rewarded, a con- 
summatory goal response becomes classically condi- 
tioned to the cues provided by the food and to the 
surrounding environmental stimuli. Similar stimuli 
at a distance evoke fractional parts of the goal 
response, and these components of the total goal 
reaction are held to contribute in some measure to 
total drive level. Thus, even prior to reaching the 
reward, the subject reacts anticipatorily and appro- 
priately with respect to it, and these reactions are 
said to increase drive. Inasmuch as these com- 
ponents of the total goal response are uniquely 
relevant to the particular reward experienced on 
previous trials, it might be tempting to speak of an 
acquired drive for that rewarding object. Nonethe- 
less, whereas the anticipatory responses of sali- 
vating, chewing, and the like are appropriate to 
food, the increment in drive level, or incentive- 
motivation, that they are postulated to generate is 
not. Within the Hull-Spence theory, incentive- 
motivation, like drive, is a multiplier of habit 
strengths and contains no integral label directing 
it toward or away from any particular goal object 
[see HULL]. 

What is learned in acquired drives. Little can 
be said that is definitive in reply to the general 
question, “What does one learn when one learns to 
be motivated?” Typically, phasic or rhythmical skel- 
etal movements have rarely been nominated for 
positions of central importance in secondary-drive 
theory. Such reactions are not specifically excluded; 
the Miller-Dollard view suggests that any learned 
reaction capable of producing strong internal stim- 
uli qualifies as an acquired drive. Emotional reac- 
tions are most frequently cast in the role of drives, 
with the accent placed on learned expectations of 
pleasant events (hopes) and anticipations of nox- 
ious events (fears or anxieties). Emotional systems 
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are favored because they are relatively diffuse (as 
is also the case with such primary motivating con- 
ditions as hunger), they can be conditioned readily 
and, if the unconditioned stimulus is strong, resist 
extinction remarkably well. Verbal reactions, par- 
ticularly those involving self-directed, “try hard” 
instructions have only recently been suggested for 
consideration as learned responses potentially capa- 
ble of functioning as sources of drive. Provided 
these responses can be woven securely into well- 
established learning theories, they may serve a 
valuable function as contributors to drive. 

Inadequacies of theoretical formulations. We 
terminate this section by noting again that 
acquired-drive theorists have been gravely remiss 
in formulating specific theoretical presuppositions 
and details. Few have wrestled with the question 
of whether it is profitable to postulate the existence 
of more than one acquired drive and, should this 
question be answered in the affirmative, in what 
ways the drives resemble or differ from one another 
and from primary sources of drive. Similarly, little 
attention has been paid to the puzzles of whether 
drives should be treated as having directing as well 
as activating functions, whether whatever is learned 
in the process of acquiring drives is similar to other 
learned behaviors, how such new learnings are 
brought about, and how an acquired drive functions 
when it is alleged to be operating “motivationally.” 
Some of the solutions proposed as answers to these 
queries are noted in the following review of re- 
search and theory in the field of conditioned 
emotionality. 

Conditioned emotional responses. This discus- 
sion begins with, and concentrates on, an examina- 
tion of the acquired drive of fear (sometimes 
termed “anxiety”) because this exemplifies the most 
tightly reasoned theory and the most convincing 
experimental buttressing in the acquired-drive lit- 
erature, 

Whereas Freud clearly saw that a learned anxi- 
ety might have motivating properties and that its 
reduction would be rewarding, Mowrer (1939) crys- 
tallized these notions by translating them into the 
newly developing stimulus—response-—reinforcement 
language of the 1930s. Linking Freud’s conception 
with Pavlovian conditioning principles, Mowrer 
proposed, in brief, that the repeated paired pres- 
entation of a neutral and a noxious stimulus would 
result in the conditioning of anxiety to the neutral 
cue. After a few such trials, presentations of the 
neutral stimulus alone should evoke the learned 
emotional state. Moreover, this state ought to serve 
as a drive, because its presence should exert a 
dynamogenic effect upon other responses and its 


diminution, following a response, should be rein- 
forcing. 

Experimental data from a variety of sources in- 
dicate that conditioned fear exhibits these motiva- 
tion-like properties. The conclusion, therefore, seems 
firmly established that reactions leading to the 
cessation of fear-arousing cues will be strengthened. 
Fear has also been shown to have an energizing or 
activating effect under rather specialized circum- 
stances; the augmentation of startle reactions to 
auditory stimuli provide the clearest example, Am- 
ple support for the view that fear also manifests 
dramatic inhibiting properties comes from studies 
showing that lever-pressing reactions in the rat 
cease almost entirely in the presence of stimuli 
formerly paired with shock. This response-suppres- 
sing function, taken at face value, does not seem 
to square with the concept of fear as an acquired 
drive, but one might assume in these instances that 
the onset of fear has followed and has blocked bar 
pressing because of a punishing property character- 
istic of all drive states [see EMOTION; PHOBIAS]. 

Generalization to human behavior. Extensions 
of these experimentally demonstrated principles to 
human behavior have been common. Usually the 
accent has been placed on the position that anxi- 
eties are probably entirely acquired and that 
anxiety-reducing behavior tends to be perpetuated. 
Moreover, it has been argued that many human 
activities that appear to reflect acquired drives for 
particular goals are actually motivated by anxiety. 
Thus, for example, behavior suggesting a secondary 
drive for money has been reinterpreted as reflect- 
ing a learned tendency to be anxious in the absence 
of money. The receipt of money necessarily re- 
duces this “deficiency anxiety,” which, in turn, 
reinforces money-obtaining actions, such as work- 
ing. Comparably, the gregarious reactions of both 
humans and animals have been regarded as re- 
flecting an anxiety-like state produced by physical 
isolation and reduced by companionship [see 
ANXIETY]. 

Perhaps the most successful application of the 
conditioned fear paradigm has been its use in the 
interpretation of behavior that repeatedly succeeds 
in enabling one to avoid an impending traumatic 
event. The problem to be surmounted here is that 
each anticipatory avoidance reaction prevents the 
noxious stimulus from impinging on the subject 
and, hence, constitutes an extinction trial. Avoid- 
ance reactions, however, are sometimes remarkably 
resistant to extinction, and it is this observation 
that is explained by saying (Mowrer 1939) that 
avoidance responses are strengthened by virtue of 
their own anxiety-reducing function. The condi- 


tioned anxiety hypothesis also has furthered our 
understanding of guilt reactions. Here it is assumed 
that the emotional reaction becomes conditioned 
to internal, response-produced cues as well as to 
external ones. Therefore, when the first elements 
of a punished response sequence are evoked, the 
cues generated by these reactions evoke conditioned 
fear, guilt, or apprehension. Interrupting the chain 
by doing something else eliminates the guilt and 
thereby reinforces the competing “virtuous” re- 
sponses. 

Variables contributing to fear. Concerning the 
variables of which fear is believed to be a function, 
there appears to be rather general agreement that 
fear increases with the intensity of the noxious 
stimulus; with the number of conditioning trials, 
for a time at least; and perhaps also with leng- 
thened intertrial intervals and with the strength of 
an appetitive source of drive such as hunger. The 
principal procedure for eliminating or reducing fear 
is that of the Pavlovian extinction trial, on which 
the conditioned stimulus is presented without the 
unconditioned stimulus. Extinction may be accel- 
erated by massing trials and by structuring the 
situation to permit competing activities, for exam- 
ple, when one eats in the presence of the condi- 
tioned stimulus. Incidentally, the empirical finding 
that learned fears are extremely resistant to ex- 
tinction when very noxious stimuli have been used 
has stilled the earlier criticism that acquired anxi- 
eties are too transitory to provide a stable base for 
persisting human reactions. Indeed, it has been 
suggested by Solomon and Wynne (1954) that 
anxieties acquired under these “traumatic” condi- 
tions may never be extinguished completely, be- 
cause the changes produced by the learning process 
may be irreversible. 

Need for achievement. In recent years an am- 
bitious theoretical and experimental program con- 
cerned with the psychogenic drive, “need for 
achievement,” and other motives from Murray's 
list has been carried out by D. C. McClelland, J. W. 
Atkinson, R. A. Clark, and E. L. Lowell (1953). 
The conceptions of these investigators are included 
here inasmuch as their theory gives central staging 
to an affective arousal model of motivation. Two 
kinds of affective conditions are described. One, a 
negative state (anxiety), differs in no discernible 
way from the conditioned fear concepts of motiva- 
tion discussed above. The other, a positive ces 
(appetite), is essentially an anticipatory reaction 
to the coming of a pleasant stimulus and is the con- 
sequence of the pairing of neutral cues with pleas- 
ant ones. A motive, for these investigators, is the 
arousal of either a positive or negative affect by 
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the presentation of a conditioned stimulus. Thus 
all motives are necessarily learned, but it is not 
clear whether the primary affective reactions that 
serve as the basis for the conditioning of motives 
are, or are not, motivating. Part of the difficulty 
here arises from a failure to spell out precisely the 
effects of motives on behavior in general. Appar- 
ently, a motive (the affective expectancy ) operates 
as a selective facilitator of actions appropriate to 
the motive. These motives thus seem to function 
as though they provided cues appropriate to the 
criterion reactions rather than as general activa- 
tors. This has led to the suggestion that the term 
motivation may be inappropriate to this work, al- 
though obviously so long as the study of motivation 
is as broadly defined as it often is, hardly any usage 
can be pegged as inappropriate [see ACHIEVEMENT 
MOTIVATION; FIELD THEORY]. 


This review has presented only the barest out- 
line of the acquired-drive problem in all its com- 
plexity. Contemporary conceptions stress the view 
that an acquired drive is, in essence, a learned 
response, often an operationally definable emotion, 
such as fear, hope, or anxiety, capable of affecting 
other responses “motivationally.” Because the func- 
tions most commonly described as “motivational” 
are those of energization, reinforcement, and pun- 
ishment, a learned response seems to qualify as a 
motivational variable and, hence, as a secondary 
source of drive, provided it displays these functions 
with respect to other responses. Contemporary 
writers, however, also seriously entertain the notion 
that acquired-drive concepts are not required even 
when complex human behavior is being interpreted. 
For these theorists, the principles of associative 
learning theory are adequate to explain phenomena 
that, to others, are patently motivational. 

Junson S. BROWN 


[Directly related are the entries LEARNING, article on 
REINFORCEMENT; MOTIVATION, article on HUMAN 
MOTIVATION. Other relevant material may be found 
in LEARNING THEORY; MOTIVATION; PERSONALITY : 
CONTEMPORARY VIEWPOINTS, Article on A UNIQUE 
AND OPEN SYSTEM; STIMULATION DRIVES; and the 
biographies of THORNDIKE and WoopwortH.) 
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I 
PSYCHOPHARMACOLOGY 


Although psychopharmacology is a relatively new 
scientific word, the interactive effects of drugs and 
behavior are perhaps as old as man himself, This 
presentation will describe contemporary interest 
in psychopharmacology and will emphasize the 
social research frontiers. It will not attempt to 
describe the emerging body of human and animal 
research results or the treatment aspects of drugs. 

Although there are clearly a number of focuses 
for discussion, the individual human has been se- 
lected as the center of analysis; this article will 
deal with a number of individuai and social factors 
relating to the effects of drugs. These factors are 
complex and generally poorly identified and weakly 
conceptualized. The ramifications of the questions 
lead one to consideration of the pathways of ac- 
tivity in the nervous system, the complexities of 
biochemical analysis, the assessment of human 
performance in laboratory or simulated situations, 
the management of patients, and the social matrix 
in which all of human behavior is embedded. 

Historical considerations. Drug utilization for 
health, religious, or individual purposes is one of 


the many primary characteristics of human be- 
havior in a wide variety of cultures. Drugs thus 
constitute an almost universally present element 
in human social behavior, and one to which 1m- 
dividual reinforcement, social training, and pri- 
vate meaning are attached. One of the world’s 
oldest professionals, the medicine man, has long 
made use of a variety of chemical substances in 
the technology of his affairs. 

Although the term “psychopharmacology” has 
been used with increasing frequency over the past 
few years, the concept is by no means new to psy- 
chology. Studies of the effects of drugs on the 
behavior of animals were undertaken in the nine- 
teenth century, and the classic investigations of 
the effects of caffeine on human behavior were 
carried out by Hollingworth (1912) in the first 
decade of the twentieth century. This series of 
studies embodied many of the aspects of good de- 
sign and may still have value as a model for anal- 
ysis of the effects of drugs on human behavior. 

The current dramatic expansion of interest in 
psychopharmacology as a special area dates from 
the early 1950s, In its early phases it was a product 
of the joint action of pharmacologist and psychi- 
atrist in their attempts at modification of the be- 
havior of psychiatric patients. 

Since the early 1950s, when drugs such as 
reserpine and chlorpromazine were introduced, the 
list of compounds has grown, although some cen- 
tral questions surround these drugs and their uti- 
lization. 

Problems. Any real understanding of drugs that 
affect behavior depends upon the development of 
adequate theory relating biochemical or pharma- 
cological events to behavioral events. Callaway and 
Stone (1960) have examined the nature of theory 
in this area and find that an integrating theory is 
still to be developed. On the biochemical side they 
find that the problem is formidable and aggravated 
by the lack of appropriate behavioral referents. 
Theories proposed by behavioral scientists may 
greatly oversimplify the underlying biological com- 
plexities. On the other hand, theories constructed 
by biochemists and pharmacologists may be naive 
about the behavioral complexities. No available 
theory is consistent with the available data, and 
many approaches use ambiguous and subjective 
terms. Callaway and Stone express the hope that 
the new psychotropic drugs can serve as a valuable 
class of independent variables in an integrated 
theory of behavior. 7 

The task of analyzing the specific pharmacologi- 
cal effects of a drug is confounded by the history, 
learned expectations, and social roles of the indi- 


viduals involved. Poffenberger (1942, pp. 200-202) 
discussed several major sources of error in drug 
investigations. He expressed serious concern about 
the “suggestibility of people who have a knowledge 
of the effects to be expected,” the lack of control 
subjects, the difficulties in the selection of appro- 
priate measures of drug effects, the short duration 
of drug studies, and, finally, the facts of interindi- 
vidual and intraindividual differences [see Suc- 
GESTION]. 

Modell has recently reviewed the problem of 
experimental controls in clinical pharmacology. He 
writes: 


It may surprise some laboratory workers as well as a 
large number of clinicians that the pharmacologic ex- 
periment in man is really far more difficult than in the 
animal. The training and the orientation of the physi- 
cian, as well as his relationship to the patient, make 
him an interested and purposive observer rather than 
a precise and objective one. Also realistic restrictions 
on the techniques that may be used in making observa- 
tions on man make the clinical experiment difficult to 
perform and limit the depths which can be proved di- 
rectly. Then there is the subject, whose human mind, 
human culture, human society, and human frailties 
make it uniquely difficult to experiment on him. All of 
these factors make the establishment of controls in a 
human experiment quite a different kettle of fish from 
the laboratory procedure of dividing littermates into 
two equal groups. (1963, p. 372) 


Methodological developments 
Experimenter and subject bias are presumably 
dealt with by the double-blind control, in which 
both E and S are prevented from knowing who has 
been assigned to a specific experimental group. 
The placebo presumably is utilized to deal with the 
implications of medication, and randomization of 
other variables permits dealing with extraneous 
factors. The advantages and liabilities of the 
double-blind procedure have been discussed by 
Nash (1962) and need not be pursued here [see 
EXPERIMENTAL DESIGN]. 
Placebo effects and research design. The placebo 
issue has also been of much interest and has been 
reviewed by Wolf (1959) and by Roueché (1960, 
pp. 85-98). Beecher (1955) has stressed the im- 
portance of the placebo concept, distinguishing 
between the effects of suggestion and the direct 
pharmacological effect. Woif defines placebo effect 
as “any effect attributable to a pill, potion, or pro- 
cedure, but not to its pharmacodynamic or specific 
roperties” (1959, p. 689). 
; re ental design involving an extra aed 
of subjects in addition to the usual drug, placebo, 
and untreated groups has been developed (Ross 
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Table 1 
DRUG NO DRUG 
PILL Drug Placebo 
NO PILL Drug disguised Nothing 


et al. 1962; Lyerly et al. 1964). This extra group, 
called the “drug disguised group,” receives the drug 
in a disguised form. Table 1 outlines this fourfold 
design. 

The main question to which this design is di- 
rected is the analysis of drug effects. Most studies 
are based on a drug group—placebo group design 
or, less frequently, a drug group-placebo group- 
untreated control group design. In the typical 
drug—placebo experiment, both drug and placebo 
groups are given pills with the same instructions. 
The effects observed in the drug Ss are generally 
attributed to the pharmacological properties of the 
agent after subtraction of the effects found in the 
placebo Ss. But these “drug” effects may be partly 
a function of Ss’ expectations or of the experimen- 
tal or therapeutic setting. When a nontreatment 
group is added to the design, however, the differ- 
ences between the untreated and the placebo 
groups can be attributed to the placebo effect. The 
special advantage of the four-group design is that 
it permits direct assessment of drug and placebo 
effects. In a typical experiment the drugs and 
placebos were given to the Ss in capsules, and all 
Ss received orange juice, which was also the vehicle 
for the disguised medication. The untreated group 
received orange juice only. 

One experiment has been reported on the effects 
of d-amphetamine sulfate. The results of this ex- 
periment, carried out with neutral instructions, 
indicated a positive pill effect on mood, that is, 
the S felt more “comfortable,” and a negative effect 
of the drug when administered in the disguised 
condition. The untreated group provided results 
similar to those of the group that received the drug 
as a pill. Motor performances of both the drug and 
the drug-disguised groups were significantly poorer 
than those of the placebo and untreated groups. 

Effects of expectancies. A second experiment 
was designed to determine the specific effects of 
direct and conflicting instructions and the effects 
of two drugs (amphetamine sulfate and chloral 
hydrate) that might be expected to produce dif- 
ferent pharmacological and perceptual effects. Two 
sets of instructions were developed to lead Ss to 
expect that the capsules they swallowed would yield 
the specific effects of one of the two drugs. Groups 
similar to those in the previous experiment were 


used. 
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Instructions alone affected motor performance 
but had little or no effect on mood. Direct instruc- 
tions, appropriate to the presumed drug effects, 
produced performance deterioration with the sim- 
ple motor tasks used. Conflicting instructions, inap- 
propriate to the presumed drug effects, counter- 
acted much of the drug-produced decrement. A 
slight decrement in performance was found in the 
placebo group which received instructions appro- 
priate to the effects of amphetamine. Amphetamine 
produced reports of greater comfort on the “mood 
index” than did chloral hydrate. On the other hand, 
the chloral hydrate instructions resulted in greater 
comfort than the amphetamine instructions. There 
was no interaction between drug effects and in- 
structional effects. The two placebo groups did not 
differ significantly on the mood index. The effects 
of instructions on mood were found only when the 
drug was present. 

The general problem of placebos has been ana- 
lyzed from a variety of viewpoints (Fisher 1962; 
Gorham & Sherman 1961; Hawkins et al. 1961; 
Kast 1961; Wilson & Huby 1961) and is far from 
settled. 


Personality and social aspects 


Increased sophistication has been emerging in 
the field of human experimentation, stimulated by 
analyses of the social psychology of the psycho- 
logical experiment (Orne 1962; Rosenthal 1963). 
The issue of ecological validity and the appropriate 
generalization from the laboratory to nonexperi- 
mental situations were treated by Brunswik (1947). 
Perhaps it is fair to say that the problem has not 
received its appropriate emphasis to date. Orne has 
suggested that the subject must be recognized as 
an active participant in any experiment and that it 
may be fruitful to view the psychological experi- 
ment as a special form of social interaction. He has 
proposed that the behavior of the subject is a func- 
tion of the total situation, which includes the ex- 
perimental variables being studied and at least 
another set of variables. The latter group has to 
be put under the heading of “demand characteris- 
tics” of the experimental situation. Rosenthal has 
described a program of research dealing with ex- 
perimenter bias. It is shown that the expectation 
of the experimenter was a partial determinant of 
the results of behavioral research. The issues in 
these studies are vital ones for psychopharmacol- 
ogy, and it is hoped that future research will attend 
to these problems. 

Drug therapy. A recent analysis of psycho- 
pharmacology and personality takes stock of per- 
sonality measurement in relation to drugs. It reports 
the rather meager progress in psychopathology and 


deduces a static plateau in personality measure- 
ment “cushioned by factor analysis on the one hand 
and by psychodynamics on the other” (Zubin & 
Katz 1964). 

In the specific domain of the treatment of hos- 
pitalized psychiatric patients, additional aspects of 
the effects of drugs become of interest to the social 
scientist. Klerman (1961) states that before the 
1930s major interest lay in such disorders as gen- 
eral paresis or epilepsy. Metrazol, electroconvulsive 
therapy, insulin coma, and lobotomy made their 
appearance at this time. It was in the decade before 
World War n that schizophrenia became the center 
of attention. Psychotherapeutic innovation also ap- 
peared under the leadership of Harry Stack Sulli- 
van, Jacob Moreno, Frieda Fromm-Reichmann, and 
others. Klerman indicates that it is unfortunate for 
drug evaluation that the advent of the newer tran- 
quilizing agents in the 1950s coincided with a wave 
of hospital reforms and innovations in the treat- 
ment of psychotic patients. This interaction is a 
complex one, and it is difficult to assess the effects 
of “drugs” on hospital admission, readmission, and 
other rates. 

In addition to affecting the individual patient or 
subject the psychotropic drugs also affect the hos- 
pital milieu and functional organization. Elkes has 
described this interaction as follows: 


In an age of revolutions, one is apt to get used to 
revolutions. In considering the impact of the pharma- 
cotherapies on the management of the psychoses in a 
mental hospital, one is apt to be reminded of the vio- 
lent changes brought about in the past by the intro- 
duction of other somatic therapies, such as deep insulin 
coma, electroplexy, and the lobotomies. Each of these 
has wielded changes, based essentially upon relatively 
short-term results; only in recent years could these be 
viewed against the perspectives of long-term follow-up 
studies. The changes in management, and the attend- 
ant changes in attitude, however, have stayed and 
become cumulative. It is quite possible that a similar 
fate may befall the pharmacotherapies of mental dis- 
order, though three important differences distinguish 
them from earlier treatments and encourage a more 
hopeful view. The first is the fact that these therapies 
can be graded and made individual in a way which 
was quite impossible with the less variable and massive 
procedures used in the past. Secondly, they can be 
used on a scale far larger than the earlier therapies. 
Thirdly, whereas in the past systematic studies of 
mode of action followed essential empirical findings 
and measures, the empirical findings of the pharmaco- 
therapies are being increasingly related to a growing 
body of modern theoretical neuropharmacology and 
neurochemistry. Theories are thus likely to keep abreast 
of clinical findings in a way never before witnessed in 
psychiatry, and are likely to make these ever more 
precise, discriminate and long-term. 
The impact of the drug therapies on mental hospital 


population is essentially threefold. They have altered the 
immediate management and treatment of certain types 
of acutely disturbed psychotic patients; they have mobi- 
lized large chronic populations hitherto secluded in the 
continued-treatment units of the average mental hos- 
pital; and, in individual cases, have made possible 
measures of rehabilitation which would have been very 
difficult to achieve in their absence. Lastly, they have 
increased and made more urgent the contacts between 
the mental hospital and the community. In terms of all 
these effects the drugs are wielding profound changes 
in staff attitudes at all professional levels. The precise 
pattern of these changes at present is far from clear, 
and for the moment can only be discussed in the 
broadest terms. (1961, pp. 91-92) 


With respect to the clinical treatment of the 
chronically hospitalized patient, there has long 
been evidence that increases in staff attention 
alone will produce symptomatic improvement 
(Galioni et al. 1953), A recent study by Bullard, 
Hoffman, and Havens (1960) indicates that drugs 
may be more important than environment in pro- 
ducing changes in chronic patients. An excellent 
review of this problem by Hordern and Hamilton 
(1963) indicates its complexities and the lack of 
clarity regarding the contributions to patient re- 
sponse of drugs and special nondrug treatments 
and of staff and patient attitudes. It is unfortunate 
but true that the effects of the drug and/or envi- 
ronment found in any single study represent only 
a single case in the larger sense, since it is almost 
impossible at present to measure or even meaning- 
fully hypothesize what aspects of the staff—patient— 
milieu—drug context are responsible for the effects 


produced. 


The introduction to a recent volume on specific 
and nonspecific factors in psychopharmacology 
states the central issue cogently: “The concept of 
specificity of drug action is at the heart of all phar- 
macological enquiry and speculation, and the stated 
hope of chemotherapy: Yet, in the unfamiliar ter- 
rain now extending between biochemistry and be- 
havior, the term is assuming new and unexpected 
meaning. The interaction between the somatic and 
the symbolic compels a revision of premises, a 
recalibration of tools, and a reassessment of rules 
(Elkes 1963, p. v). In this revision, recalibration, 
and reassessment, all of the sciences of man are 
involved. The social sciences have their special 
assignment and central contribution. 

SHERMAN Ross 
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hg 
DRUG ADDICTION: 
ORGANIC AND PSYCHOLOGICAL ASPECTS 


Common to all of the conditions to which the 
term “drug addiction” has been applied is the cri- 
terion of habitual use of chemical agents (natural 
or synthetic) in spite of harmful consequences to 
the individual, society, or both. The present article 
deals mainly with the organic (pharmacological 
and physiological) and psychological aspects of 
the problem, particularly with reference to the 
opioids. 

In part, harmful consequences may arise from 
the pharmacological and toxicological properties 
of the drug employed. These vary greatly, depend- 
ing on the class of drug, the dose, and regularity 
and frequency of administration. Thus the toxic 
states induced by opioids (all chemical substances 
with morphinelike properties, including morphine 
itself) are characterized by respiratory depression, 
stupor, or coma; barbiturates, certain nonbarbitu- 
rate sedatives, and alcohol can produce mental 


confusion, incoordination, and coma; cannabis 
products (marijuana, hashish), amphetamine and 
allied “stimulant amines,” and cocaine, as well as 
a number of “psychosomimetic” agents (mescaline, 
d-lysergic acid diethylamide), can give rise to psy- 
choses of various sorts. 

Repeated administration of opioids, barbiturates 
and allied drugs, or alcohol can give rise to a state 
of “pharmacogenic” (or “physical” ) dependence— 
i.e., a state such that abrupt withdrawal of the 
drug is followed by physiological and psychological 
disturbances (“abstinence syndromes”), which are 
usually transient but may be fatal. These also 
differ, depending on the class of drug involved— 
for example, predominantly autonomic disturb- 
ances in the case of the opioids, and convulsions 
and delirium in the case of barbiturates, nonbarbi- 
turate sedatives, or alcohol. Associated in varying 
degrees with the development of pharmacogenic 
dependence is the acquisition of “tolerance”—i.e., 
progressive decline in degree of effect produced by 
a given dose of the drug—which can be almost 
complete in opioid users but is achieved only par- 
tially by regular use of the barbiturate and allied 
drugs or alcohol. With the acquisition of tolerance, 
the user is impelled to increase progressively the 
dosage and/or frequency of administration in striv- 
ing for the original effects, thereby incurring the 
tisk of inducing toxic reactions and intensifying 
still further the degree of pharmacogenic depend- 
ence—a state that in itself has disastrous conse- 
quences for the psychological as well as the physi- 
cal functioning of the user 

Social consequences. In part, the harmful con- 
sequences of habitual drug use are social in nature. 
Even at subtoxic dose levels the effects of a given 
drug may impair the user's usefulness to the s0- 
ciety in which he lives or may otherwise modify 
his behavior in unacceptable ways. In addition, 
harmful consequences may result from the user's 
attendant neglect of the welfare of his dependents, 
through either indifference to them, diversion of 
income for purchase of drugs, or adverse effects 
on his own health caused by inattention to proper 
nutrition, hygiene, and treatment of disease. Since 
in most societies it has been found necessary to 
place legal restrictions on the manufacture and 
distribution of many drugs of abuse, criminalism 
constitutes another of the consequences of this 
practice. 


Clinical features of opioid addiction 

The more commonly abused opioids include 
opium itself (containing 10 per cent morphine); 
morphine and certain of its derivatives, such as 
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codeine, diacetylmorphine (heroin), dihydromor- 
phinone (Dilaudid), dihydrohydroxycodeinone 
(eucodal); and a number of synthetic analgesics, 
such as meperidine (pethidine, Demerol, dolantin), 
ketobemidone, methadone (Dolophine, Amidone), 
and dextromoramide (Palfium). Heroin is the most 
common opioid purveyed illicitly throughout the 
world, although opium is still widely used in the 
Orient. Important variations occur, depending upon 
the particular opioid and the mode of utilization 
(smoking, sniffing, ingestion, injection), but the 
effects of morphine under experimental conditions 
are quite illustrative of the specific patterns of 
chronic intoxication, tolerance, and pharmacogenic 
dependence that develop with repeated use of 
opioids in general. 

In nontolerant persons a single injection of a 
“therapeutic” dose of morphine (e.g, 15 milli- 
grams) commonly produces a state of drowsiness 
from which the subject can be readily aroused. 
Pain, if present, is relieved and anxiety allayed. 
The sensorium remains intact, although some sub- 
jects may report that they feel “mentally clouded.” 
Contrary to popular notions, lurid fantasies are 
rare, and sexual desires are not increased, although 
in the male potency may be enhanced by delay of 
orgasm, Particularly if injected intravenously, mor- 
phine may produce transient sensations in the 
abdomen that are sometimes described as akin to 
orgasms. Flushing of the skin and itching, as well 
as giddiness and vomiting, are also common effects. 
Characteristically, the pupils are constricted, respir- 
atory rate is depressed, the pulse is slowed, and 
body temperature is lowered slightly. These effects, 
which last four to six hours, may be evaluated in 
different ways by different subjects, depending 
upon circumstances, but even among future 
habitués the effects of the first injection of mor- 
phine (or heroin) are often judged as unpleasant, 

With further experience, however, some indi- 
viduals come to evaluate the constellation of effects 
more positively, and their observers describe the 
subjects as “euphoric.” With more frequent repeti- 
tion, however, tolerance develops, albeit at different 
rates for various effects. Thus, the degree and 
duration of drowsiness, relief of pain and anxiety, 
depression of respiration, slowing of the pulse, 
nausea, and vomiting decrease progressively, 
whereas pupillary constriction and lowering of Re 
body temperature decline at less rapid rates, an 
spasm of smooth muscle continues to be very pro- 
nounced with the resultant production of ee 
constipation. In the tolerant Sey ty on 
adrenocorticotrophic hormone ( 


pituitary gland is reduced, sexual interest is lost, 
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males become impotent, and females cease to 
menstruate. In attempts to regain “euphoria” ad- 
dicts have been known to increase the daily dose 
to 5,000 mg. or more, with only partial and transi- 
tory success. If the accustomed dose of morphine 
is then withheld, a fairly stereotyped abstinence 
syndrome ensues, Beginning 12-16 hours after the 
last dose, restlessness, anxiety, frequent yawning, 
rhinorrhea, lacrimation, pupillary dilation, sweat- 
ing, and piloerection are noted first. In addition, 
muscular aches and twitches, abdominal cramps, 
vomiting and diarrhea, elevated blood pressure, 
insomnia, agitation, weight loss, spontaneous ejac- 
ulations in the male, and profuse menstrual bleed- 
ing in the female supervene later, developing max- 
imally during the next two or three days and then 
subsiding gradually, although physiological nor- 
mality may not be achieved for six months or 
longer. Such abstinence phenomena are quickly 
alleviated by injection of the accustomed dose of 
morphine (or an equivalent dose of another 
opioid), only to reappear some hours later. The 
untreated morphine abstinence syndrome is rarely 
fatal, except in patients with cardiac disease, active 
tuberculosis, or other debilitating illness. 

Of clinical importance are some special features 
of addiction to a number of other opioids. In the 
case of meperidine ( Demerol ), myoclonic jerks or 
generalized convulsions (with characteristic elec- 
troencephalographic abnormalities) may occur if 
the total daily dose level attained in the tolerant 
state is over 2,000 mg. Although the over-all effects 
of methadone are practically identical with those 
of morphine, the duration of action of methadone 
is somewhat longer, and therefore cumulative de- 
pressant effects on the central nervous system may 
occur if the drug is administered several times 
daily. The methadone abstinence syndrome is 
slower in onset and milder in degree (with respect 
to autonomic disturbances) than that observed 
after withdrawal of morphine. For these reasons 
methadone may be substituted for morphine and 
other opioids of addiction in the management of 
drug withdrawal. Despite widespread assumptions 
to the contrary, neither the intensity of “euphoric” 
effects, the rate of development of tolerance, nor 
the peak intensity of the abstinence syndrome is 
greater for heroin than for morphine when these 
two drugs are administered in equipotent doses 
(1.80 to 2.66 mg. of morphine sulfate is equal to 
1 mg. of heroin), although some opioid addicts 
can distinguish morphine from heroin by their 
subjective effects when these drugs are adminis- 


tered intravenously. 
The diagnosis of pharmacogenic dependence on 
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opioids can be made in two ways: (a) by observa- 
tion of the abstinence syndrome that ensues when 
the drug is withheld; and (b) by subcutaneous 
injection of a specific morphine antagonist, such 
as nalorphine (3-15 mg. in divided doses), With 
the latter procedure an “acute abstinence syn- 
drome” may be precipitated within minutes after 
injection in persons addicted to any of the opioids 
(except possibly meperidine), the severity of which 
depends upon the daily dose level and duration of 
addiction. Experimentally it has been shown that 
mild abstinence phenomena, such as yawning, 
lacrimation, rhinorrhea, piloerection, sweating, and 
pupillary dilatation, can be precipitated by subcu- 
taneous injection of 15 mg. of nalorphine in per- 
sons who have been receiving as little as 15 mg. 
of morphine four times daily for two or three days 
or equivalent doses of heroin or methadone on the 
same schedule. 

Treatment. Wikler (1962) provides the details 
of treatment. Evaluation of the results of treatment 
has been extremely difficult. The best evidence 
available, that of a large group of addicts returning 
to New York after treatment at the U.S. Public 
Health Service Hospital in Lexington, Kentucky, 
indicates that over 90 per cent of the patients be- 
came addicted again, more than 90 per cent of 
these within six months after their discharge (Hunt 
& Odoroff 1962, p. 53). On the other hand, there 
is some evidence suggesting that opioid addiction 
may run a self-limiting course after a number of 
years. Thus, it has been found that at the end of 
five years of follow-up, 25 per cent of a sample 
of the group described above were “voluntarily absti- 
nent,” and attention has been directed to the 
“35-40 age drop-off” in names of “active addicts” 
in the files of the Bureau of Narcotics. This phe- 
nomenon requires further evaluation. 


Organic aspects of opioid addiction 

Physical characteristics of addicts. In a study 
of four hundred opioid addicts (Caucasian) at the 
U.S. Public Health Service Hospital, Lexington, 
Brown (1940) found them to be of average or 
slightly superior height and weight, with normal 
distribution of body build, although there was a 
trend toward the pyknic habitus. Electroencephalo- 
graphic studies by Andrews (1941) on fifty addicts 
(one year after drug withdrawal) revealed only 
that the groups differed from published data on 
“normals,” in that among the addicts, individuals 
with occipital alpha indices either of 90-100 per 
cent or of 0-10 per cent were more common. 
These deviations from “normal” have no known 
clinical significance. In the personal experience of 


the author, there are no physical traits or diseases 
among opioid addicts in the United States that 
distinguish them from nonaddicts except for those 
consequent to drug abuse, such as cutaneous scars 
and pigmentations, phlebosclerosis, and a relatively 
high incidence of systemic infectious diseases 
(malaria, bacterial or mycotic endocarditis) result- 
ing from use of contaminated needles and more 
common diseases allowed to progress through 
neglect. No irreversible organic damage to the 
nervous system or viscera is known to occur as a 
result of opioids per se. 

Effects of single doses of opioids. Since one of 
the major actions of morphine and other opioids 
is the production of analgesia, it might be supposed 
that these drugs block transmission or otherwise 
exert depressant effects somewhere in the direct 
sensory pathways presumed to mediate pain. How- 
ever, the evidence bearing on this supposition is far 
from conclusive. In the cat morphine prolongs 
recovery time of interneurons in the thalamic “re- 
lay” nuclei (responding to electrical stimulation 
of the sciatic nerve), but in the dog morphine does 
not depress amplitude or latency of responses, 
evoked by electrical stimulation of the tooth pulp, 
in the contralateral coronal gyrus, the contralateral 
nucleus ventralis posteromedialis, or the medial 
lemniscus. Apart from the question of species dif- 
ferences, resolution of this problem is complicated 
by uncertainty about the specific modalities of sen- 
sation that are activated by the stimuli used in 
animal research on pain and analgesia, In this 
regard, electrical stimulation of the tooth pulp 
would seem to be the least ambiguous. Tentatively, 
therefore, it may be assumed that actions on the 
direct pain pathways cannot account for the anal- 
gesic effects of these drugs. 

Insofar as the “indirect” sensory pathways are 
concerned, a depressant action of morphine on the 
ascending midbrain reticular “arousal” mechanism 
has been found in the rat and the rabbit, but in 
the case of the cat the evidence is conflicting. In 
both the rabbit and the dog morphine enhances 
recruiting responses evoked by stimulation of 
thalamic intralaminar nuclei, but in the cat such 
responses are said to be depressed. These discrep- 
ancies may perhaps be related to the differences 
in general behavioral effects of morphine in the 
rabbit and dog (depression) on the one hand and 
the cat (excitation) on the other. In the dog, how- 
ever, the effects of morphine on potentials evoked 
in the “indirect” pathways by stimulation of tooth 
pulp were found to be exceedingly complex, aug: 
mentation as well as depression or no effect being 
observed in different components of this system. 
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In any case, the relevance of these data to the 
problem of morphine analgesia is open to question 
because the “indirect” pathways are selectively 
affected by small doses of barbiturates and other 
drugs that are not classified as analgesics. 

Effects on reflex activity. More consistent, and 
perhaps more relevant to the problem of analgesia, 
are the effects of morphine on certain reflex activi- 
ties at various segments of the neuraxis. Thus, in 
adequate doses, morphine depresses “sham rage” 
in decorticated cats and dogs, flexor and crossed 
extensor reflexes in spinal cats and dogs, the flexor 
reflex in spinal man, and mandibular response to 
stimulation of the tooth pulp in dogs. In addition, 
morphine exerts depressant actions on the “skin- 
twitch” in the dog and the “tail-flick” in the rat, 
both of which have been used successfully in the 
screening of drugs for analgesic properties in man. 
In part, morphine affects these reflexes by a direct 
depressant action of spinal interneurons, but a 
major component of its action in the intact animal 
appears to be augmentation of supraspinal inhibi- 
tion. From the neurophysiological standpoint, 
therefore, it would appear that morphine alters 
the “reaction” to noxious stimulation at several 
levels of the neuraxis rather than affecting propa- 
gation of impulses generated by such stimuli from 
the periphery to the cerebral cortex. However, there 
is no reason to assume that the selective actions 
of morphine on nociceptive reflexes are related 
solely to the analgesic actions of this drug. In fact, 
it may be that what is termed “morphine analgesia” 
merely refers to the component of the total pattern 
of effect produced by this drug (indifference to 
ordinarily painful conditions, drowsiness, “eupho- 
ria”) that interests the clinician under a given set 
of circumstances. As Schaumann (1954) has 
pointed out, the analgesic action of morphine is 
but part of its “antiprotective” spectrum of effects 
—quite similar to the concept “primary-need reduc- 
ing,” which has also been applied in this connec- 
tion (Wikler 1952). 

Sites of morphine action. Much research has 
also been devoted to the problem of accounting for 
the selectivity that characterizes the sites of action 
of morphine in the central nervous system. In 
isolated smooth muscle preparations, morphine has 
been shown to reduce both the resting output of 
acetylcholine and the output from postganglionic 
cholinergic nerve endings when they are stimu- 
lated, and to inhibit the release of sympathetic 
transmitter substance from postganglionic nerve 
endings in rats. It was found that single doses of 
morphine decrease the brain concentrations of 
norepinephrine, but repeated administration of mor- 
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phine increases these concentrations. The possible 
significance of this finding will be discussed further 
in connection with the mechanisms of pharmaco- 
genic dependence. 

Chronic effects. Not only have abstinence phe- 
nomena been observed in babies born of mothers 
addicted to opioids, but tolerance to many of the 
effects of opioids after repeated administration of 
these drugs and clear-cut abstinence phenomena 
following their abrupt withdrawal have been dem- 
onstrated in intact monkeys, dogs, and rats, in 
long-surviving decorticated dogs, in chronic spinal 
dogs, in a chronic spinal man, and even in unicel- 
lular organisms and tissue cultures. It has also 
been reported that in vitro, cortical slices from 
brains of rats treated with morphine daily for one 
week exhibit tolerance to the depressant effects of 
morphine on respiration (oxygen consumption ) 
stimulated by potassium chloride. Since there is 
no evidence that the development of tolerance to 
opioids is associated with an increased capacity of 
the organism to destroy the drug (Woods 1954) 
or that “immune” or “antimorphine” circulating 
substances are formed during chronic morphine 
intoxication, these findings suggest a cellular site 
of origin for both tolerance and pharmacogenic 
dependence. Presumably, such cellular processes 
involve changes in enzymatic activity, but the 
identity of the enzymes concerned is not yet known. 

Axelrod (1956a; 1956b) found that in rats the 
development of tolerance to morphine is associated 
with progressive decrease in the capacity of the 
liver to N-demethylate this drug, as well as other 
agents with morphinelike actions, and that chronic 
administration of N-allylnormorphine with mor- 
phine reduces both the rate of development of 
tolerance to morphine and the decrease in N- 
demethylating capacity of the liver in this species. 
Using the hepatic N-demethylating enzyme system 
as a model for the study of tolerance, it was sug- 
gested that a decreased response to narcotic drugs 
on repeated administration may develop as a result 
of unavailability of receptor sites, due to deactiva- 
tion of these sites through continuous interaction 
with opioids. Other data suggest that enzyme sys- 
tems concerned with the metabolism of catecho- 
lamines may be critically involved in the genesis 
of the abstinence syndrome. In the rat brain con- 


centration of catecholamines is supranormal dur- 


ing chronic morphine intoxication. It has also been 
reported that in both the dog and the rat urinary 
excretion of catecholamines is markedly increased 
during the acute abstinence period. At the present 


time the precise manner in which these changes 


are related to pharmacogenic dependence is diffi- 
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cult to establish because of some differences in 
regard to details that depend on the species used 
and the techniques employed by different investi- 
gators. 

Although the phenomena of tolerance and phar- 
macogenic dependence appear to be due to changes 
at the cellular level, it is quite likely that certain 
features of the clinical opioid abstinence syndrome 
are modulated by “counteradaptive” changes in- 
volving systems not directly affected, or only 
slightly affected, by this class of drugs. 

Electroencephalograms of addicts “stabilized” on 
morphine are characterized by abnormally high 
alpha output, with marked slowing in some cases, 
but the electroencephalogram returns to the 
control (normal or high alpha index) pattern 
promptly after abrupt withdrawal of the drug 
(Andrews 1941)—a surprising finding in view of 
the intense autonomic and emotional disturbances 
that occur during the early abstinence period. In 
man, bilateral prefrontal lobotomy has little effect 
on the “nonpurposive” features of the morphine 
abstinence syndrome (preponderantly autonomic) 
but does attenuate “purposive” abstinence phenom- 
ena, such as “craving” and drug-seeking behavior 
(Wikler et al. 1952). In monkeys experimentally 
addicted to morphine, bilateral section of the 
cingulum (a fiber bundle connecting the frontal 
cortex with the limbic system) does attenuate the 
morphine abstinence syndrome (Foltz & White 
1957), but certain details of the findings (White 
1953) suggest that in this species also the “attenu- 
ating” effects of the operative procedure were ex- 
erted mainly upon withdrawal signs analogous to 
human “purposive” abstinence phenomena. Re- 
moval of facilitatory frontal cortical influences on 
subcortical autonomic centers may also have con- 
tributed to the “attenuation” of the simian morphine 
abstinence syndrome as a whole. In addition, it 
appears that at least in man the hypothalamic— 
pituitary system is depressed during maintained 
morphine addiction and becomes hyperactive dur- 
ing the acute phase of the abstinence syndrome, 
Eisenman, Fraser, Sloan, and Isbell (Eisenman 
et al. 1958) having shown that during chronic 
morphine intoxication, urinary excretion of 17- 
ketosteroids is significantly depressed although the 
adrenal glands remain responsive to ACTH. In 
contrast, both plasma levels and urinary excretion 
of 17-ketosteroids increase far above control values 
after abrupt withdrawal of morphine, along a time 
course roughly parallel to that of the acute absti- 
nence syndrome, Further studies on animals are 
needed to determine whether these endocrinal 
changes are consequent to changes in neural activ- 


ity during chronic morphine intoxication and dur- 
ing the abstinence period, or whether the reverse 
is the case. 

Of special importance to theories of opioid ad- 
diction and relapse are data acquired with the use 
of N-allylnormorphine (nalorphine), a specific 
morphine antagonist. As has already been noted, 
this agent is capable of precipitating clear-cut ab- 
stinence phenomena in man after four-times-daily 
administration of “therapeutic” doses of morphine, 
heroin, or methadone for as brief a period as two 
or three consecutive days. Martin and Eades (1961) 
have shown that in dogs nalorphine can precipitate 
fairly severe abstinence phenomena when given in 
a single subcutaneous dose (20 mg./kg.) within 
15 minutes after termination of a single continuous 
intravenous infusion of morphine at the rate of 
3 mg./kg. per hour for 7.5 to 8 hours, In addition, 
it was demonstrated that such animals had devel- 
oped a considerable degree of tolerance to several 
of the effects of morphine during the single con- 
tinuous infusion of the drug. It is probable, there- 
fore, that the processes of tolerance and pharmaco- 
genic dependence begin to develop in man with the 
first dose of morphine even though these changes 
may not be clinically detectable. Also, evidence is 
accumulating that full recovery from the morphine 
abstinence syndrome may require much longer 
periods of time than has heretofore been believed. 
As has already been noted, minor but persistent 
deviations from normal have been detected in man 
for as long as six months after withdrawal from 
morphine following a period of addiction (Him- 
melsbach 1942). Martin, Wikler, Eades, and Pescor 
(Martin et al. 1963) have demonstrated that in 
the rat an early “primary” phase of the morphine 
abstinence syndrome is succeeded by a “secondary” 
phase, lasting over six months, during which the 
morphine-abstinent rat continues to display mild 
but definite signs of hyperirritability in comparison 
with concurrently observed control rats. 


Psychological aspects of opioid addiction 


Personality characteristics of addicts. The most 
extensive studies on the personality characteristics 
of opioid addicts are those which have been con- 
ducted over a period of about twenty years on the 
population of the U.S. Public Health Service Hos- 
pital in Lexington. Although other characteristics 
of this population have undergone changes in the 
course of two decades (shift in racial distribution 
from predominantly white to predominantly Negro, 
increase in proportion of patients from large metro- 
politan centers, and decrease in median age), the 
large majority of individuals that constitute this 
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population have consistently been classified as 
“psychopathic.” Thus, on the basis of their study 
with the Minnesota Multiphasic Personality In- 
ventory (MMPI) on 270 subjects, Hill, Haertzen, 
and Glaser (1960) concluded that narcotic addicts 
are often psychopathic, that hospitalized adolescent 
and adult addicts do not differ in their MMPI pro- 
files, that great similarities exist between adoles- 
cent addicts and delinquent nonaddicts, and that 
psychopathology is an important etiological ele- 
ment in addiction (p. 138). 

However, very similar MMPI profiles have also 
been found by this group of investigators to char- 
acterize not only juvenile delinquents but also hos- 
pitalized chronic alcoholics (Hill, Haertzen, & 
Davis 1962). Speculating on the commonness of 
elevated scores for psychopathy in the MMPI 
among narcotic (opioid) and alcoholic subjects, 
Hill (1962) has suggested that the “social deviant” 
does not engage in the daily activities that are ordi- 
narily reinforced by and satisfy the larger society. 
Counteranxieties and inhibitions that deter unusual 
behavior in the mature adult do not do so in the 
social deviant. Thus, he is particularly vulnerable 
to short-term satisfactions and can readily manipu- 
late his personal affairs if drugs are available 
(p. 573). The deviant who is immature and inade- 
quate and is unable to solve problems of adult 
life independently may find temporary freedom 
from frustration and problems in alcohol or com- 
plete elimination of such problems in opiates 
(p. 578). 

Other investigators have stressed “passive de- 
pendency” as a dominant trait among opioid ad- 
dicts, and Wikler and Rasor (1953) have suggested 
that such individuals choose opioids for repeated 
use from among a number of other drugs, includ- 
ing alcohol, with which they have had experience 
precisely because the opioids facilitate their pre- 
ferred mode of dealing with frustrating and 
anxiety-arousing situations—namely, by promoting 
“indifference” to them rather than by increasing 
aggressiveness, as would the use of alcohol. How- 
ever, a major difficulty in relating personality char- 
acteristics found in opioid addicts to the addiction 
process is the paucity of suitable control data. 
Gerard and Kornetsky (1955) compared the person- 
alities of 32 young addicts treated at the Lexington 
Hospital with those of 23 nonaddict acquaintances 
named by addicts who were undergoing treatment 
at another hospital. They concluded that although 
the addicts were more disturbed (with a high inci- 
dence of overt or incipient schizophrenia), about 
half of the control group also showed evidence of 
significant psychiatric disorder. Further research 
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in this area, especially on the personality character- 
istics of parents and siblings of opioid addicts, may 
help clarify many questions about the relation be- 
tween personality characteristics and addiction that 
are still unanswered. 

Single doses of opioids. Whether the nontoler- 
ant opioid addict is more susceptible to morphine 
“euphoria” than the nonaddict individual is a ques- 
tion of considerable importance for the dynamics 
of addiction. Lasagna, Von Felsinger, and Beecher 
(1955) compared the effects on mood of single 
doses of placebo, amphetamine, pentobarbital, mor- 
phine, and heroin in a group of “postaddict” pris- 
oners and a group of normal volunteers, as well as 
another group of chronically ill patients. As ex- 
pected, these investigators found that heroin pro- 
duced the greatest incidence of “euphoric” effects 
in the postaddicts, but only a few individuals in 
the other groups responded to either morphine or 
heroin in this manner. In another study on the 
volunteer group alone, Von Felsinger, Lasagna, and 
Beecher (1955) obtained data on personality dy- 
namics indicating that typical or atypical reactions 
to the drugs used in the previous study could be 
correlated with differences in mature controls and 
emotionality (p. 1119). However, it is difficult to 
evaluate the significance of these findings for the 
addiction process because the postaddict and non- 
addict groups in the study of Lasagna, Von Fel- 
singer, and Beecher (1955) differed in ways other 
than presumed personality structures (prior experi- 
ence with drugs, sociolegal status). In further 
studies on normal subjects, Smith and Beecher 
(1959) found that morphine (10 mg./70 kg.) pro- 
duced a variety of unpleasant subjective effects, 
among which “mental clouding” was especially 
prominent. 

As judged by objective performance tests, how- 
ever, the mental impairment produced by small 
single doses of opioids would seem to be much less 
than is implied by the term “mental clouding.” 
Bauer and Pearson (1956) studied the perform- 
ance of 96 nonaddict male subjects on a difficult 
perceptual-motor task involving manipulation of 
throttle, stick, and rudder controls to compensate 
random movements of four instrument pointers 
from the null position and found that the effects 
of 8 mg. of morphine could not be differentiated 
from those of placebo (saline) injections. Kor- 
netsky, Humphries, and Evarts (1957) reported 
that 100 mg. of meperidine produced only a slight 
and nonsignificant impairment of performance on 
a variety of mental tests of nonaddict individuals. 
Smith, Semke, and Beecher (1962) did find that 
single doses of morphine (10 mg./70 kg.) or 
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heroin (4 mg./70 kg.) produced significant im- 
pairment of performance on a number of tests of 
mental function but noted that the impairment 
was primarily one of speed rather than accuracy. 
Studies with comparable techniques have not been 
made in postaddicts, but the earlier studies of 
Brown (1946) suggest that the effects of opioids 
relevant to the addiction process lie in the “emo- 
tional” area rather than in “cognitive” functioning. 
Thus, in a long-term study on two postaddicts dur- 
ing experimental readdiction to morphine, he found 
that although this drug did produce a mild degree 
of impairment of performance on code learning, 
continuous subtraction, and immediate and delayed 
recall of nonsense syllables, as well as on the speed 
of voice and hand responses to word stimuli, the 
most striking effect was the reduction of differ- 
ences in speed between voice responses to “indif- 
ferent” word stimuli and those to “disturbing” word 
stimuli (drugs, sex, and crime), suggesting the 
possibility that morphine might alleviate the dis- 
turbances of personal and social conflicts. Such 
circumstances might be the reason for morphine 
addiction and relapse after periods of abstinence 
(p. 52). In addition, Brown (1943) found that 
Rorschach responses of postaddicts were generally 
“constricted” and immature; single doses of mor- 
phine reduced the degree of constriction and in- 
creased the occurrence of fantasy and human 
movement responses. 

Other data on the “emotional” effects of mor- 
phine have been obtained by Hill and others in a 
series of studies on “anxiety associated with the 
anticipation of pain” in postaddicts, Noting that 
the effects of single doses of morphine on the so- 
called pain threshold are unpredictable (Andrews 
1943; Denton & Beecher 1949), these investigators 
hypothesized that one important action of mor- 
phine in the production of analgesia is dissociation 
of “anxiety” from perception of the noxious stimu- 
lus. In agreement with this hypothesis, others 
found that morphine (15 mg.) had no significant 
effect on the ability of postaddicts to estimate in- 
tensities of brief but painful electric shocks under 
“nonanxious” conditions; under “anxious” condi- 
tions untreated subjects overestimated the intensity 
of such stimuli, whereas after injection of mor- 
phine the estimated and actual intensities of these 
stimuli did not differ significantly. Likewise, it has 
been reported that single doses of morphine (15 
mg.) significantly reduced the disruption of per- 
formance on a visual—manual reaction time test, 
produced by repeated self-inflicted electric shock 
penalties for “slow” reaction times—an action not 
shared by pentobarbital (250 mg.). Further evi- 


dence of the effectiveness of morphine in reducing 
“anxiety associated with anticipation of pain” was 
obtained by Kornetsky (1954) in a study on post- 
addicts, employing radiant heat stimuli instead of 
electric shocks. In addition to measurement of 
“stronger or weaker than standard” judgments of the 
noxious stimulus, several measures of “anticipa- 
tion” were included, under both “anxious” and 
“nonanxious” conditions. It was found that mor- 
phine decreased significantly the number of 
“stronger” responses, as well as anticipatory psycho- 
galvanic responses, only under “anxious” conditions. 
That reduction of “anxiety associated with antici- 
pation of pain” may be but one example of a more 
general effect of morphine on “motivation” is sug- 
gested by the results of another study by Hill, Belle- 
ville, and Wikler (1957) on postaddicts. In this 
study, visual-manual reaction times were meas- 
ured under four different incentive conditions (de- 
fined as such in terms of different schedules of 
morphine rewards for participation in the experi- 
ments) in untreated subjects, one hour after ad- 
ministration of morphine (15 mg.), and after 
intramuscular injection of 250 mg. of pentobarbi- 
tal. In untreated subjects mean reaction times 
varied from “slow” to “fast,” depending upon the 
incentive condition under which they were meas- 
ured. Morphine significantly reduced the “range of 
change,” while pentobarbital had the opposite 
effect. It would thus appear that, at least in post- 
addicts, single doses of morphine reduce respon- 
sivity to a variety of stimuli that have in common 
the property of producing “emotional” arousal. 
Chronic effects. Although morphine-produced 
“euphoria” would seem to be a decisive factor in 
reinforcing repeated use of this drug, the rapid de- 
velopment of tolerance to this effect results in 
marked changes in the behavior of the addict. Thus, 
Brown (1946), in his long-term study on experi- 
mental readdiction in two subjects, noted that 
there was seldom a level of dosage that maintained 
satisfaction for a period longer than three weeks. 
There were no signs that morphine had increased 
the general sense of pleasantness, although there 
were short periods of euphoria, The general mood 
level during the period of addiction was one of un- 
pleasantness (p. 42). Similar observations were 
made by Wikler (1952) in a study on self-regulated 
experimental readdiction to morphine in a single 
postaddict. Over a period of three and a half 
months this subject progressively increased the 
size of each dose and the frequency of self- 
injections (intravenous) to a total daily dose level 
of about 1,350 mg. of morphine taken in 12 di- 
vided doses throughout the day and night. Never- 
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theless, he was unable to regain the euphoric 
effects he had experienced initially after a single 
dose of 30 mg. of morphine, and his prevailing 
mood was one of dejection, remorse, guilt, and 
anxiety, which were also reflected in his dream 
material. The experimental arrangements pro- 
vided for self-reduction of the dose of morphine 
and eventual withdrawal by the methadone substi- 
tution method whenever the subject might request 
such treatment, but he gave no evidence of enter- 
taining such an idea at any time before the date 
of the termination of the study, of which, by prior 
agreement, he received one month’s notice. In the 
absence of “euphoria,” the subject’s persistence in 
self-administration of morphine was difficult to ex- 
plain, but a striking feature of his “free associa- 
tions” was the frequency with which he compared 
craving for morphine with hunger and the relief 
of craving with satiation. It appeared, therefore, 
that instead of being a deterrent, the physical de- 
pendence brought about by repeated morphine 
injections seems to be one of the drug’s attractive 
attributes, since it is a new source of gratification. 
The new pharmacogenic need may in some cases 
become strong enough to displace other primary 
needs, 

In terms of instrumental conditioning theory, 
this inference can be restated as the proposition 
that in the state of pharmacogenic dependence, 
drug-acquisitory behavior is reinforced by the effi- 
cacy of opioids in reducing the drive engendered 
by early abstinence phenomena following the last 
previous dose of the drug. There is evidence that 
such, indeed, may be the case and also that rein- 
forcement resulting from reduction of abstinence 
phenomena may outlast the acute (“primary”) 
abstinence period. However, as Martin, Wikler, 
Eades, and Pescor (Martin et al. 1963) have dem- 
onstrated, the “primary,” or acute, abstinence syn- 
drome is succeeded by a long-enduring “secondary” 
abstinence syndrome (six months or more) in the 
rat, and therefore the possibility cannot be excluded 
that such evidence of “relapse” is based upon per- 
sistence of “unconditioned” abstinence phenomena 
that are relieved by morphine. 

To account for “late” relapse, Wikler (1948) 
suggested that under certain circumstances mor- 
phine abstinence phenomena may become condi- 
tioned in the classical manner during periods of 
pharmacogenic dependence and may therefore 
reappear as conditioned responses long after com- 
plete subsidence of the abstinence syndrome fol- 
lowing withdrawal of the drug. Some experimental 
evidence that such conditioning is possible has 
been acquired in studies on the rat, and Wikler 
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(1961; 1965) has proposed a “two-factor learning 
theory” of relapse, in which classical conditioning 
of the abstinence syndrome is postulated as the 
basis for recurrence of a drive state similar to that 
operating during previous episodes of addiction, 
and operant conditioning, also taking place during 
previous episodes of pharmacogenic dependence, is 
postulated as the basis for renewed drug-acquisitory 
behavior long after “cure.” In addition, Wikler has 
suggested (1955) that an important factor in re- 
lapse may be “the need for continuous activity 
directed toward attainable, but recurring goals” 
(p. 567), which, in the case of the addict, is fur- 
nished by pharmacogenic dependence on opioids. 
That such “hustling” activity is highly valued by 
opioid addicts is suggested by their preoccupation 
with this topic even when confined in an institu- 
tion. Viewed as an “effort” variable, “hustling” may 
increase resistance to extinction of drug-acquisitory 
behavior. 

Although the various roles assigned to pharma- 
cogenic dependence are still highly speculative, 
they would seem to deserve intensive research, 
since one obvious fact about “drug addiction” in 
general is that the drugs which create the most 
persistent degrees of “habitual use with harmful 
consequences” are precisely those which engender 
pharmacogenic dependence. 

ABRAHAM WIKLER 


[Other relevant material may be found in DRINKING 
AND ALCOHOLISM and SMOKING.] 
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DRUG ADDICTION: SOCIAL ASPECTS 


The taking of drugs is culturally patterned be- 
havior. Both the prevalence and the consequences 
of drug use in any society depend as much upon 
social norms as upon physiological responses to 
drugs or general psychological characteristics of 
drug users. The ends sought through use of drugs 
are varied: relief from pain, fatigue, or anxiety, 
the celebration of social solidarity, “kicks,” and 
enhanced mystical experience. Beliefs about the 
effects of the substances used and the specific ends 
sought through such use are closely linked with 
more general cultural goals and orientations. A 
particular drug—for example, marijuana (also 
called hashish and bhang)—may be accepted as 
an appropriate adjunct to sociability in one society, 
used as an invaluable ingredient in religious con- 
templation in another, and banned by law as dan- 
gerous in a third. 

There is no evidence that addiction to drugs is 
favorably regarded in any society or culture, but 
the status accorded the addict varies markedly. 
In the United States he has been defined as a crim- 


inal and stereotyped as a “dope fiend.” In much of 
Europe, on the other hand, the addict is regarded 
as an unfortunate person whose problem is pri- 
marily psychological and medical. Having stated 
that great differences exist in cultural orientations 
to specific drugs or drug effects, we are, however, 
far from being able to explain them. Opium and 
hashish have been widely used without extreme 
devaluation in Muslim society, although the Koran 
proscribes the use of substances that alter the state 
of consciousness. In traditional interpretations, it 
appears that the Koran’s injunction has been lim- 
ited to alcohol. 

To cite another example, a student of culture 
and personality has suggested that opium smoking 
was prevalent in China because it afforded a means 
of achieving the cultural goal of individual har- 
mony with the environment. But opium smoking 
did not exist in China until it was introduced by 
European traders in the seventeenth century (Son- 
nedecker 1963, p. 16). Opium was subsequently 
forced upon the Chinese by the East India Com- 
pany, despite protests by the Chinese government. 
An adequate understanding of drug use within a 
given culture requires a knowledge of historical 
facts that is seldom available. 

To the extent that the beliefs and practices sur- 
rounding drug use reflect different perspectives and 
value orientations, the social correlates and the 
consequences of drug use will themselves vary sub- 
stantially from one country to another and from 
time to time within any given country. Coffee and 
tobacco have, in various societies, been proscribed 
as debilitating or debauching; the smoking of 
tobacco was a crime punishable by death in Russia, 
Persia, Turkey, and parts of Germany (Lewin 
1924), The term “drug” refers to a wide variety of 
chemical substances consumed by man, not merely 
to narcotics or “dangerous drugs.” When applied 
to substances consumed outside of medical chan- 
nels, however, “drug” tends to connote something 
negatively valued. Hence, we do not normally apply 
the term “drug habit” to the smoking of tobacco, 
but we do to the smoking of marijuana (a non- 
addicting drug produced from the hemp plant), 
even though many more persons may be psycho- 
logically habituated to tobacco smoking than to 
marijuana. Because of common usage, this article 
is primarily concerned with narcotics addiction. 


Historical background 


From the earliest record in antiquity through 
the Middle Ages, it appears that the primary use of 
opium was medical. The letters and records of 
European travelers in the Orient during the Middle 
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Ages occasionally refer to opium as a drug used 
by the people to overcome fatigue, but they do not 
mention chronic intoxication or indicate recogni- 
tion of the phenomena of tolerance and depend- 
ency (Sonnedecker 1963, p. 16). Available records 
suggest that opium consumption was not a wide- 
spread practice or problem anywhere until the 
beginning of the seventeenth century, when opium 
smoking was introduced into China. Although the 
pleasures of opium smoking and opium eating were 
extolled by writers like De Quincey and Coleridge 
in the nineteenth century, in general neither the 
British nor other Europeans were drawn to opium; 
it was primarily an article for trade with alien 
peoples. 

Drug use in the United States. Attitudes were 
in general no more favorable in the United States 
than in Europe, but for several reasons opiate 
addiction was a more frequent occurrence in the 
former (Terry & Pellens 1928, pp. 66-90). Among 
these was the reliance on patent medicines con- 
taining opium and its derivatives. Such medicines 
were widely used in nineteenth-century America 
to combat a variety of ailments, especially “female 
disorders.” Another source of addiction in the 
United States was the excessive use of hypodermi- 
cally administered morphine in medical practice. 
Many soldiers wounded in the Civil War became 
addicts. Of quite a different nature was the third 
general influence, the introduction of opium smok- 
ing by devotees of the sporting world on the west 
coast in the last quarter of the nineteenth century. 
In this instance, opium was used purely for 
pleasure. 

Although strongly disapproved of by the conven- 
tional citizenry (local legislation against opium 
dens became widespread), the practice was taken 
up by many persons on the fringes of polite society, 
and it flourished among prostitutes and underworld 
characters. Imports of smoking opium generally 
exceeded 100,000 pounds a year between 1880 and 
1909, when they were finally cut off. As a conse- 
quence of these influences, opium consumption 
tripled in the period from 1870 to 1909. Simultane- 
ously, the image of the addict changed rapidly as 
opium use became more widespread in the under- 
world (Eldridge 1962, pp. 9-10). Prior to the 
enactment of the Harrison Act, most users of nar- 
cotics were, however, conventional members of 
society. 

The Harrison Act, passed by Congress in 1914, 
was basically designed to eliminate the nonmedical 
use of narcotics by providing controls and a care- 
ful system of accounting for and taxing all drugs 
defined as addicting that were produced or im- 
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ported into the United States. With the enforce- 
ment of this act, as interpreted by the narcotics 
section of the Treasury Department, an addict 
could no longer secure drugs legally. In the first 
two decades after passage of the Harrison Act, the 
number of addicts in the United States markedly 
declined. Since then, however, a persistent problem 
of somewhat changing character has emerged in 
the United States. 


Extent of drug addiction 


The extent of drug use and drug addiction, his- 
torically or currently, is known only in very general 
terms, Since the use of opiates and of marijuana 
and the so-called “dangerous drugs” is devalued by 
the great mass of the population in Western soci- 
ety, and since the possession of these drugs is 
illegal in many countries, it is not possible to get 
a direct count of drug users but only an indication 
of those who come to official attention in one way 
or another. An ad hoc panel on drug abuse 
reported in 1962 that discrepancies between esti- 
mates provided by federal, state, and local enforce- 
ment agencies and other sources of data were so 
great that they precluded any adequate estimate 
of the number of addicts in the United States 
(White House Conference 1963, pp. 290-292). 

The Federal Bureau of Narcotics, which has at- 
tempted to maintain a register of “active addicts,” 
listed therein nearly fifty thousand names. There 
is no way, however, of knowing how many of the 
persons so listed are actively addicted at any given 
time or, indeed, how many are still living. From 
various fragments of available data, it appears that 
the number of persons in the United States actively 
addicted to opiates within the past decade is almost 
certainly not less than 50,000 and probably not 
more than 100,000. By contrast, it does not appear 
that any other Western country has as many as 
five thousand opiate addicts, according to a review 
of foreign experience by the Joint Committee on 
Narcotics of the American Bar Association and the 
American Medical Association (Joint Committee 
1961, pp. 121-153), 

Great Britain has long reported an addict pop- 
ulation of between three hundred and four hun- 
dred. None of the Scandinavian countries has more 
than a few hundred addicts, nor does France or 
Italy. Only in West Germany does the number of 
known addicts go much higher, and even there the 
estimate is well under five thousand (Joint Com- 
mittee 1961, P. 151). 

Among Western countries, a significant fraction 
of all addicts is found among physicians and other 
medical personnel. Beyond this fact, very little is 


known about the social characteristics of drug 
users and addicts except for the United States, 
Because the problem is more prevalent in the 
United States than in any other Western country, 
and because in recent years detailed studies have 
been undertaken to establish the characteristics of 
drug users, the following discussion of patterns of 
drug use and of characteristics of drug users will 
be confined almost entirely to the United States, 

Changing character of drug use. When the 
Harrison Act was passed, drug addicts were widely 
dispersed in American society. Impressions of the 
dominant characteristics of the addict population 
varied according to the segment of the population 
seen in treatment—highly successful individuals, 
middle-class neurotics, criminals, and degenerates 
(Terry & Pellens 1928, pp. 513-516). It seems 
probable that female addicts outnumbered male 
addicts at the turn of the century. Moreover, the 
large majority of addicts were native-born whites 
of mature years. In the following two decades, the 
new addicts were largely white males. Most of them 
were introduced to drug use as young adults al- 
ready somewhat detached from conventional soci- 
ety. They tended to frequent or to take up residence 
in areas in which illegal drugs could be purchased, 
chiefly the most disorganized areas of the largest 
cities (Dai 1937). These were areas characterized 
by overcrowding, high rates of crime, and other 
social problems; they were areas that afforded 
anonymity in the pursuit of illicit activity. 

Drug use and minority group status. By the 
early 1950s, many of the areas in which illicit 
narcotics had long been available were occupied 
by new waves of migration from the southeast 
section of the United States, from Puerto Rico, and 
from Mexico. The new residents were at a consid- 
erable disadvantage economically and were poorly 
equipped to cope with the demands of an indus- 
trial society. Many adolescents and young adults 
were exposed to the availability of marijuana and 
heroin in their home neighborhoods. Opportunities 
for drug use were afforded, whether or not the 
individual was favorably oriented toward drug use. 
In some neighborhoods, as many as 10 per cent 
of all males in late adolescence were officially 
recorded as drug users (Chein et al. 1964, pp. 
40-41). 

In recent years nearly three-fourths of all addicts 
recorded on the register of the Bureau of Narcotics 
have been Negro, Puerto Rican, or Mexican-Amer- 
ican in extraction. One-third of these recent drug 
users have been under 25 years of age. Thus there 
has been a marked shift in the character of the 
problem of drug addiction in the United States in 


Sr a ss SS eee 


recent years; it is now entwined with minority 
group status. While the psychological needs and 
frustrations of minority group members undoubt- 
edly contribute to the attractiveness of drugs, 
addiction among them cannot be adequately ex- 
plained or dealt with in terms of individual psy- 
chology. 


Becoming a drug user 


It is obvious that narcotics must be available 
before there can be narcotics users; it is perhaps 
less obvious that an individual must learn the 
techniques of drug use and to some degree the 
proper way to perceive and enjoy drug effects be- 
fore he becomes a regular drug user. The process 
of becoming a user is closely related to patterns 
of association and access to drugs. 

In the period between initial enforcement of the 
Harrison Act and World War 11, most persons who 
became addicted to opiates through nonmedical 
channels probably did so either by virtue of close 
affiliation with another addict or in the course of 
thrill-seeking behavior (Dai 1937, p. 173). Follow- 
ing World War 1, however, a high proportion of 
those who became drug users had been introduced 
to the use of narcotics in the slum areas in which 
their families lived. These recent addicts have 
tended to come from family backgrounds and life 
circumstances conducive to the production of psy- 
chopathy. There is much evidence, however, that 
psychological difficulties are far more widespread 
than is illicit drug use. Recent research in New 
York City and Chicago suggests that the use of 
heroin and other opiates, in most instances, is 
learned through association with peers in the sub- 
culture of “street-corner society.” The norms of this 
subculture are generally inconsistent with and 
often openly hostile to those of conventional so- 
ciety. The orientation on the part of substantial 
numbers of adolescents is manifested in delin- 
quency and in the search for and exploitation of 
“kicks” (Finestone 1957a). 

In general, the prevailing sentiment toward drug 
use, even on the part of residents of slum areas, is 
decidedly negative. Most children learn that heroin 
and marijuana are considered “bad” by most adults. 
In areas of highest drug use, however, rejection 
of the standards of conventional society, distrust 
of policemen, and relatively favorable attitudes 
toward drugs tend to be much more widely prev- 
alent, even among a cross section of school chil- 
dren, than in other areas of the city (Chein et al. 
1964, p. 102). A substantial proportion of young 
people are likely to have friends or associates who 
use marijuana or heroin. 
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The subculture of addiction. Many young peo- 
ple have their initial drug experience with mari- 
juana reefers provided by older companions. The 
neophyte who likes the experience and wishes to 
move toward regular use must have a more stable 
source of supply than can be provided by chance 
encounters with other users. He is likely to spend 
more time with persons who use marijuana and 
to avoid those who strongly disapprove (Becker 
[1953b] 1963, pp. 62-72). This reduces the need 
for secrecy in smoking marijuana and enhances the 
pleasures of use. Simultaneously, changing con- 
tacts help to negate the popular stereotype of the 
marijuana “addict.” It becomes apparent that mari- 
juana does not completely transform the person- 
ality; the new user learns a series of positive beliefs 
about the beneficial effects of marijuana, beliefs 
constantly being reinforced by their verbalization 
within the group (Becker [1953b] 1963, pp. 72- 
78). 

The use of heroin is a step further along the 
path of alienation from conventional values. Here 
the prevailing attitudes, even in delinquent sub- 
cultures, are much more negative. Here there is 
the promise of a bigger kick but also at much 
greater cost. Most of those who try heroin for the 
first time, however, are aware that addiction does 
not come from a single or an occasional trial. It 
is likely that few anticipate that they will be 
“hooked.” Some—no one knows what proportion— 
manage to use the drug for a time on an occasional 
basis and then stop altogether without ever be- 
coming addicts (Chein 1964, p. 159). Others, espe- 
cially those for whom heroin use leads to a marked 
increase in feelings of adequacy, move quickly to 
the subculture of the addict, where “connections” 
can be made and where the drug itself becomes the 
central fact of existence. 

Lindesmith (1947, chapter 4) has noted the 
critical importance of the individual’s recognition 
that withdrawal sickness can be warded off by use 
of the drug. It is at this time that he becomes fully 
aware of the nature of addiction and of his de- 
pendency upon the opiates. Both his patterns of 
association and his self-image change markedly as 
he becomes assimilated into the subculture of ad- 
dict society. 

Social backgrounds, personality, and drug use. 
Certain personality characteristics of narcotics ad- 
dicts are markedly different from those of normal 
middle-class persons, and to a lesser degree they 
are different from those of nonaddicted persons 
coming from the same social backgrounds as the 
addicts. One major expression of the personality 
differences is in attitude. Addicts are characterized 
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by attitudes of pessimism and futility on the one 
hand and of distrust and rejection of the standards 
and representatives of middle-class society on the 
other, These attitudes do not merely characterize 
addicts, however. As indicated above, Chein found 
them to be rife among eighth-grade pupils in 
schools located in areas of high drug use. To a 
substantial degree, then, they would seem to be 
prevalent social orientations which are conducive 
to drug use. 

Another major aspect of personality difference 
between the addict and the nonaddict is the low 
self-esteem and high degree of social immobiliza- 
tion of the narcotics user by virtue of anxiety. A 
major appeal of the opiates is that they permit the 
constricted ego greater scope and freedom (Chein 
1964, chapter 9). But again the personality at- 
tributes appear to reflect aspects of the social en- 
vironment in which the addict has been nurtured, 
particularly the devaluation and deprivation that 
are experienced by residents, especially minority 
group members, of urban slums. The same terms 
relating to personality characteristics have been 
used to describe slum dwellers elsewhere who were 
not involved in drug use (Clausen 1957, pp. 263- 
266). Within the areas of highest rates of drug 
use, users tend to come from families lacking a 
stable father figure, lacking warmth between par- 
ents, and characterized by vague or inconsistent 
Standards (Chein 1964, chapter 10). All of these 
factors would tend to contribute to psychopathy 
in the child; families with such characteristics are 
in part a product of migration to and life in the 
urban slum. 


The world of addiction 


Criminality and drug use. Although narcotics 
use has long been prevalent in the underworld, 
prior to the passage of the Harrison Act the great 
Majority of opiate users and addicts in the United 
States were law-abiding citizens. Thereafter, addic- 
tion or occasional use of opiates for nonmedical 
reasons could be maintained only by criminal 
means—either the illegal purchase of smuggled 
drugs or the theft of drugs. Moreover, few addicts 
could afford the high cost of illegal drugs without 
resorting to theft or other criminal activities. 

In general, addiction to narcotics reduces the 
inclination toward aggressive or violent behavior 
(Joint Committee 1961, p. 68). In this respect, 
heroin is far less likely to lead to violence than is 
alcohol. It is the lack of the drug, rather than its 
consumption, that is most closely linked with crim- 
inal activities on the part of the addict. Larceny, 


or theft for money to buy drugs, is the dominant 
crime of the narcotics offender (Finestone 1957b, 
p.71). 

At the same time, drug use provides one type 
of adjustment within a delinquent milieu. Many, 
perhaps most, of the young drug users growing up 
in the worst urban slums are involved in delin- 
quency long before they try marijuana or heroin. 
There is some evidence that those whose drug use 
is part of a subcultural pattern that begins with 
delinquency may be psychologically more sound 
than those whose delinquent behavior did not start 
until they had become drug users and needed 
money in order to support the habit (Chein 1964, 
chapter 6). 

The important point to be made is that drug 
effects, even when harmful, are not the primary 
cause of criminal behavior. British opiate addicts 
are not criminals to any significant degree (Schur 
1962, pp. 135-140). The meaning of drug use and 
its linkage with crime depends largely on the laws 
relating to narcotics and on public attitudes toward 
addiction. 

Employment and drug use. To the extent that 
narcotics or other drugs become the central pre- 
occupation of an individual, as is true of an addict, 
that individual is likely to be an undependable em- 
ployee. A substantial proportion of physician ad- 
dicts are reported to have functioned adequately 
for long periods while addicted to narcotics, but the 
great majority of addicts are, at best, irregularly 
employed (Schur 1962, p. 131). Except for enter- 
tainers and physicians, the population from which 
addicts are drawn in the United States tends to 
be relatively disadvantaged in the job market. Since 
possession of a drug calls for more serious penal- 
ties than do most forms of theft, there is little 
incentive for the active addict to work. Even in 
Britain, however, where addiction and drug pos- 
session are not defined as crimes, most addicts 
have unsatisfactory work records (Schur 1962, p. 
134). It is not possible to state to what degree their 
poor work records reflect the personality problems 
that gave rise to their addiction or that are a direct 
consequence of the use of drugs. 

Contextual supports. By virtue of his alienation 
from conventional norms and his stigmatization, 
the narcotics addict tends, even when he is not 
using drugs, to have his closest associations with 
persons who speak his argot and accept him with- 
out regard for his criminal record. Many addicts 
voluntarily undertake “cures” in one of the hospi- 
tals established to provide for drug withdrawal and 
treatment in a drug-free environment. Few appear 


to be content with continual addiction (Lindesmith 
1947, chapter 6; Ray 1961, p. 134), but the rate of 
relapse to drug use is exceedingly high. Follow-up 
studies suggest that relapses tend to occur very 
soon after discharge from a treatment center. Of 
patients returned to New York City from the Public 
Health Service Hospital at Lexington, Kentucky, 
during the period from July 1952 to December 
1955, only 9 per cent were voluntarily abstinent 
six months later and less than 3 per cent were 
abstinent for a full five years (Duvall et al. 1963). 
Yet at the end of the five-year period, nearly 25 
per cent of the study group had been voluntarily 
abstinent for three months or more. 

Ray (1961) notes that the abstinent former 
addict is likely to be viewed with distrust by rel- 
atives and representatives of conventional society. 
Even though not using drugs, he may be period- 
ically picked up by the police for questioning. 
Under such circumstances, he is likely to turn again 
to his association with other addicts. 


Narcotics control 


The primary objective of efforts in the United 
States to control drug use has been to stamp out 
the trade in illicit drugs. Efforts to mobilize public 
sentiment against drug addicts and the drug traf- 
fic have often relied on the dissemination of mark- 
edly distorted information and on attempts to 
suppress consideration of alternative points of view 
(Eldridge 1962, pp. 39-40, 78-80). Legislation 
has provided severe penalties for the sale or pos- 
session of drugs, but this approach has neither 
eliminated illicit drug use nor contributed to the 
rehabilitation of addicts. 

Most addicts engage in small-scale peddling of 
drugs on those occasions when they are able to 
purchase more than enough for their own needs. 
They are then subject to the same penalties for 
drug sale and possession as ate the nonaddicted 
distributors of narcotics. As a consequence of 
longer prison sentences, the number and propor- 
tion of narcotics offenders, chiefly addicts, in prison 
populations has more than doubled. Increasingly, 
professional groups have called attention to the 
lack of success of punitive measures (Joint Com- 
mittee 1961; Eldridge 1962; White House Confer- 
ence 1963). 

It is too soon to predict the effect of such re- 
evaluations, but attention has been focused on 
alternative approaches. One such approach is the 
so-called British system of control (Schur 1962). 
The basic legislation relating to narcotics use in 
England is not markedly different from that in the 
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United States, except that penalties are far less 
stringent. Moreover, in interpretation of the law, 
the medical profession has been left free to deal 
with addiction as a medical problem. The addict 
has neither been stigmatized nor forced into crim- 
inal activity. 


Hallucinogenic and other “new” drugs 


In recent years a new group of drugs, the hal- 
lucinogens (also called psychedelic or psychotomi- 
metic drugs), has come increasingly into attention 
and use, especially among young intellectuals in- 
terested in deepening their psychic experiences 
(Barron et al. 1964). In the case of these drugs, 
as with alcohol, it appears that the effects are 
markedly influenced not only by the nature and 
amount of the drug taken but by the personality 
and current mood of the subject and by the con- 
text in which the drug is used and the expectations 
held. In a few places, cults have arisen around the 
use of the hallucinogens. Where this has occurred, 
some of the opprobrium directed toward illegal 
drug use has been noted. 

More closely linked with the orientation toward 
thrill-seeking and deviant behavior that character- 
izes much opiate addiction in the United States has 
been the increasing use of amphetamine and other 
stimulants (White House Conference 1963, pp. 
286-289). Although many of the new drugs are 
not addicting, their use is to a high degree expres- 
sive of rebellion and problematic behavior. It is 
not unlikely that experience with an ever-increas- 
ing number of new drugs will in time force a re- 
consideration of both the criteria and the standards 
underlying legislation for the control of drugs. 


JoHN A, CLAUSEN 
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DvuBOIS, W. E. B. 


William Edward Burghardt DuBois (1868- 
1963), American Negro leader and sociologist, was 
born in Great Barrington, Massachusetts. He was 
trained at Fisk University, at Harvard, where he 
received a PH.D. in 1895, and at the University of 
Berlin. DuBois turned from history to sociology in 
the hope of providing accurate information as the 
basis for public policy on Negro rights. 

He set himself against the enigmatic abstrac- 
tions, the “verbal jugglery,” of Herbert Spencer. In 
The Philadelphia Negro (1899), his finest scholar- 
ly work, he used the technique of Charles Booth’s 
Life and Labour of the People in London to yield 
a pioneering door-to-door survey of the occupa- 
tions, values, political influence, social intercourse, 
and family life of the 45,000 Negroes in the 
seventh ward of Philadelphia. With the research 
for this book, he tried to uncover information to 
replace the “fantastic theories, ungrounded as- 
sumptions, or metaphysical subleties” that passed 
among white men for knowledge about the Negro. 

As professor of economics and sociology at At- 
lanta University, from 1897 to 1910, he edited the 
Atlanta University Publications, a series of yearly 
pamphlets on different aspects of Negro life— 
artisans, college graduates, schools, groups organ- 
ized for social betterment, criminals, etc. The 
volumes were partly hortatory, partly factual. 
The factual section summarized DuBois’ year-long 
research, generally based on questionnaires distrib- 
uted in Negro communities throughout the coun- 
try. His results, although fragmentary, contained 
valuable data not available elsewhere. DuBois was 
the first student of Negro life to make empiricism 
the core of his work. For years he stood almost 
alone. 

The failure of scholarly research to budge the 
prejudices of white Americans moved him to more 
active agitation, Contemptuous of the gradualism 
advocated by Booker T. Washington, DuBois called 
for full equality as the immediate demand of jus- 
tice. To lead the Negro’s fight he looked to a col- 
lege-trained Negro elite, which he called the “tal- 
ented tenth” (1903a), and in 1905 organized the 
Niagara movement, consisting of a handful of edu- 
cated northern Negroes committed to his program. 
The National Association for the Advancement of 
Colored People (NAACP), a biracial group, ab- 
sorbed most of these men and the lean, aloof 
DuBois became director of research and editor of 
its monthly magazine, Crisis. He spoke from this 
podium during the 24 years from 1910 to 1934, 
making himself, after Washington’s death in 1915, 


the most significant spokesman for Negro rights in 
America and the greatest single influence on the 
generation of leaders that followed. DuBois sup- 
ported the association’s program of organization, 
publicity, and legal action; at the same time, he 
urged Negroes to free themselves from the white 
man—create their own ideals and assert control 
over their own organizations. He broke with the 
NAACP in 1934, when under the pressure of the 
great depression he argued for voluntary segrega- 
tion in a self-sufficient Negro community within 
America. He again became a professor at Atlanta, 
where he completed Black Reconstruction (1935), 
an eccentric blend of racial chauvinism, unassimi- 
lated Marxist terminology, and melodramatic fan- 
tasy, that nevertheless summarized for the first 
time the striking achievements of black-and-white 
governments in the South after the Civil War. 

DuBois felt a special kinship with nonwhite 
peoples, especially black men in Africa, whom he 
regarded as victims of Western capitalism and 
white prejudice. To publicize their needs he organ- 
ized Pan-African congresses, the first of which met 
in Paris in 1919. DuBois nursed the movement 
through the 1920s, but it went into eclipse until 
1945, when, again at his call, the sixth congress 
met as Asian and African peoples were breaking 
out of European colonial systems. Jubilant, DuBois 
found himself the intellectual “father of Pan- 
Africanism.” He drew away from the United States. 
Unsympathetic to American policies and despair- 
ing of the Negro leadership of the time, he looked 
upon the Soviet Union’s campaigns for peace and 
for world socialism as the most promising paths 
to the future. 

In his later years DuBois traveled widely in 
China and in eastern Europe, where he was cor- 
dially acclaimed at a time when his reputation at 
home had faded. He was awarded a Lenin peace 
prize in 1959. The following year he went to Ghana 
to fulfill a half-century-old dream of editing the 
first African encyclopedia. Late in 1961, DuBois 
joined the American Communist party. Finally, 
caught in the movement of African nationalism, 
he became a Ghanaian citizen just before his 
death, in Accra, Ghana, in 1963. 


Francis L. BRODERICK 


(For the historical context of DuBois’ work, see the 
biography of Boorn. For discussion of the subse- 
quent development of his ideas, see RACE RELATIONS.] 
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DUGUIT, LEON 


Léon Duguit (1859-1928) was a professor in 
the Faculté de Droit at Bordeaux from 1886 until 
his death. The portion of his work that is relevant 
to social science in general is contained essentially 
in two major works: one is the two-volume Etudes 
de droit public (1901-1902); the other is Traité 
de droit constitutionnel (1911). 

Duguit’s predominant concern was to discover 
the rules that should “control, direct [and] limit” 
the actions of those who govern, He sought to 
ascertain how those who hold political power and 
the force to compel compliance should use that 
power and that force—given that they hold them 
not as their personal right but merely as a means 
of performing a social function. They should not, 
therefore, use them arbitrarily but solely according 
to the interests of society; otherwise their actions 
would be unlawful, and hence invalid. 

In the modern state, political power is primar- 
ily exerted through legislation—that is, by the 
establishment of rules of conduct for the governed. 


306 DUGUIT, LEON 


Consequently, to determine “the norm for govern- 
ment action” means, in the first place, to establish 
what laws the political authority should and should 
not lay down, This is, then, a much larger and 
more general task than Duguit implied when he 
wrote in traditional liberal terms of “limiting the 
State,” of “limitation of the power of those who 
govern.” Actually, what is involved is nothing less 
than determining the entire juridical order that 
those in power shall create: all the rules of law 
that go to make it up, whatever their object, 
whether it be relations among private persons 
(private law) or between those who govern—or 
their agents—and private persons ( public law). 

Duguit did make a basic distinction, however, 
between two sorts of legal rules, the “normative 
rules of law,” or “legal norms properly so called,” 
and the “constructive or technical rules of law.” 
The former rules constitute prohibitions or com- 
mands; in particular, injunctions on individuals 
to act or abstain from acting—e.g., not to commit 
murder, to pay taxes, to respect property. The 
latter rules enact measures to ensure that the 
former are respected and enforced, defining the au- 
thority entrusted with applying a sanction to per- 
sons breaking them. There is no doubt that 
Duguit intended to determine only what normative 
rules control the actions of those who govern. 

For Duguit the “constructive rules of law,” as 
their name indicates, are consciously made by 
man, whereas the “legal norms properly so called” 
are quite different. Another fundamental element 
of his theories on law is the distinction (in a way 
more basic than the previous one) between “objec- 
tive law” and “positive law.” It is not difficult to 
define positive law: it is those rules of law laid 
down, applied, and enforced by “the public power” 
or state. But what did Duguit mean by “objective 
law”? 

On some occasions, he defined it by a socio- 
logical criterion: a juridical norm is a social norm 
in that the bulk of the members of a group accepts 
as legitimate its regular enforcement by those in 
power. A rule of law obtains if the conscience of 
the bulk of the people desires and demands this 
sanction as necessary to the maintenance of social 
solidarity, In short, “objective law” consists of the 
rules of law that in the opinion of the majority 
should have an organized sanction. Thus, juridical 
norms are a spontaneous product of mass con- 
science, 

On other occasions, however, Duguit gave a gen- 
uinely objective definition of objective law: a legal 
rule is one that is derived from the current and 
changing conditions of the life of a given society 


and determined by observation and rational anal- 
ysis, and that prescribes certain actions and pro- 
hibits others to all the members of the society, 
Viewed in this way, objective law is contingent on 
social fact, is in fact a spontaneous creation of 
social life. 

Regardless of his uncertainty about the exact 
nature of positive law, Duguit considered the task 
of operating with these notions of law as belonging 
to the science of law, in cooperation, to be sure, 
with other social sciences. In fact, he regarded this 
as the chief mission and raison d'être of the social 
sciences; if they fail to perform this task, they 
convict themselves of futility: “Social research has 
reason for existing and value only insofar as it 
can draw up rules of conduct that gain the respect 
of those who govern” (Duguit 1902, p. xxiii). 
“[All] juridical speculations are vain unless they 
succeed in determining . . . the solid basis of a 
juridical limitation on the action of those who 
govern” ([1911] 1921-1925, vol. 1, p. 65). Duguit 
further admitted that it is possible to establish 
these rules of objective law by observation alone, 
by “direct determination of facts” perceived by the 
senses, i.e., by the scientific method. Bent on being 
a positivist and realist, he rejected, in the tradition 
of Auguste Comte, all theological or metaphysical 
speculation. He insisted that the legal rules are 
present in social reality and that the role of the 
jurist consists in “discovering them beneath the 
social facts” and in noting them, rather than in 
any way promulgating them because he happens 
to believe that they are good. 

This scientific task will by no means produce 
“higher principles,” but rather rules that vary from 
one society to another and, over time, in the same 
society. Duguit categorically rejected the idea that 
there are rules of law that are valid absolutely, 
eternally, and universally. He asserted that all that 
we observe are rules that are diverse, relative, and 
contingent. In a word, he rejected the idea of 
natural law. The higher principle or principles 
posited by its advocates are posited a priori; they 
may indeed be “objects of metaphysical or religious 
belief,” but there is “nothing scientific” about them. 

Given these two kinds of law—objective and 
positive—how did Duguit conceive of their norma- 
tive relationships? His position is very explicit: 
only the rules of objective law (which alone are 
“legal norms properly so called”) have an obliga- 
tory force of their own, i.e., validity for those 
whose conduct they claim to determine. The rules 
of positive law (what we would call, basically, the 
law), on the other hand, are not binding in and 
of themselves, but only if, and hence because, they 


sanction a rule of objective law. When a legisla- 
tive rule is not backed up by a rule of objective 
law, it is invalid, has no obligatory force, and is 
not juridically binding on the citizens, or on the 
courts, for that matter. “Obedience is due not to 
legislation as such, but only to legislation as ex- 
pressing and executing a juridical norm. . . . 
Legislation can only be obligatory if it expresses 
a juridical norm that is prior to it, created by 
the very conscience of those to whom it is ad- 
dressed, i.e., spontaneous” (ibid., vol. 1, p. 153). 
Logically, then, it is the duty of the state “when 
it makes a law, to formulate an already existing 
rule of law that is binding in and of itself, or to 
take constructive measures to assure its enforce- 
ment” (ibid., vol. 3, p. 555) and never to set 
itself up in opposition to objective law, the “higher 
law,” which is rigorously binding on it, even if 
that higher law has not been sanctioned by a 
source of positive law. Thus, objective law is 
higher than positive law and is the condition of its 
validity, its obligatory character, its very char- 
acter as a rule or norm. At bottom, it alone is truly 
law or, in any event, original law. The “sources of 
positive law”—legislation, jurisprudence, custom 
—are not as such sources of rules of law; they 
only become that on the condition that the rules 
they profess to lay down conform to the rules of 
objective law, which is the supreme principle of 
all juridical validity. 

As Duguit saw it, his theory of the state and his 
theory of law were positivistic, realistic, and socio- 
logical. It was a “sociological—juridical doctrine” 
that he had “tried to base on pure observation of 
social facts,” sticking to “the observable facts” and 
therefore rejecting “metaphysics and all a priori 
concepts.” He liked to call himself a “sociological 
jurist.” The two thinkers who influenced him 
most were Auguste Comte, the positivist philoso- 
pher and precursor of sociology, and Emile Durk- 
heim, the great French sociologist of the early 
twentieth century. These influences can be noted in 
some of his general views on law and government. 

He rejected the idea of natural law and refused 
to idealize a fortiori or to deify the state and its 
rulers. He rejected all systems that present the 
state as an organic unity or even as an ideal unity, 
and he declared that the state is merely a society 
in which there is a differentiation between the 
governed and the governors (i.e., those who hold 
political power). The state is nothing more or less 
than “a fact of greater force.” He constantly re- 
peated that “those who govern are individuals like 
the others,” that they are not higher in nature or 
qualities than those who are governed, that their 
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will is not higher in essence or virtue than the will 
of the latter. The only constant difference between 
those who govern and those who are governed is 
that the former control the force of constraint. 

Duguit’s system did not succeed in establishing 
scientifically the proper juridical order. Rather 
than a theory of positive science (whether juridical 
or sociological), Duguit presented a doctrine of 
political philosophy or political ethics. His central 
theses clearly are meant to act on people’s minds, 
to help anchor in them certain beliefs that will 
determine their behavior. The absolute intellectual 
acceptance of facts that he advocated has come 
off second best. Duguit himself went so far as to 
admit that the statement that the state is limited 
by objective law is only an academic formula and 
that this limitation cannot in fact exist at all, “for 
there is no law without a material sanction and 
there can be no sanction against the state” (ibid., 
vol, 3, p. 549). But he nonetheless advocated that 
“we should never tire of energetically asserting the 
subordination of the state to a higher law, chiefly 
in order to impress that idea more deeply into the 
consciousness of the collectivity, so that those who 
hold power will hesitate to oppose [this law] 
openly” (ibid., vol. 3, p. 550). These dogmatic 
assertions patently contradict nearly all the basic 
principles of positive law. Duguit acknowledged 
this explicitly. 

Duguit also failed to found “objective law,” that 
is, positive law supported by scientific method. He 
never even provided examples that might indicate 
how this could be done. He confined himself to 
propositions just as formal and dialectical as those 
of the traditional theorists of the “common weal”: 
social law, which is binding on all, is that law that 
governs the formation and development of the 
group; the members of the group, and the gover- 
nors first of all, should strive to maintain and de- 
velop their interdependence, their solidarity with 
the group, by promoting justice and welfare. At 
best, these are theoretical principles from which 
no concrete rules can be deduced, with the possible 
exception of a few rules that are self-evident in 
any society. 

The great error that vitiates this system is Du- 
guit’s belief in the possibility of establishing, as a 
scientific truth, that individuals are bound by con- 
science to observe social rules emerging directly 
from social facts or, at a deeper level perhaps, the 
belief that rules of conduct exist in social facts— 
i.e., that they can in some way be read there, dis- 
covered there, and that men, or rather the “learned,” 
need only declare their obligatory force. He ac- 
knowledged these beliefs; they contain in embryo 
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the failure of his enterprise, and the weakness of 
the major part of his theory of law and the state. 

Although many of Duguit’s theories have been 
criticized, there is no doubt whatever that they 
exerted a considerable influence on several gen- 
erations of professors in French law faculties, from 
the time of their first publication early in the 
twentieth century. He not only gained wide ac- 
ceptance among legal scholars for methods of jurid- 
ical analysis, but he also won adherents among 
the politically oriented for his realistic approach 
to the state and for his assertion that the power 
of those who govern is to be considered simply a 
social function, existing only in the interest of the 
nation and to be exercised only in that interest. 
(This position was the very antithesis of the Hegel- 
ian or Prussian tradition that dominated German 
theories of public law, with their metaphysical or 
political exaltation of the power of the state.) 
Duguit’s best-known follower in the theory of inter- 
national law was Georges Scelle; in the field of 
administrative law, his most important intellectual 
descendants were Gaston Jéze, Roger Bonnard, and 
Louis Rolland. If Duguit’s doctrine as a whole, 
then, no longer has the impact it once had, his 
name is nevertheless a great one in the history 
of French juridical thought. 


CHARLES EISENMANN 


[For the historical context of Duguit’s work, see the 
biographies of Comte; DURKHEIM, For discussion of 
similar ideas, see JURISPRUDENCE; Law, article on 
THE SOCIOLOGY OF LAW; and the biographies of 
BRECHT; EnRLICH; HAURIOU; HELLER; JELLINEK; 
Kantorowicz; KELSEN; LLEWELLYN; Pounp.] 
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Arsène Jules Étienne Juvénal Dupuit (1804- 
1866), usually subscribed J. Dupuit, was a French 
civil engineer and economist; he is known today 
chiefly for his development of the concept later 
termed “consumer’s surplus.” 

Dupuit was born in Fossano, in the Piedmont, 
and moved with his family to France in 1814. He 
studied at the colléges of Versailles, Louis le Grand, 
and Saint Louis, at the Ecole Polytechnique, and 
at the Ecole des Ponts et Chaussées. After serving 
in the Corps des Ponts et Chaussées in various 
outlying departments, he became chief engineer for 
the city of Paris in 1850 and, later, inspector-gen- 
eral of the Corps des Ponts et Chaussées. 

Early in his career he became interested in the 
problem of determining under what conditions the 
construction of bridges or other public works could 
be justified, becoming probably the first person to 
pursue what is now known as cost-benefit analysis 
on a rigorous basis. In the course of this analysis he 
developed the concept that the benefits are to be 
measured not by what is actually paid but by the 
amount that might be paid under conditions of 
perfect discrimination—as measured by the area 
under the demand curve. He seems to have devel- 
oped the concept of the demand curve independ- 
ently of Cournot. [See the biography of CourNoT.] 
His measure of the “utility” of a facility was 
the excess of the benefits so measured over the 
costs or tolls paid by the user, later called “con- 
sumer's surplus” by Marshall. 

Dupuit was highly critical of attempts to justify 
public works by reference to vague “secondary 
benefits”; he reiterated that “Il my a d'utilité que 
celle qu’on consent à payer,” by which he meant 
that unless a benefit is somehow reflected in the 
demand curve, it is illusory. He was undoubtedly 
correct in his criticism of the double counting 
often found in justifications for some of the more 
grandiose schemes of his contemporaries. His 
formulation did, however, exclude “neighborhood 
effects” from his evaluation, which would be justi- 
fiable either on the ground that they were, in this 


context, negligible, or that they could, in principle, 
be charged for, even though not includable in the 
usual type of demand curve. 

These concepts were first published in 1844 in 
“De la mesure de l'utilité des travaux publics” 
([1844-1854] 1933, pp. 29-65) and in 1849 elabo- 
rated in “De l'influence des péages sur l'utilité des 
voies de communication” (ibid., pp. 97-162) and in 
1853 in “De l'utilité et de sa mesure” (ibid., pp. 
165-181), This last was apparently prepared as an 
article for Coquelin and Guillaumin’s Dictionnaire 
d'économie politique (1852-1853), which in- 
cluded articles by Dupuit on “Eau,” “Poids et 
mesures,” “Ponts et chaussées,” “Routes et che- 
mins,” “Voies de communication,” and an article 
on “Péage,” which contains cross references to an 
omitted article on “Utilité”; it is somewhat ironical 
that this article, which contains the ideas for which 
Dupuit is best remembered, was omitted in favor 
of a relatively conventional one by H. Passy. 

Beginning about 1850 Dupuit became increas- 
ingly interested in economics, and his name, which 
had since 1838 been appearing fairly regularly as 
contributor to the Annales des ponts et chaussées, 
began appearing in the Journal des économistes, 
while his engineering articles became less frequent. 
Dupuit became an ardent advocate of laissez-faire. 
He wrote a vigorous tract, “La liberté commerciale,” 
in defense of free trade, which was published in 
parts in the Revue européenne in 1860 and as a 
separate volume in 1861. He was a staunch Mal- 
thusian, arguing that standards of living were ob- 
servably higher in areas with low density of popu- 
lation and low rates of population growth and even 
going so far as to condemn charitable distributions 
in times of famine on the ground that while the 
direct objects of charity were helped, such activi- 
ties, since they would drive up food prices, would 
be primarily at the expense of the nourishment of 
the remaining poor classes, 

True to his engineering background, however, 
Dupuit remained essentially a pragmatist. He did 
not let his antagonism toward government interfer- 
ence in general inhibit his recognition that in the 
case of natural monopolies a government would be 
likely to set a price more conducive to economical 
utilization of the facility than would a private 
monopoly, He did not go so far as to advocate pricing 
at marginal cost, as A. P. Lerner and Harold Hotel- 
ling later did, and indeed at points he seems to 
Suggest that on occasion private enterprise, since 
it has greater incentive to ferret out more profitable 
schemes of discrimination, might produce a better 
utilization of the facility than would a government 
agency. He vigorously attacked the natural law 
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theory of property rights, asserting that the par- 
ticular forms that property rights took, particularly 
in relation to mineral rights and the rights of 
authors and inventors, were the result of man-made 
law that could and should decide in favor of those 
forms of property tenure that would be most con- 
ducive to economic efficiency. While he was reluc- 
tant, in many cases, to specify in detail what pattern 
of property rights this principle called for, he did 
indicate that patents should have a shorter term 
than copyrights and that discoverers of minerals 
should have some (not very clearly indicated) 
rights vis-a-vis the landlord. On the other hand, he 
considered that property rights legitimized by con- 
tract or created by government sanction must be 
respected, even if such rights were plainly of a 
sort leading to inefficiency and even if the contract 
was only implicit. Postmasters, having enjoyed the 
right to a toll on all traffic over their route whether 
or not the post relay facilities were used, could 
not be deprived of this right without compensation; 
on the other hand, their claim to a similar toll on 
the growing traffic on parallel railroads should not 
be allowed. Unlimited freedom of contract was not, 
however, desirable in all cases; in particular, free- 
dom of bequest was undesirable in that parents 
should not be allowed to bring children into the 
world and then disinherit them and cause them to 
become a burden to the rest of the community. 
Indeed, Dupuit at one point suggested that parents 
of uneducated children should be required to pay 
a special tax. Dupuit’s pragmatism showed itself 
again in his attitude toward the tobacco tax; while 
he regarded this tax as essentially unjust in singling 
out a particular taste for a special burden, and 
doubly so in that the grades consumed by the poor 
were taxed most heavily, ad valorem, the tax was 
nevertheless a good one in that it did not impair 
productivity. 

During his later years Dupuit was an active 
member of the Société d'Economie Politique, and 
he often engaged in vigorous disputation at its 
meetings. He argued that economics was attaining 
the status of a science, by which he meant that 
there was a growing body of universally accepted 
principles, a penumbra of matters still under debate 
and investigation, and a vast domain of unexplored 
matters. He regarded free trade as one of the uni- 
versally accepted principles and went so far as to 
take editor Henri Baudrillat to task (1863) for 
stooping to include in the Journal des économistes 
such “unscientific” articles as the review by R. de 
Fontenay of a French translation of Henry Carey’s 
Principles of Social Science, in which free trade 
and Malthusian doctrines were strongly attacked. 
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On other occasions he spoke out vigorously against 
controls over banks and dealers in foreign ex- 
change, against licensing of doctors, against the 
promotion of workers’ cooperatives and labor 
unions, against child labor laws, government crop 
predictions, and property transfer taxes. While he 
felt that woman's place is in the home, he opposed 
legal barriers to the employment of women that 
existed in certain occupations. However, he ex- 
pressed himself in favor of government ownership 
of railroads. He considered the premature redemp- 
tion of bonds as inequitable if not clearly provided 
for in the indenture and preferred that government 
bonds take the form of nonredeemable perpetuals 
or “consols,” arguing that to provide explicitly for 
redemption was unduly costly. He objected to the 
state monopoly of education, holding that it made 
it impossible to teach economic doctrines that im- 
plied criticism of state action, and he felt, more- 
over, that too much time is wasted in higher edu- 
cation as compared with apprenticeship. 

Dupuit was no less a center of controversy in his 
engineering role. In 1842 he published the results 
of a series of experiments on friction in wagon 
wheels, in which he came to the conclusion, backed 
by theoretical analysis, that the rolling component 
of the friction varied inversely as the square root 
of the radius of the wheel, contradicting both the 
relation inherited from the eminent Coulomb and 
the empirical work of his contemporaries, who 
claimed the component varied inversely as the 
radius itself. Over the next decade he actively advo- 
cated more liberal traffic rules, particularly those 
relating to maximum loads on vehicles, on the 
basis that road maintenance, consisting in those 
days mainly in the replacement of material ground 
into dust by passing traffic, varied very nearly in 
proportion to the gross weight of the traffic and 
hence that heavy vehicles would not be uneconom- 
ical for the nation as a whole if thev were profitable 
to the operator. This effort met with only limited 
success. In 1850 he published the results of an in- 
vestigation of the collapse of a suspension bridge 
at Angers. During the crossing of an army detach- 
ment in a heavy storm, 226 lives were lost; Dupuit 
blamed the failure on the ineffectiveness of meas- 
ures taken to protect the suspension cables from 
rust as they passed through the anchorages, which 
were subject to flooding. Since the same methods 
had been fairly widely used elsewhere, his findings 
were attacked with some vigor. 

With regard to his economic activities, it is his 
work on utility and consumer's surplus that consti- 
tutes his lasting contribution. While there is no 


mention of Dupuit in the first edition of Jevons’ 
Theory of Political Economy (1871), the second 
edition contains an extensive discussion of Dupuit’s 
contribution in the preface, and full references in 
the bibliographical appendix. That Jevons highly 
respected Dupuit’s contribution is shown further 
by the extensive comments interpolated by H. $. 
Jevons from his father’s manuscript notes in the 
bibliographical appendix for the fourth edition, 
Maffeo Pantaleoni refers frequently to Dupuit in his 
Principles of Pure Economy, but he is concerned 
primarily with the concept of utility as an element 
of value theory and makes little prescriptive use of 
Dupuit’s concepts. Marshall also refers to Dupuit 
but indicates that both he and Jevons originally 
arrived at their own formulations independently of 
Dupuit. And while Marshall does discuss the pos- 
sibility that it might be desirable to subsidize de- 
creasing cost industries, his exposition lacks the 
incisiveness of that of Dupuit. It remained for 
Pigou in 1912 in his Wealth and Welfare and in 
1920 in his Economics of Welfare to elaborate the 
prescriptions to which Dupuit pointed. Pigou, how- 
ever, does not refer to Dupuit as a source, nor do 
most of the writers on the theme of marginal cost 
pricing or subsidy to decreasing cost industries 
throughout the 1920s and 1930s until Hotelling, in 
1938, possibly stimulated by the republication of 
Dupuit’s salient works by Mario de Bernardi in 
1933, based his mathematical analysis of the case 
for marginal cost pricing squarely on Dupuit as a 
precursor. Thus, although Dupuit must clearly be 
given priority in the formulation of some of the 
fundamental concepts of modern welfare eco- 
nomics, his work appears to have had singularly 
little actual influence on the way economic thought 
developed over the next century. 


WILLIAM S. VICKREY 


[For discussion of the subsequent development of 
Dupuitť’s ideas, see CONSUMER'S SURPLUS; UTILITY; 
WELFARE ECONOMICS. ] 
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Emile Durkheim (1858-1917) may be called 
one of the two principal founders of the modern 
phase of sociological theory, the other being his 
somewhat younger contemporary Max Weber. In 
his four major works, starting with The Division 
of Labor in Society of 1893 and ending with The 
Elementary Forms of the Religious Life of 1912, 
and in a large number of articles, monographs, 
and carefully worked out courses of lectures (sev- 
eral of which have been published posthumously), 
Durkheim established a broad framework for the 
analysis of social systems that has remained cen- 
tral to sociology and a number of related disci- 
plines, particularly anthropology, ever since. Even 
those who basically disagree with it take it as a 
major point of reference. This frame of analysis 
underwent substantial development in the course 
of Durkheim's own career, but it focused continu- 
ally on the nature of the social system and the 
relation of that system to the personality of the 
individual. 

Durkheim was born in the town of Epinal in the 
Vosges, not far from Strasbourg. He was of Jewish 
parentage, and some of his forebears were rabbis. 
Indeed he was expected to be a rabbi himself until 
he became an agnostic. He attended the famous 
Ecole Normale Supérieure in Paris, together with 
such luminaries as Henri Bergson, Jean Jaurés, 
and Pierre Janet. His primary focus was on phi- 
losophy, but he already had the strong concern with 
political and social applications that he retained 
throughout his life. He was too rebellious to rank 
high among the agrégés of his year, and his first 
academic appointments were as teacher of philos- 
ophy in several provincial lycées. 

In 1885-1886 Durkheim took a year’s leave of 
absence to study in Germany, where he was par- 
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ticularly impressed by the work of the psychologist 
Wilhelm Wundt. A professorship of sociology (com- 
bined with education), the first in France, was 
created for him in 1887 at Bordeaux, and he re- 
mained there until, in 1902, he realized the am- 
bition of all French academics: he was called to a 
professorship in sociology and education at the 
Sorbonne in Paris. There he gathered round him- 
self a distinguished group of younger men, includ- 
ing Henri Berr, Marcel Granet, François Simiand, 
Maurice Halbwachs, and, not least, his own neph- 
ew, Marcel Mauss. In the most intimate relation- 
ship to his own work, Durkheim founded and 
edited the very important journal, L’année socio- 
logique. On two significant occasions he became 
very much involved in political affairs: during the 
Dreyfus case and during World War 1. And over a 
considerable period he was actively concerned 
with applied sociology, most notably perhaps in the 
field of education. 

The first three of Durkheim’s four books, the 
Division of Labor, the Rules of Sociological Method, 
and Suicide, were all published during his Bordeaux 
period, in 1893, 1895, and 1897 respectively. Then 
there was an interval of 15 years before the Ele- 
mentary Forms (1912) appeared. After the move 
to Paris, Durkheim was deeply involved both with 
his teaching and with the group discussions and 
activities centering on L’année sociologique. It is 
clear, however, that his thought was developing 
very rapidly and continuously during this period: 
witness such fundamentally important articles as 
“The Determination of Moral Facts” (1906) and 
“Primitive Classification” (Durkheim & Mauss 
1903). The great book on religion, then, was 
the ripe harvest of a long process of intensive 
cultivation. 

There is evidence that the war was a very great 
blow and strain to Durkheim. Not only was the cost 
to France high indeed: Peyre tells us (1960) that 
over half the class that entered the Ecole Normale 
in 1913 was killed before the war ended; but Durk- 
heim also lost his only son in 1916. These strains 
may well have helped to cause his own death from 
a heart attack, on November 15, 1917, at the age 
of 59. 

Intellectual background. Despite some con- 
troversy about the influence of his stay in Germany, 
the evidence shows that Durkheim’s thought was 
rooted overwhelmingly in French intellectual his- 
tory. In the remoter background, Descartes and 
Rousseau were the most important, although in 
quite different ways. Much closer to him were 
Saint-Simon, Auguste Comte, and his own teacher, 
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Fustel de Coulanges, as well as such others as 
Emile Boutroux. 

Durkheim’s deep concern with the prominent 
contemporary intellectual currents of other coun- 
tries, especially England and Germany, was authen- 
tically French: it is no disparagement of the origi- 
nality of French thought on problems of man and 
society to say that it filled a mediating position be- 
tween the two wings of the main European trends 
of thought, British empiricism and utilitarianism 
and German idealism. In a crucial sense, modern 
sociology is a product of the synthesis of elements 
that have figured most prominently in these two 
traditions, and it seems to have been the mediating 
character of his French background that gave 
Durkheim a distinctive “place to stand,” from which 
he contributed so effectively to this synthesis. 
Hence a brief sketch of both “wings” will help in 
the understanding of Durkheim's own orientation 
and statement of problems (see also Parsons 
1937; 1965). 

As both these traditions developed, perhaps the 
crucial problem was what happened as the Car- 
tesian approach to the problem of knowledge was 
adapted to the analysis of action. The British posi- 
tion is clearest in the economic branch of utilitarian 
thought, although it dates from the earliest utili- 
tarian formulation by Hobbes: man is conceived as 
having not only “sensations” or “ideas,” in the 
epistemological sense of Locke, but also what the 
economists called “wants” (and what Hobbes, 
speaking in a political context, went so far as to 
call “passions”), The wants define the goals of 
action, whereas knowledge of the situation in 
which action takes place provides guidance for the 
instrumental use of resources (including the indi- 
vidual’s own capacities) toward the satisfaction of 
these wants. Mere knowledge of the situation clear- 
ly does not suffice to satisfy wants; the situation 
must be changed in desired ways and prevented 
from changing in undesirable ways. Throughout, 
the point of reference is the conception of an indi- 
vidual acting in pursuit of his own “interests.” 

This frame of reference provided the background 
for a most important development in the analysis 
of action, namely, a first technical analysis of the 
structuring of social means for the satisfaction of 
wants. The economists, by considering how a plu- 
rality of individuals, as producers and consumers, 
interact in the division of labor and exchange, 
ingeniously extended Hobbes’s formulation—of 
men interdependent in their interest in “power”— 
to a conception of social systems of action coordi- 
nated by the market and the monetary mechanism 
rather than of action by discrete individuals. As far 


as it went in classical economics, this conceptual 
venture was brilliantly successful; but its exceed- 
ingly limited scope gradually became evident in 
two borderline contexts. 

One concerned the analysis of the bases of action 
of the individual. The inadequacy of classical eco- 
nomics here lay not only in its tendency to assume 
“wants” as given but also in its lack of a clear-cut 
way of establishing relationships among the dif- 
ferent wants of a single individual, to say nothing 
of the different wants of several persons interact- 
ing in the same social system. Without concepts to 
establish these relationships, the treatment of 
wants as given easily shaded over into the assump- 
tion of their randomness. Likewise, shaky as- 
sumptions were made with respect to the problem 
of “rationality,” that is, of the relation between 
means and wants conceived as ends. In this con- 
text, the empiricist—utilitarian tradition tended to a 
reductionism that is still very much with us: it 
moves from consideration of the characteristics of 
the social system (in the economic case, a market 
system) to the consideration of the properties of 
constituent units (i.e., individuals rationally en- 
gaged in want-satisfaction), then to the wants, 
next to the psychological determinants of the 
wants, and eventually to their biological conditions. 

The second problematic context bordering on 
classical economic analysis concerned what we now 
call the problem of order. How could the relational 
structure of a market economy be expected to have 
even a minimum level of stability when the indi- 
vidual participants were in the first instance bound 
to that structure only by “self-interest,” i.e., by their 
interest in the effective satisfaction of their several 
wants? Hobbes had presented a radical solution 
to this problem—the establishment of an abso- 
lute sovereign authority—in Leviathan, but, as 
Halévy made clear (Halévy 1901-1904), Hobbes’s 
influence was pushed aside by that of the Lockean 
wing of the utilitarian tradition, which assumed a 
“natural identity of interests.” The Lockean tra- 
dition did not really attempt to solve the problem 
of order but instead tried to justify the refusal to 
consider it. Although the Lockean approach facili- 
tated certain valuable developments in economic 
analysis and in some forms of political analysis, it 
failed to provide that solution to the problem of 
order which was needed before a generalized inter- 
pretation of modern “economic individualism” 
could be developed. A notable version of the prob- 
lem, which greatly influenced Ricardo and indi- 
rectly influenced Marx, was advanced by Malthus, 
but it remained for Durkheim to make a funda- 
mental direct attack on the problem. In terms of 
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substantive sociology, this is the main starting 
point of his more technical theory. 

Before taking this up, however, a few words 
must be said about the other current of thought 
converging on the French “middle ground,” namely 
German idealism and the movements stemming 
from it. 

The problems that social science must explain, 
in Durkheim’s view, lie on the subjective side of 
the Cartesian dichotomy, since the entire main 
tradition of epistemology, of which Descartes’s 
work was the focal point, virtually limited the ex- 
ternal world to the world of objects as understood 
in terms of the new physical science. (It was, of 
course, possible—witness biologically based psy- 
chology and anthropology—to move into social sci- 
ence from the base of the object world, but this 
path was relatively unimportant to Durkheim. ) 
Whereas the empiricist utilitarians had used this 
“subjective” element merely as a reference point 
for the study of behavior, failing conspicuously to 
structure it on its own terms, the idealists increas- 
ingly focused upon it and tended to treat it as a 
category of objects. In this respect Kants philos- 
ophy seems to have been transitional, while the 
Hegelian “objective spirit” (objektiver Geist) is the 
focal idealistic conception relevant here. This con- 
ception of Geist was primarily cultural, somewhat 
in the tradition of Platonic Ideas. As such it was 
transindividual, on quite a different level from the 
discrete wants of utilitarianism. 

The Hegelian conception underwent various 
changes, only two aspects of which require men- 
tion here, One was the abandonment of the grandi- 
ose Hegelian Weltgeist in favor of the more 
restricted “spirits” of what many late nineteenth- 
century German scholars called discrete “historical 
individuals,” such as particular cultures or civiliza- 
tions in particular epochs. This modification was 
perhaps most consistently expounded by Wilhelm 
Dilthey. The other was that developed by Marx. As 
the one who “set Hegel on his head,” Marx was 
ostensibly a materialist rather than an idealist. 
Nevertheless, his materialism belongs to the idealist 
tradition in that it treats human culture and moti- 
vated action as objects, and it tends to be “histor- 
ical” in the special sense of handling “history” as 
a series of ideographic exemptions from treatment 
in terms of generalized analytical categories. 

Durkheim accepted the crucial Cartesian state- 
ment of the problem of knowledge in terms of the 
relation between the knowing subject and the 
known world of external objects. In his initial 
orientation he was a Cartesian “rationalist,” in the 
sense that he approached the sociological problem 
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as a problem of knowing “social facts” in terms of 
their place in the object world. However, as he 
shifted from the problem of knowledge to that of 
action, he became concurrently concerned with 
social facts as both the social scientist and the 
actor in society, as subjects, know them. The prob- 
lem of the relation between the two references was 
the core problem of Durkheim's scheme. Thus, al- 
though basically Cartesian, this scheme could not 
be developed without going beyond a Cartesian 
position in several respects. 

Rousseau, as the primary philosopher of “demo- 
cratic individualism” in his time, influenced Durk- 
heim by his special point of view about the 
characteristics of social phenomena. While Rous- 
seau shared the frame of reference of natural law 
and natural rights that was so prominent in the 
seventeenth and eighteenth centuries (and which, 
in important respects, came to France from Eng- 
land through Locke), he handled the problem of 
the social integration of those “born free” into a 
society without invoking the predominantly coer- 
cive sovereign of Hobbes or assuming the natural 
identity of interests, as did Locke. Rather, he postu- 
lated a resolution of interests at the level of inte- 
grative action processes in terms of the concept of 
“will.” More than any other, Rousseau’s famous 
concept of the volonté générale provided a concep- 
tion of social solidarity that was neither economic 
in the sense of classical economics nor political in 
the sense of Hobbes or Austin. It was not a given 
“identity” of interests, but one achieved and institu- 
tionalized in the course of social process. Comte’s 
concept of “consensus,” which stood more immedi- 
ately in Durkheim’s background and was explicitly 
defined as sociological, was transitional between 
Rousseau’s “general will” and Durkheim’s concep- 
tion of solidarity, which lay at the core of his 
sociology. 

The problem of order. Durkheim’s initial ori- 
entation to the study of society was twofold. The 
substantive aspect was developed in the Division 
of Labor and concerned the problem of order in a 
type of system we might call economic individu- 
alism. The methodological frame of reference was 
developed more fully in the Rules of Sociological 
Method, published two years later. 

The critical starting point of the Division of 
Labor is its discussion of Herbert Spencer's con- 
ception of a system of contractual relations (Divi- 
sion of Labor, book 1, chapter 7). Durkheim clearly 
understood that order in a concrete system of con- 
tractual relations—in which the market figured 
prominently—could not be accounted for in the 
terms set forth by Spencer, whom Durkheim treated 
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as a representative utilitarian. Unless controlled 
by other factors, a society dominated by the pure 
pursuit of self-interest would dissolve into a Hob- 
besian state of nature, a complete breakdown of 
order. The other factor or set of factors Durkheim 
formulated in two different ways, and on different 
levels. Closest to Spencers analysis was the con- 
ception of the “non-contractual elements of con- 
tract,” the important idea that contracts, i.e., the 
ad hoc agreements between parties, are always sub- 
ject to generalized norms. These norms are not 
open to negotiation between parties; they exist 
prior to any such agreements, having evolved over 
time. In more comprehensive systems, these rules 
or norms are part of the formal law and are en- 
forced by the legal sanctions of public authority. 
Their subject matter is the definition of the in- 
terests for which contracts may be entered into 
(for example, a man may not contract away his 
basic civil rights), the means by which such inter- 
ests may legitimately be pursued (in general terms, 
coercion and fraud are excluded), and the bearing 
on contracts of interests other than those of the 
contracting parties (both the public interest and 
those of third private parties must be protected). 

As noted, at one level the institution of contract 
is a prominent part of the legal system. Durkheim, 
however, wanted to go behind the establishment of 
norms by political authority to societal structures 
that may be said to “underlie” the mobilization of 
political authority for the enforcement of contracts. 
He introduced the concept of organic solidarity 
essentially to designate the capacity of a social 
system to integrate the diverse interests inherent 
in qualitative structural differentiation. Durkheim 
related solidarity, in turn, to a conception of its 
underlying ground, which he called conscience col- 
lective—translatable as either collective conscience 
or collective consciousness. The normative em- 
phasis of the first translation was important to 
Durkheim himself: the conscience collective was a 
“system of veliefs and sentiments” held in common 
by the members of a society and defining what 
their mutual relations ought to be. 

Clearly the conscience collective is a derivative 
of Rousseau’s “general will” and Comte’s “consen- 
sus.” Equally clearly, it is not purely cognitive in 
reference. The most important step that Durkheim 
took beyond his predecessors, however, was to treat 
solidarity and with it, presumably, the conscience 
collective, not simply as given, but as variable 
entities. He made a distinction, therefore, between 
organic solidarity and mechanical solidarity. Or- 
ganic solidarity is the analytical type characterized 
by the structural differentiation of the division of 


labor; modern society represents a case of predom- 
inantly organic solidarity. Mechanical solidarity, by 
contrast, is characterized by uniformity and lack 
of differentiation. With this distinction, Durkheim 
from the beginning built both historical—indeed, 
evolutionary—and comparative dimensions into his 
sociological analysis (Bellah 1959). 

There is an initial difficulty in interpreting the 
relation between Durkheim’s two types of solidarity, 
on the one hand, and the concept of the conscience 
collective, on the other. Since the conscience col- 
lective stresses the commonness of the beliefs and 
sentiments that constitute it, this seems to identify 
it with mechanical solidarity and suggests that 
organic solidarity, associated as it is with differen- 
tiation in the social structure, must develop at the 
expense of the conscience collective. The broad 
solution to this difficulty, which becomes clearer in 
Durkheim’s later work, hinges on the functions 
attributed to values and norms in social systems of 
different degrees of differentiation. The focus of 
the conscience collective seems to be what we have 
come to call the values common to the members of 
any relatively well integrated social system; the 
sharing of common values is a constant feature of 
all such systems—at whatever level of differentia- 
tion. In the case of mechanical solidarity, these 
values are not clearly differentiated from the norms 
through which they are implemented, but in the 
organic case the norms come to have independent 
salience. In the relatively less differentiated social 
systems characterized by mechanical solidarity, 
common, in the sense of uniform, sentiments tend 
to be implemented directly in collective action, 
while in the case of organic solidarity the common 
element lies at a more general level and must be 
implemented in relation to different functions in 
the system through norms that are not identical for 
different sections of the collectivity. 

Sociological method. The second main line of 
development of Durkheim's analysis has to do with 
the fitting of these broad empirical considerations 
into what I have called his Cartesian frame of 
reference. The starting point is the conception of 
the actor as member of a social system and as 
oriented to the environment in which he acts. This 
actor, conceived on the model of the philosopher— 
scientist, observes and interprets the facts of the 
external world: the distinctive problem is not their 
status as facts (of the environment), but as social 
facts. Here Durkheim self-consciously and explicitly 
denied the physical environment its unique “real- 
ity.” The milieu social—for him the relevant en- 
vironment—is, as part of society, a “reality sut 
generis,” to be studied in its own right. The central 


problem concerns the properties of this category of 
“reality.” 

This problem in turn has two principal aspects. 
From the viewpoint of the scientific observer, this 
reality is clearly factual, or as we would say, em- 
pirical. But what was it from the viewpoint of the 
actor, in the second sense in which Durkheim was 
using the Cartesian scheme? A society is a given 
reality (it has “exteriority”) from the point of view 
of its own members, but it also regulates (“con- 
strains”) their action. This it does not only in the 
sense in which the physical environment sets con- 
ditions that action must take into account but also 
by defining goals and normative standards for 
action. Durkheim quite early conceived of this con- 
straint as more than a matter of given conditions; 
he saw it rather as a system of rules enforced by 
humanly imposed sanctions. In this theoretical de- 
velopment, Durkheim was evidently following up 
his previous analysis of the law (in the Division of 
Labor) as both an index of the structure of the 
society (e.g., of the nature and extent of its dif- 
ferentiation) and, when it is considered together 
with the beliefs and sentiments of the conscience 
collective, as a very important normative com- 
ponent of all societies. 

There is, however, a still deeper aspect of the 
problem. A scientific observer of physical events is 
not in quite the same sense a “member” of the 
system he observes as is the social actor, although 
it is not acceptable to suggest that there is no sense 
in which they are similarly “members.” It was 
necessary, therefore, to relativize the sense in 
which the system that Durkheim calls society con- 
stitutes only an environment to the individual 
actor-members that compose it. This problem, then, 
came to be intertwined with that of the status of 
the normative aspects of a society. 

The essential conclusion of Durkheim’s thinking 
is that for the sociologist the boundary between 
“individual” and “society” cannot be that of com- 
mon sense. If we interpret the former concept as 
something like the human personality, it must 
include a sector of the social system, most specifi- 
cally, the normative aspect of that system, the 
shared beliefs and sentiments that constitute the 
conscience collective. By this path Durkheim ar- 
rived at the crucially important view that essential 
elements of culture and social structure are inter- 
nalized as part of the personality of the individual. 
In this he converged notably with Freud and with 
the movement in American social psychology from 
Charles H. Cooley to George Herbert Mead and 
W. I. Thomas. Durkheim's quite revolutionary con- 
clusion now seems to follow more or less inevitably 
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from his premises, once he tried seriously to fit into 
the Cartesian frame of reference a distinctive nor- 
mative level of the social system both as a “reality 
sui generis,” for the actor as well as for the ob- 
server, and as an environment that is much more 
than just an environment. 

This meant a radical reinterpretation of Durk- 
heim’s original criteria of social facts—constraint 
and exteriority. The concept of social facts was de- 
veloped, then, through three phases: first, exterior- 
ity, or the givenness of empirical existence, as in 
the case of the physical environment; second, con- 
straint, or the effect of a normative rule to which 
sanctions are attached; and now, third, what Durk- 
heim called the “moral authority” of internalized 
values and norms, which “constrain” the individual 
to conform by arousing guilt in his own conscience 
if he does not conform. An element of exteriority 
is involved in moral authority because, although 
internalized, the normative system must also ob- 
jectively be part of a system extending beyond the 
individual. It is not “subjective” in the sense of 
being purely private to the individual, for it is also 
a “cultural object” in a sense relevant to the ideal- 
istic tradition. 

The theoretical development at this highest level 
of generality—Durkheim’s decisively new concep- 
tion of individuals interacting in a social system— 
did not fully crystallize until the early years of the 
present century, when Durkheim gave primary at- 
tention to the relations between moral norms and 
the process of education (1902-1906). Some of its 
roots in the more empirical emphases of the Divi- 
sion of Labor have already been indicated, Cer- 
tainly the most notable transitional formulation of 
the concept of social fact is in his study of suicide. 
Durkheim's sensitivity to the major problems of 
suicide went back to the Division of Labor and its 
critique of utilitarianism, more specifically the utili- 
tarian claim that an increasing division of labor 
and the resultant economic progress would be ac- 
companied by increasing “happiness,” Durkheim 
was struck by the fact that the economic progress 
of newly industrialized societies was everywhere 
accompanied by a rise in suicide rates. This was 
clearly an anomaly from the point of view of utili- 
tarian theory and stimulated Durkheim to a major, 
if not complete, theoretical reconstruction in his 
classic monograph Suicide (1897). 

Very advanced for its time as an empirical study, 
Suicide established a most important link between 
Durkheim's theoretical work and the traditions of 
empirical research that have since hecome promi- 
nent, especially in the United States. Durkheim's 
essential method was systematically to mobilize 
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available statistical information on suicide rates 
and to relate their variations to a whole series of 
characteristics of the populations involved. In the 
nature of the case, he was limited to the modern 
Western world, which alone provided the kind of 
information he sought. With this limitation, he 
studied nationality, religion, age, sex, marital 
status, family size, place of residence, economic 
status, and variations in economic conditions, as 
well as the seasons of the year and even the times 
of day when suicides occurred. He showed great 
ingenuity and a capacity to take pains—for exam- 
ple, in breaking down the data published for France 
by départements into arrondissements, in order to 
reveal important variations masked in the larger 
units. As Bellah points out (Bellah 1959), Durk- 
heim brought together what information he could 
find from the broadest possible comparative range, 
even when it could not be stated quantitatively. 
For instance, he cited voluntary self-immolation on 
the part of Buddhist zealots as an example of what 
he called “altruistic” suicide. 

Durkheim found the conventional classification 
of the “causes” of suicide, in terms of which the 
data were generally reported, quite unhelpful for 
his purposes. He introduced a highly original 
scheme of his own, built about the problem of the 
individual's relations to the normative structuring 
of the social system in which he is involved. This 
scheme embodies two pairs of polar extremes, at 
which suicide rates are relatively high, and median 
continua between the poles, in which suicide rates 
are relatively low. The first pair of poles has 
altruisme and égoisme at the extremes, the second 
anomie and fatalisme. 

That Durkheim was no mere extoller of the vir- 
tues of solidarity (as is sometimes alleged) is 
shown by his conception of the first polar pair in 
general and of the concept of “altruistic” suicide in 
particular. In this type the claims of the collectivity 
are so strong that there is a repeated tendency to 
subordinate personal interests to them to the extent 
of sacrificing life, even when there does not appear 
to be a practical emergency that requires such sac- 
rifice. Durkheim found military officers most prone 
to this in modern societies, but adduced numerous 
other examples from other societies. The antithesis 
of this type is “egoistic” suicide, which, for exam- 
ple, results in a higher rate of suicide among Prot- 
estants than among Catholics. This Durkheim 
explained by the social pressure inherent in Prot- 
estant norms toward a higher order of individual 
religious responsibility. It is a remarkable interpre- 
tation, both in itself and because it converges with 


the theme developed a few years later by Max 
Weber concerning the importance of the Protestant 
ethic in modern society. There is also, interestingly, 
an echo of Rousseau, in that Durkheim seemed to 
be citing an instance of the famous paradoxical 
formula about a man being “forced to be free,” 
adding that this enforced freedom may become too 
hard to bear. 

The second pair of polar concepts that Durkheim 
advanced in this connection was that of anomie 
and fatalisme (the latter concept not being devel- 
oped). Anomie has become one of the small num- 
ber of truly central concepts of contemporary social 
science. It is best interpreted in terms of Durk- 
heim’s Cartesian reference. The observer as actor 
is naturally concerned with the definiteness of the 
“reality” with which he is confronted. In a purely 
cognitive context, this is a matter of the adequacy 
of his information and analysis. Insofar, however, 
as the “reality” is man-made and, in one aspect, is 
normative for the actor, the problem of definiteness 
becomes that of “definition of the situation” in the 
sense established by W. I. Thomas and by refer- 
ence-group theory more generally [see REFERENCE 
GROUPS]. 

The focus, then, is on what is expected of the 
actor and on the problem of the definiteness of 
expectations. In the case of the physical conditions 
of, for example, technological procedures, expecta- 
tions can reasonably be defined in terms of the 
goals of the actors; they do not pertain to the ex- 
ternal processes and technically defined probabili- 
ties concerning the environment itself, since it does 
not “act.” In a system of social interaction, on the 
other hand, “success” cannot simply be a function 
of “control” over the environment, but necessarily 
involves also the “sense” it makes to exert effort 
and, generally, to expend resources, unless the out- 
come to which the actor is committed is clearly 
desirable. The sense to ego of his goal-striving 5 
thus a function both of alter’s action and of ego's 
expectation concerning it. The meaning of success 
cannot be established without understanding the 
interplay between the motivation of the actor and 
the normative claims impinging upon him from his 
social environment. At the same time, the social 
environment of any given actor of reference is com- 
posed of other actors whose action must be an- 
alyzed in the same terms as the first. In this 
interactive framework anomie may be considered 
that state of a social system which makes a par- 
ticular class of members consider exertion for suc 
cess meaningless, not because they lack capacity 
or opportunity to achieve what is wanted, but be- 


cause they lack a clear definition of what is desir- 
able. It is a “pathology” not of the instrumental 
system but of the collective normative system. 

Spelling out this concept leads to many refine- 
ments. In more contemporary terms, what is ill- 
defined may be ultimate beliefs, values, norms, or 
goals. Anomic uncertainty may affect either very 
generalized orientations or relatively specific goals; 
or the difficulties may arise from conflicting ex- 
pectations, as in the classic instance of “cross- 
pressures.” 

The two concepts of égoisme and anomie epito- 
mize Durkheim's concern over the state of modern 
society, Because of the current preoccupation with 
problems of “meaning” in contemporary life, it is 
not surprising that anomie has attracted far more 
attention than égoisme. It is my own view that the 
balance is in need of being redressed. Egoisme, in 
Durkheim's special sense, is a designation for one 
aspect of a prominent feature of modern social 
structure that can be called, more generally, “insti- 
tutionalized individualism.” Another context in 
which Durkheim emphasized égoisme was his dis- 
cussion of the “cult of individual personality” 
(Neyer 1960). At least some aspects of the subject 
of “alienation” (discussed so often and with so 
much confusion) may also be interpreted in terms 
of égoisme and altruisme. Thus, alienation appears 
to be the pathological extreme (anomic in certain 
aspects, cf. Tiryakian 1962) of institutionalized 
individualism at which “conformism” becomes asso- 
ciated with the altruistic tendencies, in Durkheim's 
sense. The alienated person, then, is under such 
pressure to establish his independence from pres- 
sures to conform that he becomes unable to accept 
the essential normative conditions of a stable sys- 
tem of organized individual freedom. 

Theory of culture. In the last major phase of 
his intellectual career, Durkheim dealt mainly with 
another set of themes that grew out of, but were 
distinct from, those outlined so far. These con- 
cerned religion, symbolic systems, and his some- 
what new conception, “collective representations.” 
In short, he emphasized the theory of culture in 
relation to that of the social system. As early as 
the Preface to Volume 2 of L’année sociologique 
(1899), Durkheim acknowledged the strong em- 
phasis on religion in that publication and outlined 
his conception of religion as the primordial “ma- 
trix,” out of which the principal elements of culture 
emerged by the process of differentiation. His con- 
cern with primitive religion, as well as with an 
articulated evolutionary perspective, was already 
clear in this statement. It is important for these 
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later developments in Durkheim's thought that the 
relatively new science of anthropology had arisen 
as a kind of mediator between utilitarianism and 
Darwinian biology. Anthropology became the “study 
of man” as part of the organic world, concerned 
especially with primitive societies, particularly with 
their magic and religion. 

We have noted that Durkheim’s conception of 
society as a “reality sui generis” steadily changed; 
he placed an increasing emphasis on the normative 
components. While the legal norms constituted his 
initial prototype, he gradually focused upon more 
general aspects, moving toward the conception of 
what we would now call institutionalized values. 
He particularly stressed the attitude of moral re- 
spect as a component of internalized norms. 

What is perhaps Durkheim’s most important 
single step in extending this perspective was stated 
as one of the primary orienting perspectives of the 
Elementary Forms. This was a double proposition: 
first, that the attitudinal distinction between treat- 
ing things as sacred and as profane is basically 
the same as that between moral obligations and 
expediency or utility; and second, that the quality 
of sacredness does not reside in the intrinsic prop- 
erties of the object treated as sacred, but in its 
properties as a symbol, From this it was a short 
step to relate sacred physical and social objects to 
the whole world of “cultural” objects, which, al- 
though very close to Durkheim’s early category of 
beliefs and sentiments, he increasingly formulated 
as “representations.” We may certainly interpret 
them as symbolic systems, leaving open the ques- 
tion of the meaning references of various categories 
of symbols. 

Durkheim was greatly impressed by the close- 
ness of integration between the religious system of 
representations and the structure of the society 
itself, the attitude of moral respect being, as noted, 
the main connecting link. This integration seems 
particularly close in the case of primitive religion 
but it also exists in others. It justifies Durk- 
heim’s emphasis on collective representations. In- 
deed, we can say that any symbolic system that can 
justifiably be called “cultural” must have a collec- 
tive aspect; symbolization that is autistic—in the 
sense of being wholly private to one individual (the 
limiting case )—is no longer cultural, if indeed it 
can be truly symbolic. Language is perhaps the 
prototype here. 

It seems to have been Durkheim’s view, a strongly 
defensible one, that the more primitive the society 
and the culture, the less differentiated they are 
from each other. He extensively analyzed the case 
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of the Australian aborigines on the strength of this 
theory: the phenomena of the integration of cul- 
ture and society could be seen there in their “ele- 
mentary forms.” But his interest in these elementary 
forms does not mean that Durkheim did not have 
a broad understanding of the possibility and im- 
portance of differentiating conceptually between 
cultural and social systems, even though, as Bellah 
points out (Bellah 1959) he somewhat obscured 
this vital point by using the term “social” for both. 
Unfortunately, he never worked out a thorough 
analysis of the place of religion in a highly differ- 
entiated society—a task that might well have led 
him to clarify his conception of the relations be- 
tween “representations” and social structure. 

The problem of integration also arises internally 
to the system of collective representations, to cul- 
ture itself. Durkheim presented the broad perspec- 
tive on this problem several years before Elementary 
Forms in the monograph on primitive classification 
written in collaboration with Mauss (1903), al- 
though he developed it further in his book. His 
main point is that in primitive systems all culture 
is at the same time both religious and social, in a 
sense not true of more advanced systems, A particu- 
larly telling example is the categorization of physi- 
cal space in terms directly corresponding to the 
arrangement of kinship groups in the camp. 

This conception of twofold integration, between 
a cultural and a social system and among the dif- 
ferent elements of a cultural system, is particularly 
significant for the broad problem area we now gen- 
erally call the sociology of knowledge. Undoubtedly 
Durkheim was at least as important a founder of 
this discipline as was Karl Mannheim, and in many 
respects his views were the clearer and better 
analyzed. 

Durkheim's combined interest in cultural prob- 
lems, religion, and evolutionary origins had a series 
of implications for the development of social sci- 
ence theory. Both the utilitarian tradition and that 
stemming from the French Enlightenment had 
tended not only to disparage traditional religion but 
even to deny its substantive importance. Evolution- 
ary perspectives, however, focused attention on 
religion, partly because of the sheer empirical 
prominence of religion and magic in nonliterate 
societies, which were becoming increasingly well- 
known. The early Tylor-Spencer phase of social 
science tended strongly to consider these phenom- 
ena characteristic of the early stages of sociocul- 
tural evolution and destined to disappear with 
advancement—a position shared by Marx. Durk- 

heim’s position established a quite new order of 


functional significance for religion in society, 
Durkheim made it clear that even at the later 
stages of sociocultural development, every society 
would require the “functional equivalent” of a re- 
ligious system (whether or not it is called “religion” 
is primarily a semantic issue). 

Beyond this, Durkheim established the ground- 
work for an exceedingly valuable conception of the 
morphology of social development—the conception 
of processes of structural differentiation and of at- 
tendant new, more general levels of integration, 
The conception of religion as the original matrix 
of both society and culture suggests further that 
society and culture themselves tend to become more 
completely differentiated from each other and that 
“secular” elements develop from this matrix on 
both the social and cultural levels. An important 
semantic point is that just because a relatively un- 
differentiated complex is called “religion” for an 
earlier stage of development and only one of its 
two or more differentiated derivatives retains that 
name for a later stage, it is not legitimate to assert 
that “religion has declined.” Thus, Durkheim viewed 
the secularization of education as an imperative of 
the stage of social differentiation that France had 
reached in his time, but he denied that this meant 
that the function of religion in French society had 
therefore been downgraded. 

Durkheim’s combination of a comparative and 
evolutionary perspective with a special concern for 
cultural-symbolic systems should have been con- 
nected with a theoretical analysis of the processes 
of social and cultural change. Durkheim did not, to 
be sure, give this as much explicit attention as he 
did problems of social morphology, but the con- 
tributions he did make to an understanding of the 
process of change seem not to have been under- 
stood as fully as his more “static” analysis. In any 
case, it is clear that Durkheim provided the 
groundwork for a major theory of developmental 
change in societies and that he made important 
direct contributions to it himself. 

His later work, in particular, tended to stress the 
importance of cultural creativity as a factor is 
change; one of the clearest statements is in his 
late essay “Value Judgments and Judgments of 
Reality” (1911), which stresses the incidence and 
salience of “effervescence” in periods of crisis in 
the development of the sociocultural system. At the 
same time, Durkheim was quite clearly a “multi- 
factor” theorist of social change. 

Conclusion. Durkheim contributed substantially 
—and very eminently for the time in which he 
worked—to relatively specific empirical problems 


in sociology. To this day, analyses of the nature of 
contractual systems, of suicide rates, and of prim- 
itive religions cannot ignore his contributions. 
Equally, he was a highly effective entrepreneur of 
sociology—as teacher, as editor of a distinguished 
periodical, and as leader of a highly talented and 
creative group of research scholars. 

These are not, however, the achievements that 
place him in the top rank among the founders of 
a scientific discipline. This higher eminence stems 
from the fact that Durkheim used the framework 
of solidly established intellectual traditions—those 
of English utilitarianism, in certain respects of 
German idealism, and of his own French back- 
ground—to formulate a theoretical framework that 
was both solidly grounded in those traditions and 
yet highly original. As grounded in tradition, it was 
capable of taking full account of established 
knowledge; but it also went far beyond this. It was 
precise and clear in its logical structures and imag- 
inative in opening up new ways of considering 
social phenomena, defining problems, and devel- 
oping patterns of interpretation. In his special con- 
ception of the nature of “social reality,” which 
emphasized the involvement of normative com- 
ponents in both social reality and, through internal- 
ization, the personality of the individual, Durkheim 
was following, along with a few others, the major 
line of the theoretical development of social sci- 
ence. But he went even beyond this to link the 
social and personality systems thus conceived with 
a highly sophisticated analysis of cultural symbolic 
systems and to set the whole action structure in a 
comprehensive evolutionary framework. The result- 
ing enrichment of the theoretical resources of the 
field of social science, of its insight into significant 
problems and its capacity to deal determinately 
with them, is incalculable. Only a very select few 
among the figures in intellectual history have con- 
tributed so crucially—at such a significant junc- 
ture—to the development of scientific culture. 
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[For the historical context of Durkheim’s work, see 
KNOWLEDGE, socioLocy oF; LAw, article on THE 
LEGAL SYSTEM; RELIGION; SocioLocy, article on 
THE DEVELOPMENT OF SOCIOLOGICAL THOUGHT; SUI- 
CIDE, article on SOCIAL ASPECTS; SURVEY ANALYSIS, 
article on METHODS OF SURVEY ANALYSIS; UTILI- 
TARIANISM; and the biographies of ComTE; DES- 
CARTES; FUSTEL DE CoULANGES; HEGEL; HOBBES; 
Locke; Rousseau; SAINT-SIMON. For discussion of 
the subsequent development of Durkheim’s ideas, 
see COMMUNITY-SOCIETY CONTINUA; INTEGRATION; 
INTERACTION, article on SOCIAL INTERACTION; PUN- 
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“Duty is a thing-which may be exacted from a 
person, as one exacts a debt. Unless we think that 
it may be exacted from him, we do not call it his 
duty” (Mill [1861] 1957, p. 60). The word “duty” 
is derived through the French devoir from Latin 


debitum, which means what is owed by or due 
from one person to another, and as Mill saw, 
analogy with a debt is still a most important feature 
in the various legal, moral, and other social phe- 
nomena which are referred to as duties. Yet not- 
withstanding this apparently unifying analogy, 
there has been much dispute as to whether or not 
there really is a single generic concept of duty 
with specific applications in the different fields of 
law, morality, and so on, and also much contro- 
versy over the analysis of the notion within these 
fields. Since these issues and other problematic 
features of the notion of duty have received the 
clearest and most detailed attention from legal 
theorists, legal duties are first considered here. 


Legal duties 


Whenever legal rules require or prohibit certain 
conduct, the notion of duty has application. The 
plainest examples of such requirements and pro- 
hibitions are the duties created by the criminal law 
when it provides for the punishment of certain 
forms of conduct. The duties it thus creates may 
be either negative (e.g., to abstain from murder or 
theft) or positive (e.g., to report for military service 
or pay taxes). Most legal theorists, however, extend 
the notion of duty to cases where the law provides 
for compensation to be paid to those who have 
suffered damage through the failure of another 
either to perform a contract or to abstain from any 
of the variety of civil (as distinguished from 
criminal) wrongs, which are called torts. Here con- 
ventional jurisprudence distinguishes the primary 
duty (e.g., to perform the contract or to abstain 
from the tort) from the secondary or “sanctioning 
duty to pay compensation for the breach of the 
primary duty. 

Though the word “duty” refers to a constantly 
recurring situation created by the existence of legal 
rules, it is in fact rarely used in the authoritative 
text of legal codes or statutes. Thus, English and 
American criminal codes and statutes create duties, 
not by the use of the expression “duty,” but by 
such formulas as “whosoever shall . . . shall be 
guilty of an offense . . .” or “whosoever shall . . « 
shall be liable to a penalty of. . . .” Similar formulas 
avoiding the equivalent of the word “duty” are com- 
monly used in Continental legal codes. “Duty 
therefore is primarily part of the language used in 
the exposition and theoretical discussion of the 
law to designate a specific type of effect or situa- 
tion created by the existence of legal rules. In the 
wording of these rules, however, the expression 
“duty” does not usually appear. 


Obligation and duty. By jurists of some sys- 
tems, mainly those descended from Roman law, 
a distinction is drawn between obligation and duty, 
the former being reserved for cases such as those 
where a determinate person is bound as a result 
of some past transaction or relationship to pay or 
render some service to another determinate in- 
dividual who has a corresponding legal right to 
such payment or service. Obligations are typically 
“incurred” or created by a man for himself, whereas 
duties “arise” from his position under the law with- 
out any act on his part. Typical examples of ob- 
ligation in this sense are cases where a person is 
bound by his contract to pay or render services to 
another or is bound as the result of some tort to 
pay compensation to another. Some jurists dis- 
tinguish between the requirements of the criminal 
law or the primary duties arising under the law of 
tort and obligations, since the first two cannot be 
incurred by the person under the duty. Though 
there are still traces of this distinction in Anglo— 
American jurisprudence, it has very largely disap- 
peared and is generally ignored in philosophical or 
jurisprudential discussions concerning the general 
nature of law. It is however still of importance in 
the classification of other nonlegal social norms. 

The notion of duty is generally considered by 
jurists to be both essential and fundamental. Some 
(e.g., Austin and Kelsen) treat it as essential in a 
strong sense of insisting that every legal rule creates 
a duty; other jurists insist only that every legal 
system includes some rules which create duties. 
Duty can be considered fundamental as well as es- 
sential, because other notions, such as legal rights, 
powers, crimes, and torts, require in their analysis 
the use of the notion of duty. Thus, to have a legal 
right is (in one important sense) to be free from 
some specific duty to act in certain ways or (in a 
second important sense) to have the option of in- 
sisting on the performance by another person of 
his duty or releasing him from it. 

Some jurists (e.g., Oliver Wendell Holmes) have 
looked upon the idea of duty as a potent source of 
confusion tending to obscure the radical differences 
between the requirements of legal and moral rules. 
Holmes at one time proposed to dispense with “all 
talk of duties and rights” and to substitute for it 
simply “a statement of the circumstances in which 
the public force will be brought to bear upon a man 
through the Courts” (Holmes & Pollock [1874— 
1932] 1961, p. 307). Many legal theorists of the 
Positivist tradition (e.g., Bentham and Austin as 
well as Holmes himself) have put forward similar 
views in the form of analysis of the notion of duty 
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rather than a proposal to dispense with it. Bentham 
({1789] 1948; [1843] 1962) defined legal obliga- 
tion or duty to do or abstain from some action as 
a chance or likelihood of suffering at the hands of 
officials in the event of failing to do so. Holmes 
similarly defined a statement of duty as a prophecy 
that if a man does certain things, he will be subject 
to disagreeable consequences by way of imprison- 
ment or compulsory payment of money ([1897] 
1952, p. 173). 

The predictive analysis of duty. The form of 
analysis favored by Holmes and Bentham has com- 
mended itself to many jurists and some sociologists. 
It is, however, open to the following objections: 

(1) The statement that a man has a legal duty 
to do a particular action may without any contra- 
diction or absurdity be conjoined with the state- 
ment that he is not likely to suffer at the hands of 
officials in the event of his failing to do the action. 
Indeed there will be frequent occasions—especially 
in a system such as English law, where there are no 
minimum penalties but only prescribed maxima— 
for asserting just this conjunction of statements. 
This is so when, for example, it is known or thought 
likely that someone who proposes to disobey the 
criminal law will not be caught or if caught will 
not be convicted (perhaps because of lack of rele- 
vant evidence) or if convicted will not in fact be 
punished (perhaps because of the clemency or 
corruption of the court). 

(2) Similarly, there is no redundancy in con- 
joining with the statement that a man has a legal 
duty to do a particular action the statement that 
he is likely to suffer at the hands of officials in the 
event of his failing to do it. 

(3) If it is allowed that a statement that a man 
has a legal duty to do a particular action may be 
assessed as either true or false, we can summarize 
the above two criticisms by saying that Bentham’s 
analysis fails to state even a necessary condition 
for the truth of such a statement. But the following 
considerations show that it also fails to state a 
sufficient condition for the truth of such a state- 
ment. This is so because the notion “of suffering 
at the hands of officials in the event of doing (or 
failing to do) a particular action” has no specific 
connection with the idea of duty unless it is con- 
ceived or interpreted in a restricted way. Such 
suffering is wholly irrelevant to the idea of duty 
unless the suffering is conceived of as punishment 
or compulsory compensation supporting or securing 
conformity with legal standards of conduct, that is, 
as a means for discouraging or preventing actions 
of a certain sort or restoring (so far as compensa- 
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tion can) the status quo disturbed by such actions. 
Hence suffering, in the form of compulsory pay- 
ment of money, though provided for by legal rule, 
is irrelevant to the idea of legal duty if it is inter- 
preted as a tax on certain conduct (and not as a 
punishment by way of fine or as compensation for 
injury done) and does not render abstention from 
that conduct a duty. 

The normative analysis of duty. The principal 
rival to the predictive analysis may be termed 
normative. This approach, without identifying 
moral and legal duties by insisting on any common 
content, stresses certain formal features which 
these duties share as forms of rule-guided conduct 
involving the idea of what “ought to be done.” From 
this point of view an act is called a legal duty not 
merely because failure to do it renders likely 
sanctions or suffering at the hands of officials but 
because, owing to the existence of the relevant 
rules of law, it is also an act which like a debt may 
be correctly or justifiably demanded of those con- 
cerned as something which ought to be done. 
Similarly, failure to do the act required is significant 
because punishment or compulsory compensation 
is properly applicable or justified according to the 
legal rule even if predictions that it will in fact 
follow are falsified. The normative analysis there- 
fore reproduces the point of view of those who, 
though they may not regard the law as the moral 
or final arbiter of conduct, in general accept the 
existence of legal rules as a guide to conduct and 
as legally justifying demands for conformity, 
punishment, or enforced compensation. Attention 
to these normative features of the idea of legal 
duty seems essential for understanding the way in 
which law in fact operates in a modern legal 
system, 

The predictive analysis of duty has found favor 
among positivist writers for a variety of theoretical 
and practical reasons, On the one hand, it has 
seemed to free the idea of legal duty both from 
metaphysical obscurities and irrelevant associa- 
tions with morals and, on the other hand, to pro- 
vide a realistic guide to life under law. For it 
isolates what for some men is the only important 
fact about the operation of a legal system and what 
for all men is always one important fact: the oc- 
casions and ways in which the law may work ad- 
versely to them. This is of paramount importance 
not only to the malefactor—“if you want to know 
the law and nothing else, you must look at it as a 

bad man [would]" (Holmes [1897] 1952, p. 171)— 
but also to the utilitarian critic and reformer of the 
law who is concerned to balance the costs of the 


law in terms of human suffering against the bene- 
fits which it brings. A reconciliation between the 
two types of theory is, however, possible, though 
exclusive correctness has often been attributed to 
one or the other. The normative account might be 
said to give correctly the standard meaning of the 
statement that a person has a legal duty to do a 
certain action, and the predictive theory may be 
taken as stating the point or purpose of making a 
Statement of duty (i.e., to warn that suffering is 
likely to follow upon disobedience). This distinction 
between the meaning of a statement and what is 
implied by the making of it in different contexts is 
of general importance in legal philosophy. 

Duties and sanctions. Many, perhaps most, 
legal theorists have argued that legal and moral 
duties can only be distinguished by the provision 
made by legal rules for coercive sanctions. How- 
ever, this argument seems mistaken, since there 
are other important features which may be used 
to distinguish between them. It is conceptually im- 
possible to hold (1) that moral duties may be ex- 
tinguished by legislative repeal or fiat, as legal 
duties can, (2) that moral duties may relate to 
activities which are not considered in any way im- 
portant as some legal duties may be, and (3) that 
general moral duties could be expressly created 
simply for the purpose of advancing some specific 
objective or social aim as general legal duties fre- 
quently are. Furthermore, lawyers and jurists some- 
times use the expression “duty” of actions where no 
sanction is provided. These considerations suggest 
that it is mistaken to define the idea of duty in 
terms of coercive sanctions but that a wider notion 
of social demand, pressure for conformity, and the 
liability of the lawbreaker to different forms of ad- 
verse criticism and treatment should be used in- 
stead. The coercive sanctions of the law may then 
be more illuminatingly regarded as the character- 
istic, but not invariant, legal form of this wider 
notion. 


Moral duties 

In considering moral duties it is important to 
distinguish the accepted or conventional positive 
morality of an actual social group from the moral 
principles and ideals which may govern an indi- 
vidual’s life but which do not exist as a shared 
code of a social group. The duties of positive 
morality are those forms of conduct (negative or 
positive) which like a debt are, according to the 
rules of the accepted moral code, held to be justi- 
fiably exacted or demanded from individuals. 
Though organized coercive sanctions may not be 


used to enforce conformity or to punish deviation, 
various distinctively moral forms of pressure are 
available. In the case of actual or threatened 
breach of the moral code, appeal will usually be 
made to the sense of respect for the rules as im- 
portant in themselves, and reminders are given of 
the moral character of the action. These reminders 
assume that respect for the rules is shared by those 
addressed, and it is also assumed that they will 
excite in those addressed a sense of shame or 
guilt. Deviations from the positive moral code may 
lead to many different forms of hostile reaction, 
ranging from expressions of blame and contempt 
to severance of social relations. But reminders of 
what the rules demand and appeals to “conscience” 
and reliance on the operation of a sense of guilt 
and shame are the characteristic forms of pressure 
used in the support of conventional moral duties, 

In spite of the differences noted above between 
moral and legal duties, there are still certain strik- 
ing similarities between them. Both are supported 
by serious social pressure for conformity, though 
this pressure characteristically takes different 
forms. Compliance with both legal and moral duties 
is generally taken as a matter of course and not as 
a matter for praise except when marked by excep- 
tional difficulties or efforts, since performance is 
thought of as a minimum contribution to social 
life. Further, legal and moral duties relate to 
actions the opportunities for which constantly re- 
cur throughout life and not only on special occa- 
sions. Although legal rules and conventional moral- 
ities may include much that is peculiar to different 
societies, both make demands which must obviously 
be satisfied if any considerable number of human 
beings are to succeed in living together. Hence in 
almost all societies where a legal system can be 
distinguished from a moral code, there are both 
legal and moral duties to abstain from violence and 
theft and to show some minimum forms of honesty 
and truthfulness in social intercourse. 

When we turn from the positive morality of 
Social groups to morality in the sense of the rules, 
or principles, or ideals of individuals to which they 
Subscribe on religious or theoretical grounds, the 
Scope of the notion of duty appears a matter of 
more philosophical controversy, Some philosophers 
extend it so as to include not only the minimum 
demands of an individual's morality, which may 
be formulated in general rules, but any action 
which for any moral reason “ought” to be done. 
Thus extended, it includes both the action which 
is held to be “the best on the whole” in cases of a 
Conflict of duties or otherwise problematic situa- 
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tions and also works of supererogation or the satis- 
faction of ideals (e.g., of heroism or saintliness ). 
This extension of the idea of duty is no doubt 
guided by the analogy between legal punishment 
and other forms of social pressure and the experi- 
ence of guilt and remorse which follow on devia- 
tion from individual morality. This extension seems 
confusing, since the minimum demands of a moral- 
ity that can be formulated in general rules referring 
to constantly recurring situations in daily life must 
always present important features different from 
the requirements of individual morality. 


Duty in other social contexts 


Legal and moral rules are of course not an ex- 
haustive dichotomy, and the notion of duty also 
refers to the requirements of rules which fall into 
neither of these two categories (e.g., the rules of 
voluntary associations like clubs or business or- 
ganizations, rules governing activities for which 
there are only intermittent opportunities such as 
ceremonies, or activities like games which are held 
to be voluntary and from which withdrawal is per- 
mitted), 

Broadly speaking, the notion of duty or, some- 
times, obligation may be used where there is a 
relatively enduring office or social role in which the 
occupant performs a specific function calling for 
specific forms of behavior, Thus, it is common to 
speak of the duties of a host or of the captain of a 
team or the chairman of a committee, where rules 
which are neither legal nor moral specify the forms 
of behavior required. Sometimes however such 
duties attached to roles or offices may be supported 
by specifically moral pressure, as in the case of the 
duties of a father or husband, In all such cases the 
analogy of a debt is still present, for the duties 
attached to such roles or offices are those actions 
which it is held may be demanded from the oc- 
cupant, even though neither legal sanctions nor 
moral pressure is available to support the demand. 

Finally, it is salutary to remember that even in 
these wider social contexts there is no simple 
equivalence between the notion of duty and of what 
ought to be done according to social rules. This is 
evident from the fact that there are many rules, 
such as those of etiquette or correct speech, where 
it would be misleading to refer to duties. In such 
cases the correct action or mode of speech is not 
conceived of as something which may be peremp- 
torily demanded, no doubt because of the lesser 
importance to others or to social life of the ac- 
tivities in question. 

H. L. A. HART 
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[See also POLITICAL THEORY; RESPONSIBILITY; and the 
biographies of AUSTIN; BENTHAM; HOLMES; KEL- 
SEN. A guide to other relevant material may be 
found under Law.] 
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EBBINGHAUS, HERMANN 


Hermann Ebbinghaus, German psychologist, was 
born on January 24, 1850. He was the son of Carl 
Ebbinghaus, a merchant in the town of Barmen 
near Bonn, Germany. Of his infancy and child- 
hood it is known only that he was brought up in 
the Lutheran faith and was a pupil at the town 
Gymnasium until he was 17. In 1867 he went to 
the University of Bonn and somewhat later to 
Berlin and Halle, Although his initial interest was in 
history and philology, he was gradually drawn to 
philosophy. When the Franco-Prussian War broke 
out in 1870 he joined the Prussian Army. In the 
Spring of 1871, however, he left the army to con- 
tinue his philosophical studies at Bonn, He com- 
pleted his dissertation, Uber die Hartmannsche 
Philosophie des Unbewussten (1873), and received 
his PH.D. on August 16, 1873, passing his exam- 
ination with distinction. 

In 1880 he received his habilitation at Berlin. 
In 1885 he published Memory: A Contribution to 
Experimental Psychology. He was made a professor 
in the same year, probably in recognition of this 
publication. While at Berlin he founded the psycho- 
logical laboratory, and in 1890, in association with 
Arthur König, he founded the Zeitschrift fiir Psy- 
chologie und Physiologie der Sinnesorgane. 

He was called to Breslau in 1894 to become a 
full professor in the chair left vacant by Theodor 
Lipps’ departure for Munich. (Lipps replaced 


Stumpf, who, in turn, was bound for Berlin.) At 
Breslau, Ebbinghaus again founded a psychological 
laboratory. In 1905 he moved to Halle to succeed 
Alois Riehl, who was going to Berlin. He remained 
there as professor of philosophy until his death 
from pneumonia on February 26, 1909. 

Ebbinghaus found his own way to psychology. 
None of his instructors determined in any marked 
way the direction of his thinking, even though they 
included such eminent persons as Johann E. Erd- 
mann, Friedrich A. Trendelenburg, and Jiirgen B. 
Meyer. A major influence, however, was the com- 
bination of philosophical and scientific points of 
view that he found in Fechner, a copy of whose 
Elemente der Psychophysik he picked up in a 
Parisian secondhand bookstall. He acknowledged 
his debt in the Grundzüge (1897-1908), which he 
dedicated to the memory of Fechner. 

One leitmotiv runs through his work: psychology 
is Naturwissenschaft. The very first thesis in his 
dissertation sets forth the proposition that psychol- 
ogy (in the broadest sense) belongs no more to 
philosophy than does natural science (1873, p. 2). 
His goal was the establishment of psychology on a 
quantitative and experimental basis. This focus is 
well brought out in the short historical sketch that 
introduces his Abriss der Psychologie. “When Weber 
in 1828 had the seemingly petty curiosity to want 
to know at what distances apart two touches on 
the skin could be just perceived as two, and later, 
with what accuracy he could distinguish between 
two weights laid on the hand . . . his curiosity re- 
sulted in more real progress in psychology than all 
the combined distinctions, definitions, and classi- 
fications of the time from Aristotle to Hobbes 
(inclusive )” (1908, p. 17). Ebbinghaus’ desire to 
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bring into psychology clear and exact methods 
resulted in his extreme carefulness in experi- 
mental technique and his considerable interest in 
apparatus. 

Ebbinghaus was an unusually good lecturer. His 
buoyancy, his humor, and the unusual clarity and 
ease of his presentation assured him of large audi- 
ences, Another outstanding trait, especially valu- 
able for a journal editor, was his Jamesian tolerance 
(Boring [1929] 1950, p. 390). This capacity led 
him to publish widely diverse opinions—a policy 
vital to a young science, In contacts with his stu- 
dents, he invariably showed great interest in their 
problems, 

It may seem surprising that Ebbinghaus had 
so few disciples. In his obituary of Ebbinghaus, 
Jaensch attributed this to Ebbinghaus’ lack of in- 
terest in developing them (1909). He never urged 
others to undertake investigations; in fact, to work 
with him one had to obtrude oneself upon him with 
determination. Some of his better known students 
are Arthur Wreschner, Louis W. Stern, and Otto 
Lipmann. 

Ebbinghaus did psychology a great service in 
founding and editing the Zeitschrift für Psycholo- 
gie. The 50 volumes published up to his death 
present a practically complete portrait of psychol- 
ogy in the two decades from 1890 to 1910. A brief 
selection of names from the index—Hermann von 
Helmholtz, Carl Stumpf, Georg E. Müller, Friedrich 
Schumann, Theodor Lipps, Johannes von Kries— 
is convincing evidence that the Zeitschrift was the 
most important psychological organ in Germany 
and therefore in the world. 

Ebbinghaus published relatively little. His own 
point of view with regard to print is expressed in 
a passage quoted by Woodworth (1909, p. 255) to 
the effect that “the individual has to make innumer- 
able studies for his own sake. He tests and rejects, 
tests once more and once more rejects. For cer- 
tainly not every happy thought, bolstered up per- 
haps by a few rough and ready experiments, should 
be brought before the public. But sometimes the 
individual reaches a point where he is permanently 
clear and satisfied with his interpretation. Then 
the matter belongs to the scientific public for their 
further judgment.” The seriousness of Ebbinghaus’ 
attitude in this regard is shown by his memory 
experiments. Although they were completed in 
1880, he did not report the results until 1885, after 
having repeated them in their entirety in 1883. 

The interest aroused by Edward von Hartmann’s 
Philosophie des Unbewussten, which appeared in 
1869, testifies to the general interest in the uncon- 


scious at that time. The unconscious was a popular 
dissertation subject among doctoral candidates. 
Ebbinghaus’ treatment of it in his own dissertation 
was very critical, in line with his views concerning 
the essential similarity of psychology and the nat- 
ural sciences and the excessively abstract and 
verbal nature of the then existing psychology. Two 
of his verdicts on contemporary psychology were: 
“Wherever the structure is touched, it falls apart” 
(1873, p. 57); and “What is true is alas not new, 
the new not true” (ibid., p. 67). 

It is unfortunate that Ebbinghaus left no record 
of the work he did before he began his work on 
memory, which was published in 1885. In the in- 
troduction to the section on nonsense syllables he 
made the bare statement, “In order to test prac- 
tically, although only for a limited field, a way of 
penetrating more deeply into memory processes , . . 
I have hit upon the following method” ({1885] 1964, 
p. 22), and he went on to discuss the nature and 
mechanics of nonsense syllables. As nearly as we 
can tell, he conceived of nonsense syllables for the 
investigation of the nature of memory between 
1875 and 1879. 

Before the publication of Memory, exact work 
on the mind had been limited to problems of pre- 
dominantly physiological affinities. Not that inter- 
est in more strictly psychological phenomena had 
been lacking; rather, the means for their study 
had not been easily available. Now, however, a 
fundamental central function had been subjected 
to experimental investigation. This must have 
meant a good deal to the young science, although 
comparatively little of the contemporary effect can 
be discovered in print. James ([1890] 1962, p. 443) 
was impressed with the “heroic” nature of the ex- 
periment, as was Tanzi (1885, p. 598), who char- 
acterized it as “truly worthy of a Carthusian monk. 
A later opinion was expressed by Titchener: “It is 
not too much to say that the recourse to nonsense 
syllables, as means to the study of association, 
marks the most considerable advance, in this chap; 
ter of psychology, since the time of Aristotle 
([1909] 1928, pp. 380-381). However, Titchener 
also thought that the “introduction of nonsense 
syllables . . . has nevertheless done psychology & 
certain disservice. It has tended to place the em 
phasis rather upon organism than upon mind 
(ibid., p. 414). Murphy later described this investi- 
gation as “one of the greatest triumphs of original 
genius in experimental psychology” ([1929] 1949, 
p. 174). 7 

In 1894, Dilthey’s “Ideen über eine beschrei- 
bende und zergliedernde Psychologie” appeared. 


This amounted to an attack on the very keystone 
of Ebbinghaus’ faith. Dilthey claimed that the new 
psychology could never be more than descriptive 
and that attempts to make it explanatory and con- 
structive were wrong in principle and led to nothing 
but confusion of opinion and fact. He asserted that 
we “explain nature, but we understand psychic 
life,” and that any psychology which is modeled 
after atomistic physics—as is that of Ebbinghaus— 
can never “understand,” for in the final analysis 
the process of “understanding” has to be experi- 
enced (erlebt) and cannot be inferred logically 
(erschlossen). 

Although Ebbinghaus was reluctant to enter into 
controversy, he did undertake to defend psychology 
as he understood it. In an article in the Zeitschrift 
for 1896, “Über erklärende und beschreibende Psy- 
chologie,” he justified the use of hypothesis and 
causal explanation in psychology. He claimed that, 
insofar as Dilthey was attacking explanatory psy- 
chology, he was attacking the old associationists, 
who had indeed failed. He felt their difficulty had 
arisen because they had analogized psychology to 
the fields of chemistry and physics rather than to 
biology. Dilthey, as Ebbinghaus saw it, was not 
actually discussing modern psychology; what he 
identified with explanatory psychology was actually 
only the work of Johann Herbart—and Herbart 
was no longer read, even in Germany. To Ebbing- 
haus, Dilthey’s point that explanatory psychology 
works, like physics, on the principle that cause is 
exactly equal to effect was incorrect; rather, all 
that psychology can and does say, according to 
Ebbinghaus, is that “the contiguity of two sensa- 
tions is considered as causal relationship because 
later a representation of one sensation results in a 
Vorstellung of the other” (1896, p. 186). This con- 
troversy has yet to be settled. 

In 1895 the school authorities of Breslau were 
interested in the advisability of holding longer 
school sessions. Ebbinghaus was appointed to a 
commission that was created to investigate this 
problem. His contribution was the Kombinations- 
methode, a form of completion test (1897, pp. 401- 
459) designed to measure intellectual fatigue. Al- 
though it did not serve its original purpose, it 
proved very valuable as a measure of general intel- 
lectual capacity, since scores on it correlated highly 
with the rank and scholarship of the pupils. There 
are many current adaptations of the tests prin- 
ciple. 

When Ebbinghaus died in 1909, the systematic 
treatise—the Grundziige—that he had begun early 
in the 1890s was only a little more than half com- 
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pleted. The first half of Volume 1 had come out 
in 1897. This volume was published as a whole 
only in 1902, and a second edition of it followed 
in 1905. In 1908 the first section of Volume 2 (96 
pages) appeared. On Ebbinghaus’ death Ernst Dürr 
took over the editing of his works and completed 
Volume 2. The major virtues of the Grundzüge lie 
in its readableness and convenient format rather 
than in any radical approach to psychology, but 
these, together with its comprehensiveness and its 
minor innovations, were sufficient to produce an 
enthusiastic reception. 

Abriss der Psychologie (1908), an elementary 
textbook of psychology, achieved considerable suc- 
cess, as is evidenced by the fact that on the average 
more than one new edition appeared every two 
years until 1922. Additionally, an English trans- 
lation by Max Meyer appeared in 1908, and French 
editions were published in 1910 and 1912—all of 
which attests to the value and appeal of the volume. 
The introduction consists of an admirable short his- 
tory of psychology and begins with the well-known 
statement, “Psychology has a long past, yet its real 
history is short.” 

Ebbinghaus’ influence on psychology, great as it 
was, has been mostly indirect. Memory is undoubt- 
edly his outstanding contribution. His work on 
memory was the starting point not only for prac- 
tically all the studies that have followed in this 
field but probably also for much of the work on the 
acquisition of skill. His Grundziige is next in im- 
portance, not for its new system (which is very 
much like that of his contemporaries) but for its 
clear and concise treatment of the literature and 
its experimental emphasis. His Kombinations- 
methode has been valuable to the field of mental 
testing. His editing of the Zeitschrift did much to 
advance psychology during a very productive pe- 
riod. His emphasis on experiment and his faith 
in the laboratory approach led to his personally 
establishing at least two laboratories and develop- 
ing a third. His qualities as a lecturer and writer 
helped to spread a knowledge of orthodox psy- 
chology. 

Despite an early training in philosophy, he was 
one of the leaders in the movement to emancipate 
psychology from philosophy. He belongs funda- 
mentally in the tradition that leads from prepsy- 
chological science, to physiology and the work of 
Helmholtz and Fechner, to Wundt and “content 
psychology.” Dunlap (1927) would give him, to- 
gether with Aristotle and Binet, the credit for 
making psychology “behavioristic,” but that is prob- 
ably going too far, His psychology does, however, 
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have a functional emphasis, as suggested by his 
constant reference to the biological affinity of psy- 
chology, his nativism in the matter of general at- 
tributes of sensation, and his contribution to the 
problem of individual differences. 

As Boring (1929) has pointed out, the history 
of general experimental psychology has passed 
through three successive phases: (1) sensation 
and perception; (2) learning; and (3) motivation, 
The landmark for the first is Fechner’s Elemente 
der Psychophysik of 1860 and for the last is 
Freud's Die Traumdeutung of 1900. Ebbinghaus’ 
Uber das Gedächtnis of 1885 stands as the middle- 
phase landmark. If he had produced nothing else, 
this work would assure Ebbinghaus an important 
place in the history of psychology. 

Davip SHAKOW 


[For the historical context of Ebbinghaus’ work, see 
the biographies of DiLTHEY; FecHNer. For discus- 
sion of the subsequent development of Ebbinghaus’ 
ideas, see FORGETTING; LEARNING, article on TRANS- 
FER; PsycHopHysics.] 
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1, Human Ecotocy 
m. CULTURAL ECOLOGY 


I 
HUMAN ECOLOGY 


The term “ecology,” which has its root in the 
Greek word oikos (household or living place), came 
into use in the latter part of the nineteenth century 
in the works of zoologists and botanists to describe 
the study of the ways in which organisms live in 
their environments. Soon two branches of ecology 
were distinguished: autecology, the study of the 
individual organism's interaction with environment, 
and synecology, the study of the correlations be- 
tween the organisms engaged with a given unit of 
environment. The latter study has prevailed, how- 
ever, and has become the principal connotation of 
ecology, since it became evident in numerous field 
studies that organisms, whether plant or animal, 
establish viable relationships with environment, not 
independently but collectively, through the mech- 
anism of a system of relationships. Bioecologists 
were thus led to employ a set of concepts and tech- 
nigues of investigation that imparted a markedly 
sociological coloration to their work. 


Origins and history 

The ecological approach was introduced as a 
man ecology into the field of sociology at a critical 
period in the development of the latter discipline. 


In the 1920s the reformistic phase of sociology was 
drawing to a close, and the subject was gaining 
acceptance as a respected discipline in the curricula 
of American universities. That the transition would 
have been effected so quickly without the aid of a 
theoretical framework lending itself to empirical 
research seems doubtful. Ecology opportunely pro- 
vided the necessary theory. A period of vigorous 
research followed that was to prove instrumental 
in launching sociology on its career as a social 
science, 

Sociologists made free use of analogy as they 
borrowed heavily from the concepts of plant and 
animal ecology. The Darwinian notion of animate 
nature as a web of life became at once a general 
orienting concept and a basic postulate; it directed 
attention to the necessary interdependence among 
men as well as among lower forms of life. A sec- 
ond concept, the balance of nature, denoting a 
tendency toward stabilization of the relative num- 
bers of diverse organisms within the web of life 
and of their several claims on the environment, 
provided human ecology with its characteristic 
equilibrium position. The more or less balanced 
web of relationships, when viewed in a specific 
local area, presented the aspect of community, a 
concept with obvious appeal for students of human 
social life. 

According to plant and animal ecologists, the 
community, or ecosystem, is a population com- 
prising a set of species whose reactions to the 
habitat and coactions between each other consti- 
tute an integrated system having some degree of 
unit character. Coactions involve members behav- 
ing both with reference to their similarities in an 
intraspecific relationship known as commensalism 
and with reference to their differences in what is 
called symbiosis, an interspecific relationship. The 
community develops from simple to more complex 
forms through a sequence of stages described as 
Succession. Each stage in the sequence is marked 
by an invasion of a new species or association of 
Species, the series culminating in a climax stage 
in which a dominant species appears. The domi- 
nant species is related to the environment in such 
a way that it is able to control and maintain 
the community indefinitely. The community, then, 
tends to approximate a self-maintaining, or “closed,” 
system, 

Community and environment. The application 
of the concepts from plant and animal ecology to 
the human community carried with it the implica- 
tion that the community was essentially a natural 
Phenomenon, which meant that it had developed 
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independently of plan or deliberation. From this it 
was a short, though uncritical, step to the inter- 
pretation of human ecology as a study of the biotic 
or subsocial aspect of human social organization 
(Park 1936), a view that was elaborated at some 
length by Quinn (1950), Not only did the subsocial 
characterization convey an excessively narrow con- 
cept of social organization, but it posed an opera- 
tional problem for which there was no workable 
solution. 

A somewhat different definition of human ecol- 
ogy, which ignored any reference to the cognitive 
level of events, was enunciated by McKenzie ([1924] 
1925, pp. 63-64), whose formulation of the sub- 
ject as a study of the spatial and temporal relations 
of human beings, affected by the selective, dis- 
tributive, and accommodative forces of the environ- 
ment, was widely accepted as authoritative. Al- 
though McKenzie’s definition inspired a large 
amount of fruitful research effort, it had the unfor- 
tunate effect of concentrating attention almost 
exclusively on spatial distributions and correlations. 
In consequence, many promising implications of 
ecological assumptions were neglected. 

Hawley, in attempting to restore a conceptual 
continuity with plant and animal ecologies, ad- 
vanced the view of human ecology as the study of 
the form and development of the human com- 
munity (1950, p. 68). Community, in this connec- 
tion, is construed as a territorially localized system 
of relationships among functionally differentiated 
parts; human ecology, then, is concerned with the 
general problem of organization conceived as an 
attribute of a population—a point of view that has 
been shown to be consistent with a long-standing 
sociological tradition (Schnore 1958). Although 
the emphasis is centered on the functional system 
that develops in a population, it is not intended to 
exclude concern with spatial and temporal aspects; 
rather, these aspects are regarded as useful dimen- 
sions for the measurement of organization. 

A further step in making the orientation of hu- 
man ecology explicit within the larger context of 
general ecological theory was made by Duncan 
(1959, pp. 683-684 ), who described four principal 
variables of human ecology—population, organiza- 
tion, environment, and technology—that constitute 
an ecosystem. In other words, while any one of the 
four may be treated as a dependent variable for 
certain purposes, it is also reciprocally connected 
with each of the other variables. The virtue of this 
perspective lies in the range of problems it opens 
to the student of human ecology. Yet it seems un- 
likely that that advantage can be fully enjoyed 
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without a clear notion of how organization is 
constituted, 

Other applications. While sociologists were at 
work defining human ecology for their purposes 
and pursuing many of its research leads, the con- 
cept spread into various other fields of inquiry. 
Human geographers wondered if the term “human 
ecology” was not a more apt characterization of 
their discipline; but their historic preoccupation 
with the landscape and their general addiction to 
a macroscopic treatment of occupance led them 
to discard this notion. Archeologists, in their efforts 
to reconstruct population distributions, have made 
use of ecological concepts and techniques, but 
without attempting to give formal statement to the 
approach, 

Studies of human evolution by anthropologists 
involved questions of the man-environment rela- 
tionship, which, in turn, led them to describe their 
work as human ecology; indeed, social or cultural 
anthropologists have long engaged in various eco- 
logical studies, although not until recently have they 
so defined their activity. One definition by an 
anthropologist is that of Steward (1955), for whom 
human ecology is the study of the adaptation of 
Specific items of culture to particular environ- 
ments, This conception, which reduces ecology to 
something akin to a research technique, is shared 
by a number of Steward’s contemporaries. 

The language of human ecology has also made 
its appearance in economics, psychology, epidemi- 
ology, and other fields. In some instances, the term 
is used merely as a label for an environmental 
emphasis; in others, it is put forward in an effort 
to broaden the purview of a discipline. 

But in spite of the widespread diffusion of 
ecological concepts, responsibility for systematic 
development of human ecology has been left to 
sociologists, who have drawn heavily on related 
fields for both theoretical and technical aid. The 
writings on real estate, finance, public administra- 
tion, demography, planning, history, and other 
areas of inquiry, in addition to the literature of the 
fields previously mentioned, have at one time or 
another been exploited in the interest of human 
ecology. The reason for that catholicity of taste is 
not hard to find; human ecology is concerned with 
sociological problems in their fullest breadth. It 
overlaps, therefore, all the spheres of learning that 
concern the social life of man. 


Distinctive features of human ecology 

As human ecology has moved toward a major 
concern with the general problem of social organ- 
ization and thus closer to the central concern of 


sociology, it has retained certain distinctive fea- 
tures. Foremost among these is the importance 
attributed to environment. 

The broad, positional hypothesis is that organ- 
ization arises from the interaction of population 
and environment. Environment, however, defined 
as whatever is external to and potentially or actu- 
ally influential on a phenomenon under investiga- 
tion, has no fixed content and must be defined 
anew for each different object of investigation. 
Environment is seen both as presenting the prob- 
lem of life and as providing the means for its reso- 
lution; to adopt this position is to place the problem 
in a time-space context. 

A second distinguishing characteristic is the 
emphasis on population as the point of reference; 
organization, it is contended, is exclusively the 
property of a population taken as a whole and not 
of an assemblage of individuals. Obvious as this 
position may be, it has profound methodological 
and theoretical implications for the manner in 
which the ecological problem is put, the variables 
employed, and the data to be observed. The con- 
creteness of population, however, in contrast with 
the seemingly ephemeral nature of organization, 
tends to beguile the student to a view of population 
as the independent variable in all things pertain- 
ing to organization. It is obviously more convenient 
to proceed from the more accessible to the less 
accessible by asking how a population makes a 
unitary response to its environment; yet that is 
merely a common-sense way of approaching the 
problem, and investigation soon makes it apparent 
that population is for many purposes better re- 
garded as the dependent variable, delimited and 
regulated by organization, 

A third characteristic is the treatment of organ- 
ization as a more or less complete and self-sustain- 
ing whole. The interaction of population and en- 
vironment is seen as culminating in a system of 
relationships between differentiated parts which 
gives the population unit character and enables it 
to maintain its identity. As the property of a popu- 
lation, organization lends itself most readily to 4 
morphological, or structural, analysis. The parts 
are the units—individuals or clusters of individ- 
uals—that perform functions and the relationships 
by which these units are linked. Differing config- 
urations of unit functions and relationships are 
expected to occur with differences in relationship 
to environment and at different stages in devel 
opment. 

i morphological concern does not exclude the 
problem of development. Presumably, any given 
form of organization had an earlier form and is 


capable of having a later form, Every organization 
has a history, perhaps a natural history, the knowl- 
edge of which may shed light on the nature of 
organizations by indicating, for example, at what 
points they are vulnerable to change and how 
change spreads through them. 

Related to the conception of organization is a 
fourth distinctive, although by no means unique, 
attribute of human ecology: the central position 
given to an equilibrium assumption. Morphological 
change is assumed to be a movement toward an 
equilibrium state, whether through a succession- 
like sequence of stages or through a process of 
continuous modification, Unlike the equilibrium 
notion in some of its other applications, such as in 
functionalist theory, the ecological usage of the 
term harbors no teleological overtones; on the con- 
trary, this usage merely implies that as an organi- 
zation attains completeness, it acquires the capacity 
for controlling change and for retaining its form 
through time, although the interval need not be 
Specified, To put it differently, to the extent that an 
organization possesses unit character, an approxi- 
mation to equilibrium obtains, 

There is a further implication that a stable rela- 
tionship with environment is contingent on relative 
Stability in the relationships between the parts of 
an organization. A population always remains open 
to environment, but the formation of organizations 
canalizes environmental influences and makes for 
increasing selectivity of response, 

The term “community” has commonly been used 
to denote the unit of organization for ecological 
purposes, Operationally defined as that population 
which carries on its daily life through a given sys- 
tem of relationships, the community is regarded 
48 the smallest microcosm in which all the param- 
eters of society are to be found, For reasons that 
were largely fortuitous, human ecologists at first 
focused their attention on the city and its tributary 
area as the prototype of community; later, in an 
effort to encompass the antecedents of cities, they 
broadened their consideration to include all forms 
of nucleated settlement. The term “community,” 
however, has the disadvantage of referring to the 
Organization of a more or less localized settlement 
unit that does not always approximate a self-main- 
taining whole, For example, in an extensively ur- 
banized society, local settlement units are usually 
components of more inclusive systems; in that 
event, the entire system must be treated as the 
object of study, But the difference between a sim- 
ple, compact organization and a large, diffuse one 
is primarily a difference in scale; accordingly, the 
Principles of organization should be the same in 
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each. Since the designation of both simple and 
complex systems as communities threatens con- 
fusion, a more neutral term, such as “social sys- 
tem,” is to be preferred. 


Principles of ecological organization 

Inasmuch as principles of organization hinge on 
what is meant by population, it is imperative that 
the concept be developed more fully. 

A human population is an aggregate of individ- 
uals who possess the following characteristics. As 
a living organism, every individual must have 
access to environment, for that is the only possible 
source of sustenance, Moreover, the interdepend- 
ence of individuals is necessary; this condition, 
which obtains for all forms of life, holds true to an 
exceptional degree in the case of man, because of 
the naked state in which he comes into the world 
and his long period of postnatal maturation, In- 
terdependence is the irreducible connotation of 
sociality, 

The human being also possesses an inherent 
tendency to preserve and expand his life to the 
maximum permitted by prevailing circumstances; 
this is a general motive of which all other motives 
are special cases, In its most elementary sense, 
expansion of life refers to the multiplication of 
man-years through either the leaving of progeny or 
the extension of longevity; but it also includes all 
that is involved in the realization of that objective. 
Another important characteristic is the indetermi- 
nacy of the human being’s capacity to adapt; there 
is no known restriction on the kind or extent of 
refinement of activity in which he can engage. 
Finally, the human individual, again like other 
organisms, is time-bound; the recurring needs for 
food and rest fix the fundamental rhythm of life 
and regulate the allocation of time to all other 
activities, Accordingly, the time available for move- 
ment is limited and, in consequence, the space over 
which activities can be distributed is correspond- 
ingly limited. 

These several attributes of individuals not only 
define the kind of population with which human 
ecology deals, but they also constitute a cardinal 
set of assumptions from which principles of eco- 
logical organization may be deduced. The following 
are some of the more salient of these principles of 
organization, 

Principle of interdependence. Interdependence 
develops between units on each of two axes; the 
symbiotic (on the basis of their complementary 
differences) and the commensal (on the basis of 
their supplementary similarities), That is, units 
that combine in a symbiotic union may also enter 
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into other combinations of a commensal character; 
the effect of each type of union is to raise the power 
of action above what it would be were the units to 
remain apart. 

The effect, however, is not the same in each case, 
The symbiotic union enhances the efficiency of pro- 
duction or creative effort; the commensal union, 
since its parts are homogeneous, can only react 
and is suited, therefore, only to protective or con- 
servative actions. Although commensalism is an 
elemental form of union, it is applicable to a wide 
array of situations, The point of importance at 
present is that a population tends to be knit to- 
gether through an interwoven set of symbiotic and 
commensal relationships. 

It should be evident that interdependence has 
temporal and spatial implications for the units in- 
volved. Relations of functional complementation 
and supplementation require mutual accessibility 
among units; since this is contingent on the time 
available for movement, the distance separating 
related units is always subject to some limitation. 
In general, for every set of related units, there 
should be, other things being equal, an appropriate 
pattern of distribution in the temporal and spatial 
dimensions. 

Principle of the key function. A second prin- 
ciple may be described as the principle of the key 
function. That is, in every system of relationships 
among diverse functions, the connection of the 
system to its environment is mediated primarily 
by one or a relatively small number of functions, 
the latter being known as the key function or func- 
tions. To the extent that the principle of the key 
function does not obtain, the system will be tenu- 
ous and incoherent; in the extreme case, in which 
no system exists, every function has the same rela- 
tionship to environment. Given a functional system, 
then, there are always some functions or functional 
units directly involved with environment and others 
that secure access to the environment indirectly, 
through the agency of the key function. 

The notion of key function invokes the question 
of how to define the notion of environment, which 
can refer to many different kinds of things. For 
present purposes, these things may be classified in 
two broad categories: the natural and the social. 
Although every organized aggregate must contend 
with both, the relative importance of each may 
vary over a wide range. In some instances, because 
of the inaccessibility of the settlement, activities 
of necessity center on the exploitation of the local 
natural environment, while influences from the 
social field are relatively infrequent or of no great 

consequence. In this event, the key function is the 


activity that extracts the principal sustenance sup- 
ply from local resources. But where the product 
from local resources, or a substantial part of it, is 
exchanged for other sustenance materials, whether 
through trade or other distributive mechanisms, 
the key function is determined by the comparative 
importance of production and of trade as sources 
of sustenance, 

In many such instances, no distinction is neces- 
sary because the producer is also the trader—two 
functions combined in one functionary. But even 
before the two functions appear as separate spe- 
cialties, the requisites of trade or distribution begin 
to regulate the uses of local resources. As the reli- 
ance on exchange advances, the social environment 
actually displaces the natural environment as the 
critical set of influences. A population is never 
emancipated from its dependence on physical and 
animate matters, but the importance of locale de- 
clines with increasing involvement in a network of 
intersystem relations; the natural environment is 
extended and diffused, and contacts with it are 
mediated through a variety of social mechanisms. 
Hence, the functions that link the local system to 
the social environment come to occupy the key 
position. 

Principle of differentiation. The extent of func- 
tional differentiation varies with the productivity 
of the key function or functions; this is the prin- 
ciple of differentiation. A corollary is that the size 
of population supportable by the system varies with 
the productivity of the key function. For given the 
simplifying assumption that each unit is fully oc- 
cupied, the number of people is determined by the 
number of functions to be performed. , 

In a hunting and gathering community, for in- 
stance, productivity is usually low, even though 
the physical environment is richly endowed; hence, 
there is little time or opportunity available for the 
cultivation of more than a few specialized func- 
tions. Nor is it possible to support enough people 
to staff even a moderate extent of specialization. 
By contrast, where the key function is devoted to 
stable agriculture, the range of possibilities is much 
greater, while in an industrial system productivity 
is so great that there are no known upper limits on 
either the number of specializations or the size of 
the population that can be supported. 

The productivity of the key function, then, con- 
stitutes the principal limiting condition on the 
extent to which a system can be elaborated, on the 
size of population that can be sustained in the sys- 
tem, and on the area or space the system can 
occupy. 4 

ca tO of some importance has to do with 


the relative number of units engaged in each of 
several interrelated functions. That question re- 
mains unanswered at present, It may be suggested, 
however, that the number of units engaged in any 
given function is inversely proportional to the pro- 
ductivity of the function and directly proportional 
to the number of units that utilize the product of 
the function, 

It follows, of course, that functional differentia- 
tion involves a differentiation of environmental 
requirements. As the materials and conditions used 
by diversely specialized units differ, so also will 
their needs for location in space and time, In gen- 
eral, units performing key functions have the high- 
est priority of claim on location. Other units tend 
to distribute themselves about the key function 
units, their distances away corresponding to the 
number of degrees of removal separating their func- 
tions from direct relation with the key functions. 
The temporal spacing may be expected to reveal a 
similar pattern. Special location requirements, how- 
ever, as for type of soil or other resource, may 
obscure the tendency to a symmetrical arrange- 
ment of functions by degree of indirectness of rela- 
tionship with key functions. 

An important implication of the principles of the 
key function and of differentiation is that of transi- 
tivity in the relation among functional units. Rela- 
tions with environment are necessarily transitive 
for some units. By the same token, relations among 
many units are transitive. The advance of special- 
ization increases transitive relations more than pro- 
portionally and lengthens the transitive sequences. 
Thus, it is possible for functional units widely re- 
moved from one another so far as direct encoun- 
ters are concerned to be inextricably linked through 
their respective linkages with one or more units 
performing intervening functions. All functions, re- 
gardless of kind, are subject to the environmental 
nexus, They differ only in the immediacy with 
which influences reach them. Those which operate 
at or near the ends of chains of relationships may 
appear to have large degrees of independence of 
environment. The appearance is illusory, however. 
It is due, rather, to the time required for effects to 
reach them and, in complex systems, to their hav- 
ing positions in two or more relational sequences 
which expose them to countervailing influences. 

Principle of dominance. Given the principles 
Stated above, it is a simple inferential step to a 
principle of dominance, According to this principle, 
functional units having direct relations with en- 
Vironment, and thus performing the key function, 
determine or regulate the conditions essential to 
the functions of units having indirect relations with 
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environment. Dominance, in other words, is an 
attribute of function. 

But while the power inherent in a system is un- 
evenly distributed, it is not confined to the key 
function unit. Power is held in varying degrees by 
all other units, in measures that vary inversely with 
the number of steps of removal from direct relation 
with the unit performing the key function. A single 
power gradient running through all the units in- 
volved in a system assumes a very simple situation, 
of course, one in which a single unit occupies a key 
function position. Before introducing complications 
into this overly simplified conception, it is oppor- 
tune to note some further implications of the dom- 
inance principle. 

Where the progress of differentiation has distin- 
guished a relatively large number of units, they 
tend to form clusters or complex units. A corollary 
to the dominance principle sheds light on how that 
comes about: the greater the extent to which vari- 
ously specialized units are subject to the environ- 
mental conditions mediated by some one unit, the 
greater their tendency to coalesce in a corporate 
body. The interdependence among such units is 
manifestly close, and their requirements for mutual 
accessibility are correspondingly acute. A hierarchic 
pattern emerges in which the number of strata 
corresponds to the number of degrees that the parts 
are removed from direct relations with the key 
function; thus, symbiotic subsystems appear as 
components of a parent system. The nuclear family, 
although its origins are obscure, fits this principle. 
But instances with more proximate origins are 
found in the combinations of specialists to form 
producing enterprises, welfare agencies, governing 
bodies, etc., and again in the combinations of pro- 
ducing enterprises, retail establishments, or govern- 
ments to constitute larger, more complex units. 
There is no restriction of scale or complexity in the 
formation of symbiotic or corporate units. 

On the other hand, units that are of a given 
functional type and therefore occupy equivalent 
positions in the power hierarchy may raise their 
power potential by the formation of categoric 
unions. Any threat to a function or to the condi- 
tions of its performance can provoke such a 
response; groups of elders, the medieval guilds, 
labor unions, professional associations, councils of 
churches, retail associations, and associations of 
manufacturers are examples of categoric units. A 
social class is at most a loose form of categoric 
unit, 

As with the corporate union, the categoric union 
may be composed of units of any size or kind. It 
may appear as a federation of categoric units or 
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as an association of corporate units. So long as it 
retains its pure categoric form, however, such a 
unit can do little more than react to circumstances 
affecting it. Nor can it have more than a transitory 
existence, since in order to engage in positive action 
of any kind and to attain some measure of per- 
manence, it must develop at least a core of spe- 
cialists. Although the categoric unit tends to assume 
the characteristics of the corporate unit, it remains 
distinctive as long as its criterion for membership 
is possession of a given, common characteristic. 
In any event, the categoric unit is a source of 
rigidity in a social system, and once formed, its 
effect is to preserve the position of a functional 
category in the system, 

The concept of dominance has been widely em- 
ployed for the purpose of delimiting the boundaries 
of systems. It is argued that centers of settlement, 
such as cities, exercise dominance over their sur- 
rounding areas, in diminishing degrees as the dis- 
tance from the center increases; the margin of 
influence marks the boundary of the system. Em- 
pirical support for this proposition is provided by 
the evidence of nonrandom distribution of related 
functions and by the gradient pattern that appears 
in the frequencies with which outlying functions 
are involved with central functions. 

Two qualifications of this conclusion are needed; 
first, dominance is exercised from and not by cen- 
ters, since it resides in functional units rather than 
in the places where they are located. Further, the 
apparent decline of dominance results less from 
an effect of distance in reducing control over re- 
lated functions than from the increased difficulty 
of establishing dominance uniformly over an ex- 
ponentially widening area. 

Isomorphism. A principle of isomorphism has 
been implied in much that has been said and needs 
now to be stated. Units subject to the same environ- 
mental conditions, or to environmental conditions 
as mediated through a given key unit, acquire a 
similar form of organization. They must submit to 
standard terms of communication and to standard 
procedures in consequence of which they develop 
similar internal arrangements within limits im- 
posed by their respective sizes. Each unit, then, 
tends to become a replica of every other unit and 
of the parent system in which it is a subsystem. 

Since small units cannot acquire the elaborate 

organizations of which large units are capable, 
they jointly support specialized functions that com- 
plement their meager organizations. For example, 
whereas a large unit may include among its func- 
tionaries accountants, lawyers, engineers, public 
relations experts, and other specialists, the small 


unit must purchase comparable services from units 
specializing in each of the relevant functions, The 
principle of isomorphism also applies to the size 
of units, at least as a tendency; that is, all units 
tend toward a size that enables them to maintain 
contact with all relevant sectors of the parent 
system. 


Closure and social change 


Operation of the several principles mentioned 
thus far moves a system toward a state of closure. 
This term must be employed here with circumspec- 
tion, for it cannot have its usual connotation of 
independence of environment. Closure can only 
mean that development has terminated in a more 
or less complete system that is capable of sustain- 
ing a given relationship to environment indefinitely. 
For closure to be realized, it is required not only 
that the differentiation of function supportable by 
the productivity of the key function has attained 
its maximum but also that the various functions 
have been gathered into corporate and categoric 
subsystems; moreover, the performance of the key 
function should have been reduced to one unit or 
to a number of units united in a categoric federa- 
tion. Then a system is highly selective of its mem- 
bership and capable of exercising some control over 
factors that threaten change in the system. 

Under these circumstances, certain conditions of 
equilibrium are held to obtain: the functions in- 
volved are mutually complementary and collectively 
provide the conditions essential to the continuation 
of each; the number of individuals engaged in each 
function is just sufficient to maintain the relations 
of the functions to each other and to all other 
functions; and the various units are arranged in 
time and space so that the accessibility of one to 
others bears a direct relation to the frequency of 
exchanges between them. Needless to say, equi- 
librium as thus defined is a logical construct; the 
conditions express an expected outcome if the prin: 
ciples of organization are allowed to operate with- 
out any external disturbances. - 

Origins and effects of change. Every social ike 
tem is continuously subject to change, for since 
the environment is always in some state of flux, 
the equilibrium that can be attained is seldom more 
than partial. A system founded on nonreplaceable 
resources is faced with “immanent change”; sooner 
or later it will either pass into decline or shift per- 
force to a different resource base. Such, for in- 
stance, has been the experience of innumerable 
mining communities; in a similar manner agricul- 
tural communities often alter the soil composition 
of their lands by the uses they practice, with the 


ultimate result that the lands will no longer sup- 
port the systems as they are constituted. Instances 
of maladaptation, such as the reliance of the Irish 
on the potato as a food staple in the nineteenth 
century, can lead to catastrophic consequences. 

In general, however, change has an external ori- 
gin, a proposition that follows by definition from 
an equilibrium position, Some influences emanate 
from the physical or biotic environment, such as 
variations in the growing season or invasion by 
parasites. To the extent that episodic occurrences 
of that order fail to modify one or more functions 
comprised in the system, their effects are transi- 
tory; the system returns to its original form. But 
where a function, particularly a key function, is 
substantially modified, the system must be recon- 
stituted. The disappearance of a game supply, the 
silting of an estuary, or the eruption of a volcano 
may render a key function inoperative in its usual 
location. The population must relocate and work 
out a new system of activities. Unless there is an 
increase of productivity, new ways of acting will 
merely displace old ways, and the change will not 
be cumulative. 

Cumulative change. Cumulative change, or 
growth of the system, presupposes an increase in 
the productivity of a key function. Only in this way 
is it possible to multiply specialization, to employ 
a greater variety of techniques, and to support a 
larger population. The probability of the occurrence 
of disturbances having that effect rises with the 
number of points of contact with a social environ- 
ment; location, therefore, is an important factor. 
A site on a traveled route is more exposed to ex- 
ternal influences than one situated at a distance 
from an avenue of movement; a site at a conflux 
of routes is much more vulnerable to disturbances 
from without. Any location that fosters frequent 
meetings of people from diverse backgrounds is a 
gateway for the infusion of alien experiences and 
techniques into the social system centered there. 
Whether change is released through a deposit of 
numerous small additions to the culture or through 
a simple, dramatic innovation is immaterial. 

The process of cumulative change may be gen- 
eralized as a principle of expansion. Expansion is 
a twofold process involving, on the one hand, the 
growth of a center of activity from which domi- 
nance is exercised and, on the other hand, an 
enlargement of the scope of the center's influence. 
The process entails the absorption and redistribu- 
tion of the functions formerly carried on in outlying 
areas, a centralization of mediating and control 
functions, an increase in the number and variety 
of territorially extended relationships, a growth of 
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population to man a more elaborate set of activities, 
and an accumulation of culture together with a 
leveling of cultural differences over the expanding 
domain. After the revival of trade in Europe, from 
the tenth century onward, the favorably situated 
village with its narrow vicinage grew into a market 
town that served as a center of an enlarged terri- 
torial organization. That gave way, in turn, to the 
emergent city capable of exercising an integrating 
influence over an area of regional scope. Most 
recently, the metropolis has superseded the city 
and has brought under its dominion a vast inter- 
regional territory, Although the process is as old as 
recorded history, it has not advanced in a simple 
linear progression, It has moved and then stalled 
in one place, only to surge ahead in another; it has 
faltered and even on occasion has seemed to turn 
back upon itself, but it has always resumed its 
course with renewed vigor. 

The limits to expansion are sometimes fixed by 
the facility of movement between center and pe- 
riphery—by the maximum distance over which the 
exercise of dominance is feasible, More often than 
not, however, the limits are drawn at the points of 
juncture with the expanding domains of social sys- 
tems in neighboring regions. As a system encoun- 
ters its limits, however they are fixed, it loses 
capacity to absorb further change, and equilibrium 
tendencies begin to assert themselves once more. 

Under conditions of closure, environmental ef- 
fects, and particularly cultural innovations, would 
be expected to enter a system through the key 
function, for the obvious reason that it has the 
most direct connection with environment. And, 
presumably, change would spread through the sys- 
tem by affecting units successively in the order in 
which they are removed from direct relation with 
a key function. But in a system centered on a con- 
vergence of routes, many units may have direct, 
although not equal, access to the outside world. 
Change may therefore enter the system at many 
points, at least until its structure is fully developed 
with the parts systematically arrayed relative to a 
key unit. An expanding system, in other words, is 
an open system, and it remains so until the limits 
to expansion are reached. 

In many instances, however, the first symptoms 
of change are experienced at the periphery of a 
system. Since the effects of dominance grow more 
uneven with increased distance from a center, 
boundaries are apt to be permeable at many points. 
Yet, in the degree to which a system is integrated, 
events at the boundary are transmitted directly to 
the key unit, from which they are communicated 
to ancillary units; it is always at the boundary that 
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one system begins its absorption of another. Expan- 
sion may be resumed at any moment and in any 
of the systems that hem in one another. An innova- 
tion, even though it might present itself to all 
systems about the same time, gains admission to 
one or another by virtue of a more favorable loca- 
tion for its use, a more appropriate organization 
for its acceptance, or some other local advantage. 
The renewed expansion encroaches upon the terri- 
tories of adjoining systems, sometimes reducing 
them to mere components of a single, greatly en- 
larged system. 

Burgess’ hypothesis of city growth. A hypothesis 
of city growth, stated by E. W. Burgess ([1925] 
1961, pp. 37—44), pertains to a special case of the 
more general principle. According to that proposal, 
city growth takes the form of expansion from a 
zone centered on a highly accessible location. 
Growth involves increasing density of occupance 
of the central zone and, at the same time, a redis- 
tribution of activities or land uses scattered around 
the center to conform to a gradient pattern of 
variation of intensity of land use according to dis- 
tances from the center. Redistribution results from 
increasing pressure at the center and a consequent 
encroachment by high-intensity uses into the spaces 
occupied by lower-intensity uses in a succession- 
like manner. Alternating periods of redistribution 
and stabilization of distribution, and those of growth 
and partial equilibrium, create a wavelike effect 
more or less visible in a set of concentric zones. By 
venturing to describe, in rather specific terms, the 
content of the zones and by thus reifying a set 
of statistical constructs, Burgess diverted attention 
from a growth process and caused it to be fixed on 
a specious distribution pattern. His argument, 
therefore, seemed to acquire a historical limitation 
that it need not have had, 

The miscarriage of the import of the Burgess 
hypothesis is evident in criticisms that have op- 
posed the preindustrial city to the industrial city as 
a qualitatively different phenomenon. Whereas 
Burgess suggested that the social-economic status 
of residents is higher in each successive concentric 
zone, critics have shown that in the preindustrial 
city the social-economic gradient runs in the oppo- 
site direction. Useful as that finding may be, it 
misses the essential point. That is, the significant 
gradient is one of dominance; units tend to dis- 
tribute themselves over space in a way that reflects 
their relationships to the dominant unit. In this 

respect, both industrial and preindustrial cities are 
similar. The qualitative difference lies in the kinds 
of units that exercise dominance. In the preindus- 


trial city, all functions are carried in familial or 
household units, and power is unevenly distributed 
among them. But in the industrial society, the 
household unit has been relegated to a minor posi- 
tion; specialized functions are performed by extra- 
familial units, and the separation of functions from 
the household has involved a spatial separation as 
well. 

Furthermore, the notion of a monocentered sys- 
tem is applicable only to the simplest instances, All 
others include a constellation of settlement nuclei, 
that is to say, subsidiary service centers within 
cities, and villages, towns, or cities within hinter- 
lands. Each serves as a locus of influence over a 
localized area, varying in scope with the types of 
functions centered in the nucleus. Thus, as Christal- 
ler observed, the constellation of nuclei forms a 
hierarchy by size and number of places and by 
order of functions performed (1933). Small places 
provide low-order, or ubiquitous, functions, whereas 
each larger place performs, in addition to low-order 
functions, higher-order functions for broader do- 
mains. At the apogee of the hierarchy is the metrop- 
olis, in which the integrating and coordinating 
functions for the entire system are domiciled. Thus, 
dominance is exercised downward and outward 
through nested sets of subsidiary centers [see CEN- 
TRAL PLACE]. 

The least satisfactory aspect of the theory of 
change concerns its temporal incidence. The idea 
of succession, borrowed initially from bioecology, 
lingers in the dictionary of human ecology. Change 
as cyclical in form, consisting in movements be- 
tween equilibrium stages, is clearly the most intel- 
ligible conception. Nevertheless, apart from the 
difficulty in empirically identifying an equilibrium 
stage, there are unsolved conceptual problems of 
the spacing of stages, and of the factors governing 
the intervals between stages. Thus far, succession 
has been applied only in retrospect. Its utility will 
remain uncertain until it can be projected into 
prediction. Quite possibly, that may have to wait 
for more extensive work on social system taxonomy. 


The limits of human ecology 


Human ecology has progressed since its incep- 
tion from an effort to apply the concepts of plant 
and animal ecology to human collective life, through 
an extended period of preoccupation with spatial 
configurations, to an increasing concern with the 
form and development of territorially based social 
systems. In the last phase, human ecologists have 
sought to clarify the assumptions of ecology and to 
draw out their implications for organization. 4 


though the results of that work are far from com- 
plete, it seems clear that they indicate the direction 
in which human ecology will continue to develop. 

As with most approaches in social science, hu- 
man ecology has limited objectives. It seeks knowl- 
ledge about the structure of a social system and 
the manner in which the structure develops. Hence, 
it is not prepared to provide explanations for all of 
the manifold interactions, frictions, and collisions 
that occur within the bounds of a social system. 
The findings of human ecology, however, define the 
context in which all such phenomena take place, 
and which is therefore pertinent to their full 
understanding. 

Human ecology is not qualified to deal with the 
normative order in a social system. Yet consistent 
with its position is the expectation that a normative 
order corresponds to and reflects the functional 
order. The two are different abstractions from the 
same reality. 

Amos H. HAWLEY 


[See also Crry, article on COMPARATIVE URBAN STRUC- 
TURE; ENVIRONMENT; ENVIRONMENTALISM; GEOG- 
RAPHY; and the biographies of BURGESS; DARWIN; 
Park.) 
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I 
CULTURAL ECOLOGY 


Cultural ecology is the study of the processes by 
which a society adapts to its environment. Its 
principal problem is to determine whether these 
adaptations initiate internal social transformations 
or evolutionary change. It analyzes these adapta- 
tions, however, in conjunction with other processes 
of change. Its method requires examination of the 
interaction of societies and social institutions with 
one another and with the natural environment. 

Cultural ecology is distinguishable from but 
does not necessarily exclude other approaches to 
the ecological study of social phenomena, These 
approaches have viewed their special problems —for 
example, settlement patterns, the development of 
agriculture, and land use—in the broad context of 
the complexly interacting phenomena within a 
defined geographical area. Explanatory formula- 
tions have even included the incidence of disease, 
which is related to social phenomena and in turn 
affects societies in their adaptations. This modern 
concept of ecology has largely superseded other 
concepts, such as “urban,” “social,” and “human” 
ecology, which employed the biological analogy of 
viewing social institutions in terms of competition, 
climax areas, and zones. 

Cultural ecology is broadly similar to biological 
ecology in its method of examining the interactions 
of all social and natural phenomena within an 
area, but it does not equate social features with 
biological species or assume that competition is 
the major process. It distinguishes different kinds 
of sociocultural systems and institutions, it recog- 
nizes both cooperation and competition as proc- 
esses of interaction, and it postulates that environ- 
mental adaptations depend on the technology, 
needs, and structure of the society and on the 
nature of the environment. It includes analysis of 
adaptations to the social environment, because an 
independent tribe is influenced in its environmental 
adaptations by such interactions with its neighbors 
as peaceful trading, intermarriage, cooperation, 
and warfare and other kinds of competition, in the 
same way a specialized, dependent segment of a 
larger sociocultural system may be strongly influ- 
enced by external institutions in the way it utilizes 
its environment. 

The cultural ecological method of analyzing cul- 
ture change or evolution differs from that based on 
the superorganic or culturological concept. The 
latter assumes that only phenomena of a cultural 
level are relevant and admissible, and it repudiates 
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“reductionism,” that is, consideration of processes 
induced by factors of a psychological, biological, 
or environmental level. The evolutionary hypoth- 
eses based upon this method deal with culture in 
a generic or global sense rather than with indi- 
vidual cultures in a substantive sense, and they 
postulate universal processes, Cultural ecology, on 
the other hand, recognizes the substantive dis- 
similarities of cultures that are caused by the par- 
ticular adaptive processes by which any society 
interacts with its environment. 

Cultural ecology does not assume that each case 
is unique. Its method, however, requires an em- 
pirical analysis of each society before broader 
generalizations of cross-cultural similarities in 
processes and substantive effects may be made. 
Cultural ecologists study highly diversified cultures 
and environments and can prescribe neither spe- 
cific analytic formulas nor theoretical or ideal 
models of culture change; there can be no a priori 
conclusions or generalizations concerning evolu- 
tion. The heuristic value of the ecological view- 
point is to conceptualize noncultural phenomena 
that are relevant to processes of cultural evolution. 

Empirical studies disclose that among the simpler 
and earlier societies of mankind, to whom physical 
survival was the major concern, different social 
systems were fairly direct responses to the exploita- 
tion of particular environments by special tech- 
niques, As technological innovations improved 
man’s ability to control and adjust to environments, 
and as historically derived patterns of behavior 
were introduced, the significance of both the en- 
vironment and the culture was altered and the 
adaptive processes not only became more complex 
but also acquired new qualities. 

The ecological concept of interacting phenomena 
draws attention to certain general categories of 
relevant data, The resources, flora, fauna, climate, 
local diseases and their vectors of occurrence and 
many other features of the environment constitute 
potential factors in one part of the interacting 
system, The nature of the culture, especially its 
exploitative and adaptive technology but also fea- 
tures of the internal and external social environ- 
ment, constitutes the other part. The interaction 
involves the social arrangements that are re- 
quired in land exploitation; population density, 
distribution, and nucleation; permanence and com- 
position of population aggregates; territoriality of 
societies; intersocietal relationships; and cultural 
values. In each case the empirical problem is 
whether the adaptation is so inflexible as to permit 
only a certain pattern or whether there is latitude 


for variation which may allow different patterns to 
be developed or borrowed. 

Explanations in terms of cultural ecology require 
certain conceptual distinctions about the nature of 
culture. First, the various components of a culture, 
such as technology, language, society, and stylistic 
features, respond very differently to adaptive proc- 
esses. Second, sociocultural systems of different 
levels of integration profoundly affect the inter- 
action of biological, cultural, and environmental 
factors. Societies having supracommunity (state or 
national) institutions and the technological ability 
to expand the effective environment beyond that 
of the local or primary group can utilize resources 
within and outside the area controlled by the larger 
society. The adaptive responses of complex societies 
are thus very unlike those of a tribal society, which 
adapts predominantly to its own environment. 


Culture history and ecological adaptations 


The historical processes by which a society ac- 
quires many of its basic traits are complementary 
to studies of adaptive processes, The historical 
processes include the extensive borrowing of many 
cultural traits and trait complexes from diverse 
sources; the migrations of people; the transmission 
of cultural heritages to successive generations; and 
local innovations or inventions. Recognition of 
these historical processes, however, does not rele- 
gate environment to the circumscribed role of 
merely permitting or prohibiting certain cultural 
practices so that all origins must be explained by 
such history. It is obvious that fishing will have 
minor importance in desert areas and that agri- 
culture is impossible in arctic regions, It is equally 
clear that abundant fishing resources cannot be 
exploited without appropriate techniques and that 
agricultural potentials cannot be utilized without 
domesticated plants. The presence of gold, oil, and 
uranium is unimportant until the society has de- 
mands for them and means of extracting them. 

The Indians of the Southwest had at one time 
been predominantly food collectors, but the intro- 
duction of food crops, largely from Mesoamerica, 
provided the basis for the development of the more 
complex Pueblo village culture. Many details of 
this culture, such as ritual elements, were bor- 
rowed from the south, but the development of 
Pueblo sociocultural patterns are comprehensible 
only in terms of the processes of population growth, 
extension of biological families into lineages, 
eventual consolidation of lineages into larger set- 
tlements, and the appearance of many village 
institutions that cut across kinship groups. 


Dissimilar ecological adaptations may also occur 
among societies that have been subjected to similar 
historical influences, The Indians of California and 
of the semiarid steppes and deserts of the Great 
Basin shared substantially the same devices for 
collecting wild foods and for hunting, but the vastly 
greater abundance of flora and fauna in California 
supported a population thirty times that of the 
Great Basin, The California Indians lived in fairly 
permanent villages which had developed some 
social elaborations, whereas the Great Basin In- 
dians were divided into independent family units 
which foraged over large territories during most of 
the year and assembled only seasonally in encamp- 
ments that did not always consist of the same 
families. 

The investigation of cultural ecological processes 
must consider the possibility that basic sociocul- 
tural patterns may have diffused or been carried 
by migrations from one kind of environment to 
another, but it also must examine whether these 
patterns have been modified, Assessment of modifi- 
cations requires a distinction between the outer 
embellishments of the culture, such as ritual ele- 
ments, art styles, and kinds of architecture, and 
those social patterns which are human arrange- 
ments for self-perpetuation. 

Biological factors. Man's adaptations to his nat- 
ural environment cannot be comprehended in purely 
culturological terms because man not only shares 
basic biological needs with all animals but also has 
distinctively human characteristics. All activities 
are culturally conditioned, but ecological factors 
cannot be wholly distinguished from inherent bio- 
logical and psychological factors which are the 
basis of behavior. Analyses that ascribe importance 
to inherent human qualities bear directly upon the 
validity of common hypotheses and raise new 
problems, 

It has generally been assumed that the nuclear 
family consisting of parents and children has al- 
ways been the irreducible social structure because 
the sexes serve complementary functions in meet- 
ing procreational and subsistence needs and in 
caring for and training the young during their long 
period of dependency. This assumption is generally 
valid for all ethnographically known societies, espe- 
cially for primitive societies, among whom there is 
a clear sexual division of labor in subsistence, 
maintenance of the household, and child rearing. 
In contemporary industrial societies, however, the 
biological family has surrendered many of its func- 
tions, and cultural differences between sex roles 
have been diminished. Conceivably, the nuclear 
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family might be reduced to little more than a pro- 
creational unit if its cultural functions are lost. 

In primitive societies, larger suprafamilial bands 
or other groups that developed in response to coop- 
erative needs, especially in subsistence activities, 
had a biological basis, especially through lineages. 
The patrilineages of hunting bands, for example, 
had the advantage that particular skills, knowledge 
of the environment, and obedience to authority were 
transmitted from father to son in a male-dominated 
group. Biological relationships have so long been 
fundamentally important in culture history that 
they may be ascribed excessive importance, as 
when the position of head of a family or even 
kingship is transmitted through primogeniture. The 
biological basis survived because of the lack of 
patterned alternatives, 

Another basic biological factor in cultural eco- 
logical adaptations is the prolonged period of hu- 
man growth. The factors of age and environment 
interact in many ways to affect behavior patterns. 
The Shoshoni child may help in rabbit drives but 
not in big game hunting; the east African boy may 
herd cattle but not go to war; modern American 
youths attend school for a required number of 
years; in areas of precarious subsistence the aged 
may be abandoned to die, while in other areas 
they may be accorded special consideration; and 
everywhere the marriage age group is determined 
by biological development, culture, and adaptations 
to the environment. 

There are, however, many problems concerning 
the interaction of biological and cultural ecological 
factors, One of these arises from the assumption 
that the nuclear or biological family is a basic part 
of every human society. This assumption might 
rest on the obvious biological nature of the family 
or upon the universal cultural functions the family 
is believed to serve. The higher primates usually 
live in fairly small bands that consist of females 
and their dependent offspring, a single powerful 
male, and subordinate adult males. In certain de- 
pressed segments of modern society, however, the 
matrifocal family or kin group, which consists of 
women and dependent children and somewhat sub- 
ordinate males who are loosely attached to the 
family, appears to be the irreducible social unit. 
This family represents an adaptive response to re- 
stricted territoriality, low income, and lack of op- 
portunity for improvement. 

Although the transition from a precultural pri- 
mate society to the nuclear family has not been 
explained, the contemporary existence of the matri- 
focal kin group raises the question of origin. How 
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did the nuclear family develop from an early kind 
of society which may have lacked the cultural eco- 
logical processes that created and have subsequently 
supported it? 

Speculation about the origin of the family must 
be based on what is known archeologically about 
the origins of technology and on ethnographic evi- 
dence. In all ethnographic cases of prefarming soci- 
eties the nuclear family is based on clear comple- 
mentarity of the sexes in subsistence activities. 
Women are food collectors because they must not 
leave their children, who are their inescapable 
responsibility, whereas men may spend long periods 
away from their families hunting large game or 
fishing, if the technology makes this possible. Prior 
to the invention of such distinctly male-associated 
hunting devices as spears, bows, nets, and traps, 
both sexes may have collected food in so similar a 
way that they were no more differentiated than 
were the higher primates, If so, there may have 
been a long transitional stage when some type of 
matrifocal kin group persisted. The hand ax, which 
was long the principal weapon, may attest to men’s 
role in protecting the group but not specialization 
in hunting, for its use was in close combat. 

Settlement patterns and the adaptive process. 
A culture that is introduced into a wholly new 
environment must adapt in some ways to local con- 
ditions, but only empirical research can determine 
whether the adaptations are unalterably fixed or 
whether there is latitude for variations. In some 
instances, the case for narrow limits of variation 
seems clear. Simple societies that exploit only 
Sparse and scattered resources obtained by food 
collecting must obviously fragment into small 
groups, for members of the society are in competi- 
tion with one another. More abundant resources, 
such as rich areas of vegetable foods, large game 
herds, fisheries, or intensive farming, may permit 
variation in some features and thus allow latitude 
for borrowing or for local innovations, 

There can, however, be no a priori conclusions: 
whether a society is dispersed over its land or clus- 
tered in large settlements may result from adap- 
tive factors, from historical patterns, or from both. 
The Indians of California and the Northwest Coast 
had very similar native population densities, but 
the dispersed acorns and game in California led to 
considerable dispersal of the villages, whereas the 
rich salmon fisheries of British Columbia and Alaska 
required the people to concentrate on the main 
streams in fairly large settlements. Similarly, agri- 
cultural areas dependent upon irrigation concen- 
trated the population within the network of canals. 
On the rainless and barren deserts of coastal Peru, 


the people had to live in dense settlements along 
or near the rivers. The large communities on the 
coasts of British Columbia and Peru, however, were 
parts of unlike sociocultural systems. Each devel- 
oped from a different historical background, and 
each had a distinctive local adaptation; there were 
fisheries on the Northwest Coast and intensive, irri- 
gation farming together with ocean resources in 
Peru. 

A dense population supported by a rich economy 
is not always concentrated in large centers. The 
native agricultural peoples of the northern Andes 
of Colombia and the Araucanian Indians of the 
central valley of Chile had very similar population 
densities, but the northern Andean peoples were 
organized in strong, class-structured chiefdoms 
with communities of five hundred persons or more, 
whereas the Araucanians were dispersed in many 
closely spaced villages, each a patrilineage of no 
more than one hundred persons. This contrast is 
not explained by farm productivity or other tech- 
nological factors involved in maintaining large 
communities. The northern Andean chiefdoms 
were the result of a diffused cult-complex of tem- 
ples, priests, warriors, and human sacrifice, which 
may have originated in Mesoamerica but was re- 
adapted to the diversified environment of high 
mountains and deep valleys. Presumably, the Arau- 
canians could have supported chiefdoms, but the 
nucleating factors were absent. A limited pastoral 
nomadism based on llama herding may have been 
an ecological deterrent to community growth, al- 
though this factor did not inhibit the development 
of states and empires in Peru. 

Nontechnological features of culture may also 
affect the adaptive arrangements through the ex- 
ternal social environment of the society. Peaceful 
interaction of societies through marriage, trade, 
visiting, and participation in ceremonies, games, 
and other activities may take various patterns 
within the limits imposed by the environment. 
Intergroup hostilities, however, may have a decisive 
influence, because fear of warfare often causes 
peoples to cluster in compact settlements that are 
not required by subsistence patterns and sometimes 
do not give optimum access to resources. Thus, 
the palisaded villages of parts of the Amazon some 
what inhibited farming of the frequently shifting 
slash-and-burn farm plots. To judge by the pre- 
history of the Pueblo Indians, the nature of farm- 
ing permitted considerable variation. Early settle- 
ments were small and widely scattered, but in later 
periods the population declined as the settlements 
became large and tightly nucleated. This was the 
result of increased precariousness of farming in an 


area that had frequent droughts and was inhabited 
by marauders. In Tanganyika, people did not dare 
disperse along the fertile lands near the water- 
ways until colonialism largely eliminated warfare. 
Throughout much of history, warfare has been a 
major factor in the interaction of culture and 
environment, 

Exploitation of an environment by means of cer- 
tain cultural devices may also drastically affect the 
environment, which again reacts upon the culture. 
In native California, as in many other areas, the 
Indians deliberately fired the grass in order to kill 
seedlings of brush and trees and thus increase the 
grazing land for game. Deforestation, overgrazing 
with concomitant soil erosion, the damming and 
rechanneling of rivers, drainage of swamps, and 
conversion of rural areas to urban or suburban 
land-use patterns also alter the environment. In 
such instances as the firing of grasslands, the de- 
velopment of irrigation works, or reclamation of 
swamps, the culture increases its basic resources. 
In others, such as overgrazing, lack of conservation, 
or the exhaustion of basic minerals, the culture 
destroys or impairs certain aspects of its local 
foundations. 


Cultural variables versus holism 


Although the culture of any society constitutes a 
holistic system in which technology, economics, 
social and political structure, religion, language, 
values, and other features are closely interrelated, 
the different components of a culture are not sim- 
ilarly affected by ecological adaptations. Technol- 
ogy, which exemplifies progress in man’s control 
of nature, tends to be cumulative. A language, 
unless replaced by another one, slowly but con- 
tinuously evolves into divergent groups of lan- 
guages. Humanistic and stylistic cultural mani- 
festations may retain their formal aspects during 
Social transformations but acquire new functions. 
Societies change through a series of structural and 
functional transformations, 

Social structures respond most clearly to envi- 
ronmental requirements. This basic structuring is 
related most immediately to cooperative productive 
activity, and it is manifest in community and band 
Organization and in essential kinship systems. 
Among simple societies, any interpersonal or inter- 
familial arrangements necessary for survival in 
Particular areas are virtually synonymous with 
Social organization. Because food collecting, such 
as in the case of seed gathering, is competitive, 
Societies in unproductive environments tend to be- 
come fragmented into nuclear family units. Soci- 
eties of hunters are more productive under cooper- 
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ative arrangements and attain various patterns of 
cohesion. Societies that depend primarily upon 
farming tend to have permanent community organ- 
ization, whether in dispersed or nucleated settle- 
ments, because cooperation in such activities as 
clearing plots and irrigation projects facilitates pro- 
duction. Increase in productivity is delimited by 
the environmental potentials, crops, and farm 
methods, but it may lead to larger communities 
and to internal specialization of role and status. 

Productive increases achieved by social coopera- 
tion and improvements in exploitative technology 
became the bases of the transformation of small, 
homogeneous societies into larger societies that 
were internally specialized by occupational role 
and social status. Structural aspects of cooperation 
are reasonably well known, but quantitative state- 
ments of productivity, potential and actual sur- 
pluses, manpower hours that might be available 
for pursuits other than production of basic necessi- 
ties, numbers of persons engaged in special occu- 
pations, and other measurable activities are rarely 
available. These serious lacunae in our data leave 
open the question of the part that sheer quantities 
of people and things played in the transformations 
from simple to complex societies. 

Even within the category of culture subsumed 
under social patterns, it cannot be assumed that all 
features are equally fixed by a given ecological pat- 
tern. Among simple societies, residence, kinship, 
and subsistence patterns are more fundamental 
and less alterable or variable than clans, moieties, 
religious and secular associations, and other elab- 
orations which are secondary embroideries on the 
basic social fabric. Distributional evidence in many 
parts of the world clearly shows that such elabora- 
tions have diffused widely across different kinds 
of environments and to fundamentally different 
social structures adapted to use of the environ- 
ments. In the Southwest, for example, matrilineal 
clans occurred among the western Pueblo farm 
villages and the seminomadic pastoral Navajo, and 
moieties were a secondary adjunct of the patri- 
lineal hunting bands in southern California and 
some of the eastern Pueblo agriculturalists. 

A final problem concerning the response of dif- 
ferent aspects or components of culture to adaptive 
processes is whether old or traditional forms may 
acquire new functions. Among the Sonjo of Tan- 
ganyika, the men’s age-group that served as war- 
riors in precolonial times now spends several years 
performing wage labor in the new nonmilitaristic, 
cash-oriented context. Clan-owned or lineage-owned 
land and traditional types of ownership and in- 
heritance of herds may persist under production 
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for an outside market. Although traditional struc- 
tures and trait-complexes tend to perpetuate them- 
selves in all stages of cultural development, the 
evidence seems clearly to indicate that the new 
functions of the changing context eventually alter 
old forms beyond recognition, or that these forms 
wither and are replaced by new ones. 


Levels of sociocultural integration 


The adaptations of a complex or highly devel- 
oped society differ in many ways from those of a 
simple society, The internal specialization that de- 
veloped after the agricultural revolution and more 
so after the industrial revolution has affected the 
adaptive processes of the local segments of states 
and nations, Land use has increasingly reflected 
the importance of external economic institutions 
rather than local subsistence goals. Trade, im- 
proved transportation, mechanization, and other 
factors related to industrialization have made each 
local social group a more highly specialized and 
dependent part of the larger sociocultural system. 
There is a tendency toward monocropping in areas 
best suited to production of crops for external mar- 
kets and toward specialization of local extractive 
industries, such as mining, oil, and timber, which 
have little intrinsic value to the local subsociety, 
Larger sociocultural systems have also created 
metropolitan and industrial centers and set aside 
special recreational areas. 

Owing to technological achievements, the im- 
pact or conditioning effects of nature upon society 
are far less direct and compelling in a complex 
society than in a simple one. Culture increasingly 
creates its own environment. Foods and other ne- 
cessities can be transported, water diverted, and 
electric power transmitted great distances. Reason- 
able comfort can be provided even for those who 
live for a time in Antarctica. 

In a more fundamental sense, however, the dis- 
tinction between primitive and developed societies 
is not merely one of social complexity or of tech- 
nological knowledge. Among primitive peoples, the 
family, extended kin group, or fairly small village 
or band adapts directly to its own territory, Among 
more complex societies, there are suprafamilial and 
supracommunity institutions that are impersonal 
in that they do not involve the total cultural behav- 
jor of the people connected with them. State eco- 
nomic institutions—corporations, for example, that 
extract new materials and manufacture and dis- 
tribute the products—serve the varied needs of the 
many subcultural groups within the state. They 
also extend the interaction of culture and environ- 

ment far beyond state boundaries through the ex- 


ploitation of distant resources and through exten- 
sive commerce. These vast economic extensions of 
the more developed societies penetrate local soci- 
eties, and by creating new uses for land and other 
resources and imposing outside political and eco- 
nomic patterns, they fundamentally transform the 
local societies. 

Within a modern, complex state, there is increas- 
ing specialization of land use within environmental 
potentials, but distinctiveness of local subcultures 
is partly leveled by the impact of national influ- 
ences. The subcultures of ranchers in Nevada, sugar- 
cane workers in Puerto Rico, tea-plantation workers 
in Kenya, and other subsocieties who live on the 
land are shaped by the outside economic institu- 
tions to which they are linked, as well as by the 
local environment. As technology develops and sub- 
societies become more dependent upon the larger 
society, direct adaptation to the local environment 
decreases. The way of life, or subculture, of busi- 
ness executives of New York City, for example, 
has become dichotomized; in part it is derived 
from their highly specialized occupational role in 
a city and in part from their residence in suburbia, 
where the family and neighborhood culture differs 
from that of their profession and neither is closely 
adapted to the local environment. Even more re 
mote from direct environmental impact are certain 
urban institutions found in contemporary Nevada, 
where the divorce, gambling, and entertainment 
complexes are linked to special aspects of the total 
national culture rather than to the semideserts of 
the Great Basin. 


Some substantive applications 


The concept of cultural ecology may be clarified 
by a number of substantive applications. These 
cases range from simple to complex societies, from 
the most elementary level of sociocultural integra- 
tion to higher levels. 

The precursors of man lacked culturally derived 
devices for killing game and for gathering, storing, 
and preparing food. They were food scavengers, 
and any food they collected was preserved without 
culturally prescribed techniques. Because animals 
can best forage a known habitat, they may have 
lived in groups or bands that were somewhat terri- 
torially delimited. Any social structuring was prob- 
ably biologically oriented around the male's dom- 
inance and protection of the group and the female's 
role of caring for the young. E 

When culture provided more efficient techniques 
for survival, the essential human biological facts 
of sex, age, and kinship continued to affect the 
nature of society but were patterned in various ways 


by cultural ecological adaptations. Where the prin- 
cipal resource was seeds, roots, fruits, insects, small 
mammals, or shellfish, which occur sparsely, food 
gathering was necessarily competitive rather than 
cooperative because the yield decreases with the 
number of families that exploit the same site. If 
the local occurrence of principal resources varied 
each year, as it did among the Shoshoni Indians of 
Nevada, a multifamily society of permanent com- 
position was not possible. Each Shoshoni family 
was linked with other families through intermar- 
riage, through largely fortuitous association at a 
winter campsite, and through brief cooperation 
during antelope and rabbit hunts wherever there 
happened to be game, There were no bands of con- 
stant membership and no claims to territory or to 
resource areas. These families were not free wan- 
derers, however, for any human society more effec- 
tively exploits a territory familiar to it. 

The Chilean archipelago is completely unlike 
the Great Basin, but the erratic occurrence of shell- 
fish, which was the principal food of the Alacaluf 
Indians, was functionally similar to Great Basin 
resources in preventing the formation of perma- 
nent social units larger than the nuclear family. 
That the nomadic, food-collecting family was the 
irreducible social unit in both cases is a function 
of the biological factors underlying marriage, divi- 
sion of labor, and child rearing. The independent 
nuclear family, however, is rarely reported ethno- 
graphically, and its occurrence in history is an em- 
pirical question rather than a matter of deduction 
from evolutionary principles, 

Many societies fragment seasonally into family 
units of food collectors, but if the resources are 
sufficiently abundant and their whereabouts is pre- 
dictable, the same families maintain contact with 
one another in loose groups that associate for supra- 
familial activities. Examples are the Indians of Cal- 
ifornia, who subsisted on acorns, and the Indians 
of Lake Superior and the upper Paraguay River, 
who gathered wild rice. It appears that such groups 
tended to consist of extended families or lineages, 
although the evidence is not clear on this point. 

Hunting generally requires cooperation, but there 
are various kinds of hunting societies, each reflect- 
ing special ecological adaptations. The Central 
Eskimo, whose population density was one person 
to 250 square miles or more, lived in small, some- 
what isolated family clusters. Since some coopera- 
tion was necessary in arctic hunting, nuclear fam- 
ilies could not well have survived alone. 

In societies that hunted large herds of bison, the 
concentrated resources supported bands of several 
hundred persons that exceeded the size of trace- 
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able lineages and consisted of many unrelated kin 
groups, The clans among certain Plains Indians 
were fictitious kinship extensions and an embellish- 
ment of the basic subsistence group. The Plains 
bands were more tightly integrated than the hunt- 
ing bands of Canada, because game herds were 
larger, cooperative hunts were more highly devel- 
oped, and warfare, especially after horses were 
acquired, became a major factor. By contrast, 
hunters of small and less-migratory game herds 
cooperated in bands of thirty to sixty persons who 
constituted a patrilineal lineage that controlled ter- 
ritorial resources, married outside the band, and 
brought wives to the husband's band. 

The basic social structure of these types of bands 
is essentially an interfamilial arrangement that is 
a necessary—or an optimum—organization for 
survival, depending on the conditions, A historical 
explanation that they originated elsewhere and 
were introduced to the areas through diffusion or 
migration is not credible, especially in the case of 
the patrilineal hunting bands that occurred in South 
Africa, Australia, Tasmania, southern California, 
and Tierra del Fuego. Each is a special kind of 
evolutionary development, 

Societies of early farmers seem also to have been 
small, except where intercommunity warfare forced 
the people to concentrate in protected villages, and 
each probably consisted of lineages that tended to 
bud off as long as land was available, Although few 
analyses of the ecological adaptations of such farm- 
ers have been made, several types are suggested. 
In tropical rain forests such as the Amazon basin, 
where root crops were staples, slash-and-burn cul- 
tivation together with rapid soil exhaustion required 
the frequent shift of plots and sometimes villages. 
In addition, dietary needs made riverine protein 
foods of major importance, and canoe transporta- 
tion facilitated the concentration of the population 
along rivers, In temperate areas, such as the United 
States, there was a marked contrast between the 
small villages of woodland farmers, who contended 
with prairie sods or forests, and the river-bottom 
farmers, as in the Mississippi system, whose fertile 
soils, once cleared of vegetation, supported large 
communities that were able to adopt the temple- 
mound complex from Mesoamerica, 

In several temperate, arid areas, early farming 
was restricted to flood plains or to the rainy high- 
lands, Increased population density in the valleys— 
made possible by irrigation in such cases as Peru 
—led to theocratic and militaristic states, whose 
power extended over wide areas, and to urban 
centers, which were the containers of civilization. 
It is not now clear to what extent trade, based upon 
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increasingly specialized local production in adjoin- 
ing areas, religious cults, and militaristic controls 
regimented the people to achieve maximum pro- 
duction and fill special statuses and roles. 

Animal domestication entailed other kinds of 
ecological adaptations. Herds require more land for 
subsistence purposes than farming, have to be 
tended and moved about, and may be subject to 
theft. Claims to exclusive grazing areas were diffi- 
cult to enforce until barbed wire was mass- 
produced in the last century. Cattle stealing and 
consequent intergroup hostility could therefore de- 
velop, as they did among many cattle breeders of 
east Africa, Free-roaming livestock, moreover, is a 
threat to crops, and until recently, farmers had to 
fence out the animals, 

When societies became mounted, for example, 
the horse or camel nomads, other special adapta- 
tions occurred, They became more efficient hunters 
or herders and could transport foods a greater dis- 
tance, thus extending their areas of exploitation 
and permitting larger social aggregates to remain 
together. They could also engage in predatory activ- 
ities against one another and their settled neigh- 
bors. Mounted predators even created several ruling 
dynasties in the empires of the Middle East, China, 
and India, 

After the agricultural revolution, the develop- 
ment of supracommunity state- or imperial-level 
institutions extended the areas of resource exploita- 
tion beyond those occupied by any of the subsoci- 
eties that were welded into the larger sociopolitical 
unit. Each local subsociety, although adapted in 
some measure to its terrain, became a specialized 
part of a new kind of whole to which it contributed 
different foods, raw materials, products, and even 
people—all for state purposes. 

The industrial revolution enormously expanded 
the areas of exploitation through its improved trans- 
portation, mass manufacturing, communications, 
and economic and political controls. Its technology 
also gave importance to many latent resources. 
Modern nations and empires embrace highly diver- 
sified environments and draw upon areas beyond 
their political boundaries. Their technology may 
modify environments to meet their cultural needs, 
and, above all, any localized subsociety reacts to a 
complex set of state institutions, to a diversified 
social environment, and to a large number of goals 
other than survival, Although people necessarily 

live in particular places, members of highly indus- 
trialized nations must be viewed as increasingly 
nonlocalized to the extent that their behavior is 
determined by a great overlay or elaboration of 
cultural patterns that are only remotely connected 


with particular environments and are even min- 
imizing some of the child-rearing and economically 
complementary functions of members of the nu- 
clear family. It can be imagined that nuclear power, 
hydroponic and synthetic food production, and 
other technological developments might create 
wholly artificial environments, as in a permanent 
space station. 

JULIAN H, STEWARD 


[See also CULTURE, article on CULTURAL ADAPTATION; 
EVOLUTION, articles on HUMAN EVOLUTION and CUL- 
TURAL EVOLUTION. Other relevant material may be 
found in HUNTING AND GATHERING and KINSHIP.] 


BIBLIOGRAPHY 


BARTH, FREDERIK 1956 Ecologic Relationships of Ethnic 
Groups in Swat, North Pakistan. American Anthropol- 
ogist New Series 58:1079-1089. 

HELM, June 1962 Ecological Approach in Anthropology. 
American Journal of Sociology 67:630-639, > In- 
cludes a review of the literature of cultural ecology. 

Lattimore, OWEN (1940) 1951 Inner Asian Frontiers 
of China. 2d ed. Irvington-on-Hudson, N.Y.: Capital. 

SAHLINS, MARSHALL D.; and SERVICE, ELMAN R. (editors) 
1960 Evolution and Culture. Ann Arbor: Univ. of 
Michigan Press. 

Srewarp, Jurian H. 1955 Theory of Culture Change: 
The Methodology of Multilinear Evolution. Urbana: 
Univ. of Illinois Press. 

WASHBURN, SHERWOOD L, (editor) 1961 Social Life of 
Early Man. Viking Fund Publications in Anthropology, 
No. 31. Chicago: Aldine, 


ECONOMETRIC MODELS, AGGREGATE 


An econometric model is a set of equations de- 
signed to provide a quantitative explanation of the 
behavior of economic variables. This article dis- 
cusses models that focus on the behavior of an 
economy in the aggregate, especially on the time 
paths of variables such as national income and 
product, consumption, investment, employment, 
the price level, the interest rate, etc. The pioneer- 
ing model of this type was constructed by Jan 
Tinbergen (1939). The leading aggregate model 
builder for some years has been Lawrence R. Klein. 

Aggregate econometric models grew out of a 
blend of several different streams of work, One 
is the mathematical stream springing from the 
work of Léon Walras, which represents the econ- 
omy by a system of simultaneous equations. An- 
other is the work of Ragnar Frisch and others in 
the theory of economic dynamics. A third is the 
work in statistical inference associated with Karl 
Pearson and his successors, showing how to esti- 
mate the value of unknown parameters with the 
aid of prior information and observed data. A 
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fourth is the development by Willford King and 
Simon Kuznets and others of numerical estimates 
of national income and expenditure and their com- 
ponents [see NATIONAL INCOME AND PRODUCT 
ACCOUNTS]. A fifth is the formulation of aggre- 
gative economic theories of income and employ- 
ment, by R. F. Kahn, John Maynard Keynes, and 
others. 

General features. The general characteristics 
of aggregate econometric models are described in 
the following sections. Then a very simple example 
is given, and contemporary models are discussed. 

Definitional equations (identities). In any ag- 
gregate model some of the equations are definitions 
(usually called identities), of the type arising in 
national accounting; they are supposed to hold 
exactly and contain no unknown parameters, Ex- 
amples are, “Consumption plus net investment 
plus government purchases plus exports minus 
imports equals net national product,” and “Total 
money wage bill equals average money wage rate 
times quantity of labor input.” 

Stochastic equations. The remaining equations 
are stochastic. They are supposed to hold only 
approximately, and they contain disturbances that 
are assumed to be unobservable, small, and ran- 
dom, with expected values of zero. An example is, 
“Consumption during any period equals a constant 
proportion of that period’s disposable income plus 
a constant proportion of the preceding period's 
consumption plus a third constant plus a random 
disturbance.” In some models the disturbances take 
the form of random errors in the measurement of 
the variables, The assumption of random disturb- 
ances is very convenient for statistical estimation 
of the values of the unknown constants, called 
parameters. It is sometimes justifiable even if the 
disturbances have systematic components; for if 
those components are small and numerous and 
independent of each other, their total effect be- 
haves approximately as if it were random. In 
formulating a model of this kind, one hopes to 
include explicitly in each equation all the impor- 
tant systematic influences that are present, so that 
the disturbances will be small and at least ap- 
proximately random. 

Structural equations, Some of the stochastic 
equations describe the behavior of a group in the 
economy, such as consumers (as in the foregoing 
example), investors in real capital goods, etc. Some 
describe institutional or technological restraints, 
such as the tax laws or the so-called production 
function, which indicates the maximum output 
that can be produced with any given quantities of 
inputs. Some describe adjustment processes that 


take place in particular markets (e.g., for labor 
or goods) when there is excess demand or supply. 
(A special case of an adjustment equation is 
an equilibrium condition asserting that demand 
equals supply.) These four types of equations 
(definitional, behavior, restraint, and adjustment) 
are called structural equations, for each is sup- 
posed to describe some more or less well-defined 
part of the structure of the economy. 

Types of variables. In addition to constant pa- 
rameters and unobservable random disturbances, 
the equations contain observable variables, usually 
more than there are equations in the model. Some 
of the variables are supposed to be determined by 
forces completely outside the model, and their 
values are assumed to be given; these are called 
exogenous. Variables often regarded as exoge- 
nous are government policy variables, population, 
foreign countries’ actions, etc. The other variables, 
whose values are determined by the system when 
parameters, disturbances, and exogenous variables 
are given, are called endogenous. Typically, in a 
complete model there are just as many equations 
as endogenous variables. In many cases the equa- 
tions for a given period will contain both current 
and lagged (i.e., past) values of the endogenous 
variables. The current endogenous variables are 
known as jointly dependent variables. The exoge- 
nous variables and lagged endogenous variables to- 
gether are known as predetermined variables, for 
their values are determined as of any time period 
(either outside the system or by the past operation 
of the system) when the system goes to work to 
determine the jointly dependent variables for that 
time period. 

The reduced form—forecasting, Suppose that 
the system of structural equations is solved for the 
jointly dependent variables, each being expressed 
as a function of structural-equation parameters, 
predetermined variables, and disturbances. The 
result is called the reduced form of the model. It 
could be used to forecast future values of the 
jointly dependent variables if its parameters and 
the future values of disturbances and predeter- 
mined variables were known in advance. In prac- 
tice these are unknown, so that parameters must 
be estimated, future disturbances must be approxi- 
mated by estimates of their expected values (zero 
is often used for these estimates, since the dis- 
turbances are assumed to have zero expected 
values), and future values of predetermined vari- 
ables must be assumed. Thus, forecasts made from 
the reduced form are necessarily approximate. 

When the exogenous and lagged endogenous 
variables are taken as given, reduced-form fore- 
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casts based on them are said to be conditional upon 
the values of the exogenous and lagged endoge- 
nous variables. For example, a model might forecast 
that if tax rates are cut 10 per cent at the end of 
this year and other predetermined variables are 
unchanged, then national income next year will 
be 7 per cent higher than this year; whereas if 
tax rates are not changed, other things being the 
same, then national income next year will be only 
4 per cent higher than this year. When the un- 
known future values of predetermined variables 
are forecast in some way (for exogenous variables 
this involves using information from outside the 
model), reduced-form forecasts of jointly depend- 
ent variables are said to be unconditional. For 
example, one might forecast that tax rates will 
be cut by 10 per cent at the end of this year, and 
then use a model to forecast that next year’s na- 
tional income will be 7 per cent higher than this 
year's. 

Dynamic features. If a model contains lagged 
endogenous variables, it has a dynamic character, 
for its jointly dependent variables are affected not 
only by parameters, disturbances, and exogenous 
variables but also by the past history of the system. 
Simple systems containing lagged values or year- 
to-year changes of endogenous variables can gen- 
erate cycles and/or long-term growth or decline, 
even with no changes in parameters, disturbances, 
or exogenous variables. There are other devices for 
introducing dynamic effects, e.g., time-trend vari- 
ables, derivatives with respect to time, and cumu- 
lative variables such as capital stock that is the 
sum of past net investments. [See TIME SERIES.] 

Linearity versus nonlinearity. If the structural 
equations are linear in the jointly dependent vari- 
ables and if the matrix of parameters of those 
variables is nonsingular, then the solution (the 
reduced form) is linear in the jointly dependent 
variables and is unique. If the structural equations 
contain nonlinearities in the jointly dependent vari- 
ables, their solution will be nonlinear and may fail 
to be unique. In that case, one may use additional 
information to rule out the spurious solutions and 
find the one that represents the behavior of the 
economy (for example, any solution giving a nega- 
tive national income would be spurious). Alterna- 
tively, a nonlinear model may be approximated by 
a linear one, with results that are acceptable as 
long as the range of variation of the variables being 
studied is small relative to their extreme values 
(this is likely to be so over short periods, but not 
over long periods). A model thus linearized has a 

linear reduced form. While models nonlinear in 
variables are fairly common, almost all models are 


built to be linear in unknown parameters, because 
that makes estimation of the parameters vastly 
simpler. 

Model building and estimation. Numerical es- 
timates of the unknown and supposedly constant 
parameters of reduced-form or structural equations 
are obtained by statistically fitting the equations 
to past data for the jointly dependent and prede- 
termined variables, provided that the parameters 
have the property of identifiability [see SIMULTA- 
NEOUS EQUATION ESTIMATION; STATISTICAL IDEN- 
TIFIABILITY]. 

In this estimation process, one uses specifica- 
tions indicating what variables appear in the 
model; which are endogenous and which are ex- 
ogenous; what lags appear, if any; which variables 
appear in each of the structural equations; what 
the mathematical form of each structural equation 
is; and what properties the probability distribution 
of the random disturbances is assumed to have. 
In principle, these specifications are supposed to 
represent the model builder's prior knowledge of 
the economy—prior in the sense of arising from 
sources other than the observed data that are to 
be used in estimating the parameters. In practice, 
the model builder is often uncertain about some 
of the specifications of the model, and so he may 
try fitting several differently specified theoretically 
plausible models to the data and then choose the 
one that offers the best combination of (a) good- 
ness of fit to the data and (b) consistency with 
any knowledge that may not have been incorpo- 
rated into the formal specifications of the models. 
Such knowledge may come from economic theory, 
cross-section studies, results obtained for other 
countries or time periods, or other sources. Re- 
cently developed methods of Bayesian inference 
are suitable for incorporating probabilistic prior 
knowledge into the estimation process [see BAYES- 
IAN INFERENCE]. 

Forecasting and model testing in practice. Con- 
ditional forecasts can be expected to be quite ac- 
curate if (a) the specification of the model is sub- 
stantially correct for both the sample period and 
the forecast period; (b) the data sample and esti- 
mation technique used are such as to give ap- 
proximately correct estimates of the reduced-form 
parameters under assumption (a); and (c) @ 
highly accurate explanation of the sample-period 
jointly dependent variables is provided by the esti- 
mated reduced-form equations, in the sense that 
when the observed values of the predetermined 
variables are substituted into the reduced form to 
get calculated sample-period values of the jointly 
dependent variables, the calculated values are close 
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to the observed values. Conditional forecasts can 
be expected to have substantial errors if the fore- 
going conditions are not met. 

Condition (c), above, can readily be tested by 
substituting sample-period data into the estimated 
reduced form. Condition (b) can be tested in a 
probabilistic sense, with the aid of statistical 
inference techniques that reveal the degree of 
confidence one should have in the estimated pa- 
rameters, on the assumption that the model is 
correctly specified. Condition (a), the correct spec- 
ification of the model, is more difficult to test. 
Economic theory provides some information con- 
cerning the adequacy of a model's specifications, 
but the most powerful test is the indirect one based 
on the quality of forecasts that a model makes 
when conditions (b) and (c) are reasonably well 
satisfied. Individual structural equations can also 
be usefully tested by similar procedures, although 
the typical structural equation contains more than 
one jointly dependent variable and hence is not 
capable of making forecasts in the same sense as 
is the reduced form. 

Unconditional forecasts, of course, can go wrong 
if the three foregoing conditions are not met, and 
also if the values of the predetermined variables 
used in the forecasts are not substantially correct. 

A simple example. A very simple three-equa- 
tion model will illustrate many of the foregoing 
points. Equation (1), below, is the accounting 
definition mentioned earlier: net national product 
(NNP) equals consumption plus net private do- 
mestic investment plus government purchases plus 
exports minus imports. Equation (2), below, is the 
consumer behavior equation mentioned earlier, 
specifying that consumption is a linear function 
of disposable income and lagged consumption plus 
a random disturbance. A third equation is needed 
to relate disposable income to NNP. Assume an 
economy in which (a) the whole of government 
revenue is raised by an income tax whose yield is 
a linear function of NNP; (b) there are no transfer 
Payments; and (c) all business income is paid 
Out to individuals, so that disposable income is also 
a linear function of NNP. Equation (3), below, 
expresses this. The three structural equations of 
this model are as follows (the notation and units 
are explained below: 


(1) y=c+it+g, 

(2) c =a + Bd + yc + u = 5.7 + 69d + 2501, 
(3) d=y(1—m)—h. 

This model specifies further that there are three 
endogenous variables (c= consumption, d = dis- 
Posable income, and y = NNP); and that there are 


four exogenous variables (i = net private domestic 
investment, g = government purchases plus ex- 
ports less imports, h = the fixed part of tax reve- 
nues independent of NNP, and m = the marginal 
tax rate on NNP). Lagged consumption is denoted 
by c: or in brief by c_,; a, 8, and y are three un- 
known parameters; and u is a random disturbance 
with zero mean. All these quantities are expressed 
in billions of real (i.e., deflated) dollars per year, 
except for m, 8, and y, which are pure numbers 
between 0 and 1. 

The approximate numerical estimate of the con- 
sumption equation (2), above, was obtained by the 
two-stage least squares estimation method, from 
United States data (expressed in billions of 1954 
dollars per year) for the years 1929-1941 and 
1946-1959. The ratios of the three estimated pa- 
rameters to their estimated standard errors are 
respectively 1.8, 12, and 3.5; and the standard 
error of estimate is 2.7 billion 1954 dollars per year 
(as compared with the maximum and minimum 
observed consumption values of 288.9 and 103.5 
billion respectively ). 

The tax variable h may be negative if the in- 
come tax allows for a fixed total exemption. If 
equation (3) were to be applied to the United 
States economy, it would have to include a dis- 
turbance, for the United States tax structure is 
only very roughly described by a linear function of 
NNP. 

The reduced form of this model is obtained by 
solving the three structural equations for y, d, and 
c, thus: 
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Note that the model and its reduced form are 
linear in endogenous variables, but not in all vari- 
ables because of the term containing ym in equa- 
tion (3). If in the reduced form one substitutes 
estimated values for the three structural parame- 
ters, zero for the disturbance u, and numerical 
values for the five predetermined variables for a 
certain year, one obtains estimates of the expected 
values of the three jointly dependent variables y, 
d, and c for that year, conditional on the chosen 
values of the predetermined variables. 
Medium-scale models. Most of the medium- 
scale aggregate econometric models published so 
far have from 14 to 48 equations and accordingly 
are much more detailed and complex than the 
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simple example just given, although, of course, 
they still involve great simplifications of reality. 
These models differ from each other in significant 
ways, but the following features are typical of most 
of the medium-scale models listed in the bibliog- 
raphy. 

There is an identity, substantially like equation 
(1), above, stating that national product equals 
consumption plus investment plus government 
purchases plus imports minus exports. Consump- 
tion, private investment, and imports (and in some 
cases exports) are endogenous variables, and there 
are behavior equations to explain them or their 
components, Consumption is sometimes divided 
into parts, with one equation for each—such as 
consumer durable goods, nondurable goods, and 
services—and is explained in terms of variables 
such as disposable income, past consumption, 
liquid asset holdings, consumer credit conditions, 
and income distribution. Investment is commonly 
separated into plant and equipment purchases and 
inventory investment, and in some models resi- 
dential construction is treated in one or more sepa- 
rate equations. Plant and equipment investment is 
explained in terms of accelerator variables, such 
as output and capital stock, or profits or both. In- 
ventory investment is usually specified to depend 
on lagged inventory holdings, sales or output, and 
other variables. Most import functions depend on 
income and exogenous import prices, If exports are 
not exogenous, they are a function of exogenous 
variables such as world income and world prices. 
Government purchases are regarded as exogenous. 
[See CONSUMPTION FUNCTION; INVENTORIES; IN- 
VESTMENT, article on THE AGGREGATE INVESTMENT 
FUNCTION. ] 

In the early models tax revenues and govern- 
ment transfer payments (such as social security 
benefits, unemployment compensation, and in- 
terest on the national debt) were specified as exog- 
enous, but in more recent models the tax and 
transfer schedules are specified as exogenous and 
equations are provided to explain tax and transfer 
payments as endogenous variables depending upon 
national income, the number of retired persons, 
unemployment, the national debt, and so on, ac- 
cording to the exogenous schedules. 

There is a production function, to explain the 
output of goods and services in terms of inputs 
of labor and capital. There is a demand-for-labor 
equation, in many cases expressing total real labor 
income in terms of total real output. There is an 
identity expressing property income as the differ- 
ence between total income and labor income, In 
many cases there is an equation explaining the 


allocation of property income between retained in- 
come (which is not part of disposable income) and 
payments to individuals (including interest and 
dividends). 

There is a wage-rate adjustment equation, com- 
monly expressing the change in the money wage 
rate in terms of the unemployment rate and the 
rate of inflation. (But see Netherlands 1961 for a 
model in which the wage rate is exogenous, since 
it has been a policy variable in the Netherlands 
in recent years.) Unemployment of course is the 
difference between the labor force, typically ex- 
ogenous, and employment. - 

The general price level is endogenous in most of 
these models; and in some cases other prices 
appear that, when endogenous, are usually ex- 
pressed as functions of the general price level. 
Typically, real output and the price level can be 
thought of as determined by a pair of equations 
containing only these two variables plus predeter- 
mined variables, the equations being obtained by 
substituting into two important identities all the 
other equations of the model. One of these identi- 
ties is that equating output to the sum of all ex- 
penditure components; the other expresses the 
total real wage bill as employment multiplied by 
the real wage rate. 

Among the more commonly used exogenous 
policy variables are government purchases, govern- 
ment transfer payments, and tax revenues or tax 
rates. In some models there are few or no variables 
or equations describing interest rates and the sup- 
ply and demand for money, and where they do 
appear, they are commonly rather loosely tied to 
the rest of the model. Thus, most models built so 
far are much better suited to an analysis of effects 
of fiscal policy (government purchases, taxes, and 
transfer payments) than the effects of monetary 
policy. 

Among the more commonly used exogenous non- 
policy variables are population and its distribution 
(and sometimes labor force), import prices, €x- 
ports (or world income and world prices), and 
time. In certain more recent models there are vari- 
ables measuring attitudes or anticipations, ob- 
tained from surveys. 

Behavior equations, in most cases, contain vari- 
ables in real (deflated) terms rather than in money 
terms, to reflect the theoretical postulate that real 
economic behavior depends upon real tastes and 
real opportunities, unaffected by price changes that 
leave these things the same. Some consumer be- 
havior equations are stated in per capita terms, 
to allow for the possibility that an increase 
aggregate income may have different effects, 


in 
de- 


ECONOMETRIC MODELS, AGGREGATE 349 


pending upon how it is distributed between in- 
creases in population and in per capita income; 
but most behavior equations are stated in aggre- 
gate terms. 

Nearly all models contain some nonlinearities, 
especially where identities of the form “value 
equals price times quantity” are involved; but non- 
linearities in unknown parameters are rare in sto- 
chastic equations whose parameters have to be 
estimated. Tinbergen’s models (1939, 1951) and 
the model of the Netherlands (1961) have had all 
their nonlinear equations linearized. 

First differences (i.e., year-to-year changes) of 
the data are used occasionally (e.g., Suits 1962), 
but most models use ordinary data, without this 
transformation. Early models used annual data, but 
quarterly models are becoming more common as 
quarterly data become available. Time trends, 
lags, and cumulated variables are the main de- 
vices used for dynamic effects, although occasion- 
ally a ratchet-type variable is used for this purpose. 
An example of a ratchet-type variable is the 
value of disposable income at its previous peak, 
which is sometimes included in the consumption 
function. 

Parameters of structural equations are estimated 
by a variety of methods. Least squares is common, 
in spite of its asymptotic bias in a simultaneous 
equations context. With the advent of electronic 
computers, consistent estimating methods have be- 
come cheap and are increasingly used, especially 
the limited-information and the two-stage least 
Squares methods [see SIMULTANEOUS EQUATION 
ESTIMATION]. 

Large-scale models. Two large-scale models 
have appeared. One, dealing with the United States 
and having 219 equations, is sponsored by the 
Social Science Research Council (SSRC) and the 
Brookings Institution (Duesenberry et al. 1965). 
The other, dealing with Japan and having 164 
equations, is the work of a group at Osaka Uni- 
versity (Ichimura et al. 1964). In essential con- 
ception these models are similar to the medium- 
Scale models discussed above; they innovate chiefly 
In providing a more detailed treatment of certain 
Markets and sectors of the economy. The SSRC— 
Brookings model goes into detail particularly re- 
garding consumption, housing, fixed and inventory 
Investment, new orders, six nonagricultural pro- 
duction sectors, agriculture, government, and 
Population and labor force. The Osaka model is 
Particularly detailed regarding fixed and inventory 
investment, eight production sectors (including 
agriculture), foreign trade, and the monetary and 
financial sector, The SSRC-Brookings model con- 


tains a seven-by-seven input-output model corre- 
sponding to its seven production sectors, and plans 
are under way for a more detailed model to contain 
about 32 production sectors [see INPUT-OUTPUT 
ANALYSIS]. 

Application and evaluation. Econometric model 
forecasts and their comparison with subsequent 
actual events have been all too uncommon but are 
becoming accepted as one important means of 
evaluating models. Some recent results have been 
quite good. The Klein-Goldberger model (1955) 
and its successors under Suits (1962) have been 
used each November since 1952 to make annual 
unconditional forecasts of real United States gross 
national product (GNP) for the following year. 
The percentage errors for the years 1953 to 1962 
have been as follows (where, for example, —0.7 
means that the model's forecast of GNP was too 
low by 0.7 per cent of the subsequently observed 
value, +0.5 means that it was 0.5 per cent too 
high, etc.): —0.7, +0.5, —6.4, +0,2, +0.5, +0.05, 
—4.0, —1.6, +0.7, —0.1. 

Such models may also be used for simulation 
studies of the economy’s stability and long-term 
behavior, as in Adelman and Adelman (1959), or 
of its reaction to policy changes. Simulation studies, 
as well as forecasts, acquire practical value to the 
extent that the models used can be shown to be 
accurate representations of the relevant aspects 
of economic behavior and not merely systems of 
equations that fit past data well. [See SIMULATION, 
article on ECONOMIC PROCESSES. | 

CARL F. CHRIST 


[Directly related are the entries AGGREGATION; BUSI- 
NESS CYCLES, article on MATHEMATICAL MODELS; 
PREDICTION AND FORECASTING, ECONOMIC; INCOME 
AND EMPLOYMENT THEORY.] 
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ECONOMETRICS 


Succinctly defined, econometrics is the study of 
economic theory in its relations to statistics and 
mathematics. The essential premise is that eco- 
nomic theory lends itself to mathematical formula- 
tion, usually as a system of relationships which 
may include random variables, Economic observa- 
tions are generally regarded as a sample drawn 
from a universe described by the theory. Using 
these observations and the methods of statistical 
inference, the econometrician tries to estimate the 
relationships that constitute the theory. Next, these 
estimates may be assessed in terms of their sta- 
tistical properties and their capacity to predict 
further observations. The quality of the estimates 
and the nature of the prediction errors may in turn 
feed back into a revision of the very theory by 
which the observations were organized and on the 
basis of which the numerical characteristics of the 
universe postulated were inferred. Thus, there is a 
reciprocating relationship between the formulation 
of theory and empirical estimation and testing. The 
salient feature is the explicit use of mathematics 
and statistical inference. Nonmathematical theo- 
rizing and purely descriptive statistics are not part 
of econometrics. d 

The union of economic theory, mathematics, an! 
statistics has been more an aspiration of the econo- 
metrician than a daily achievement. Much of what 


is commonly known as econometrics is mathe- 
matical economic theory that stops short of em- 
pirical work; and some of what is known as econo- 
metrics is the statistical estimation of ad hoc 
relationships that have only a frail basis in eco- 
nomic theory. That achievement falls short of as- 
piration, however, ought not to be discouraging. It 
is part of the developmental process of science that 
theories may be advanced untested and that the 
search for empirical regularities may precede the 
systematic development of a theoretical framework, 
A consequence of this, however, is that although 
the word “econometrics” clearly implies measure- 
ment, much abstract mathematical theorizing that 
may or may not ultimately lend itself to empirical 
validation is often referred to as part of econo- 
metrics. The meaning of the word has frequently 
been stretched to apply to mathematical economics 
as well as statistical economics; and in common 
parlance the “econometrician” is the economist 
skilled and interested in the application of mathe- 
matics, be it mathematical statistics or not. In this 
article I shall accept this extended definition and 
consider both econometrics in its narrow sense and 
mathematical economic theory. 


A brief history 


The use of mathematics and statistics in eco- 
nomics is not of recent origin. In the latter part of 
the seventeenth century Sir William Petty wrote 
his essays on “political arithmetik” [see the biog- 
raphy of Perry]. This fledgling work, remarkable 
for its time, was econometric in its methodological 
framework, even from the modern point of view. 
Despite the fact that it was not referred to by Adam 
Smith, it had a discernible influence on later 
writers, In 1711 Giovanni Ceva, an Italian engi- 
neer, urged the adoption of the mathematical 
method in economic theory. Although many statis- 
tical studies appeared during the intervening years, 
the revolutionary impact of the mathematical 
method did not occur until the latter part of the 
nineteenth century. More than any other man, 
Léon Walras, professor at the University of Lau- 
sanne, is acknowledged to be the originator of 
general equilibrium economics, which is the basic 
framework of modern mathematical economics 
[see the biography of Warras]. His work, removed 
from any immediate statistical application, devel- 
oped a comprehensive system of relationships be- 
tween economic variables, including money, in 
order to explain the mutual determination of prices 
and quantities of commodities and capital goods 
produced and exchanged. Walras conceived of the 
economy as operating along the lines of classical 


ECONOMETRICS 351 


mechanics, the state of the economy being de- 
termined by a balancing of forces between all mar- 
ket participants. His general equilibrium system 
was, however, essentially static because the values 
of the economic variables did not themselves de- 
termine their own time rates of change. For that 
reason the term “equilibrium” is something of a 
misnomer because, since Walras’ general system 
was not explicitly dynamic, its solution cannot be 
described as an equilibrium state. Nevertheless, as 
is still true in much of economic theorizing, there 
were side discussions of the adjustment properties 
of the economy, and so, in a wider context, the 
solution can be regarded as the result of a bal- 
ancing of dynamic forces of adjustment, 

The significant combination of mathematical 
theory and statistical estimation first occurred in 
the work of Henry Luddell Moore, a professor at 
Columbia University during the early part of the 
twentieth century [see the biography of Moore, 
Henry L.]. Moore did genuine econometric work 
on business cycles, on the determination of wage 
rates, and on the demand for certain commodities. 
His major publication, culminating some three 
decades of labor, was Synthetic Economics, which 
appeared in 1929, Incredibly, this work, of such 
seminal importance for the later development of a 
significant area of social science, sold only 873 
copies (Stigler 1962). 

Econometrics came to acquire its identity as a 
distinct approach to the study of economics during 
the 1920s, The number of persons dedicated to this 
infant field grew steadily, and, on December 29, 
1930, they established an international association 
called the Econometric Society. This was achieved 
in large measure through the energy and persist- 
ence of Ragnar Frisch of the University of Oslo, 
with the assistance and support of the distin- 
guished American economist, Irving Fisher, a pro- 
fessor at Yale University [see the biography of 
Fisner, Irvine]. To call this small minority of 
economists a cult would impute to them too paro- 
chial and evangelical a view; nevertheless, they 
had a sense of mission “to promote studies that aim 
at a unification of the theoretical-quantitative and 
the empirical-quantitative approach to economic 
problems and that are penetrated by constructive 
and rigorous thinking similar to that which has 
come to dominate in the natural sciences” (Frisch 
1933). 

Their insights and ambitions were well founded. 
During the following years and through many a 
methodological controversy about the role of mathe- 
matics in economics (a topic now rather passé) 
their numbers grew and their influence within the 
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wider profession of economics was steadily extend- 
ed. Today all major university departments of eco- 
nomics in the Western world, including most re- 
cently those in the Soviet-bloc countries, offer 
work in econometrics, and many place considerable 
stress upon it. Specific courses in econometrics 
have been introduced even at the undergraduate 
level; textbooks have been written; the younger 
generation of economists entering graduate schools 
arrive with improved training in mathematics 
and statistical methods, gravitate in what appear 
to be increasing proportions toward specialization 
in econometrics, and soon excel their teachers in 
their command of econometric techniques. Mem- 
bership in the Econometric Society increased from 
163 in 1931 to over 2,500 in 1966. The society's 
journal, Econometrica, has virtually doubled in 
size over these years, and nearly all other scholarly 
journals in economics publish a regular fare of 
articles whose mathematical and statistical sophis- 
tication would have dazzled the movement's found- 
ers in the 1920s and 1930s. 

Areas of application of econometrics within eco- 
nomics have been steadily widened. There is now 
scarcely a field of applied economics into which 
mathematical and statistical theory has not pene- 
trated, including economic history. With the in- 
creasing interest and concentration in econometrics 
on the part of the economics profession, the very 
notion of specialization has become blurred. With 
its success as a major intellectual movement with- 
in economics, econometrics is losing its identity 
and is disappearing as a special branch of the disci- 
pline, becoming now nearly conterminous with the 
entire field of economics, These remarks must not 
be misunderstood, however. There remain many 
problems and much research in economics that is 
neither mathematical nor statistical, and although 
the modern economist’s general level of training 
and interest in mathematics and statistics far ex- 
ceeds that of his predecessors, a quite proper grada- 
tion of these skills and interests inevitably con- 
tinues to exist. Moreover, to repeat, much of what 
is known as econometrics still falls short of the 
interrelating of the mathematical-theoretical and 
the statistical, which is the aspiration contained in 
the field’s definition, 


A survey of econometrics 


Since econometrics is no longer a small enclave 
within economics, a survey of its subject matter 
must cover much of economics itself. 

General equilibrium. Pursuing Walras’ con- 
ception of a general economic equilibrium, mathe- 
matical economists have in recent years been en- 


gaged in a far more thorough analysis of the 
problem than Walras offered [see Economic EQUI- 
LIBRIUM]. In the earlier work a general economic 
equilibrium was described by a system of equalities 
involving an indefinitely large number of economic 
variables, but a number equal to the number of 
independent equations. It was presumed that a sys- 
tem of simultaneous equations with the same num- 
ber of unknowns as independent equations would 
have an “equilibrium” solution. This is loose mathe- 
matics, and in recent times economic theorists have 
been concerned to redevelop the earlier theory with 
greater rigor. Equality of equations and unknowns 
is neither a necessary nor a sufficient condition for 
either the existence or the uniqueness of a solution, 
Consequently, one cannot be sure that the early 
theory is adequate to explain the general equi- 
librium state to which the economy is postulated to 
converge. This might be because the theory does 
not impose conditions necessary to assure the ex- 
istence of a general equilibrium state or because 
the theory might be indeterminate in that several 
different solutions are implied by it. The modern 
equilibrium theorist has therefore tried to nail 
down the necessary and sufficient conditions for 
the existence and uniqueness of the general eco- 
nomic equilibrium. 

The concept of an equilibrium is that of a state 
in which no forces within the model, operating over 
time, tend to unbalance the system. Even if such a 
state can be demonstrated to exist within the 
framework of some general equilibrium model, 
there remains the question of whether it is stable 
or unstable, that is, whether, for any departure of 
the system from it, forces tend to restore the origi- 
nal equilibrium or to move the system further 
away. An analysis of these questions, which are 
rather more involved than suggested here, requires 
the explicit introduction of dynamical adjustment 
relationships. 

Questions of the existence, uniqueness, and sta- 
bility of an equilibrium are, in the present context, 
not questions about the actual economy, but ques- 
tions regarding the properties of a theoretical model 
asserted to describe an actual economy. In this 
sense, their examination is oriented toward an im- 
proved understanding of the implications of alter- 
nate specifications of the theory itself rather ae 
toward an improved empirical understanding © 
how our economy works. y $ 

Most of this work, moreover, has been restricted 
to an examination of the general equilibrium model 
of a competitive economy, which is a special case 
indeed. It is a case of particular interest, however, 
because, under idealized assumptions, welfare 


economists have imputed features to a competitive 
equilibrium that satisfy criteria which are regarded 
as interesting for a social evaluation of economic 
performance [see WELFARE ECONOMICS]. Accord- 
ing to a concept of Pareto’s, a state of the economy 
(not to be thought of as unique) is said to be 
optimal if there is no other state that is techno- 
logically feasible in which some individual would 
be in a position he prefers while no individual 
would be in a position that he finds worse [see the 
biography of PARETO]. The conditions under which 
a general economic equilibrium would be optimal 
in this sense have therefore been subject to rigid 
scrutiny. Thus, Pareto welfare economics is in- 
timately involved in the modern examination of 
general equilibrium systems, but it is not well de- 
veloped as an empirical study. 

The positive economist, concerned with predic- 
tion, has also been concerned with general equi- 
librium systems in principle, but from a different 
point of view. His central question is, How does a 
change in an economic parameter (a coefficient or 
perhaps the value of some autonomous variable not 
itself determined by the system) induce a change 
in the equilibrium value of one or more other varia- 
bles that are determined by the system? In short, 
How does the equilibrium solution depend upon the 
parameters? This is a problem in comparative 
statics which contrasts two different equilibria de- 
fined by a difference in the values of one or more 
parameters. 

Comparative statics—partial equilibrium. It is 
in the problem of comparative statics—the com- 
parison of alternative equilibrium states—that we 
can most ably distinguish between general equi- 
librium economics and partial equilibrium eco- 
nomics, a familiar contrast in the literature. 

Suppose that, in the neighborhood of an equi- 
librium, a general system of simultaneous eco- 
nomic relationships is differentiated totally with 
respect to the change in a particular parameter, so 
that all direct and indirect effects of that change 
are accounted for, One might then hope to ascer- 
tain the direction of change of a particular eco- 
nomic variable with respect to that parameter. For 
example, if a certain tax rate is increased or if con- 
sumer preferences shift in favor of a particular 
commodity, will the quantity demanded of some 
other commodity increase, diminish, or stay the 
same? This question can sometimes be answered 
on the basis of the constellation of signs (plus, 
minus, zero) of many or all of the partial deriva- 
tives of the functions constituting the system (as- 
suming here that they are continuously differen- 
tiable). Theoretical considerations or common 
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sense may enable one to specify a priori the signs 
of these partial derivatives, for example, to assert 
that an elasticity of demand is negative or that a 
cross elasticity of demand is positive. In some 
cases, however, the theorist is not comfortable in 
making such assertions about a derivative, and 
hence some signs may be left unspecified. The 
question is whether the restrictions that the theorist 
is willing to impose a priori suffice to determine 
whether the total derivative of the economic varia- 
ble of interest wtih respect to a given parameter is 
positive, negative, or zero. The formal considera- 
tion of the necessary and sufficient restrictions 
needed to resolve this question unambiguously con- 
stitutes the study of qualitative economics and pre- 
sents a mathematical problem in its own right 
(Samuelson 1947; Lancaster 1965), In some situa- 
tions it may be critical to know not only the signs 
of various partial derivatives but also their relative 
algebraic magnitudes. This points to the need for 
the statistical estimation of these derivatives, a 
task belonging to econometrics in its most narrow 
meaning, 

At times it is also useful to know that certain 
derivatives are sufficiently close to zero that if they 
are assumed equal to zero the conclusion about the 
sign of the total derivative being investigated would 
not be affected. The trick or art of deciding when 
to regard certain partial derivatives as zero, that is, 
of deciding that certain economic variables do not 
enter in any significant way into certain relation- 
ships, is the essence of partial equilibrium analysis, 
so called because it tends to isolate a portion of the 
general system from other portions that have little 
interaction with it. Partial equilibrium analysis is, 
thus, a special case of general equilibrium analy- 
sis, in which more daring a priori restrictions have 
been introduced with the object of deducing more 
specific and meaningful results in comparative 
statics. Just as general equilibrium economics has 
been commonly associated with the name of 
Walras, so partial equilibrium economics has been 
associated with the work of Alfred Marshall [see 
the biography of MARSHALL]. 

In qualitative economics some light is shed upon 
the signs of the partial derivatives of the system by 
considering the dynamic stability of the model. 
With assumptions about the nature of the dynamic 
adjustment relationships, correspondences might 
be found between the conditions necessary for an 
equilibrium to be stable and the signs of the partial 
derivatives, Thus, just as stability depends upon 
assumptions about whether different variables 
enter a given relationship positively or negatively, 
so, also, the way in which those variables enter a 
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given relationship may sometimes be inferred from 
the assumption that an equilibrium is stable. This 
is the famous correspondence principle, due to 
Samuelson. [See STATICS AND DYNAMICS IN ECO- 
NOMICS.] 

Spatial models. Most general equilibrium models 
have conceived of the economy as existing at a 
single point in space, thereby ignoring transporta- 
tion costs, the regional specialization of resources, 
and locational preferences. Some studies, however, 
explicitly introduce the spatial dimension in which 
a general equilibrium occurs. This provides a frame- 
work for the study of interregional location, special- 
ization, and interdependency in exchange. [See 
SPATIAL ECONOMICS, article on THE GENERAL EQUI- 
LIBRIUM APPROACH.] These models, because of 
their greater complexity, generally involve more- 
special assumptions, such as linearity of relation- 
ships and the absence of opportunities for substitu- 
tion among factor services in production. They 
have also, however, lent themselves more directly 
to empirical work. 

In the application of partial equilibrium analysis 
to problems of spatial economics, it is assumed, 
moreover, that the locations of certain economic 
activities are determined independently of the loca- 
tion decisions regarding other economic activities, 
and therefore the former can be regarded as fixed 
in the analysis of the latter, [For a discussion of 
this line of inquiry, see SPATIAL ECONOMICS, article 
OM THE PARTIAL EQUILIBRIUM APPROACH. ] 

Aggregation and aggregative models. Since gen- 
eral equilibrium systems are conceived of as em- 
bracing millions of individual relationships, they 
obviously do not lend themselves to quantitative 
estimation, Much interest, therefore, inheres in re- 
ducing the dimensionality of the system, so that 
there is some possibility of econometric estimation. 
This means that relations of a common type, such 
as those describing the behavior of firms in a given 
industry or households of a certain character, need 
to be aggregated into a single relationship describ- 
ing the behavior of a collectivity of comparable 
economic agents. The conditions needed to make 
such aggregation possible and the methods to be 
used are still in a rather preliminary stage of ex- 
ploration. But a literature is developing on this 
subject. [See AGGREGATION.] 

An older problem is simply that of aggregating 
into a single variable a multiplicity of similar varia- 
bles. This is the familiar problem of “index num- 
bers”—for example, how best to represent the 
prices of a great variety of different commodities 
by a single price index. The index number prob- 

lem, therefore, has its theoretical aspects [see 


INDEX NUMBERS, article on THEORETICAL ASPECTS] 
as well as its statistical aspects [see INDEX NUM- 
BERS, articles on PRACTICAL APPLICATIONS and SAM- 
PLING]. The theory has been useful in guiding the 
interpretation of alternative statistical formulas, 
The major efforts in the empirical study of gen- 
eral equilibrium systems that have to some limited 
degree been aggregated come under the heading 
input-output analysis. This approach, originated 
by Wassily Leontief in the late 1930s, consists es- 
sentially in considering the economy as a system 
of simultaneous linear relationships and regarding 
as constant the relative magnitudes of the inputs 
into a production process that are necessary to pro- 
duce the process’s output. These inputs may, of 
course, be the outputs of other processes, Thus, 
with fixity of coefficients relating the inputs and 
outputs of an integrated production structure, it is 
possible to determine what “bill of goods” can be 
produced, given an itemization of the quantities of 
various “primary” nonproduced inputs that are 
available. Alternatively, the quantities of primary 
inputs necessary to produce a given bill of goods 
can also be determined. The coefficients of such a 
system can be estimated by observing the ratios of 
inputs to outputs for various processes in a given 
year or by averaging these ratios over a sequence 
of years or by using engineering estimates. This 
may be done for an economy divided into a large 
number of different sectors (a hundred or more), 
or it may be done for portions of an economy, such 
as a metropolitan area. Moreover, the sectoring of 
the economy may be by regions as well as by in- 
dustries, and the former makes the method ap- 
plicable to the study of interregional or interna- 
tional trading relations, A great deal of empirical 
research has been done on input-output models, 
tables of coefficients having now been developed 
for over forty countries, The quantitative analysis 
of the workings of these models has, as one may 
readily surmise, required the availability of large- 
scale computers. [See INPUT-OUTPUT ANALYSIS.] 
Aggregative models in economics may be of 
either the partial or the general equilibrium type: 
Those of a partial equilibrium type deal with a 
single sector of the economy in isolation, under the 
assumption that the external economic variables 
that have an important impact on that sector are 
not in turn influenced by its behavior. Thus, for 
example, a market model of demand and a 
for a particular commodity may regard the tot: 
income of consumers and its distribution as deter- 
mined independently of the price and output of the 
particular commodity being studied. Yet, the ian 
ket demand and supply functions are aggregates © 


the demand and supply functions of many individ- 
uals and firms. Aggregative models of the general 
equilibrium type may explain the mutual determi- 
nation of many major economic variables that are 
aggregates of vast numbers of individual variables. 
Examples of the aggregate variables are total em- 
ployment, total imports, total inventory investment, 
etc. These models are generally called macroeco- 
nomic models, in contrast to microeconomic models, 
which deal, in a partial equilibrium sense, with the 
individual household, firm, trade union, etc. Many 
macroeconomic models treat not only so-called real 
variables, which are physical stocks and flows of 
goods and productive services, but also monetary 
variables, such as price levels, the quantity of 
money, the value of total output, and the interest 
rate, Models of this sort have been especially com- 
mon since 1936, having been stimulated by John 
Maynard Keynes's General Theory of Employment, 
Interest and Money and by the literature that de- 
volved therefrom. 

One type of aggregative, macroeconomic model 
is that which distinguishes a few important sectors 
of the economy or which relates macroeconomic 
variables of two or more economies interrelated in 
trade. Much of the theory of international trade 
deals with models of this sort [see INTERNATIONAL 
TRADE, article on MATHEMATICAL THEORY]. In fact, 
since this has been a natural mode for the analysis 
of international economic problems, international 
trade theory has historically been one of the live- 
liest areas for the development of economic theory, 
both mathematical and otherwise. More-narrowly 
econometric studies in this area have focused on 
estimates of import demand elasticities, 

Moreover, macroeconomic models have lent them- 
selves to the study of economic change, and it is 
with these models that the most significant work 
in economic dynamics has occurred. Dynamical 
Systems in economics are those in which the values 
of the economic variables at a given point in time 
determine either their own rates of change (con- 
tinuous, differential equation models) or their 
values at a subsequent point in time (discrete, dif- 
ference equation models). [For a general discussion 
of dynamic models see STATICS AND DYNAMICS IN 
Economics.] Thus, dynamic models involve both 
variables and a measure of their changes over 
time. The former often occur as “stocks,” and the 
latter as “flows.” When both stocks and flows enter 
into a given model, there are complexities in recon- 
ciling the desired quantities of each. These prob- 
lems become especially important when monetary 
variables are introduced, for example, when we 
consider the desire of individuals both to hold a 
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certain value of monetary assets and to save (add 
to assets) at a certain rate. [Specific problems of 
stock—flow models are discussed in StOCK—FLOW 
ANALYSIS. ] 

Dynamic models arise both in the theory of long- 
run economic growth [see ECONOMIC GROWTH, ar- 
ticle on MATHEMATICAL THEORY], where both 
macroeconomic and completely disaggregated gen- 
eral equilibrium models have been employed, and 
in the theory of business fluctuations or business 
cycles [see BUSINESS CYCLES, article on MATHE- 
MATICAL MODELS], where macroeconomic models 
are most common. Not all models intended to ex- 
plain the level of business activity need be cyclical 
in character. The modern emphasis is more on 
macroeconomic models, cyclical or not, that explain 
the level of business activity and its change by a 
dynamical system that responds to external vari- 
ables. These include variables of economic policy 
(government deficit, central bank policy, etc.) and 
other variables that, while having an important 
impact on the economy, have their explanation out- 
side the bounds of the theory, for example, popu- 
lation growth and the rate of technological change. 
Thus, outside variables, known as exogenous or 
autonomous variables, play upon the dynamical 
economic system and generate fluctuations over 
time that need not be periodic, These models lend 
themselves to empirical investigation, and a great 
deal of work has been done in estimating them 
[see ECONOMETRIC MODELS, AGGREGATE]. The struc- 
ture of these models has been refined and devel- 
oped as a consequence of the empirical work. 

The great advantage of aggregative models, of 
course, is that they substantially reduce the vast 
number of variables and equations that appear in 
general equilibrium systems and thereby make esti- 
mation possible. Even so, these models can be quite 
complex, either because they still contain a large 
number of variables and equations or because of 
nonlinearities in their functional forms. The mod- 
ern computer makes it possible to estimate systems 
of this degree of complexity, however. But if one 
is interested in analyzing the dynamical behavior 
of these systems, the difficulties often transcend 
our capabilities in mathematical analysis. The com- 
puter once again comes to the rescue, It is possible, 
with the computer, to simulate complex systems of 
the type being considered, to drive them with ex- 
ogenous variables, and to shock them with random 
disturbances drawn from defined probability dis- 
tributions. In that way the performance of these 
systems under a variety of assumptions regarding 
the behavior of the exogenous variables and for a 
large sample of random variables can be surveyed. 
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[Simulation studies of this sort are discussed in 
SIMULATION, article on ECONOMIC PROCESSES.] 

Variables that commonly arise in macroeconomic 
models are aggregate consumer expenditure, in- 
ventory investment, and plant and equipment 
investment. Aggregate consumer expenditure, or 
consumption, reflects the behavior of households 
in deciding how much to spend on consumer goods, 
which in some studies may be further broken down 
into categories such as consumer durables, non- 
durables, and services. Using regression techniques, 
consumer expenditure is made to depend upon 
other variables, some of which are economic in 
character (consumer income, change in income, 
highest past income, the consumer price level and 
its rate of change, interest rates and terms of con- 
sumer credit, liquid assets, etc. ) and some of which 
are demographic (race, family size, urban-rural 
residence, etc. ). The empirical study of the depend- 
ency of consumer expenditure on variables of these 
kinds has been intensive during the past twenty 
years. [For a survey of this work, see CONSUMPTION 
FUNCTION. ] 

The behavior of inventory investment has like- 
wise been the object of intensive study, both in 
terms of how inventories have varied over time 
relative to the general level of business activity and 
in terms of how inventory investment has responded 

to such variables as the interest rate, sales changes, 
unfilled orders, etc. [This work is reviewed in IN- 
VENTORIES, article on INVENTORY BEHAVIOR.] There 
are some subtle issues involved in formulating an 
inventory investment function, Sometimes inven- 
tories accumulate when firms intend they should, 
and other times they accumulate despite the desire 
of firms to reduce them, for example, when sales 
fall off rapidly relative to the capability of firms to 
alter their rates of output. Theoretical work con- 
cerned with the optimum behavior of firms in mat- 
ters of inventory policy can therefore provide some 
underpinning to the selection and interpretation of 
the role of different variables in an inventory in- 
vestment function, [See INVENTORIES, article on 
INVENTORY CONTROL THEORY.] 

The dependency of plant and equipment invest- 
ment upon such variables as business sales, sales 
changes, business profits, liquidity, etc., may also 
be studied by econometric techniques, and different 
theories have been advanced to support notions 
about the relative importance of these different 
variables. As with the consumption function and 

the determination of inventory investment, the 
plant and equipment investment function has also 
been the subject of intensive empirical research 
over the past couple of decades, [This work is re- 


viewed in INVESTMENT, article on THE AGGREGATE 
INVESTMENT FUNCTION, ] 

Decision making. Though it is methodologically 
proper for the economist to postulate ad hoc rela- 
tionships between macroeconomic variables (Peston 
1959), it is more gratifying, more unifying of eco- 
nomic theory, if the behavior of the macrovariables 
can be derived from elementary propositions re- 
garding the behavior of the microvariables whose 
aggregates they are. This is the aggregation prob- 
lem, referred to earlier, The aspiration is that an 
axiomatic theory of the behavior of the individual 
economic decision maker, most importantly the in- 
dividual (or household) and the firm, can serve as 
a fundament to theories of the interaction of the 
aggregated macrovariables. Most of the behavioral 
theory of firms and households, however, is in the 
context of partial equilibrium analysis, because the 
individual economic agent does not bother to take 
account of the very slight influence that his own 
decisions exert on the market or on the economy 
as a whole. Thus, each household and each competi- 
tive (but not monopolistic) firm regards market 
prices as fixed and unaffected by its own choices. 
But in linking together such partial equilibrium 
models of the behavior of vast multitudes of indi- 
vidual households and firms, one cannot ignore the 
impact of their combined behavior on the very 
market variables they regard as constants. Thus, 
the partial equilibrium micromodels must be re- 
organized into more general models allowing for 
these individually unperceived but collectively im- 
portant interactions, 

Microeconomic theory is largely deductive, pro- 
ceeding systematically from axioms regarding pref- 
erence and choice to theorems regarding economic 
behavior. To proceed carefully through the logical 
intricacies of this deductive theory, formal mathe- 
matics is heavily invoked. Market decisions of eco- 
nomic agents are usually hypothesized to be prudent 
or rational decisions, by which is meant that they 
conform by and large to certain basic criteria of 
decision making that are thought to have wide in- 
tuitive appeal as precepts of prudent or rational 
choice. The situations in which a decision maker 
may be called upon to choose can be formulated in 
a variety of ways. There are “static” situations, 
where the decision is not assumed to have a tem- 
poral or sequential character. There are “dynamic 
situations, in which a sequence of decisions must 
be made and in some consistent way. The decision 
problem may also be categorized according ge 
knowledge the decision maker believes he has al out 
the consequences of his decisions. At one extremi 
is the case of complete certainty, where the conse- 


quences are thought to be completely known in 
advance. Other cases involve risk and arise when 
the decision maker is assumed to know only the 
probability distribution of the various outcomes that 
can result from the decision he makes. Finally, at 
the other extreme, the decision problem may be 
conceived of as involving almost complete uncer- 
tainty, in which case the decision maker knows 
what the possible outcomes are but has no a priori 
information about their probabilities. [For a dis- 
cussion of various criteria proposed for these dif- 
ferent situations, see DECISION MAKING, article on 
ECONOMIC ASPECTS.] Fundamental, however, is the 
notion that the decision maker has preferences and 
that he exercises these within the range of choice 
available to him, An index giving his preferences 
is commonly called utility and is conceived of as a 
function of the objects of his choice. In particular, 
where an individual with a fixed income is choosing 
among various “market baskets” of commodities, 
utility is commonly postulated to be a function of 
the components of the market basket. Axiomatic 
systems that are necessary and sufficient for the 
existence of such a function have been the object 
of intensive study by mathematical economists. 
[This central problem and many subtle aspects of it 
are considered in UTILITY.] Great effort, with per- 
haps little benefit for empirical economics, has gone 
into the refinement of the axiomatics of utility the- 
ory or of the theory of consumer choice; unfortu- 
nately, much less work has been done to strengthen 
the assumptions of the theory so as to increase its 
empirical content. From the theory of consumer 
behavior comes the concept of the demand function 
of the consumer for a particular commodity, de- 
pending characteristically on all prices and on 
income, 

As for the theory of the firm, prudent, purposeful 
behavior is also assumed, and in the theory's most 
common formulation it is supposed that the firm 
wishes to maximize some measure of its preference 
among streams of future profits. This must be done 
subject to the prices that the firm must pay for 
factor services, the market opportunities it con- 
fronts when selling its products, and its internal 
technology of production. From this analysis comes 
the theory of production and of supply. [For the 
theory of production of the firm, see PRODUCTION; 
for econometric studies of production relationships 
and of the cost of production, see PRODUCTION AND 
COST ANALYSIS; and for econometric studies of de- 
mand and supply, see DEMAND AND SUPPLY, article 
On ECONOMETRIC STUDIES.] 

In the derivation of the theory of consumer be- 
havior and the theory of the firm, purposeful and 
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prudent behavior has characteristically been as- 
sociated with the notion that the decision maker 
attempts to maximize some function subject to mar- 
ket and technological constraints. Thus, the mathe- 
matics of constrained maximization has served the 
economist as the most important tool of his trade. In 
an effort to develop models of maximizing behavior 
that would lend themselves better to quantitative 
formulation and solution, interest came to focus on 
problems where the function being maximized is 
linear and the constraints constitute a set of linear 
inequalities. Methods for solving such problems 
became known as linear programming. With fur- 
ther advances, nonlinearities and random elements 
were introduced, and the method came to be ap- 
plied, as well, to problems of sequential decision 
making, The entire area is now known as mathe- 
matical programming [see PROGRAMMING]. Because 
of their practical usefulness, these methods lent 
themselves to the analysis of various specific plan- 
ning and optimization problems, especially to prob- 
lems internal to the operation of the firm. Stimu- 
lated by the availability of these techniques, as well 
as by advances in probability theory and some war- 
time experience in systems analysis, there has come 
to flourish a modern quantitative approach to the 
problems of production and business management, 
This is known as management science or opera- 
tions research [see OPERATIONS RESEARCH]. This 
development is a case of fission, management sci- 
ence now being regarded as distinct from econo- 
metrics, although both fields have much in common 
and share many a professor and practitioner. 

The most complex problems in the area of pru- 
dent decision making are those that involve strate- 
gical considerations. In its essence this means that 
the consequence of a decision or action taken by 
one participant depends upon the actions taken by 
others; but their actions in turn depend upon the 
actions of each of the other participants. Thus, the 
structure of the problem is not that of simple max- 
imizing, even in the face of risk or uncertainty, but 
is that of the strategical game, [See GAME THEORY, 
article on THEORETICAL ASPECTS.] Based upon con- 
siderations of the prudent strategy of the individual 
participant and of the incentives for subsets of 
participants to form coalitions, the theory of games 
can be presented as a general equilibrium problem 
and has become intimately associated with the 
modern work in general equilibrium economics, In 
a more partial context, game theory has appeared 
applicable to the decision problems of firms in 
oligopolistic and bilateral monopoly situations. 
These are characterized by the fact that each firm, 
in choosing its best course of action, must take 
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into account the effect of its action on the actions 
of other firms, which also perform in a prudent 
way. In general, the early enthusiasm for the appli- 
cation of game theory to these problems of industrial 
behavior has thus far been confirmed in only lim- 
ited degree. [For a review of the applications of 
game theory to business behavior, see GAME THE- 
ory, article on ECONOMIC APPLICATIONS.] 

Distribution processes. A concern of long stand- 
ing in economics has been the size distribution of 
economic variables. What determines the distribu- 
tion of family incomes or the distribution of the 
assets or sales of firms in a given industry? In past 
years these problems have been dealt with descrip- 
tively by fitting frequency distributions to the data 
of different countries, different years, or different 
industries. A good fit to data from different sources 
could be declared an empirical “law”; thus, the 
Pareto law of income distribution. In more recent 
years the size distribution problem has been rede- 
fined. Econometricians now regard it as one of for- 
mulating a dynamic process of growth or decay with 
random elements. The task is to estimate the param- 
eters of the process and to determine whether there 
is an equilibrium distribution of the size of units and 
what that distribution is. A good fit can thus have 
a theoretical mechanism behind it, and the param- 
eters can be made to depend upon other economic 
variables that may change or may be controlled. 
[In this connection, see SIZE DISTRIBUTIONS IN ECO- 
NOMICS and MARKOV CHAINS.] 

Statistical methods. In the natural sciences the 
investigator must make his own measurements. In 
economics, however, the economy itself generates 
data in vast quantities. Taxpayers, business firms, 
banks, etc., all record their operations, and in many 
cases these records are available to the economist. 
Unfortunately, these data are not always precisely 
the kind the economist wants, and they must fre- 
quently be adjusted for scientific purposes. In re- 
cent decades the government has been engaged 
increasingly in the accumulation and processing of 
economic data. This has been of tremendous help 
in the development of econometrics. Not only is 
this the case for the United States and western 
European governments but data are also accumu- 
lated in the planned economies, where they are of 
critical importance in the planning operation. [See 
Economic DATA.] The absence of adequate data is 
felt most severely in the study of the underdevel- 
oped economies, although, through the United Na- 
tions and other organizations, an increasing amount 
of data for those parts of the world is being gath- 
ered and collated. 


A major form in which economic data occur is 
that of successive recordings of economic observa- 
tions over time. Thus, there may be many years of 
price data for particular commodities, of employ- 
ment data, etc. The econometrician, therefore, has 
traditionally been heavily concerned with time series 
analysis [see TIME SERIES] and especially with the 
use of regression methods, where the various ob- 
servations are ordered in a temporal sequence. This 
has lent itself to the development of dynamical 
regression equations attempting to explain the ob- 
servation of a particular date as a function not only 
of other variables but also of one or more past 
values of the same variable. Thus, the dynamic 
regression relationship is a difference equation in- 
corporating a random term, When many past values 
of the variable are introduced into the difference 
equation, so that it is one of very high order, it 
becomes difficult to estimate the coefficients of 
these past variables without losing many degrees 
of freedom. As a result, the econometrician has 
tried to impose some pattern of relationship on 
these coefficients, so that they may all be estimated 
as functions of relatively few parameters, This is 
the technique of distributed lag regressions. [See 
DISTRIBUTED LAGS.] 

The techniques just described have largely come 
to replace the older methods of time series decom- 
position, whereby a time series is split up into such 
components as trend, cycles of various lengths, a 
seasonal pattern of variation, and a random com- 
ponent. These methods implied the interaction of 
recurrent influences of regular periodicity and am- 
plitude. With the move toward the difference equa- 
tion and regression approach, exogenous variables 
have been introduced and random disturbances 
made cumulative in their effects. The temporal 
performance of a time series is thereby described 
less in terms of some inherent law of periodicity 
and more in terms of a succession of responses to 
random influences and to the temporal variation 
of other causal variables, Forecasting is not, there- 
fore, the inexorable extrapolation of rhythms but 
is the revised projection, period by period, of an 
incremental relationship depending on present and 
past values, on exogenous variables, and on ran- 
dom elements. [See PREDICTION AND FORECASTING, 
ECONOMIC. ] 

Nevertheless, it has always been sensible to as- 
sume a rather strict periodicity for the seasonal 
component because of the recurrent nature of pens 
sons, holidays, etc. As a result, when studying time 
series where the observations are daily, weekly, or 
monthly, it is customary first to estimate and re- 


move the seasonal influence. [Techniques for doing 
this are discussed in TIME SERIES, article on SEA- 
SONAL ADJUSTMENT. ] 

The other kind of data that the economist uses 
is cross-sectional. For example, he may use a sam- 
ple of observations, all made at approximately the 
same time, of assets, income, and expenditures of 
different households, firms, or industries. [See 
CROSS-SECTION ANALYSIS.] By observing differences 
in the behavior of the individuals in the sample 
and, again usually through regression analysis, 
ascribing these differences to differences in other 
variables beyond the control of these individuals, 
the econometrician attempts to infer how the be- 
havior of similar economic units would change 
over time if the values of the independent variables 
were to alter. There are many pitfalls in this proc- 
ess of inferring change over time for a given firm 
or household on the basis of differences among 
firms and households at a given point of time. What 
becomes especially useful are data that are both 
cross sectional and time series in character, as, for 
example, when the budgets of a sample of house- 
holds are observed, each over a number of suc- 
cessive years. To obtain usable information of a 
cross-section or of a cross-section and time-series 
sort commonly requires the design of a sample 
survey, [The application of survey methods in eco- 
nomics is discussed in SURVEY ANALYSIS, article 
On APPLICATIONS IN ECONOMICS.] 

A very common problem in econometrics arises 
when different variables are related in different 
ways. For example, aggregate investment depends 
on national income, but national income depends, 
in a different way, on aggregate investment. In 
demand and supply analysis, equilibrium quantity 
exchanged and market price must satisfy both a 
demand and a supply function simultaneously, This 
simultaneity of multiple relationships between the 
same variables presents special problems in the 
application of regression methods. These problems 
have been much studied over the past twenty years, 
and various devices for dealing with them are now 
available. These methods are often quite complex, 
but with advances in statistical theory and in the 
availability of data and with the use of the large- 
scale computer they have come into common use 
in estimating both partial equilibrium and macro- 
economic models, sometimes of quite large dimen- 
sion, Although touched upon only briefly here, this 
commanding problem in statistical methodology is 
perhaps the most central feature of econometric 
analysis and is the subject of a number of texts 
and treatises. It is also probably the largest block 
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of material covered in most special courses in econ- 
ometrics, [See SIMULTANEOUS EQUATION ESTIMA- 
TION. ] 


To those engaged in research at the frontiers of 
any science, progress seems always to be exceed- 
ingly slow; but a review of the accomplishments of 
econometricians both in the development of eco- 
nomic theory and in its quantitative estimation and 
testing over the past two or three decades gives one 
the feeling of great achievement. But as old prob- 
lems are solved, new ones are invented. Thus, the 
advance of econometrics continues unabated. 


ROBERT H. STROTZ 
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ECONOMIC ANTHROPOLOGY 


Economic anthropology is the analysis of eco- 
nomic life as a subsystem of society. The facts of 
economic anthropology are arrived at through field 
observations, either as a by-product of the ethnog- 
rapher’s investigation of the total culture or as the 
focus of his attention. The small-scale society was 
the hearth on which the methods and theories of 
economic anthropology were forged, but modern 
interests have expanded as anthropologists have 
begun to study complex, industrialized, and modern 
societies. As anthropologists extend their interests 
they come into contact with the discipline of eco- 
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nomics, which has a highly developed body of the- 
ory. This confrontation between economists, who 
are reaching toward what they have professionally 
considered the “backwashes and underworlds” of 
economics, and anthropologists, who are moving 
toward the complex monetized systems, has gen- 
erated some fruitful interchange, particularly re- 
garding economic change and development. 

The methods of economic anthropology are the 
basic methods of social and cultural anthropology; 
they are designed to discover social regularities in 
an alien social setting and to attribute meaning to 
these regularities, 

The problems of getting the basic facts of 
economic life in a small-scale nonmonetary or par- 
tially monetized economy are often tests of meth- 
odological ingenuity. Taking household censuses, 
plotting landholdings, collecting data on income and 
expenditure patterns from representative social 
units, observing the contexts of exchange and valu- 
ation, and calculating the output of productive 
units are now standard field procedures. Methodo- 
logical innovations like the construction of subsist- 
ence-exchange ratios for the extended household 
in northern Nigeria (Smith 1955), the development 
of annual output indices for artisan communities 
(Nash 1961), and the critical ratio of labor export 
for Rhodesian tribal societies (Watson 1958) are 
some examples of attempts to quantify available 
economic data, 

Peasant and primitive economies appear to have 
some startling and pervasive organizational differ- 
ences from the monetized, market-directed or state- 
directed, and industrialized society of the Western 
world, Interpreting and explaining these differences 
is a major task of economic anthropology. Early 
evolutionary interpretations (Bücher 1893; Buxton 
1924; Hoyt 1926) and those of more recent years 
that are based on some field observation (Thurn- 
wald 1932; Viljoen 1936) view primitive economies 
as polar opposites to market economies. These in- 
terpretations, however, have fallen under the 
weight of modern field work, which has revealed 
a host of variables that the crude classification by 
stages could not contain, and the dynamics of eco- 
nomic change have turned out to be vastly more 
varied and complex than the evolutionary sche- 
matics allowed. 

Attempts to import wholesale the techniques of 
economics (Goodfellow 1939) have proved of as 

little use in handling the anthropological data as 
the programs to build a specific “economics” to 
handle premarket (Polanyi et al. 1957) or non- 
Western societies (Boeke 1947). 

The arguments about the economic rationality 


of peasants and primitives—whether or not they 
respond to economic incentives, whether or not 
they maximize, whether or not they have habits of 
mind about advantage—have turned out to be sham 
encounters or semantic confusions, The real issue 
is not that of the presence or absence of economic 
rationality in economic life, our own or other; 
rather, it is a matter of discovering what patterns 
of rationality, of choice, of action, are appropriate 
to varying social and cultural arrangements. This 
research perspective sees any and all social and 
cultural structures that define and channel the 
patterns of economic choice and activity. There are 
no economic motives, only motives relevant to the 
economic sphere. Debate over concepts of maxi- 
mization and rationality thus dissolves into the 
construction of models of social and economic 
choice in various cultural and social structures, 
Recognition of the social necessity of bringing 
scarce means into relation with competing ends 
along some culturally determined axes of valua- 
tion makes the search for the empirical patterns 
actually governing allocation in varyingly con- 
stituted societies the most fruitful line of analysis. 


Peasant and primitive economies 


Information about peasant and primitive econo- 
mies appears at first to be a heterogeneous mass 
impervious to analysis. A four-dimensional model, 
however, seems to provide the axes along which 
a classification can be built and also to provide 
initial hypotheses for the understanding of the 
form, function, and dynamics of peasant and 
primitive economies. 

Technology and division of labor. In societies 
with a relatively simple technology the number of 
different tasks involved in any productive act are 
few. Usually it is the skill of a single or a few pro- 
ducers that carries production from beginning to 
end, Many primitive and peasant technologies are 
ingenious, marvelously fitted to a particular en- 
vironment, requiring high levels of skill and per- 
formance, but still very simple. The Bemba of Rho- 
desia (Gluckman 1945) wrest a living from poor 
soil with uncertain water supply by an intricate 
method of cultivation requiring tree pollarding. 
They climb the tall trees, cut several branches, burn 
them, and use the ash for fertilizer on hoed ground. 
With good rains and luck they harvest their crop 
of finger millet. The system is one of balance in 4 
precarious ecological niche, but the task structure 
is simple and the tools involved require only human 
energy to operate. The specialized operations a 
volved are not the kind that make an aia ae 
web of occupations. Men do most of the agricultur 


work of the Bemba, and one man is virtually as 
good as another in agricultural skill. The division 
of labor follows lines of sex and age. An occupa- 
tional list in a peasant and primitive society is not 
a long one. Persons tend to learn their productive 
skills in the ordinary business of growing up, and 
within age and sex categories there is high substi- 
tutability of productive workers, Work and tasks are 
apportioned to the appropriate persons without 
much regard to functional differences in skill or 
productivity. The technology also sets limits on the 
size of combined working parties. Except at peak 
periods—planting or harvesting in agricultural 
communities, an organized hunt at the height of 
the animal running season, or gathering clay in a 
pottery-making community—large working parties 
are not found. Effort and work are closely fitted to 
the annual and ceremonial cycle. 

Structure of productive units. Peasant and 
primitive societies do not have organizations whose 
only tasks are those of production, and there are 
no durable social units based solely on productive 
activities. The unit of production—that is, the 
social organization carrying out the making of 
goods—is dependent on and derived from other 
forms of social interaction. The bonds of kinship 
within and between families, clans, and kindreds 
often provide the structure for economic activities. 
The political structure, especially in societies with 
hereditary nobilities, is often a mechanism for 
forming productive units. Territorial bonds also 
serve to create local producing organizations, Eco- 
nomic units are thus dependent on prior kinds of 
Social relations. Productive units in peasant and 
primitive societies are multipurposed; their eco- 
nomic activities are but one aspect of the things 
done by the unit. These productive units are limited 
in the sorts of personnel they are able to recruit, 
the capital they are able to command, and the ways 
in which a product may be distributed. Usually 
there is no labor market, or capital market, or sys- 
tem of distribution to factors of production, A 
Striking example of reduplicative productive units 
based on relations derived from the organization of 
Social groups only partially oriented to economic 
activity is an Indian pottery-making community in 
Southeastern Mexico. This community is composed 
of 278 households. Each household engaged in the 
production of pottery for sale has virtually the 
same technology (Nash 1961). Among the people 
of Tepoztlán, also in Mexico, many make their liv- 
ing by the sale of services at a wage, Yet people 
must be sought out for employment, and hiring a 
fellow member of the community is a delicate social 
job: an individual's social position may not be im- 
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pugned, nor can the transaction appear strictly 
economic (Lewis 1951). Use of the capital accumu- 
lated by a multipurposed productive unit based on 
noneconomic criteria for membership is con- 
strained by the whole task structure of the social 
unit, not just its economic dimension. 

This kind of economy is relatively inflexible. Pos- 
sibilities of technical advance, of innovation in or- 
ganization, or of extended risk taking are not only 
often precluded by poverty but also cut off by the 
social constitution of the productive unit. But all 
economies need some flexibility, and this is often 
provided by resident strangers. The role of the resi- 
dent stranger highlights the ways in which social 
and cultural features channel economic activity. 
Resident strangers come to other societies for busi- 
ness reasons. They lend money, they bring com- 
modities, or they tie communities into larger net- 
works of exchange. The resident strangers are not 
morally bound to the people of the community; 
they stand outside the social structure. The Chinese 
of southeast Asia, the Lebanese of the West Indies, 
the Muslims and Hindus in sub-Saharan Africa, 
and the Jews of medieval Europe are all examples 
of resident strangers who played the role of pro- 
viding flexibility to economies not organized for 
exclusively economic ends, 

The system and media of exchange. In primi- 
tive and peasant economies the close calculation 
of costs is often impossible or merely irrelevant. 
The advantages of a change in the use of time, re- 
sources, and personnel are arrived at through the 
logic of social structure, through a calculus of rela- 
tive values, and not in terms of the increase of a 
single magnitude such as productivity. The inability 
to estimate closely costs and benefits is aggravated 
by the absence of money as the medium of ex- 
change. Most of the world now has some familiarity 
with the use of a monetary medium of exchange. 
In fact, some societies developed complete, all- 
purpose monetary systems prior to contact with the 
industrial and commercial West (Davenport 1961), 
Many societies have other standards of exchange, 
like the Polynesian shell currencies, the tusked 
pigs of Melanesia, the salt currency of the Horn of 
Africa, or the cocoa beans of the Aztecs. But money, 
in the sense of measure of the value of goods and 
services with derived attributes of demand con- 
vertibility, futurity, and anonymity of source and 
indifference to who holds it, is much less fre- 
quent. When quasi money, or special-purpose 
money, exists, it is merely the standard with the 
widest sphere of exchange. Special-purpose money 
is confined to a particular circuit of exchange, and 
the circuits of exchange in the economy are only 
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partially articulated. Among the Siane of New 
Guinea (Salisbury 1962) there are different kinds 
of exchange goods, and each kind of goods is lim- 
ited to its particular circuit of exchange. Some 
goods can be exchanged only for subsistence items, 
others only for luxury items, and others only for 
items that confer status and prestige. The Tiv of 
Nigeria (Bohannan 1955) have a similar multi- 
centered exchange system, with media appropriate 
to each sphere of exchange. Food is exchanged for 
food and can be exchanged for brass rods; brass 
rods exchange for the highest valued goods, women 
and slaves. A reverse or downward movement of 
exchange items is severely resisted and considered 
illogical and unfortunate among the Tiv. 

The media of exchange and the circuits of ex- 
change are set into various kinds of systems of 
exchange. The most common systems of exchange 
are markets, redistribution, reciprocity, and mo- 
bilization. The market system is widespread among 
peasants, and, as in Mesoamerica, tends to be free, 
open, and self-regulating (Tax 1953), Sometimes, 
as in Haiti (Mintz 1961), special bonds of personal 
attachment grow up between some buyers and some 
sellers that cut down some of the risk and uncer- 
tainty involved in small-scale peasant trading. Ro- 
tating market centers, with a central market and 
several subsidiary markets, are a fairly common 
feature in Burma among the Shans and the Chins, 
in several parts of Africa north and south of the 
Sahara, and in many places in the Near East. These 
market systems usually operate without the pres- 
ence of firms and do not have expensive facilities 
for the spread of information or other activities. 
The complex of markets, firms, capital investments, 
entrepreneurs, deliberate technical investment, and 
property rules to facilitate accumulation and ex- 
change is apparently historically peculiar to the 
West. In the ethnographic record it does not appear 
as a necessary bundle or sequence of events. 

Reciprocity is a form of exchange exemplified 
by the practices of gift giving (Mauss 1925) and 

tends to lack much bargaining between trading 
partners. It involves fixed sets of trading partners 
and occurs between equivalent units of the social 
structure. Thus, clans exchange with clans; barrios 
or wards with wards; households with households; 
communities with communities. The exchanges are 
for near equivalences in goods and services and 
approach fixed rates. 

Redistributive trade takes place in societies with 
marked systems of social stratification but lacking 

market exchange. An African paramount chief may 
collect tribute in the form of goods and redistribute 
it down the social hierarchy through his clients and 


kinsmen. Or a political center may administer trade 
at fixed prices, exchanging with its peripheries 
(Polanyi et al. 1957). The effect of redistributive 
exchanges is to keep a differentiated political and 
status system operating without great gaps in 
wealth between the different status levels. A mo- 
bilization system of exchange (Smelser 1959) con- 
centrates goods and services in the hands of an 
elite for the broad political aims of the society. The 
irrigation empires of the early civilizations appar- 
ently had these sorts of exchange systems, and 
some of the new nations of Asia and Africa control 
capital in this manner, in conjunction with some 
aspects of market, redistributive, and reciprocal 
exchange. 

The control of wealth and capital. Generally, 
investment takes the form of using resources and 
services to support or expand existing social ar- 
rangements. The chief capital goods in peasant and 
primitive societies are land and men. Tools, ma- 
chines, terraces, livestock, and other improve- 
ments in productive resources are controlled in a 
manner derived from the conventions of control 
and allocation of land and human beings. Land 
tenure is an expression of the social structure of a 
peasant and primitive society, and the allocation of 
land results from the operation of the system of 
kinship, inheritance, and marriage, rather than 
through contracts or transactions between eco- 
nomic units. Even in those societies where corpo- 
rate kin groups (clans) do not exist as the land- 
holding bodies, special devices like titles obtained 
through kinship bonds or kindred-based land-hold- 
ing corporations, as on Truk (Goodenough 1951), 
may be established. Manpower, like land, is also 
organized to flow in terms of given social forms, 
rather than according to abstract concepts of 
maximization. , 

For peasants and primitives to maintain their so- 
cieties, capital or property rules or economic chance 
may not be permitted to work in ways disruptive 
of the values and norms of the society. A fairly 
ubiquitous device for ensuring that accumulated 
resources are used for social ends is a leveling 
mechanism. The leveling mechanism is a means of 
forcing the expenditure of accumulated resources 
or capital in ways that are not economic or produc- 
tive. The leveling mechanism may take the form 
of forced loans to relatives or coresidents; a large 
feast following economic success; a rivalry of ex- 
penditure, like the potlatch of the Northwest Coast 
Indians, in which large amounts of valuable goods 
were destroyed; the ritual levies consequent on 
office holding in civil and religious hierarchies, as 
in Mesoamerica; or the giveaways of horses and 


goods that characterized Plains Indian culture. 
This institutionalized means of scrambling wealth 
inhibits reinvestment in technical advance and pre- 
vents crystallization of class lines on an economic 
base. 

This schematic presentation of the major fea- 
tures of peasant and primitive economies serves to 
emphasize the range of social contexts for com- 
parative economic analysis. An alternative model 
can be found in Meillassoux (1960). Forde (1956) 
and Firth (1951) also give descriptive summaries 
of the major features of small-scale economic sys- 
tems. But charting the range and diversity of eco- 
nomic systems is only a part of the task of anthro- 
pology. How economic systems relate to the total 
social system is a question of major theoretical 
importance in social science. 


Relations of economy and society 


The value system of a society defines and grades 
the ends actors seek. The ends sought in the eco- 
nomic sphere must be consonant with, or comple- 
mentary to, goals in other spheres. Economic ac- 
tivity derives its meaning from the norms of the 
society, and people engage in economic activity for 
rewards often extrinsic to the economy itself. In 
peasant and primitive societies the norms and 
values used to define a resource, a commodity, con- 
trol over certain goods and services, the distributive 
process, and standards of economic behavior are 
norms governing most social interaction. The 
economy is not so structurally differentiated that 
one set of values holds there and other sets hold 
in other contexts. In the language of social science, 
the economy is “embedded” in other social systems 
and does not exhibit an ethic counterposed to the 
regnant value system. 

The functional interdependence of economy and 
society (Parsons & Smelser 1956) stems from the 
fact that the same persons are actors in the eco- 
nomic, the kinship, the political, and the religious 
spheres. Thus, among the Tallensi (Fortes 1949) 
the role of father must fit in some way with the 
role of farmer, and these must fit with the role of 
believer in the ancestor cult, and these must fit 
with authority position in the lineage. The interde- 
pendence of parts of a society means that there are 
limits to the sorts of economies and societies that 
may coexist in one time-and-space continuum. But 
it is patent that a system of reciprocal exchange 
Tests on social units that are nearly equivalent in 
Status, power, and size. Similarly, an economy for- 
bidding the performance of any economic role by 
its religious specialists, as in some Theravada 
Buddhist countries, will not have funded clerical 
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estates. The marriage and descent system of the 
Nayar (where husbands were warriors who lived 
away from wives and where descent was matri- 
lineal) is an instance of the functional compati- 
bility of an occupational and status system with 
a marriage and descent system. 

The causal interaction of economy and society 
pivots on the provision of facilities. For given forms 
of social structure a given variety and volume of 
goods and services are required, and if there are 
shifts in facilities available there will be shifts in 
the rest of society. The converse is also logically 
true; shifts in the social structure will change the 
volume and variety of goods and services a society 
produces. To discover the causal interactions of 
these processes, we study peasant and primitive 
societies undergoing change. The facts of change 
are the only sure guides to generalizing on the se- 
quences, forms, and processes of economic and 
social interaction. Much of the change in the eco- 
nomic life of peasants and primitives comes from 
the expansion and spread of the Western forms of 
economic activity. This may occur through the ex- 
port of goods and labor services by the peasants and 
primitives or through the introduction of complex 
industrial and commercial organizations in the 
midst of peasant and primitive societies. 

The expansion of the economic frontier can be 
seen in places like Orissa, a hilly tribal region in 
India (Bailey 1957). Here economic opportunity in 
the wake of the spread of the money economy has 
allowed some castes to move quickly up the status 
ladder and forced some traditionally high status 
castes downward. The economic frontier in the 
form of money and new opportunities tends to 
change the role of corporate kin groups and place 
more emphasis on smaller familial units (Worsley 
1956), to introduce a re-evaluation of the goods of 
a society (Bohannan 1959), and to put pressures 
on traditional authority systems. 

The chief way that peasants and primitives get 
involved in the world economy is by entering a 
wage labor force or by producing something that 
can sell in international trade. Entering a wage 
labor force (Watson 1958) often generates con- 
flict between generations, raises problems about 
the control of income, and sometimes depopulates 
the society, so that its social structure collapses. A 
rural proletariat may replace a tribal society. In- 
come from entrepreneurial activity by peasants 
poses larger problems for the social system (Firth 
1954). It may result in greater wealth differences, 
in modifications of the conventional use of capital, 
in loosening the normative integration of the so- 
ciety, and in restructuring the political and au- 
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thority patterns. The boom involved in peasant 
agriculture (Belshaw 1955) often involves a 
change in religious and ethical concepts and an 
increase in the importance of economic activity 
relative to other forms of social activity. 

The introduction of factories to peasant and 
primitive societies (Nash 1958 ) provides, in theory, 
the widest possibilities for transformation. The 
change induced by a factory may be akin to that 
induced by the increased monetization from wage 
labor or the expansion of the economic frontier, 
but it tends to tie the community more closely to a 
national and international economic network, to 
provide the context for political change, to lay the 
basis for new voluntary groupings, and to exert 
great pressures on extended familial networks and 
on traditional patterns of respect and authority, and 
to demand of persons and institutions a type of 
flexibility and mobility usually not found in tra- 
ditional societies (Moore & Feldman 1960). 

Studies of economic change have shown that 
modifications in economic activity set up a series 
of pressures and tensions in the society and cul- 
ture and that there are limited possibilities for their 
resolution. There is no generally agreed upon se- 
quence of change, and hardly more consensus on 
final forms, but the evidence seems to indicate that 
economic systems are among the most dynamic 
parts of a society and that economic activity, in the 
sense of providing facilities for the organization of 
the rest of social life, is one of the most pervasive 
and determinative aspects of social life. It sets the 
limits within which social structures and cultural 
patterns may fall. 

The challenge and potential of economic anthro- 
pology is the fashioning of a theory encompassing 
both economic and noneconomic variables in a 
single explanatory system. It may then provide a 
framework for a truly comparative study of the 
form, function, and dynamics of economic systems. 


MANNING NASH 


[Directly related are the entries ECONOMY AND SOCIETY; 
EXCHANGE AND DISPLAY; PEASANTRY; TECHNOLOGY. 
Other relevant material may be found in ANTHRO- 
POLOGY, article on SOCIAL ANTHROPOLOGY; and in 
the biographies of Mauss and Potanyt.] 
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I 
GENERAL 


During the first half of the twentieth century 
there was considerable evolution in the scope and 
content of economic data. At the turn of the cen- 
tury, statistical abstracts of economic data gener- 
ally contained only information relating to the 
production and prices of specific agricultural and 
mineral commodities, the value of imports and 
exports, and, in some cases, information on tax 
revenues and outlays by central governments. The 
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financial crises of the late nineteenth century and 
early twentieth century stimulated an increase of 
interest in banking and financial statistics, and as 
central banks were set up and systems of reporting 
regularized, data on the operation of the banking 
system emerged as another type of basic economic 
data. 

Economists as such did not play an important 
role in fashioning economic data prior to World 
War 1. In the 1920s, however, the situation changed 
rather dramatically. One of the first areas of data 
in which the activities of economists became impor- 
tant was that of index number construction. Irving 
Fisher and Warren Persons did much to advance 
index number theory and to foster the development 
of price and output measurements. Fisher not only 
developed the measurement of index number for- 
mula (1922) but he also pioneered in the develop- 
ment of what later became known as econometrics 
—the advancement of economic theory in its rela- 
tion to statistics and mathematics. Warren Persons’ 
work and, later, that of Wesley C. Mitchell gave 
impetus to the analysis of economic time series; 
they concentrated on the task of separating trends 
and seasonal and cyclical factors from one another. 
Mitchell was the founder of the National Bureau 
of Economic Research, an institution in the United 
States that has continued to be in the forefront in 
the development and analysis of economic data. 
In England during this period, work on economic 
data was concerned somewhat more with the devel- 
opment of statistical theory. Bowley and Stamp 
were particularly influential in the development of 
statistical techniques and techniques for the anal- 
ysis of economic data. 

In spite of the rapid development during the 
1920s, however, it is fair to state that prior to the 
great depression of the 1930s economic data re- 
mained highly fragmented, and there was no unify- 
ing framework to integrate the large number of 
independent series. Economists had been preoccu- 
pied with the search for a few key economic indi- 
cators that would solve all their problems, revealing 
the true economic cycle and showing the level of 
prices and industrial activity. 

A completely different view of the functioning 
of the economic system was introduced by John 
Maynard Keynes in The General Theory of Employ- 
ment, Interest and Money (1936). By emphasizing 
the basic idea that the functioning of the economy 
depended upon the interaction of its parts, Lord 
Keynes fostered the concept of economic data as 
a systematic body of interrelated information. For 
the first time a comprehensive set of economic data 
covering all production, consumption, saving, and 
investment for the economy as a’whole was intro- 


366 ECONOMIC DATA: General 


duced in the context of an operational economic 
model. The pioneering work in the field of national 
income measurement that had been done by Will- 
ford I, King (1930), Simon Kuznets (1941), and 
Erik Lindahl (see Stockholm, Universitetet 1933— 
1937) was developed into a social accounting 
framework by Richard Stone (1951), Simultane- 
ously, Wassily Leontief (1941) was developing 
another operational economic model involving the 
interrelation of inputs and outputs of specific indus- 
trial sectors of the economy. 

The development of national income accounting 
and input-output analysis thus introduced the idea 
that economic data are a set of systematically inter- 
related pieces of information that can be utilized 
to show the functioning of the various parts of the 
economy, However, this concept of a framework 
for economic data did not supplant interest in the 
growing body of partial data in almost every area 
of economic analysis. Data in such fields as agri- 
culture, labor, manufacturing, trade, ete., contin- 
ued to expand, The significance of the more sys- 
tematic approach to economic data lay partly in 
its ability to digest, integrate, and put order into 
the massive increase in information about specific 
sectors of the economy, 

World War 11 provided a further impetus to the 
expansion of economic data. The mobilization of 
the war economy required vast amounts of infor- 
mation about the productive capabilities of various 
industries and their supply requirements. The sup- 
pression of inflation through taxation and price 
control required the further development of na- 
tional income accounts. In almost all countries 
considerable advances were made in the develop- 
ment of systematic bodies of economic data, and 
in the immediate postwar period these advances 
continued, Programs of economic recovery were 
instituted, and international organizations such as 
the Organization for European Economic Coopera- 
tion and the United Nations stimulated countries 
to make basic improvements in their statistical re- 
porting systems, 

The trend toward the utilization of more compre- 
hensive bodies of economic data in econometric 
models is continuing. Prior to World War 1 econ- 
ometric models were generally quite aggregative, 
but in recent years these models have become 
considerably less so. For example, the model Law- 
rence Klein developed for the United States econ- 

omy in 1950 contained 16 equations; the 1959 
Klein-Goldberger model contained 32 equations; 
and the 1963 Social Science Research Council 
model contains over two hundred equations. Simi- 
larly, the research studies by Jan Tinbergen in the 


Netherlands and by Richard Stone in England have 
been moving toward disaggregated econometric 
models involving complex interrelations among the 
variables, [See ECONOMETRIC MODELS, AGGREGATE. | 
Recently a somewhat different approach has 
been developed, utilizing computer simulation tech 
niques in order to project economic behavior over 
a future period or to analyze how an economic 
system behaves under specific conditions. Basically, 
simulation utilizes economic models in such a way 
that major pieces or segments of the model are 
treated as probability samples for which behavioral 
characteristics are taken from sample surveys or 
masses of detailed data observations at the micro 
economic level. By utilizing a building-block ap- 
proach, simulation techniques can handle many 
different levels of aggregation simultaneously and 
therefore are capable of utilizing both highly de- 
tailed microeconomic data and more aggregated 
macroeconomic data, The recent advances in com- 
puter technology have thus made feasible the utili- 
zation of economic data in a manner that was 
previously not thought possible. [See SIMULATION. | 
Types of economic data available. Itis obvious 
that the content of economic data is undergoing 
profound change and that at the present time we 
are in the middle of these changes. Nevertheless, 
it will be useful to identify what are the main 
types of economic data currently available and to 
discuss the ways these bodies of data are changing. 
National income accounts. Within the past 
decade national income accounts have come to be 
recognized as one of the most basic types of infor- 
mation about a nation’s economic activity. In a 
number of countries the government is required to 
present, along with its budget, a forecast of the 
national income accounts for the coming period, 
showing how the proposed government budget 
relates to anticipated economic developments. 
National income accounts today generally cover 
five sectors of the economy. (1) The gross national 
income and product account shows who purchases 
the final output of the economy and how the in- 
come generated by such economic activity flows to 
individuals as income, to governments as taxes, Or 
to corporations as profits. (2) The personal income 
account shows the income received by individuals, 
whether in their role as participants in the pro- 
ductive process or as recipients of social security 
benefits or other transfers from government. The 
personal income account also shows how people dis- 
pose of their income, i.e., how much of their 
income goes for consumer expenditures and per- 
sonal taxes and how much is saved. (3) The public 
sector account, which has become considerably 


more important in recent years, shows, on the 
receipts side, what the government receives from 
taxes on individuals or on businesses and how 
much it receives as property income, such as the 
surplus of government enterprises. The expenditure 
side of the government account shows the outlays 
of the government in terms of the functions which 
it undertakes, such as education, health, defense, 
general administration, and economic development. 
(4) The balance of payments account shows the 
foreign trade of an economy in terms of imports 
and exports and transfers to and from abroad. This 
account shows whether a nation is running down 
its reserves of foreign exchange or acquiring a 
surplus. (5) The final account, the saving and 
investment account, on one side brings together 
the excess of income over expenditure in each of 
the other sectors, and on the other side shows the 
capital formation by government and by public and 
private enterprises. 

These five sector accounts are so set up that an 
expenditure in one account corresponds to a receipt 
in some other account. Thus, for example, the 
amount of consumer expenditures in the personal 
income account corresponds exactly to the sale of 
goods to consumers appearing in the gross national 
product account. [See NATIONAL INCOME AND 
PRODUCT ACCOUNTS.] 

Flow-of-funds data. The national income ac- 
counts cover only current income and expenditure 
transagtions; they exclude those financial transac- 
tions that merely alter the assets and liabilities of 
the various sectors of the economy. Because this 
type of financial information is important in any 
analysis of fiscal and monetary policy, sets of 
information on the sources and uses of funds by 
the different sectors of the economy have been 
developed in many countries. In the United States 
information on the flow of funds is published by 
the Federal Reserve Board. This information is 
generally consistent with the national income ac- 
counting data, but the two sets of data are not 
fully integrated. In some other countries—for ex- 
ample, Canada, the Netherlands, and France—the 
flow-of-funds data have been integrated with na- 
tional income data to produce one consistent set 
of data interrelating the income accounts with the 
financial transactions of each sector. 

Input-output data. Input-output data showing 
the purchase of inputs and sale of outputs by 
industrial sectors of the economy are also becom- 
ing available for many countries, but unlike na- 
tional income and flow-of-funds data, this informa- 
tion rarely appears on an annual basis. The 
statistical requirements of a complete input-output 
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system are very considerable, and it is usually 
possible to-obtain such a body of data only for 
periods for which economic censuses are available. 
Again, however, there is an increasing tendency 
toward integration of national income accounting 
data and input-output data. For example, the 
gross product originating by industrial sector has 
come to be a common breakdown in the national 
income accounts, and this often matches the indus- 
trial breakdown provided in the input-output 
matrix. [See INPUT-OUTPUT ANALYSIS.) 

National wealth. The development of flow-of- 
funds accounts and input-output matrices has 
fostered increased interest in balance sheets for 
various sectors of the economy. Data on the finan- 
cial transactions of each sector and on capital 
formation make possible, via a perpetual-inventory 
approach, the development of national wealth and 
sector balance sheets. This work, however, is still 
in its early stages, and few countries at present 
publish official current data on national wealth. 
[See NATIONAL WEALTH.] 

Data coverage, collection, and processing. One 
of the major characteristics of economic data sys- 
tems at the present time is increasing detail, both 
in terms of time coverage and in terms of disag- 
gregation. For example, today most of the indus- 
trial economies publish quarterly data on national 
income accounts; and for some components, such 
as personal income, they may even publish monthly 
data. There is also increasing interest in regional 
breakdowns of information within nations. In large 
part this is due to the fact that the different regions 
of a nation develop differently, and economic de- 
velopment must recognize differences in regional 
patterns, for the consideration of problems of loca- 
tion of new industries and for dealing explicitly 
with problems of regional poverty. [See REGIONAL 
SCIENCE.] 

In developing these new bodies of data, govern- 
ments have made use of two new tools which have 
made possible the development of much more reli- 
able data at lower cost: (1) the use of sample 
surveys and (2) the development of electronic data 
processing. Since World War 11 there has been a 
considerable increase in the use of sample surveys 
that are specifically designed to obtain desired in- 
formation. Many of these sample surveys have 
been very successful in providing data which here- 
tofore were completely unavailable. Private groups 
as well as governments have been able to develop 
important bodies of data. For example, in the 
United States the Survey Research Center at the 
University of Michigan has developed an important 
body of information on consumer expenditures, 
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and the McGraw-Hill Publishing Company has de- 
veloped an interesting and useful survey of invest- 
ment plans of industrial firms. 

The development of electronic data processing 
techniques has permitted governments to make 
use of existing bodies of information in ways which 
were previously impossible because of the ex- 
tremely high cost of processing. The increased use 
of high-speed computers to process and edit data 
has meant that major bodies of data that are col- 
lected for other purposes, e.g., income taxes and 
social security, can be utilized for statistical pur- 
poses. Furthermore, bodies of information, such as 
census data, can be made available in a space of a 
few months rather than the several years which 
used to be required before final tabulations were 
available. 

Problems of measurement and reliability. There 
can be little doubt that the quality as well as the 
quantity of economic data has shown substantial 
improvement in recent years. In part this improve- 
ment has been due to increasing knowledge of 
sample design with resulting improvements in the 
ability to measure biases and to take into account 
errors in reporting. It is generally true that careful 
sampling in any given area can produce much- 
higher-quality data at very much lower cost than 
can complete reporting. The flexibility of sampling 
techniques permits the analysis of a large number 
of different factors in a way that would not be 
possible if complete and uniform reporting were 
required of each responding unit. 

Furthermore, the reliability of data is not deter- 
mined purely at the point of collection, Careful 
editing is very important if the final results are to 
be meaningful and consistent, and great strides 
have been made in recent years in this area, The 
computer is especially valuable in this context. It 
is now possible to carry out many internal checks 
within a body of data to reveal inconsistencies and 
to resolve such inconsistencies in an optimal man- 
ner, Perhaps even more important, however, is the 
increasing possibility of bringing together informa- 
tion from a number of different sources for com- 
parisons and consistency checking. For example, 
matching census data with income tax data pro- 
vides some overlapping information which, when 
compared on a case by-case basis, may reveal unex- 
pected weaknesses in one or both sets of data. 
Awareness of problems is the first step toward the 
improvement of the quality of data. 

The problem of reliability of data and the meas- 
urement of error in empirical work in economics 

is complex. In the simplest case the concept of 
sampling error is not too difficult to comprehend 
or measure. One is essentially asking how much 


variance can be expected to occur on repeated sam- 
pling of a given type of information. In point of 
fact, however, a given piece of economic data is 
never used in isolation; it is always related to other 
pieces of economic data, which also have sampling 
or reporting errors. The problem of measurement 
of error, therefore, comes down to the joint error 
distribution of the different pieces of data that are 
used in conjunction with each other. For example, 
data on the volume of consumer expenditures in 
a particular period may have a certain sampling 
error, but it does not follow that the error in the 
change in consumer expenditures from one period 
to another can be calculated on the basis of sam- 
pling errors. The reason for this is rather simple. 
The sampling errors in the consumer expenditure 
data in different periods may be expected to be 
somewhat correlated with each other, and to the 
extent that this is true the error in the change in 
consumer expenditures will be affected. It would 
be quite possible, for example, for the error in the 
level of consumer expenditures to be quite substan- 
tial; but as long as the error is relatively constant, 
the change in consumer expenditures may be quite 
accurate. 

Validity of measurement and operationalism. 
For economic data as a whole, the problem of the 
validity of measurement is often considerably more 
important than that of reliability. It is, of course, 
quite proper that economists approach empirical 
information with a framework that embodies pre- 
conceptions as to the nature of the data, To do 
otherwise would result in a chaotic mass of infor- 
mation without any principles for organizing the 
data. Nevertheless, the economists’ preconceptions 
often represent an oversimplification of the actual 
situation, 

In the field of price measurement, for example, 
economists reasoning from a static model envisage 
an economy where all transactions can be expressed 
as the product of a price and a quantity, and the 
quantity of output of the system as a whole can be 
measured in terms of a properly constructed index. 
This basic concept of price and output underlies 
our measurement of the gross national product in 
constant prices. Unfortunately, there are impedi- 
ments to making this simple theoretical model 
operational. Few products remain unchanged over 
time, and there is no satisfactory way to evaluate 
whether changes in price do or do not correspond 
to changes in quality. Furthermore, one of the most 
common features of economic development is that 
new products are developed to replace old products. 
Unfortunately, the newly developed products sel- 
dom bear a one-to-one relationship with the old 
products. Thus, central heating may replace heavy 


wool clothes, and telephone communications may 
replace travel and mail correspondence. These sub- 
stitutions are very subtle and cannot be perfectly 
measured, 

The statistician must make tremendously over- 
simplifying assumptions in order to make data 
about the real world fit into the theoretical eco- 
nomic models. For example, price statisticians 
usually assume implicitly that new products are 
exactly equivalent in utility and cost to the prod- 
ucts they replace and that old products improve 
in quality only in a few, rigidly specified ways. The 
problem with such a solution, however, is that the 
results of measurement in a given situation are 
often dictated by the assumptions that are made 
rather than by what has actually happened. In 
measuring output in such areas as construction and 
services, for instance, the statistician is often 
forced to substitute the measurement of inputs for 
the measurement of the output, thus, by definition, 
assuming that there is no change in productivity 
in these sectors. 

Similar problems have arisen in many other 
areas of economics where the statistician has at- 
tempted to implement concepts developed by the 
theorist. For example, in the development of the 
concept of capital, theorists have been vague as to 
the definition and meaning of real capital over 
time. Statisticians have dealt with the problem by 
adopting certain conventions, such as formal rates 
of obsolescence and depreciation, but in doing so 
they have in part determined by assumption the 
answer to the question of how much the growth 
in real capital contributes to the expansion of 
output. 

In view of these limitations, economics is still 
far from being able to answer such interesting 
questions as how much better off we are now than 
at some previous time or how much productivity in 
various sectors has advanced, But such questions, 
interesting as they are, may not be the most impor- 
tant or productive questions that might be asked. 
Perhaps it is more important to find out how the 
economy functions, what parts are growing relative 
to other parts, and how the change comes about. 
Questions of this sort are much more likely to be 
answered by the growing body of economic data 
now becoming available. It is much easier to de- 
scribe and analyze economic change than to eval- 
uate the welfare or productivity effects of such 
change, 


RICHARD RUGGLES 


[Directly related are the entries COMPUTATION; GOV- 
ERNMENT STATISTICS; INDEX NUMBERS; SAMPLE 
SURVEYS; SURVEY ANALYSIS.] 
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n 
THE SOVIET UNION AND EASTERN EUROPE 


Adequate collection of economic information, 
rapid transmission and processing of the data, and 
proper utilization of scientific procedures for draw- 
ing conclusions from the data are vital for any 
modern society—a fortiori, for centrally planned 
economies. The Soviet Union:and the communist- 
ruled east European countries are socialist planned 
economies, that is, economies whose “commanding 
heights” (banking, industry, transport, etc.) are in 
the hands of the state. From these commanding 
positions the communist leadership attempts to 
guide the main economic processes of each country 
within the framework of comprehensive plans that 
set specific goals for the economy as a whole and 
for its main sectors and branches. Resources are 
channeled by a variety of means toward fulfilling 
these goals, and performance of each production 
unit is systematically followed up as plan imple- 
mentation unfolds, 

Industrial and nonindustrial data. For plan- 
ning purposes a vast and comprehensive amount 
of information is needed. The statistical organiza- 
tion must reach deeply into the economic and 
social system and must have access to all the rele- 
vant data. Systematic collection of data on many 
facets of economic life, adequate condensation of 
information at various levels, feeding of the appro- 
priate information to planning and management 
organs, methodic follow-up of the operational as- 
pects of plan implementation, and manifold anal- 
yses of the developmental aspects of the economy 
are permanent tasks in continuous planning. These 
tasks raise complicated problems with respect to 
methods of aggregation and disaggregation of the 
data, accuracy in record keeping, comprehensive- 
ness and timeliness in reporting, effectiveness and 
speed in auditing and processing and, above all, 
boldness and freedom in analyzing the results 
obtained. 

In the Soviet Union and in each east European 
country, the backbone of statistical apparatus is 
the Central Statistical Administration (CSA)—in 
Yugoslavia, the Federal Institute of Statistics. The 
CSA handles all the statistical data, that collected 
by itself as well as that collected by all other agen- 
cies involved in statistical operations (banks, min- 
istries, etc.), The main stream of information 
flowing to the statistical offices comes from the 
producing enterprises. The producers report daily, 

weekly, monthly, etc., so-called operating—technical 
data, in which inputs and outputs are expressed 
in physical terms. A second basic stream of infor- 
mation, furnished in value terms, flows to the CSA 


from the producers and from the banking and finan- 
cial system, particularly the national and invest- 
ment banks and, in Yugoslavia, the Social Auditing 
and Accounting Agency. On the basis of the data 
processed by the CSA and the information col- 
lected by their own organs and consolidated at 
local, regional, or ministerial levels, the planning 
organs elaborate the operational (short-term) and 
“perspective” (long-term) plans within the frame- 
work laid down by the directives of the policy 
makers. 

There are avowedly numerous deficiencies in the 
producers’ input-output data used in preparing the 
planning statistics. The deficiencies are due pri- 
marily to the following: (a) performance is judged 
on the basis of reported data; (b) operating—tech- 
nical and value data are not well coordinated; 
(c) far too heavy demands are continuously placed 
on the producers for data on all aspects of their 
activity. 

Since plan fulfillment is evaluated and mana- 
gerial rewards are assigned on the basis of reported 
data, managers are tempted to manipulate and 
misreport statistics in a variety of ways. 

Further, the Soviet and east European enter- 
prises (excepting the Yugoslav) obey numerous, 
often conflicting, commands and have to keep a 
variety of overlapping and poorly integrated ac- 
counts: operating-technical records of physical 
flows; accounting records in value terms; control- 
ling records of plan fulfillment. The flow of mate- 
rials organized on the basis of specified norms of 
uses of inputs and outputs is not well coordinated 
with the value aspects of production either at the 
level of the plant or at higher levels. There are no 
unified balance sheets at the level of the enterprise 
on the basis of which the over-all situation could 
be easily and rapidly appraised, and there is lack 
of integration between recorded material flows and 
value indicators at the level of the economy as a 
whole. 

Finally, the enterprises are called upon to pro- 
duce an enormous volume of statistical reports, 
often overlapping and superfluous, in response to 
the urgent requests of statistical, banking, or plan- 
ning organs at local or regional levels. Swamped by 
innumerable forms which they must complete for 
a variety of purposes, the enterprises deliver a vast 
but low-quality output of data. The fight against 
paperwork (the Kampf gegen die Papierflut as 
the East Germans call it) has long been a con- 
tinuous but losing battle in all the east European 
economies except Yugoslavia. ` 

At the heart of almost all of these problems is 
the limited opportunity for decision making granted 
to the managers of producing enterprises, that is, 
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the centralization of economic management and 
decision making. The greater the volume of com- 
mands from above, the greater the volume of norms 
to be complied with from below; the broader the 
decentralization in decision making, the better the 
incentive system—as in Yugoslavia since the early 
1950s—and the smaller the needs for specifica- 
tions, norms, and continuous checking from above. 

The deficiencies of the nonindustrial statistics 
are even more glaring than the deficiencies of the 
industrial statistics. Agriculture, for instance, fur- 
nishes notoriously poor information because of its 
over-all low level of development, its type of organ- 
ization, its system of statistical reporting, and the 
poorly trained personnel used for record keeping, 
particularly on the collective farms. 

Social accounting. The principal responsibility 
of the CSA is to compile output indexes by branches, 
sectors, or regions, in physical or value terms, and 
feed them to the appropriate planning and man- 
agement organs. (The efforts of these organs de- 
pend in effect upon the completeness, rapidity, and 
accuracy of the data collected by the central statis- 
tical offices. ) Moreover, the CSA is charged with the 
elaboration of social accounts. The accounts deal 
with production, consumption, and investment, 
with transfers and “redistributions” of the social 
product, and with labor resources. Integration and 
aggregation of data are achieved by the CSA fol- 
lowing the Marxian definitions of production. Ac- 
cording to these definitions, production includes 
services directly connected with the processing and 
distribution of goods and excludes all other services. 

Publication. Only a relatively small portion of 
the information available to the CSA is published. 
What is published is still in some cases annoyingly 
incomplete, of questionable quality, or of little rele- 
vance for assessing the true performance of these 
economies. During the Stalin era—1928 to 1952— 
concealment, bizarre manipulations, and outright 
distortions became the trademarks of Soviet statis- 
tics. A cloak of secrecy was cast over most of the 
key domestic economic data. This secrecy has been 
appreciably lifted since the mid-1950s, but even 
now many economic aspects of the Stalin period 
for the Soviet Union, and of the early postwar 
decade—1945 to 1955—for all the area, remain 
unclear; further, some crucial data are still with- 
held. 

The Soviet and east European policy makers 
have a high stake in the performance of their 
planned economies. Hence, they are always sensi- 
tive about some troublesome indicators, such as the 
rate of growth of agricultural output or the rate of 
growth of the economy as a whole in certain years. 
Generally, however, ample statistical materials are 
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currently released on industrial outputs, manpower, 
investment, state budgets, etc. Much of this data is 
not readily comparable to the Western data. The 
differences are due not only to dissimilarities in 
underlying statistical methodologies and frames of 
reference but also to underlying discrepancies be- 
tween East and West concerning taxation and pric- 
ing systems. The Soviet-type tax system blurs the 
line between requited and unrequited factor re- 
turns, and administrative price manipulation raises 
complex problems concerning the valuation of 
output. The severe limitations in the nature of the 
underlying ruble prices and of the definitional 
frameworks used, along with the various other defi- 
ciencies mentioned, hamper the usefulness of Soviet 
social account and of Soviet income indexes. A sys- 
tematic effort has been made in the West to elimi- 
nate the distortions of the available Soviet statisti- 
cal materials and to recompute the Soviet national 
income series for the whole of the planning era 
opened in 1928. A similar effort of recomputation 
has been made for various east European countries. 
During the late 1950s and early 1960s, Soviet 
and east European statistical recording, processing, 
and publishing have increasingly improved. Early 
in 1945, the fledgling east European statistical ad- 
ministrations achieved the necessary shift in focus 
from the prewar type of social statistics to the type 
of economic statistics needed for planning pur- 
poses. After centering their attention on the com- 
pilation of industrial or agricultural inventories, 
statistical administrations made a serious effort to 
progress from the collection of “photographic” in- 
ventory-type data to the collection of “cinematic” 
production-type data that would be valuable for 
planning and forecasting. During the 1950s the 
CSAs solved in part the problems of following up 
the plan performance and of elaborating output 
measurements by branches, sectors, planning con- 
trols, etc. During that period the first complete 
postwar population censuses were published in 
eastern Europe. Since then, the energies of the sta- 
tistical machines have been turned toward the 
complex problems of social accounting. In some 
countries—for example, the Soviet Union, Yugo- 
slavia, Hungary, Poland, and East Germany—rel- 
atively successful steps have been taken toward 
the elaboration of input-output tables. The volume 
and quality of statistical information published 
have notably increased, and the publication of de- 
tailed and informative statistical yearbooks has 
been undertaken by all these countries. Statistical 
and economic journals are published regularly. 
Concepts and methods. All this activity has 
been highly dependent in the area as a whole, 
except Yugoslavia, on statistical concepts and 
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methods elaborated during the Stalin era. Severe 
doctrinaire limitations have existed in regard to the 
use of probability theory and to the use of mathe- 
matical methods in general in the social sciences. 
The key contention that dominated Soviet statis- 
tical thought under Stalin was that socioeconomic 
statistics form a special “social science” based on 
Marxian dialectics; the body of methods known as 
mathematical statistics based on probability and 
the law of large numbers—to which, incidentally, 
many great Soviet scientists contributed—was con- 
sidered applicable only to the natural sciences. 

Since 1960, however, sustained efforts have been 
made in all these countries to break away from the 
specious Stalinist concepts and methods. Mathe- 
matical methods are increasingly being introduced 
into economics and statistics. Throughout the area, 
there is growing interest in Western theories of 
decision making under uncertainty, Western tech- 
niques of sampling, planning, social accounting, 
and activity analysis, and Western concepts of 
efficiency and optimality. These influences will 
play a crucial role in the late 1960s and beyond, 
when Soviet and east European statistics will have 
to tackle new and more complex tasks, Hopefully, 
they will result in better training of statisticians, 
new points of view in statistical elaboration, and 
less high-handedness in the publication of data. 

Improvements in the general quality of the data 
will require several reforms and innovations, Re- 
porting demands on producing enterprises must be 
reduced, and the output indicators now compiled 
should be supplemented with quality indicators. 
Automating the collection and transmission of data 
would significantly increase speed and efficiency 
in the performance of these tasks, 

Developing an optimal system of unified ac- 
counting for economic planning and control is an 
ambitious task, Further progress toward this goal 
will necessitate a vast amount of research, not only 
in mathematics and statistics but also in eco- 
nomics, cybernetics, and sociology, Complex multi- 
staged programming models must be formulated, 
and methods must be developed to solve short-term 
and long-term planning control problems at various 
levels. To implement these models and methods, 
an integrated network of computational centers is 
indispensable. 

The prospects for improvement and progress 
also depend very much on reorganizations of the 
economies themselves. The volume of centralized 
commands will probably decrease as the market 
mechanism is increasingly relied on to determine 
resource allocation. This should lead to an over- 
hauling of incentive systems and should produce 


price systems that indicate relative scarcities and 
production costs of goods more accurately than 
present price systems. If such reorganizations take 
place, serious progress in the elaboration and dis- 
semination of economic data may be expected. 
Statistical theory and experimentation have a long 
tradition in these countries; however, the basic 
challenge now is not so much renewing ties with 
the past but successfully meeting the new tasks 
and problems that a developing economy poses to 
economists and statisticians. 

NICOLAS SPULBER 
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mI 
MAINLAND CHINA 


The communist regime was established on the 
Chinese mainland in October 1949. A substantial 
amount of economic statistics was published by 
the regime during the period 1949-1960. While 
poor in quality, the statistics for 1952-1957 could, 
with proper caution and careful evaluation, serve 
as a reference for understanding the performance 
of the economy. The data for the earlier years, 
1949-1951, are less reliable and those for the 
“great leap forward” years, 1958-1959, are ex- 
tremely untrustworthy. No significant economic 
statistics have been released by the regime since 
1960. 

There is no clear evidence of intentional falsifi- 
cation of statistics at the level of the central gov- 
ernment prior to 1958. The poor quality of the data 
was primarily due to the underdeveloped nature of 
a large sector of the economy. The collection of 
reliable statistics from hundreds of millions of old- 
fashioned producing units was inherently difficult; 
and the numerous drives to fulfill targets, often 
unreasonable even before the “great leap forward” 
in 1958, induced and compelled local units to file 
exaggerated reports of achievement. 

Development of a statistical network. There 
was no effective national statistical reporting sys- 
tem until the establishment of the State Statistical 
Bureau on August 8, 1952. By the end of 1953 
roughly 160 local statistical offices had been es- 
tablished at the provincial level for the relatively 
large cities. During 1955 and 1956 efforts were 
made to extend this network to smaller towns and 
rural units, but even as late as the end of 1957 
the difficult task of establishing perhaps a quarter 
of a million local statistical offices had not been 
completed. Without an effective organization to 
collect them, the data cannot be expected to be 
reliable (Li 1962). 

Major statistical undertakings completed from 
1952 to 1957 include the population census of 
1953, the unification of statistical reports on state- 
managed enterprises and construction in 1954, the 
1954 census of private industry and individual 
handicraftsmen, the 1955 census of private trade 
and restaurants, and the 1955 census of workers 
and employees in the socialist sectors. Reporting 
on agriculture had to be entrusted to the regular 
local government units. “Model surveys” of ques- 
tionable quality were carried out mainly for food 
Crops and cotton in 1953 and 1954, the choice of 
the sample being left to the judgment of the indi- 
vidual investigators. A survey of the income distri- 


ECONOMIC DATA: Mainland China 373 


bution of agricultural producer cooperatives was 
carried out in 1955 and 1956. Sample surveys of 
the incomes of farm households and of urban 
workers and employees were conducted in 1956. 
A study of the age and sex distribution and the 
educational and political background of workers 
and employees of all industrial, construction, and 
transportation enterprises, as of February 1957, 
was reported to have been made. 

The effort toward improving the statistical sys- 
tem was abruptly shattered by the “great leap for- 
ward” in 1958. The collection and reporting of sta- 
tistics were in effect taken over by the Communist 
party mechanism and were integrated into emula- 
tive drives to increase production. This led to the 
unprecedented statistical fiasco of 1958, from 
which the regime learned a sober lesson. A seri- 
ous effort was made in 1960 to rehabilitate and 
strengthen the statistical system. With this effort, 
however, the publication of national economic sta- 
tistics practically stopped. 

Important data for 1952-1957. For 1952-1957, 
a period of relatively good statistics, the best data 
are those on government budgets and such ho- 
mogeneous products as steel and electric power, 
produced by large industrial enterprises. Poor as 
it is, the population count of 1953 must be con- 
sidered better than the reports on agricultural and 
handicraft production. A substantial amount of re- 
tail trade was transacted by small private traders, 
for whom no reliable information was available. 
Estimates of national income had to be made with 
these basic data of varying degrees of unreliability. 
Only a brief review of some of the available data 
will be given. 

Population. Table 1 presents the U.S. Census 
Bureau’s summary of the regime’s 1953 population 
census. This census has been criticized by scholars 
both inside and outside mainland China. The 
census was not obtained by house-to-house can- 
vassing. Instead, heads of households were required 


Table 1 — Population of mainland China, by age and 
sex, June 30, 1953 


Age Male Female Total 
(In millions) 

All ages 302.1 280.5 582.6 
(In per cent) 

0-4 years 15.5 15.7 15.6 

5-14 years 21.0 19.7 20.3 

15-24 years 17.5 17.1 17.3 

25-34 years 15.0 14.2 14.6 

35—44 years 12.0 12.0 12.0 

45 years and over 19.0 21.3 20.2 


Source: Aird 1961, pp. 81, 83. 
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to report to registration stations set up for this 
purpose. Since the census was taken primarily for 
registering voters for the national election of 1953— 
1954, it is doubtful that political undesirables 
would have come to register. Inadequate prepara- 
tion and insufficient pilot surveys, the prolonged 
period of enumeration, the highly inaccurate “pen- 
cil stroke system of tallying,” the troublesome prob- 
lem of temporary residents (supposedly reported by 
families or neighbors at place of permanent resi- 
dence), and the tradition of underreporting of 
females must have resulted in substantial inac- 
curacy. The high degree of social control in Com- 
munist China, the one factor favorable to an 
accurate count, was not sufficient to counter- 
balance all the unfavorable factors. 

Further doubts were cast on the accuracy of the 
census on the basis of demographic analysis. Aird 
(1961) found a marked deficiency of persons 5 to 
14 years of age and an excess of persons 45 and 
over, The former, however, could have been the 
result of the Sino-Japanese and the nationalist— 
communist conflicts from 1937 to 1949, which 
were unprecedentedly violent even for China. The 
latter could have been the result of the relative 
absence of large-scale wars during the last two 
decades of the Ching dynasty. Aird also constructed 
theoretical models of population dynamics for 
mainland China and found the vital rates implicit 
in the age distribution of the reported 1953 popula- 
tion to be incompatible with the demographic 
history of China. Conversely, using the vital rates 
reported for the years near 1953, Aird could not 
derive the reported 1953 population. These anal- 
yses, while interesting, are inconclusive. It is doubt- 
ful that vital rates were constant in China during 
any reasonably long period of time, an assumption 
made in these analyses. The vital rates reported 
for the years near 1953 were not a part of the 1953 
census and are even less reliable than the 1953 
population count. Thus, they should not be used to 
invalidate the census itself. 

The 1953 census is certainly not comparable in 
accuracy to censuses of the more developed coun- 
tries. It was, nevertheless, collected by a systematic 
and comprehensive effort, however poorly designed. 
Comparable efforts have never been made to obtain 
Statistics for the output of agriculture and the small 
producing units in the underdeveloped sectors of 
industry and transportation or for the incomes and 
expenditures in these sectors. These other statistics 
must be considered inferior to the population data 
for 1953. 

Industrial production. Modeled on the Soviet 
concept of gross value of industrial production, the 


official data on this important item are aggregates 
of the gross value of production of the reporting 
enterprises. With some important exceptions, 
double counting was not permitted within an en- 
terprise; raw materials produced and used by the 
same enterprise in making its own output were 
excluded from its value of production. Purchases 
from other enterprises, however, were not excluded. 
It is known that managers of individual enter- 
prises went as far as purchasing one another's 
intermediate products in an attempt to increase 
their respective gross values of production and to 
fulfill or overfulfill quotas. 

Prices for the third quarter of 1952 were used 
as weights in computing the official gross values 
of production for years prior to 1957. Prices of 
consumer goods were depressed in the fall of 1952 
because of the “Five Anti” campaign against private 
enterprises (a movement outwardly directed against 
(1) bribery; (2) tax evasion; (3) theft of public 
assets; (4) fraud in fulfilling government con- 
tracts; and (5) theft of government information), 
but prices of producer goods were little affected. 
Thus, producer goods were overvalued relative to 
consumer goods, Since output of producer goods 
increased faster than output of consumer goods 
after 1952, the official rates of growth of industrial 
production were upward biased. The valuation of 
new products at trial-manufacturing expenses (de- 
flated by 1952 prices) also exaggerated the in- 
creasing trend. 


Table 2 — Indexes of industrial production of mainland 


China 
Year Official I" Official Il? Chao® Liv-Yeh® 
1952 100 100 100 100 
1953 132 134 125 128 
1954 154 157 142 149 
1955 166 169 147 161 
1956 217 243 182 225 
1957 241 268 196 248 


a. Gross volve of factory output. 

b. Net value added in factory production. 

€. Factory production. 

Sources: Official | index from China 1960, p. 16; 

Official I! index from Liu & Yeh 1965, 
p. 150; Chao index from Chao 1965, 
p. 88; Liv-Yeh index from Liv & Yeh 
1965, p. 66. 


Attempts were made by scholars in the United 
States to correct some of the biases. Since the con- 
cepts used in the different studies are not the 
same, only the rates of change are roughly com- 
parable, The different estimates are therefore 
presented as index numbers in Table 2. 

Agricultural production. There is a consensus 
among Western scholars that the official data on 


production of food crops underestimated output in 
earlier years, so the rate of growth for the 1952- 
1957 period was overstated (Eckstein 1962, p. 32; 
Hollister 1958, pp. 19, 29; Li 1959, p. 65; Liu & 
Yeh 1965, pp. 43-46; Wu et al. 1963, p. 185). The 
official data and the other estimates are given in 
Table 3. 


Table 3 — Food crops production of mainland China 
(millions of metric tons) 


Year Official liv-Yeh Wu 

1952 154 177 184 
1953 157 180 180 
1954 161 185 177 
1955 175 186 184 
1956 183 184 183 
1957 185 185 185 


Sources: Official data from China 1960, p. 119; 
Liu-Yeh data from Liu & Yeh 1965, p. 132; 
Wu data from Wu et al. 1963, p. 185. 


National income. The concept used by the 
regime in computing national income is the net 
material product, that is, the net value produced 
by agriculture, industry, construction, freight trans- 
portation, the part of communications serving 
other productive sectors, trade, and restaurants. 
Income originating in government administration 
(including armed forces), cultural and educational 
organizations, and personal services is excluded. 


Table 4 — National income of mainland China (billions 
of 1952 yuan*) 


Year Official Eckstein Hollister ui liv-Yeh Wu 

1952 61.13 71.26 67.86 72.9 71.41 72.4 
1953 70.04 77.06 80.2 75.33 75.4 
1954 73.88 81.92 86.6 79.28 78.3 
1955 78.80 85.41 93.8 82.30 82.0 
1956 88.75 97.21 107.3 92.08 91.7 
1957 93.53 102.42 111.8 95.34 94.8 


* Official rate of exchange: 1 U.S. dollar equals 2.343 yuan in 1952. For well- 
known reasons, the yuan figures should not be converted into U.S. dollars 
ot the official exchange rate for the purpose of comparing them with the 
national income of the United States or of other countries. 


Sources: Official data from Liv & Yeh 1965, p. 220; 
Eckstein data from Eckstein 1962, p. 56; 
Hollister data from Hollister 1958, p. 2; 
Li data from Li 1959, p. 106; Liv-Yeh 
data from Liv & Yeh 1965, p. 66; Wu data 
from Wu et al. 1963, p. 241. 


Scholars in the United States have attempted to 
construct independent estimates following standard 
Western concepts. Eckstein (1962) estimated the 
gross national product of 1952 by using the value- 
added approach for agriculture and the factor 
Shares method for other sectors. Hollister (1958) 
applied the final-expenditure method to estimate 
gross national product. Li (1959) adjusted the 
official estimate of net domestic product to derive 
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an estimate of net national product but cautioned 
that the implied rate of growth in his estimates 
was excessively high. Liu and Yeh (1965) used 
the value-added approach to estimate net domestic 
product. Wu and associates (1963) used the Liu- 
Yeh framework but modified the estimates in 
several respects. 

The official data and the independent estimates 
are presented in Table 4. Because of the different 
concepts of national income used, only the rates 
of growth can be even roughly compared. The 
average annual rates of growth for the 1952-1957 
period are as follows: official, 9.0 per cent; Hol- 
lister, 8.6 per cent; Li, 8.8 per cent (an over- 
estimate, according to Li); Liu and Yeh, 6.0 per 
cent; and Wu, 5.6 per cent. 

The 1958 fiasco and its aftermath. The produc- 
tion targets set for 1958 during the “great leap 
forward” were extremely ambitious and completely 
unrealistic. The statistical reporting system grad- 
ually developed since 1952 was cast aside, and 
the local cadres dutifully reported unprecedented 
but impossible achievements. In April 1959 the 
regime announced 100 per cent increases in the 
output of food crops, cotton, and steel in 1958 
alone. A drastic downward revision of the claims, 
however, was announced in August 1959, reducing 
the estimated production of food crops and cotton 
by one-third. The claimed increase in iron and 
steel production was also scaled down substantially 
but in a more subtle way. Official economic sta- 
tistics published for 1958 and 1959 (China 1960) 
are of much lower quality than those of 1952- 
1957, and the several independent Western esti- 
mates made for 1958 and later years can be 
considered only as conjectures. 

The disruptive economic policies undertaken 
during the “great leap forward,” together with 
severe floods and droughts in 1959 and 1960, re- 
sulted in a serious agricultural crisis and a sharp 
reduction in output in the ensuing years. The rest 
of the economy also suffered setbacks, resulting 
from the shrinking supply of raw materials from 
the agricultural sector, the withdrawal of Soviet 
technicians in 1960, and the generally lower 
morale and efficiency of workers because of food 
shortages. The almost complete blackout of eco- 
nomic statistics from the Chinese mainland since 
1960 makes it exceedingly difficult if not impos- 
sible to give any quantitative evaluation of the 
performance of the economy from 1960 to 1964. 
There are some indications that by 1964 the 1957 
level of production may have been regained in most 


sectors of the economy. 
Ta-CHuNG Liu 
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ECONOMIC EQUILIBRIUM 


There are perhaps two basic, though incompletely 
separable, aspects of the notion of general equi- 
librium as it has been used in economics: (1) the 
simple notion of determinateness—that is, the rela- 
tions that describe the economic system must form 
a system sufficiently complete to determine the 


values of its variables—and (2) the more specific 
notion that each relation represents a balance of 
forces. The last usually, though not always, is taken 
to mean that a violation of any one relation sets in 
motion forces tending to restore the balance (as 
will be seen below, this is not the same as the 
stability of the entire system). In a sense, there- 
fore, almost any attempt to give a theory of the 
whole economic system implies the acceptance of 
the first part of the equilibrium notion, and Adam 
Smith’s “invisible hand” is a poetic expression of 
the most fundamental of economic balance rela- 
tions, the equalization of rates of return, as en- 
forced by the tendency of factors to move from low 
to high returns. 

The notion of equilibrium (“equal weight,” re- 
ferring to the condition for balancing a lever piv- 
oted at its center) and with it the notion that the 
effects of a force may annihilate the force (for 
example, water finding its own level) had been 
familiar in mechanics long before the appearance 
of Smith’s Wealth of Nations in 1776, but there is 
no obvious evidence that Smith drew his ideas from 
any analogy with mechanics. Whatever the source 
of the concept, the notion that through the work- 
ings of an entire system effects may be very dif- 
ferent from, and even opposed to, intentions is 
surely the most important intellectual contribution 
that economic thought has made to the general 
understanding of social processes, 


1. History of the concept 

It can be maintained that Smith was a creator 
of general equilibrium theory, although the co- 
herence and consistency of his work can be ques- 
tioned. A fortiori, later systematic expositors of 
the classical system, such as Ricardo, J. S. Mill, 
and Marx, in whose work some of Smith’s logical 
gaps were filled, can all be regarded as early ex- 
positors of general equilibrium theory. Marx, in 
his scheme of simple reproduction, read in com- 
bination with his development of relative price 
theory in Volumes 1 and 3 of Das Kapital, has 
indeed come in some ways closer in form to mod- 
ern theory than any other classical economist, 
though of course everything is confused by his 
attempt to maintain simultaneously a pure labor 
theory of value and an equalization of rates of 
return on capital. The view that the classical econ- 
omists had a form of the general equilibrium 
principle is further bolstered by modern recon- 
structions (see, e.g., Samuelson 1957; 1959). 

There is, however, a very important sense in 
which none of the classical economists had a true 
general equilibrium theory: none had an explicit 


role for demand conditions. No doubt the more 
systematic thinkers, most particularly Mill and 
Cournot, gave verbal homage to the role of demand 
and the influence of prices on it, but there was no 
genuine integration of demand with the essentially 
supply-oriented nature of classical theory. The 
neglect of demand was facilitated by the special 
simplifying assumptions made about supply. A 
general equilibrium theory, from the modern point 
of view, is a theory about both the quantities and 
the prices of all economic goods and services. How- 
ever, the classical authors found that prices ap- 
peared to be determined by a system of relations 
not involving quantities, derived from the zero- 
profit condition. This is clear enough with fixed 
production coefficients and a single primary factor, 
labor, as in Smith’s famous exchange of deer and 
beaver, and it was the great accomplishment of 
Malthus and Ricardo to show that land could be 
brought into the system. If, finally, Malthusian as- 
sumptions about population implied that the supply 
price of labor was fixed in terms of goods, then 
even the price of capital could be determined 
(although the presence of capital as a productive 
factor and recipient of rewards was clearly an em- 
barrassment to the classical authors, as it remains 
to some extent today). 

Thus, in a certain definite sense the classicists 
had no true theory of resource allocation, since the 
influence of prices on quantities was not studied 
and the reciprocal influence was denied, But the 
classical theory could solve neither the logical prob- 
lem of explaining relative wages of heterogeneous 
types of labor nor the empirical problem of account- 
ing for wages that were rising steadily above the 
subsistence level. It is in this context that the neo- 
classical theories emerged, with all primary re- 
sources having the role that land alone had before. 

(In all fairness to the classical writers, it should 
be remarked that the theory of foreign trade, espe- 
cially in the form that Mill gave it, was a genuine 
general equilibrium theory. But, of course, the as- 
sumptions that were made, particularly factor 
immobility, were very restrictive. ) 

Cournot (1838), and after him Jenkin (1870) 
and the neoclassical economists, employed exten- 
sively the partial equilibrium analysis of a single 
market. In such an analysis, the demand and sup- 
ply of a single commodity are conceived of as func- 
tions of the price of that commodity alone; the 
equilibrium price is that for which demand and 
supply are equal. This form of analysis must be 
viewed either as a pedagogical device that takes 
advantage of the ease of graphical representation 
of one-variable relations or as a first approximation 
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to general equilibrium analysis in which repercus- 
sions through other markets are neglected. 

The contributions of Walras. The full recogni- 
tion of the general equilibrium concept can unmis- 
takably be attributed. to Walras (1874-1877), 
although many of the elements of the neoclassical 
system were worked out independently by W. Stan- 
ley Jevons and by Carl Menger. In Walras’ analysis, 
the economic system is made up of households and 
firms, Each household owns a set of resources, 
commodities useful in production or consumption, 
including different kinds of labor. For any given 
set of prices a household has an income from the 
sale of its resources, and with this income it can 
choose among all alternative bundles of consumers’ 
goods whose cost, at the given prices, does not ex- 
ceed its income. Thus, Walras saw the demand by 
households for any consumers’ good as a function 
of the prices of both consumers’ goods and re- 
sources. The firms were—at least in the earlier 
versions—assumed to be operating under fixed co- 
efficients. Then the demand for consumers’ goods 
determined the demand for resources; and the com- 
bined assumptions of fixed coefficients and zero 
profits for a competitive system implied relations 
between the prices of consumers’ goods and of re- 
sources. An equilibrium set of prices, then, was a 
set such that supply and demand were equated on 
each market; under the assumption of fixed coeffi- 
cients of production, or more generally of constant 
returns to scale, this amounted to equating supply 
and demand on the resource markets, with prices 
constrained to satisfy the zero-profit conditions for 
firms. Subsequent work of Walras, J. B. Clark, 
Wicksteed, and others generalized the assumptions 
about production to include alternative methods of 
production, as expressed in a production function. 
In this context, the prices of resources were deter- 
mined by marginal productivity considerations. 
[See PRODUCTION. ] 

That there existed an equilibrium set of prices 
was argued from the equality of the number of 
prices to be determined with the number of equa- 
tions expressing the equality of supply and demand 
on various markets. Both are equal to the number 
of commodities, say n. In this counting, Walras 
recognized that there were two offsetting compli- 
cations: (1) Only relative prices affected the be- 
havior of households and firms, hence the system 
of equations had only n — 1 variables, a point which 
Walras expressed by selecting one commodity to 
serve as numéraire, with the prices of all commod- 
ities being measured relative to it. (2) The budget- 
ary balance of each household between income and 
the value of consumption and the zero-profit con- 
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dition for firms together imply what has come to 
be known as Walras’ law, that the market value of 
supply equals that of demand for any set of prices, 
not merely the equilibrium set; hence, the supply— 
demand relations are not independent. If supply 
equals demand on n — 1 markets, then the equality 
must hold on the nth. 

Walras wished to go further and discussed the 
stability of equilibrium essentially for the first time 
(that is, apart from some brief discussions by Mill 
in the context of foreign trade) in his famous but 
rather clumsy theory of tétonnements (literally, 
“gropings” or “tentative proceedings”). He starts by 
supposing a set of prices fixed arbitrarily; then sup- 
ply may exceed demand on some markets and fall 
below on others (unless the initial set is in fact the 
equilibrium set, there must, by Walras’ law, be at 
least one case of each). Suppose the markets are 
considered in some definite order. On the first mar- 
ket, adjust the price so that supply and demand 
are equal, given all other prices; this will normally 
require raising the price if demand initially ex- 
ceeded supply, decreasing it in the opposite case. 
The change in the first price will, of course, change 
supply and demand on all other markets. Repeat 
the process with the second and subsequent mar- 
kets. At the end of one round, the last market will 
be in equilibrium, but none of the others need be, 
since the adjustments on subsequent markets will 
destroy the equilibrium achieved on any one. How- 
ever, Walras argues, the supply and demand func- 
tions for any given commodity will be affected more 
by the changes in its own price than by the changes 
in other prices; hence, after one round the markets 
should be more nearly in equilibrium than they 
were to begin with, and with successive rounds the 
supplies and demands on all markets will tend to 
equality. 

It seems clear that Walras did not literally sup- 
pose that the markets came into equilibrium in 
some definite order, Rather, the story was a con- 
venient way of showing how the market system 
could in fact solve the system of equilibrium rela- 
tions. The dynamic system, more properly ex- 
pressed, asserted that the price on any market rose 
when demand exceeded supply and fell in the oppo- 
site case; the price changes on the different markets 
were to be thought of as occurring simultaneously. 

Finally, Walras had a still higher aim for gen- 
eral equilibrium analysis: to study what is now 
called comparative statics, that is, the laws by 
which the equilibrium prices and quantities vary 

with the underlying data (resources, production 
conditions, or utility functions). But little was actu- 


ally done in this direction. [See the biography of 
WALRAS.] 

Important contributions were made by Walras’ 
contemporaries, Edgeworth, Pareto, and Irving 
Fisher. One contribution perhaps calls for special 
mention, since it has again become the subject of 
significant research, the contract curve (Edgeworth 
1881), known in modern terminology as the core 
(see section 3 below and Debreu & Scarf 1963). 

Developments during and after the 1930s. The 
next truly major advances did not come until the 
1930s. There were two distinct streams of thought, 
one beginning in German-language literature and 
dealing primarily with the existence and unique- 
ness of equilibrium, the other primarily in English 
and dealing with stability and comparative statics. 
The former started with a thorough examination 
of Cassel’s simplification (1918) of Walras’ sys- 
tem, an interesting case of work which had no 
significance in itself and yet whose study turned 
out to be extraordinarily fruitful. Cassel assumed 
two kinds of goods: commodities which entered 
into the demand functions of consumers, and fac- 
tors which were used to produce commodities 
(intermediate goods were not considered), Each 
commodity was produced from factors with con- 
stant input-output coefficients, Factor supplies were 
supposed totally inelastic. Let a;; be the amount of 
factor i used in the production of one unit of com- 
modity j; x; the total output of commodity j; v: the 
total initial supply of factor i; p; the price of com- 
modity j; and r; the price of factor i. Then the 
condition that demand equal supply for all factors 
reads 


(1) D auk =U, for all i, 
i 
while the condition that each commodity be pro- 


duced with zero profits reads 


(2) Lair =P, for all j. 
The system is completed by the equations relating 
the demand for commodities to their prices and to 
total income from the sale of factors. In total, there 
are as many equations as unknowns, But three 
virtually simultaneous papers by Zeuthen (1932), 
Neisser (1932), and von Stackelberg (1933) 
showed in different ways that the problem of exist- 
ence of meaningful equilibrium was deeper than 
equality of equations and unknowns. Neisser noted 
that even with perfectly plausible values of the 
input-output coefficients, a;;, the prices or quan- 
tities which satisfied (1) and (2) might well be 
negative, Von Stackelberg noted that (1) consti- 


tuted a complete system of equations in the outputs, 
x), since the factor supplies, v;, were data; but 
nothing had been assumed about the numbers of 
distinct factors or distinct commodities. If, in par- 
ticular, the number of commodities were less than 
that of factors, equations (1) would in general have 
no solutions. 

Zeuthen reconsidered the meaning of equations 
(1). He noted that economists, at least since Carl 
Menger, had recognized that some factors (for 
example, air) were so abundant that there would 
be no price charged for them. These would not 
enter into the list of factors in Cassel’s system. But, 
Zeuthen argued, the division of factors into free 
and scarce should not be taken as given a priori. 
Hence, all that can be said is that the use of a 
factor should not exceed its supply; but if it falls 
short, then the factor is free. In symbols, (1) is 
replaced by 


(GG) Laijx; SU; 
i 


if the strict inequality holds, then r; = 0. To a later 
generation of economists, to whom linear program- 
ming and its generalizations are familiar, the cru- 
cial significance of this step will need no elabora- 
tion; equalities are replaced by inequalities, and the 
vital notion of the complementary slackness of 
quantities and prices is introduced. [See PROGRAM- 
MING. ] 

Independently of Zeuthen, the Viennese banker 
and amateur economist K. Schlesinger came to the 
same conclusion, But he went much further and 
intuitively grasped the essential point that replace- 
ment of equalities by inequalities also resolved the 
problems raised by Neisser and von Stackelberg. 
Schlesinger realized the mathematical complexity 
of a rigorous treatment, and, at his request, Oskar 
Morgenstern put him in touch with a young mathe- 
matician, Abraham Wald. The result was the first 
rigorous analysis of general competitive equilib- 
rium, In a series of papers (see Wald 1936 for a 
summary), Wald demonstrated the existence of 
competitive equilibrium in a series of alternative 
models, including the Cassel model and a model of 
pure exchange. Competitive equilibrium was de- 
fined in the Zeuthen sense, and the essential role 
of that definition in the justification of existence is 
made clear in the mathematics. Wald also initiated 
the study of uniqueness. Indeed, both of his alter- 
native sufficient conditions for the uniqueness of 
competitive equilibrium have since become major 
themes of the literature: (1) that the weak axiom 
of revealed preference hold for the market demand 
functions (the sums of the demand functions of 
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all individuals), or (2) that all commodities be 
gross substitutes (see definitions below). 

Wald’s papers were of forbidding mathematical 
depth, not only in the use of sophisticated tools but 
also in the complexity of the argument. As they 
gradually came to be known among mathematical 
economists, they probably served as much to in- 
hibit further research by their difficulty as to stim- 
ulate it. Help finally come from the development of 
a related line of research, John von Neumann's 
theory of games (first basic paper published in 
1928; see von Neumann & Morgenstern 1944). 
The historical relation between game theory and 
economic equilibrium theory is paradoxical, In 
principle, game theory is a very general notion of 
equilibrium which should either replace the prin- 
ciple of competitive equilibrium or include it as a 
special case. In fact, while game theory has turned 
out to be extraordinarily stimulating to equilibrium 
theory, it has been through the use of mathematical 
tools developed in the former and used in the latter 
with entirely different interpretations. It was von 
Neumann himself who made the first such appli- 
cation in his celebrated paper on balanced eco- 
nomic growth (1937). In this model, there were 
no demand functions, only production. The markets 
had to be in equilibrium in the Zeuthen sense. But 
beyond this, there was equilibrium in a second 
sense, which may be termed stationary equilibrium 
(see section 8 below). To prove the existence of 
equilibrium, von Neumann demonstrated that a 
certain ratio of bilinear forms has a saddle point, 
a generalization of the theorem which shows the 
existence of equilibrium in two-person zero-sum 
games. But in game theory the variables of the 
problem are probabilities (of choosing alternative 
strategies), while in the application to equilibrium 
theory one set of variables is prices and the other 
is the levels at which productive activities are 
carried on. 

Von Neumann deduced his saddle-point theorem 
from a generalization of Brouwer's fixed-point the- 
orem, a famous proposition in the branch of mathe- 
matics known as topology. A simplified version of 
von Neumann’s theorem was presented a few years 
later by the mathematician Shizuo Kakutani, and 
Kakutani’s theorem has been the basic tool in vir- 
tually all subsequent work. With this foundation, 
and the influence of the rapid development of linear 
programming on both the mathematical—again 
closely related to saddle-point theorems—and eco- 
nomic sides (the work of George B. Dantzig, Albert 
W. Tucker, Harold W, Kuhn, Tjalling C. Koopmans, 
and others, collected for the most part in an influ- 
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ential volume [Cowles Commission . . . 1951]) 
and the work of John F. Nash, Jr. (1950), it was 
perceived independently by a number of scholars 
that existence theorems of greater simplicity and 
generality than Wald’s were possible. The first 
papers were those of McKenzie (1954) and Arrow 
and Debreu (1954). Subsequent developments were 
due to Hukukane Nikaid6, Hirofumi Uzawa, Debreu, 
and McKenzie. (The most complete systematic ac- 
count of the existence conditions is in Debreu 
1959; the most general version of the existence 
theorem is in Debreu 1962.) 

Independently of this development of the exist- 
ence conditions for equilibrium, the Anglo—-Amer- 
ican literature contained an intensive study of the 
comparative statics and stability of general com- 
petitive equilibrium. Historically, it was closely 
related to analyses of the second-order conditions 
for maximization of profits by firms and of utility 
by consumers; the most important contributors 
were John R. Hicks, Harold Hotelling, Paul Sam- 
uelson, and R. G. D. Allen. In particular, Hicks 
(1939) introduced the argument that the stability 
of equilibrium carried with it some implications for 
the shapes of the supply and demand functions in 
the neighborhood of equilibrium; hence, the effects 
of small shifts in any one behavior relation may be 
predicted, at least as to sign, Hicks’s definition of 
stability has been replaced in subsequent work by 
Samuelson’s (1941-1942); however, Hicks showed 
that (locally) stability in his sense was equivalent 
to a condition which has played a considerable role 
in subsequent research, Let X; be the excess de- 
mand (demand less supply) for the ith commodity; 
it is in general a function of p, , ++- , Pa, the prices 
of all n commodities. Then Hicks’s definition of 
stability was equivalent to the condition that the 
principal minors of the matrix whose elements 
were 9X;/dp; had determinants which were posi- 
tive or negative according as the number of rows 
or columns included was even or odd. Such matrices 
will be referred to as Hicksian. Hicks also sought 
to derive comparative-statics conclusions about the 
response of prices to changes in demand functions. 
Presently accepted theorems derive the same con- 
clusions but from somewhat different premises 
(see section 6). 

Samuelson formulated the presently accepted 
definition of stability. It must be based, he argued, 
on an explicit dynamic model concerning the be- 
havior of prices when the system is out of equi- 
librium. He formalized the implicit assumption of 
Walras and most of his successors: the price of 
each commodity increases at a rate proportional to 
excess demand for that commodity. This assump- 


tion defined a system of differential equations; if 
every path satisfying the system and starting suffi- 
ciently close to equilibrium converged to it, then 
the system was stable. Samuelson was able to dem- 
onstrate that Hicks’s definition was neither neces- 
sary nor sufficient for his and that the economic 
system was stable if the income effects on con- 
sumption were sufficiently small. He enunciated a 
general correspondence principle: that all meaning- 
ful theorems in comparative statics were derived 
either from the second-order conditions on maxi- 
mization of profits by firms or of utility by con- 
sumers or from the assumption that the observed 
equilibrium was stable. 

The current period of work in comparative statics 
and stability dates from the work of Mosak (1944) 
and Metzler (1945). The emphasis has tended to 
be a little different from Samuelson’s correspond- 
ence principle; rather, the tendency has been to 
formulate hypotheses about the excess demand 
functions which imply both stability and certain 
results in comparative statics. [See the biographies 
of CASSEL; SCHLESINGER; VON NEUMANN; and 
WALD.] 


2. The existence of competitive equilibrium 


Consider a system with n commodities, with 
prices p,,--+ ,p,, respectively. Let us first suppose 
that at each set of prices, each economic agent 
(firm or household) has a single chosen demand 
or supply. If supplies are treated as negative de- 
mands, then for each commodity the net total 
excess demand (excess of demand over supply) by 
all economic agents is obtained by summing the 
excess demands for the individual agents and this 
is a function of all prices; let Xi(p.,+** , Pn) be the 
excess demand function for the ith commodity. At 
an equilibrium, excess demand cannot be positive 
since there is no way of meeting it. Further, if the 
excess demand is negative (that is, there is an 
excess of supply over demand), the good is free 
and should have a zero price. Formally, a set of 
nonnegative prices f,,--+ , Pa constitutes an equt- 


librium if 
(3)  Xi(fr, +++, Pn) <0 for all i, 
(4) pi =0 for all i such that 

Xi(Pr, °°, Bn) <O 


With this definition, the following assumptions are 
sufficient for the existence of equilibrium: 


(H) The functions Xi(p:,:*+, Pn) are (posi- 
tively) homogeneous of degree zero, that 
is, Xi(ApPi, +++, Apa) = Xi (Pis t> Pn) for 
all A > 0 and all pı, +t, Pn- 


(W) SipiXi(p.,+++, Pa) = 0 for all sets of prices. 


(C) The functions X;(p,,---, pn) are continuous. 


(B) The functions X;(p,,---, pa) are bounded 
from below (that is, supply is always 
limited ). 


Assumption (H) is standard in consumers’ de- 
mand theory; (W) is Walras’ law referred to above. 
Assumption (C) is the type usually made in any 
applied work, although, as will be seen later, it is 
closely related to assumptions of convexity of pref- 
erences and production. Assumption (B) is trivially 
valid in a pure exchange economy, since each indi- 
vidual has a finite stock of goods to trade. In an 
economy where production takes place, the matter 
is less clear. At an arbitrarily given set of prices, a 
producer may find it profitable to offer an infinite 
supply; the realization of his plans will, of course, 
require him to demand at the same time an infinite 
amount of some factor of production. Such situa- 
tions are of course incompatible with equilibrium, 
but since the existence of equilibrium is itself in 
question here, the analysis is necessarily delicate. 

The current proofs that the assumptions listed 
above imply the existence of competitive equilib- 
rium require the use of Brouwer’s fixed-point theo- 
rem, a mathematical theorem which asserts that a 
continuous transformation of a triangle or similar 
figure in higher-dimensional spaces into itself must 
leave at least one point unaltered. The argument 
may be sketched as follows: From (H), an equi- 
librium is unaltered if all prices are multiplied by 
the same positive constant; hence, without loss of 
generality we can assume that the sum of the prices 
is one, The set of all price vectors with nonnegative 
components summing to one is clearly a general- 
ized triangle (technically called a simplex). For 
each set of prices, compute the excess demands 
(positive or negative) and then form a new price 
vector in which those components with positive ex- 
cess demands are increased and the others de- 
creased (but not below zero). These new prices are 
then adjusted so that the sum is again one. This 
process defines a continuous transformation of the 
simplex into itself and thus has a fixed point, a 
price vector which remains unaltered under the 
adjustment process. It is easy to see that this price 
vector must be an equilibrium. 

The point of view just sketched is not sufficiently 
general for most purposes, We have already seen 
that the boundedness assumption appears artificial 
in the case where production is possible. Closely 
related to this is a second issue; the assumption 
that supplies and demands are single-valued ap- 
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pears unduly restrictive, Consider the simplest case 
of production: one input and one output which is 
proportional to the input. The behavior of the profit- 
maximizing entrepreneur depends on the ratio of 
the input price to the output price. If the price ratio 
is greater than the output—input ratio, then the firm 
will lose money if it engages in any production; 
hence, the profit-maximizing point is zero output 
and zero input, which is indeed single-valued. If 
the two ratios are equal, however, all output levels 
make zero profit; hence, the profit-maximizing entre- 
preneur is indifferent among them, and the supply 
function of the output and the demand function for 
the input must be taken to be multivalued. If the 
price ratio is less than the output—input ratio, then 
the entrepreneur will make increasing profits as he 
increases the scale of activity. There is no finite 
level which could be described as profit-maximizing. 

To state a general definition of competitive equi- 
librium more precisely, the following model can be 
formulated: there are presumed to exist a set of 
households and a set of firms; all production is car- 
ried on in the latter. Each household has a collec- 
tion of initial assets (here assumed to include the 
ability to supply different kinds of labor) and also 
a claim to a given portion (possibly zero, of course) 
of the profits of each firm; it is assumed that for 
each firm there are claims for exactly the entire 
profits (these claims are interpretable as equities 
or partnership shares). For given prices and given 
production decisions of the firms, the profits of the 
firms and the value of each individual's initial 
assets are determined and, hence, so is the indi- 
vidual's total income. The commodity bundles avail- 
able are those whose value, at the given prices, 
does not exceed income and whose individual com- 
ponents are nonnegative (or satisfy some still 
stronger condition independent of prices). It is 
further assumed that the household can express 
preferences among commodity bundles and that 
these preferences have suitable continuity proper- 
ties. The aim of the household is taken to be selec- 
tion of the most preferred bundle among those 
available. 

The behavior of the firms is more simply de- 
scribed. Each firm has available to it a set of pos- 
sible production bundles; conventionally, the com- 
ponents are taken to be positive for outputs and 
negative for inputs. For a fixed set of prices, the 
profits for each possible production bundle are de- 
termined; then the firm chooses the (or a) bundle 
which maximizes profits. Notice that the profit- 
maximizing bundle need not be unique, Indeed, 
under constant returns to scale, it is never unique 
unless the firm’s best policy is to shut down com- 
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pletely. (Under constant returns, if any bundle is 
possible, the bundle obtained by doubling all com- 
ponents, inputs and outputs alike, is also possible. 
Then if any bundle makes positive profits, doubling 
the bundle will double the profits; hence, there can 
be no profit-maximizing bundle. The existence of a 
profit-maximizing bundle thus entails that maxi- 
mum profits be nonpositive. Since zero profits are 
always possible by zero activity level, we must have 
zero profits at the maximum. Either profits are nega- 
tive for all nonzero bundles, in which case shutting 
down is the unique optimal policy, or profits are 
zero for some nonzero bundle, in which case any 
nonnegative multiple also achieves zero and there- 
fore maximum profits. ) 

In this terminology, a “firm” may actually be 
producing zero output and therefore may not be 
statistically observable; the firm is potentially capa- 
ble of producing outputs and would find it profit- 
able to do so at different sets of prices than those 
prevailing. 

A competitive equilibrium, then, is a designation 
of nonnegative prices for all commodities, of a 
bundle for consumption for each household, and 
of a production bundle for each firm satisfying the 
following conditions: 

(a) for each household, the designated bundle 
maximizes utility among all available bundles; 

(b) for each firm, the designated bundle maxi- 
mizes profit among all technically possible bundles; 

(c) for each commodity, the total consumed by 
all households does not exceed the total initially 
available plus the net total produced by all firms 
(“net” here means that input uses by some firms 
are subtracted from outputs of others); 

(d) for those commodities for which total con- 
sumed is strictly less than total initially available 
plus total produced, the price is zero. 

The adjective “competitive” refers to the assump- 
tion that each household or firm takes prices as 
given and independent of its decision. 

The following assumptions are sufficient to en- 
sure the existence of competitive equilibrium: 

(1) The preference ordering of each household 
is continuous (a strict preference between two 
bundles continues to hold if either is slightly 
altered), admits of no saturation (for each bundle, 
there is another preferred to it), and is convex (if 
a bundle is varied along a line segment in the com- 
modity space, one of the end points is least pre- 
ferred). 

(11) The set of possible production bundles for 
each firm is convex (any weighted combination of 
two possible production bundles is possible) and 
closed (any bundle that can be approximated by 


possible bundles is itself possible); further, it is 
always possible to produce no outputs and use no 
inputs. 

(m) No production bundle possible to society 
as a whole (a bundle is possible for society as a 
whole if it is the algebraic sum of production bun- 
dles, one chosen among those possible for each 
firm) can contain outputs but not inputs; there is 
at least one bundle possible for society that con- 
tains a positive net output of all commodities not 
possessed initially by any household, 

(1v) The economy is irreducible—a concept de- 
veloped by Gale (1960) and McKenzie (1959 )—in 
the sense that no matter how the households are di- 
vided into two groups, an increase in the initial 
assets held by the members of one group can be 
used to make feasible an allocation which will 
make no one worse off and at least one individual 
in the second group better off. 

It is perhaps interesting to observe that “atom- 
istic” assumptions concerning individual house- 
holds and firms are not sufficient to establish the 
existence of equilibrium; “global” assumptions m 
and Iv are also needed (though they are surely 
unexceptionable). Thus, a limit is set to the tend- 
ency implicit in price theory, particularly in its 
mathematical versions, to deduce all properties of 
aggregate behavior from assumptions about indi- 
vidual economic agents. 

The hypotheses of convexity in household pref- 
erences and in production are the empirically most 
vulnerable parts of the above assumptions. In pro- 
duction, convexity excludes indivisibilities or in- 
creasing returns to scale. In consumption, convexity 
excludes cases in which mixed bundles are inferior 
to extremes; for example, in the very short run, a 
mixture of whiskey and gin is regarded by many 
as inferior to either alone, or living part time in 
each of two distant cities may be inferior to living 
in either one alone. It is of interest to know how 
far these assumptions may be relaxed. 

Convexity does play an essential role in the 
proof, This may be illustrated by considering the 
simplest case—one input and one output propor- 
tional to it. As noted earlier, there will be one ratio 
of output to input prices at which the entrepreneur 
will be indifferent among all levels of output. If the 
supply of the input is given, then the equilibrium 
levels of input and output, as well as price, are de- 
termined. Now suppose that production is possible 
only at integer-valued levels of input, so that the 
production possibility set is not convex. If the sup- 
ply of the input is not an integer, there is no way 
of equating demand and supply for it. It should be 
noted, though, that the input (and output) level 


can be so chosen that the difference between sup- 
ply and demand does not exceed one half. In effect, 
convexity ensures that supply and demand are, in 
a suitable sense, continuous and thus can be ad- 
justed to varying levels of initial assets. 

The assumption of convexity cannot be dis- 
pensed with in general theorems concerning the 
existence of equilibrium strictly defined. However, 
aline of thought begun by Farrell (1959) and de- 
veloped by Rothenberg (1960) and Aumann (1966) 
suggests that the gap between supply and demand 
does not tend to increase with the size of the 
economy. More explicitly, if each agent (household 
or firm) is small compared with the total economy, 
then by suitable choice of prices and of consump- 
tion and production bundles, the discrepancy be- 
tween supply and demand can be made small rela- 
tive to the economy. Each household is certainly 
small relative to the economy, so that nonconvexi- 
ties in individual preferences have no significant 
effect on the existence of equilibrium. However, in- 
creasing returns to scale over a sufficiently wide 
range may mean that a competitive profit-maximiz- 
ing firm will be large at a given set of prices if it 
produces at all; hence, there is a real possibility 
that equilibrium may not exist. 


3. Optimality and the core 


Although the view that competitive equilibria 
have some special optimality properties is at least 
as old as Adam Smith's invisible hand, a clarifica- 
tion of the relation is fairly recent. Since the sub- 
ject belongs to the field of welfare economics, only 
a brief statement is given here. An allocation 
(designation of bundles for all households and all 
firms) is feasible if each bundle is possible for the 
corresponding agent and if, in the aggregate, the 
net output of each commodity (including quan- 
tities initially available) is at least as great as the 
demand by consumers. Each allocation, then, de- 
termines the utility level of the consumption of 
each household, One allocation is dominated by a 
Second if the latter is feasible and if each indi- 
Vidual has a higher utility under the second than 
Under the first (more frequently, in the literature, 
the condition is put in the more complicated form 
of having each individual at least as well off and 
one individual better off, but the difference is 
trifling). Then an allocation is said to be optimal 
if it is feasible but not dominated by any other (a 

lefinition due to Pareto). 

‘Here are two theorems relating competitive 
®quilibrium and optimal allocations, one concern- 
Re Sufficiency and the other concerning necessity; 

e two have not always been distinguished in the 
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literature. They are stated here without some minor 
qualifications. 

Sufficiency. Any competitive equilibrium is nec- 
essarily optimal, 

Necessity. Given any optimal allocation, there 
is some assignment of society's initial assets among 
individuals such that the optimal allocation is a 
competitive equilibrium corresponding to that dis- 
tribution, provided that the assumptions of section 
2 which ensure the existence of equilibrium hold. 

It is useful to note that the sufficiency theorem 
is valid even if the assumptions of section 2 do 
not hold. 

The concept of optimality is defined without re- 
gard to a price system or any prescribed set of 
markets. The optimality theorems assert that even 
though prices do not enter into the definition of 
optimality, there happens to be an identity between 
optima and competitive equilibria (under suitable 
conditions). This relation has been brought into 
still sharper relief with the modern theory of the 
core (Debreu & Scarf 1963), which also, however, 
serves to emphasize the special role of large num- 
bers of economic agents in the theory of perfect 
competition. 

We start with essentially the same model of the 
production and consumption structure as in section 
2, deleting, of course, all references to prices and 
to income. However, the analyses of the core have 
so far made one significant restriction on the rela- 
tion between producers and consumers. It is as- 
sumed that any coalition of households has access 
to the same set of possible production vectors, 
which is further assumed to display constant re- 
turns to scale. Consider now any feasible allocation. 
It is said to be blocked by a coalition S (a set of 
households) if there is another allocation among 
the members of S feasible for them (using only the 
assets they collectively started with) which makes 
each of them better off. Notice that the coalition 
might consist of all households in the society; for 
that coalition, blocking reduces to domination in 
the sense used earlier, A coalition might also con- 
sist of one individual; then he can block an alloca- 
tion if, with only his own resources and the uni- 
versally accessible technological knowledge, he can 
produce a bundle whose utility is higher than that 
of the bundle allocated to him. 

The core, then, consists of all allocations that 
are not blocked by any coalition. The first theorem 
concerning the core generalizes the sufficiency 
theorem for optimality: Any competitive equilib- 
rium belongs to the core. More interesting is a sort 
of converse proposition, which may be loosely 
stated as follows: If the hypotheses of section 2 
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which ensure the existence of equilibrium hold, and 
if each individual is small compared with the 
economy, then the allocations in the core are all 
approximately competitive equilibria. (The words 
“small” and “approximately” are rigorously inter- 
preted as referring to suitably chosen limiting 
processes. ) 

Some interpretive remarks are in order here. First, 
the natural interpretation of the core is that if 
any sort of bargains are permitted by the rules of 
the economic game, the allocation finally arrived 
at should be in the core, since otherwise some 
coalition would have both the power and the desire 
to prevent it. Hence, it would follow, very strikingly, 
that for large numbers of participants the outcome 
would be the competitive equilibrium, provided the 
assumptions of section 2 were satisfied. Even under 
nonconvexity some scattered results of L. S. Shap- 
ley and M. Shubik (unpublished) suggest that the 
same holds approximately (that is, there may be 
no core in the precise definition, but there is a set 
of allocations that can be blocked but only with 
very small preference on the part of the blocking 
coalition). Hence, the existence of monopoly must 
depend on one of the three factors: (1) specialized 
abilities that are scarce relative to the economy; 
(2) increasing returns on a scale comparable to 
that of the economy; or (3) costs of coalition 
formation that are relatively low among producers 
of the same good and high for coalitions involving 
consumers and producers. 

Second, the assumption that the production pos- 
sibilities are the same for all coalitions is one that 
has been used by McKenzie and, with suitable in- 
terpretation, is not as drastic as it seems, We can 
assume that some or all productive processes re- 
quire as inputs “entrepreneurial skills” or special 
talents of some kind. The commodity space is en- 
larged to include these skills, which may be dis- 
tributed very unequally in the population. It can 
then be argued that diminishing returns to scale in 
the observable variables really results from a com- 
bination of constant returns in all variables, includ- 
ing entrepreneurial skills and a fixed supply of the 
latter. Further, different coalitions will really have 
very different access to production possibilities be- 
cause of their very different endowments of skills. 
[See WELFARE Economics.] 


4. Uniqueness of competitive equilibrium 
Results concerning the uniqueness and stability 
of competitive equilibrium have been stated only 
for the case where the excess demand functions are 
single-valued, as at the beginning of section 2. It 
will then be assumed that assumptions (H), (W), 


and (B) hold; in fact, (C) will be strengthened to 
require differentiability of the excess demand func- 
tions. 

Without further assumptions, there is no need 
that equilibrium be unique, and examples of non- 
uniqueness have been known since Marshall. The 
mathematical basis for a fairly general uniqueness 
theorem has only recently been worked out by Gale 
and Nikaidô (1965), and the most appropriate 
economic theorem has not been fully explored. 
However, one theorem along these lines can be 
stated. Suppose there is one commodity for which 
the excess demand is positive whenever its price is 
zero, regardless of the prices of all other com- 
modities. Such a commodity is an eminent candi- 
date for Walras’ role of numéraire, and we may 
choose its price to be 1, since it cannot be zero (and 
therefore will be positive) in any equilibrium, and, 
from the homogeneity of excess demand functions, 
multiplying any equilibrium set of prices by a posi- 
tive number leads to a new equilibrium (unique- 
ness of price equilibrium is, of course, defined only 
up to positive multiples). Call this the nth com- 
modity, and consider the excess demands for 
commodities 1, ---, n — 1 as functions of p,, =, 
Pn+, With p, held constant at 1, The Jacobian of 
this set of functions is defined in mathematics as 
the matrix with components (@X,/dp;), where 
i and j range from 1 to n — 1. As noted in section 1, 
a matrix is termed Hicksian if the determinant of 
a principal minor is positive when it has an even 
number of rows and negative otherwise. 

Uniqueness theorem 1. If the Jacobian of the 
excess demand functions, omitting a numéraire 
and holding its price constant, is Hicksian for all 
sets of prices, then the equilibrium is unique. 

A special case of this theorem originated in effect 
with Wald (1936). Commodity i will be said to be 
a gross substitute for commodity j if an increase in 
P; , holding all other prices constant, increases Xi. 
It follows from (H) that if all commodities are 
gross substitutes, the Jacobian of the excess de- 
mand functions, omitting a numéraire, is Hicksian. 
Then a consequence of uniqueness theorem 1 is: 
If all commodities are gross substitutes, then equi- 
librium is unique. ka 

Finally, an entirely different sufficient condition 
was also stated by Wald: 

Uniqueness theorem 2. If the weak axiom of 
revealed preference holds for consumers as a whole, 
then equilibrium is unique. 

What is now termed the weak axiom of revealed 
preference—developed independently by Samuel- 
son (1938)—states that if the commodity bundle 
chosen at prices prevailing in situation 1 are no 


more expensive at the prices prevailing in situa- 
tion 0 than the bundle chosen in situation 0, then 
the latter bundle must be more expensive than the 
former at the prices in situation 1. For an indi- 
vidual, this axiom is a simple consequence of 
rational behavior; both bundles were available to 
him in situation 0, and the bundle he chose in that 
situation must therefore have been preferred. If 
both bundles had also been available in situation 1, 
then the less preferred bundle could never have 
been chosen; hence, the bundle chosen in situation 
0 must have been unavailable because of cost in 
situation 1. However, the statement that the axiom 
holds for the demand functions of the market as a 
whole is not a consequence of rational behavior but 
is an additional assumption that is valid, in effect, 
when income’ effects are not too large in the 


aggregate. 


5. Stability 

The stability problem, as formalized by Samuel- 
son, can be stated as follows: Suppose that an 
arbitrary (in general, nonequilibrium ) set of prices 
is given so that there are nonzero excess demands, 
some of which are positive. It is assumed that 
prices adjust under the influence of the excess de- 
mands, specifically rising when excess demand is 
positive and falling in the opposite case. This sug- 
gests the following dynamic system: 


(5) dpi/dt = B:Xi(pi, +++» Pn), 


that is, the changes in prices (t denoting time) are 
proportional to the excess demands. Notice that 
allowing for “speeds of adjustment,” k,, that are 
different from 1 and from each other is not merely 
due to a desire for generality but is virtually a 
logical necessity, for a careful dimensional analysis 
shows that k; will change with changes in the 
units of measurement of commodity i. More gen- 
eral (nonlinear) adjustment models have been 
studied, for example, a model in which dp;/dt has 
merely the same sign as X;. 

A variation of this system, which has often been 
studied, distinguishes one commodity as numeéraire 
and assumes that its price is held fixed, say at 1: 


(6) dpi/dt = kiXi(Pr» ***> Pa), 
Pr=1. 


i=1, n, 


Ea EO E 


One difficulty arises in either system when the 
rules call for a price to become negative, which can 
happen if, for some i, X:(pı, "> Pa) <0 with 
pi =0. Since the excess demand functions are not 
even defined for negative prices, the rules must be 
altered, It has become customary to modify (5) 
and (6) by requiring that a price remain at zero 


ECONOMIC EQUILIBRIUM 385 


under these conditions; but whether the rules re- 
main consistent is a difficult mathematical ques- 
tion, which has been studied only by Uzawa (1958) 
and Morishima (1960a) and then only for very 
special cases. 

The systems (5) or (6) are systems of differ- 
ential equations; their solutions are time paths of 
prices, which are determined by the specification 
of initial conditions as well as by the system. The 
stability question is whether or not the resulting 
time path converges to an equilibrium. Global sta- 
bility means that convergence occurs for any initial 
conditions; local stability that the path converges 
for initial conditions sufficiently close to an equi- 
librium. However, at the present time the most 
interesting results are sufficient for global as well 
as local stability, so we need not distinguish the two. 

It should first be noted that neither of the sys- 
tems is necessarily stable even if all the hypothe- 
ses which ensure the existence of equilibrium are 
satisfied; examples have been supplied by Scarf 
(1960) and Gale (1963). Gale's example is par- 
ticularly simple: Suppose there are two individuals 
and three commodities, and there is no production. 
Individual 1 starts with a supply of good 1 and 
individual 2 with supplies of goods 2 and 3. Indi- 
vidual 1 has a utility function involving only goods 
2 and 3, while individual 2 wishes only good 1. It 
is easy to see that there is a unique equilibrium. 
Now suppose that Giffen’s paradox holds with re- 
gard to good 2 for individual 1 (that is, a rise in 
the price of good 2, holding other prices constant, 
raises individual 1’s demand). Then it is possible 
to show that, for suitably chosen adjustment 
speeds, the solution of system (5) or (6) remains 
away from the equilibrium. 

There are three different conditions, due to Ar- 
row, Block, and Hurwicz (1959), any one of which 
is sufficient for the stability of the system. 

Stability theorem 1. If all commodities are 
gross substitutes, then systems (5) and (6) are 
both stable. 

Stability theorem 2. If the market satisfies the 
weak axiom of revealed preference, then systems 
(5) and (6) are both stable. 

To state the third theorem we have to introduce 
the mathematical concept of a matrix with domi- 
nant diagonal. A matrix (aij) is said to have a 
dominant diagonal if |aii| > Zy- |a| for all i. (The 
diagonal element is more important than all others 
in the same row.) 

Stability theorem 3. If the Jacobian of the ex- 
cess demand functions (excluding a numéraire ) 
has a dominant diagonal, all elements of which are 
negative, then system (6) is stable. 
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Whether the Jacobian has a dominant diagonal 
may depend on the units in which the commodities 
are measured; stability theorem 3 asserts stability 
if there is any way of choosing the units so as to 
achieve diagonal dominance. 

Stability theorem 2 can be interpreted as mean- 
ing that the transfers of income which take place 
during the course of the time path produce broadly 
offsetting results on demand; income effects are 
not too asymmetrical. Stability theorem 3 is per- 
haps closest to Walras’ initial concepts; stability 
holds when the excess demand for a commodity is 
much more affected by a change in its own price 
than by any other price change (holding the price 
of a numéraire constant). 

In all discussion of stability so far it has been 
implicitly assumed that no transactions take place 
at nonequilibrium prices, for if they did the excess 
demand functions would shift. (An alternative in- 
terpretation is that all commodities are completely 
perishable and that utilities and production are 
independent as between time periods. Then any 
transactions occurring in one period will have no 
effect on transactions occurring in the next period. ) 
This assumption is the classical one of “recontract- 
ing,” in Edgeworth’s terminology, and was made by 
him and by Walras. The problem raised by trans- 
actions at nonequilibrium prices was immediately 
recognized, but little analysis took place, Several 
recent writers, particularly Negishi (1961), Hahn 
and Negishi (1962), and Uzawa (1962), have 
considered this problem under the rather awkward 
title of “non-tdtonnement stability.” The system, to 
be complete, has to specify the nature of the trans- 
actions; since the system is not at equilibrium 
there will have to be rationing of sellers or of 
buyers. If it is simply assumed that any transac- 
tions that do take place cannot change the value of 
any individual’s holdings, then gross substituta- 
bility is again a sufficient condition for stability. 
Under more specific assumptions about transac- 
tions, stability can be shown in much wider classes 
of cases, 


6. Comparative statics 


The question raised in comparative statics is, 
What can be said of the effect of a shift in the 
excess demand functions on the price system? As 
might be supposed from the nature of the ques- 
tion, answers can only be given in a limited range 
of cases, Suppose that a binary shift has occurred 
—that is, the excess demand for one commodity, 
say 1, has decreased at every set of prices; the 
excess demand for another commodity, say 2, has 


increased correspondingly (the money value of the 
increased demand for 2 exactly equals the decrease 
in money value of demand for 1 at any given set 
of prices); and the excess demand functions for all 
other commodities have remained unchanged. 
Then the only general result in the literature is the 
following (Morishima 1960b): If all commodities 
are gross substitutes and there has occurred a 
binary shift in demand from commodity 1 to com- 
modity 2, then all prices of commodities other than 
1 rise relative to the price of 1 or do not fall, and no 
relative price rise is greater than that of com- 
modity 2. 


7. Partial equilibrium 


A great deal of economic analysis has been con- 
cerned with equilibrium of a single market. De- 
mand and supply for a commodity are regarded as 
functions of a single price, and equilibrium is inter- 
preted as equality of supply and demand or as an 
excess of supply over demand at zero price. 

Partial equilibrium analysis is to be regarded as 
a special case of general equilibrium analysis, The 
existence of one market presupposes that there 
must be at least one commodity beyond that traded 
on that market, for a price must be stated as the 
rate at which an individual gives up something else 
for the commodity in question. If there were really 
only one commodity in the world, there would be 
no exchange and no market. 

Suppose for the moment that there are only two 
commodities, say 1 and 2. Because of homogeneity, 
demand and supply (which may, as before, be 
multivalued) are determined by the ratio of the 
price of commodity 1 to that of commodity 2, that 
is, the price of commodity 1 with commodity 2 as 
numéraire. From Walras’ law, equilibrium on mar- 
ket 1 ensures equilibrium on market 2. Partial equi- 
librium analysis of market 1 is, in the case of two 
commodities, fully equivalent to general equilib- 
rium analysis, All the previous theorems hold, and 
one more theorem on stability may be stated (Ar- 
row & Hurwicz 1958): For two commodities, equi- 
librium is always globally stable in the sense that 
for any initial price or prices systems (5) and (6) 
of section 5 always converge to some set of equi- 
librium prices (if the equilibrium price set is not 
unique up to positive multiples, the prices to which 
the system converges will depend on the starting 
point). This theorem has not always been recog- 
nized in discussions of stability of equilibrium in 
foreign trade, where the analysis has frequently 
been confined to two commodities. 

Analysis of a two-commodity world may have 


considerable didactic usefulness as a way of study- 
ing general equilibrium through a special case ad- 
mitting of simple diagrammatic representation, but 
it may be asked if partial equilibrium analysis has 
any empirical interest for a world of many com- 
modities. An answer is provided by the following 
theorem, due independently to Hicks (1939) and 
Leontief (1936): If the relative prices of some set 
of commodities remain constant, then for all ana- 
lytical purposes the set can be regarded as a single 
composite commodity, the price of which can be 
regarded as proportional to the price of any mem- 
ber of the set and the quantity of which is then 
defined so that expenditure (price times quantity ) 
on the composite commodity is equal to the sum of 
the expenditures on the individual commodities in 
the set. In symbols, if the prices pı, ''*, Pm of a 
set of commodities 1, «++, m satisfy the conditions 
Pi = pp: (Pı a constant for each i while p may 
vary), then we can take p to be the price of the 
composite commodity and £ '-ı p,q, to be the quan- 
tity, where q; is the quantity of the ith good. 

The Hicks—Leontief aggregation theorem can be 
used to justify partial equilibrium analysis. Sup- 
pose that a change in the price of commodity 1 
leaves the relative prices of all others unchanged. 
Then insofar as we are considering only disturb- 
ances to equilibrium from causes peculiar to the 
market for commodity 1, the remaining commodi- 
ties can be regarded as a single composite com- 
modity, and partial equilibrium analysis is valid. 

The assumption of strict constancy of relative 
prices of the other commodities will not usually be 
valid, of course, but it may hold approximately in 
many cases of practical interest. It is sufficient for 
the purpose that the changes in the relative prices 
of other commodities induced by a change in the 
price of the commodity being studied do not in 
turn induce a significant shift in supply or demand 
conditions on the market for that commodity. 


8. Equilibrium over time 

While the above summarizes the central part of 
the literature on general economic equilibrium, 
there is a related conceptual question that deserves 
brief mention. Consider an economy extending 
over time, with dated inputs and outputs and house- 
hold plans that run into the future. What can be 
said about the equilibrium of such an economy and, 
indeed, what is meant by the term? 

One straightforward answer is that originally 
due to Hicks, We may simply date all commodity 
trdnsactions and regard the same physically de- 
fined commodity at two different times as being 
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two different commodities. Then the formal model 
of section 2 remains, with reinterpretation, and we 
can still argue that there is an equilibrium over 
time. Planned supplies and demands are equated 
in the usual way. 

This is a legitimate and indeed important inter- 
pretation. Problems of optimality, the core, unique- 
ness, and comparative statics (perhaps to be re- 
named “comparative dynamics”) are restatable 
with no difficulty, Stability theory faces a more 
serious challenge, since time now enters in two 
different ways—in the underlying model and in the 
adjustment process, If all markets are taken to be 
futures markets, so that all adjustments take place 
simultaneously, there is no difficulty, but otherwise 
there is a new range of problems which have been 
approached only in the most fragmentary way. 

However, the simple redating has the important 
implication of perfect foresight (possibly achieved 
through having all future economic transactions 
determined in currently existing futures markets ). 
This seems empirically most unsatisfactory and in- 
volves significant logical problems of knowledge 
(Morgenstern 1935). An alternative is to assume 
that each individual has expectations about the 
future that are continuous functions of present 
observed variables. Then in each period there will 
be an equilibrium, although each individual's plans 
for the future generally will not in fact be carried 
out, 

From either point of view, considerable interest 
has attached to another meaning of equilibrium, 
which we may term stationary equilibrium. The 
equilibrium over time in the case of perfect fore- 
sight defines a set of prices and quantities for each 
period, The same is true of the succession of short- 
run equilibria defined by individuals acting under 
expectations. The question is, Does this time se- 
quence have a stationary or equilibrium value? Is 
there a set of prices and quantities such that, if 
they governed in the initial period, they would re- 
main equilibrium values for all subsequent periods? 
Or, in a growing economy, would at least the rel- 
ative prices and quantities remain constant if the 
appropriate values held in the initial period? This 
might be termed the question of existence of sta- 
tionary equilibrium, frequently termed the balanced 
growth path. The distinction between equilibrium 
in general and the stationary state, implicit in clas- 
sical economics, was first clarified by Frisch (1935- 
1936). 

The stability of stationary equilibrium is a dif- 
ferent problem from that of stability in the sense 
used in section 5. Suppose we have an arbitrary 
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initial quantity configuration. The question then 
arises, Will the equilibrium values of the successive 
periods tend to converge to the stationary equi- 
librium? The study of this problem has been a 
major preoccupation of modern growth and capital 
theory, and it will not be enlarged on here. 
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ECONOMIC EXPECTATIONS 


Economic actions are chosen with a view to 
imagined consequences assigned to some more or 
less distant future date or stretch of time. Such 
imagined and temporally projected consequences 
are what we mean by economic expectations. The 
decision maker, choosing among the rival schemes 
he has in mind for using his resources, must find 
some principle and basis for assigning to each such 
scheme a statement of its meaning to him in pos- 
sible outcomes. The outcomes that he connects 
with any one scheme may be plural and mutually 
exclusive. In this case, he is uncertain what will 
be the outcome of the scheme. He must then adopt 
some procedure for valuing (in a general and not 
necessarily monetary sense) as a whole the array 
of rival conceived outcomes. This valuation has as 
its purpose, which governs the choice of the basis 
and method of valuation, a comparison of the rival 
schemes in order to choose one of them and act 
upon it. 

The principal matters that have to be considered 
by a theory of economic expectations are the fol- 
lowing: 

(1) What kinds of evidence or data play a part 
in the decision maker's task of conceiving the out- 
comes he connects with any specified scheme of 
action? 
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(2) Can a set of rules be found that are, or that 
ought to be, invariably used by any decision maker 
in this task? 

(3) Is this task a process of pure reasoning, 
capable, if correctly performed, of logical justifica- 
tion; or do elements of judgment, of invention or 
inspiration, or of ex nihilo creative origination 
enter into it? 

(4) When the outcomes imagined for any ac- 
tion scheme are plural and mutually exclusive, 
what procedure of valuation of the scheme and its 
array of rival conjectured outcomes as a whole is 
employed for the purpose of ordering the schemes 
or of selecting the best? 

(5) Is the choice of such a procedure made in- 
dependently of the evidence available for forming 
the expectations, or does this evidence influence or 
govern it? 

(6) The array of imagined outcomes that the 
decision maker connects with any contemplated 
scheme of action will influence his valuation of 
that action in two ways: by the character of each 
such outcome individually and by the structure 
composed by all these hypotheses taken together. 
What are the relative strengths and respective 
effects of these influences, and what is the nature 
of their interaction? 

(7) If any such expectation structure is the out- 
growth of fragmentary, enigmatic, elusive, or self- 
contradictory evidence; if, as must be the case 
when several rival hypotheses are entertained con- 
cerning the result of any one action, the evidence 
is insufficient to determine a unique outcome; and 
if, in consequence, the evidence possessed by the 
decision maker at any moment is liable to be sup- 
plemented, canceled, differently illuminated, partly 
contradicted, or reinterpreted, what effect upon his 
valuations of different action schemes will this in- 
herent and recognized fragility of his expectations 
have? 

Logical teeth, however sharp, do nothing toward 
the nourishment and development of the body of 
practical knowledge until they have bitten upon 
fact. Some classification and cross classification of 
impressions reaching the mind from outside, some 
sorting of records and measurements, some emer- 
gence of repetitive associations and configurations 
are indispensable to the formation of tools for 
coping with the environment. The knowledge of 
“fact” that is appropriate for expectation forming 
is of two sorts. First, there must be knowledge of 
pervasive, repeated stereotypes of structure. These 
are answers to the following questions: How, by 
the ineluctable testimony of nature (including hu- 
man nature), do things hang together? What 
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things do we constantly find in association, either 
at one moment or in temporal sequence? How do 
things work? What can happen? These are state- 
ments of the principles of the world we live in, not 
the logical but the factual principles. Given such 
facts, logic, especially mathematics, may reveal a 
vast body of entailments that then assume the rank 
of fact without having been actually observed. Sec- 
ond, when we have some idea of where the state 
of affairs can go from a given situation, we need 
to know what situation, at the moment of seek- 
ing to form expectations, is given, This is the other 
sort of fact. 

Historical determinism. The most fundamental 
question, on which the expectation-former de- 
termined to start from first principles would need 
some opinion, is whether the course of world his- 
tory, in detail as well as in the large, is at each 
moment the uniquely inevitable sequel of the past. 
There are at least two hypotheses, either of which 
if true would require a negative answer to the fore- 
going question. One is an essential randomness in 
the origin of events at subatomic levels, or at the 
level of genetic mutation. The other is the non- 
existence of any necessitating link between circum- 
stance, past and present, and the thoughts that a 
man may have. If a thought can be in some sense, 
as to some part of its character, “uncaused,” we 
have plainly the possibility that history from time 
to time and from place to place makes a fresh 
start, One further feature would need to be true of 
the world if randomness in nature, or essential 
novelty of thought in man, were to defeat that sta- 
tistical determinacy, which, for example, makes 
possible the laws of thermodynamics. This extra 
feature is the principle of self-reinforcing proc- 
esses, by which a “small” event (such as a “favor- 
able” genetic mutation ) releases larger forces, such 
as the rise of a species adapted to a hitherto barren 
environment, and so leads to biological events on 
the largest terrestrial scale. Having, for the sake of 
completeness, tried to suggest the nature of these 
ultimate questions, we have to admit their virtual 
irrelevance for the practical business of forming 
expectations, This irrelevance arises from the fact 
that, whether or not it is in the nature of things 
(and, in principle, possible) to predict the future 
from the past, it is plainly inconceivable that the 
requisite information about the past could be pos- 
sessed and processed by any individual, more espe- 
cially since the time available for such processing, 
if it were to give any useful answer, would be 
strictly limited. 

Expectations that reach beyond the immediate 
threshold of the future or that concern themselves 


with any but the most direct physical effects of the 
proposed action must accordingly always be uncer- 
tain. This uncertainty is responsible for the whole 
difficulty that expectations bring into economic 
analysis; and it is the source of much the greater 
part of the difficulty arising in economics from con- 
siderations about time, under any of its aspects, 
Uncertainty of expectation gives rise to agreeable 
or disagreeable states of mind, and can thus have 
an economic valuation and enter directly into the 
play of economic influences. Indeed, it is itself one 
of the most powerful of such influences and ex- 
plains the possibility of large-scale unemployment 
in a world of omnipresent scarcity. It is the chief 
explanation of the existence of rates of interest. A 
service is rendered to society by those who, by em- 
barking their resources in enterprises looking to a 
distant future, expose themselves to uncertainty to 
an especial degree; and if, on the whole, the render- 
ing of this service entails discomfort, some com- 
pensating hope of profit will be needed to elicit 
this service. Such a hope might not sustain itself 
in the minds of those concerned unless there were 
a record of some degree of fulfillment of these 
hopes. 

The stereotype. The venture of forming eco- 
nomic expectations depends upon an extension of 
the stereotype from the plane of the physical and 
technical to that of the psychical, social, and his- 
torical, By “stereotype,” we mean a fixed and stable 
pattern or configuration, composed of specific an- 
tecedent circumstances, or kinds of circumstance, 
and a specific sequel or end situation, or a specific 
train of consequences. By calling it stable, we mean 
that it occurs repetitively and can be frequently 
observed, or may even appear to be a universal and 
infrangible aspect of the natural order, It is upon 
such stereotypes (whether or not they have been 
combined into a larger, more general, and compre- 
hensive system of knowledge) that all applied sci- 
ence or technology depends. When such stereo- 
types are used in isolation, they are rules of thumb. 
When combined into a great structure of interlock- 
ing ideas, they constitute a science or an art. Ina 
bridge, the conformation, scantlings, and materials 
are together the source of strength; in a chemical 
process, the reaction is the consequence of juxta- 
posing certain substances under certain conditions 
of temperature and pressure; in biology a crossing 
of certain breeds may yield a predictable kind of 
offspring. When we look at events on a cosmical 
scale, the idea of the stereotype takes on a some- 
what different aspect. If astronomers agree that 
the outer galaxies are receding from us, what is re- 
peated is not their positions, as of planets in their 


orbits, but the recession at a given acceleration, 
that is, at a speed proportional to distance at- 
tained, Thus even an evolutionary, irreversible 
transformation can exhibit repetitiveness and the 
essence of the notion of the stereotype. It is when 
we come to events in the human and social sphere 
that the application of this tool, although still in- 
escapable, becomes altogether more difficult and 
hazardous. Although Spengler (1918-1922), Toyn- 
bee (1934-1961), and Rostow (1960) have sought 
to demonstrate repetitiveness in history (as the 
only hope of making history a predictive science), 
still there are immense difficulties, dangers, and a 
widespread skepticism even (or perhaps espe- 
cially) among historians. 

These difficulties lead the businessman to dis- 
tinguish two types of questions (Keirstead 1953): 
First, how will the general posture of affairs, de- 
scribable by large aggregative statistical measure- 
ments such as the general level of prices, the 
monthly or annual expenditure of the community 
on all retail goods together or on goods of particu- 
lar types, the percentage of the labor force unem- 
ployed, the rate of income tax or profits tax, and 
so on develop over the coming months or years? 
Second, what are the prospects for a particular 
enterprise—exactly specified as to the nature of 
its product; the type, design, and scale of its equip- 
ment; its location, marketing policy, research 
policy, management recruitment policy, and so on 
—given the particular industry’s record of fast or 
slow innovation, its composition of many small or 
a few large firms, etc.? For his answer to the first 
of these questions, he may rely on a particularly 
simple form of that permanence or stability in the 
world of nature and of man upon which all possi- 
bility of prognostication depends—namely, on 
what we can alternatively call inertia or momen- 
tum in affairs, Cataclysmic change can occur: a 
crucial battle can be lost; a revolution can break 
out. But, save for such upheavals, some of the great 
public aggregates cannot change very abruptly. 
Governments and political parties are committed to 
certain policies and programs and could not aban- 
don them without electoral disaster; thus, the 
budget may leave little room for choice. To a lesser 
degree, the same may apply, although for differ- 
ent reasons, to the investment policies of large 
firms and publicly owned industries. Government 
expenditure and the investment expenditure of 
business enterprises together account for a large 
part of the Keynesian multiplicand, which, with a 
Kefnesian multiplier deriving its numerical value 
and its stability from the habitual or conventional 
behavior of large masses of people in electing what 
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proportions of their incomes to spend and what to 
save, largely determines national prosperity or de- 
pression. Alongside such reliance on the present as 
an image of the near future, businessmen pay great 
heed to each other's opinions (Keynes 1937) with- 
out caring to explain to themselves how these can 
be better founded than their own. 

Keynes (1936) has suggested the extreme muta- 
bility, fragility, and instability, with possible explo- 
sive or cumulative self-reinforcing consequences, 
that characterize such “conventional” opinions or 
assumptions about the future, Indeed, he paints a 
picture of the businessman’s suppressed and latent 
desperation, his urge to clutch at straws, and thus, 
finally, to swing in behind any apparent coales- 
cence of opinion or hint of leadership and turn it 
into a mass movement. The bear and bull markets 
of the stock exchanges have their rationale but can 
partake also of hysteria. Since it matters nothing to 
the speculator whether or not a rise in share prices 
has any basis in the genuine situation of the firm 
concerned, he needs only to know that bulls of the 
shares are more numerous, wealthy, or convinced 
than the bears; then he can safely join the bulls in 
their self-justifying operation, which itself drives 
up the price. The game of trying to guess what the 
other man is guessing, what he is guessing that 
you are guessing, and so on to the nth degree, is 
plainly unrelated to accounting facts or the real 
state of the economy and leads therefore to inher- 
ently restless markets, which generate movements 
out of nothing or next to nothing. Moreover, no 
speculative profit can be made in any market un- 
less there is a change of opinion, bringing about a 
change of price; and there exists at all times, there- 
fore, a motive for some speculators to set afloat 
ideas that will cause such changes of opinion. 
Genuine investment decisions in the economist’s 
sense, that is, decisions by firms to order plant and 
equipment, are somewhat insulated from such 
speculative markets. But it is still true, as Keynes 
(1936, p. 151) indicates, that a firm that can ac- 
quire another firm as a going concern at a de- 
pressed stock exchange price may find this a more 
profitable use of its reserve funds than establishing 
a mint-new plant of its own. Such a choice tends, 
of course, to depress employment, unless the sellers 
of the shares use the proceeds themselves to order 
newly made equipment. 

It was, at one time, common to speak of the 
“business cycle,” meaning by this a repetition of 
alternate phases of increasing and decreasing pros- 
perity, as measured by national general output or 
income, employment, the general level of prices, 
and so on. The record of the nineteenth century and 
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early twentieth century seemed to show a length, 
from crest to crest or from trough to trough, of from 
7 to 11 years. The amplitude of these fluctuations 
seemed to increase greatly after World War 1 and 
to disappear completely after World War 1, being 
then replaced in the United States and Great 
Britain by a much shorter and milder cycle, At no 
time, however, could any such pattern have offered 
a sure guide to general business conditions for a 
number of years ahead. If there is a cyclical mech- 
anism, its effect can be transformed or obliterated 
by the kind of extreme political and social up- 
heavals that have occurred in our time. It has not 
been the experience of our generation that general 
history, in the large, can be foreseen. 

It is in seeking an answer to the second type of 
question, concerning the prospects of a closely 
specified projected enterprise, that the businessman 
can more plausibly have recourse to stereotypes of 
business and investment situations. Such stereo- 
types will be quite distinct from case studies, al- 
though possibly founded on them. The stereotype 
will need to select and combine features from many 
such studies. The information a stereotype affords 
will be like that derived by a yachtsman about the 
likely performance of a sailing boat in a race, from 
a knowledge of the boats design. This knowledge 
may tell him how the boat will perform in this set 
of conditions or in that set, but can, of course, tell 
him nothing as to which set will in fact prevail. 
Such stereotypes may be derived by what we may 
call a clinical method, that is, by so close an ex- 
amination of a few particular instances, recorded 
or observed, that insight is gained into the precise 
interplay of policy and circumstance and a grasp 
of the reasons for success or failure in each case 
obtained; or they may be derived by a statistical 
method, where only the probabilities of success, in 
this and that degree, of each given design of enter- 
prise is studied and no attempt is made to pene- 
trate the mechanism or rationale of this success. 
Naturally, these methods can be combined. 

Probability. For an enterprise of given design, 
it will be natural to suppose that each distinct hypo- 
thetical degree of success will be associated with 
a different degree of probability. The pattern of 
association of these two variables, if statistically 
derived, will be expressed in a frequency table. If 
derived by some procedure of intuitive judgment, 
it will be expressed as a subjective probability dis- 
tribution. We have to ask whether these two kinds 
of expression can, in practice, be so sharply dis- 
tinguished. Many meanings have been proposed 
for the word “probability”; but these, for the most 
part, converge on the idea of recorded, or else po- 


tential, relative frequency, that is to say, on the 
notion of the counting of cases. To say that a 
given outcome Q has a probability of } is to say 
that if a trial or performance, the nature and cir- 
cumstances of which. lie within strictly stated 
bounds, is many times repeated, the result will be 
Q in approximately 4 of the instances. Now when 
this trial or performance consists in tossing a coin 
or throwing dice, it may be easy to specify that the 
coin or dice must preserve their physical character 
exactly throughout the series of tosses and that 
each of these must be made by hand, in still air, 
etc. But what of the proposed setting-up of a busi- 
ness enterprise? The nature of the product; the 
type, design, and scale of the physical equipment; 
the location; and even the particular persons com- 
posing the initial cadre of directors and managers 
can no doubt be specified. The policy of the enter- 
prise can be laid down in broad terms. But who 
can guarantee that the political, fiscal, social, com- 
mercial, fashionable, technological, and epistemic 
circumstances within which the enterprise will 
have to operate will remain unchanged? And if 
they change, what becomes of the meaning of 
probability? The relevance and applicability of any 
given statistical record of the past to the future 
experience of a new firm can be only a matter of 
judgment, that is, of conjecture, on the part of the 
enterpriser, since this relevance involves a double 
stretch of time in which the circumstances have 
evolved and will evolve further. Thus, in our con- 
text, there is no clear dividing line between statis- 
tical and subjective probability. But there are 
deeper questions. 

Frequency ratios give us knowledge, namely, 
knowledge of the proportions in which the total list 
of individual results—one result for each of many 
repetitions of some kind of trial, a kind of trial 
exactly specified as to the range of permitted varia- 
tion of its circumstances—have fallen under this 
or that head, or into this or that range of the varia- 
ble whose values are the form taken by the results. 
This knowledge is applicable also to contemplated 
further repetitions of the same kind of trial, speci- 
fied precisely as before, Let us call the complete 
list of contemplated further trials, considered as 
one whole, a divisible experiment. Then, a properly 
derived frequency table gives us knowledge of what 
will be the outcome of a divisible experiment, pro- 
vided that experiment can in fact be performed. 
With a tossed coin or thrown dice, with any game 
sufficiently constrained by explicit rules, there iS 
no question but that it can in fact be performed. 
But what of the establishment of a new business 
or a new plant? Here the tide of history itself is, 


willy nilly, changing some highly relevant circum- 
stances all the time, rapidly or slowly, The ques- 
tion is whether the establishment of precisely this 
kind of business, in a given location and facing 
given conditions in the market for its product and 
in the market for its factors of production, can con- 
ceivably be repeated enough times before the whole 
scene has changed. If not, the statistical frequency 
table ceases to provide knowledge of what will hap- 
pen and merely offers a tipster’s suggestion of 
which horse is the most promising, on past form. 

Potential surprise. The enterpriser who tries to 
go beyond statistics and to gain insight into the 
conditions of success still needs a means of ex- 
pressing a degree of solidity, of seriousness, which 
he adjudges to this and to that imagined (hypo- 
thetical) degree of success or of misfortune of 
some given project. The rival (mutually exclusive) 
suppositions that he makes concerning the degree 
of success of some project, when these suppositions 
are treated as values of one variable, need to be 
associated with values of a second variable that 
measures the strength of the claim of each sup- 
position to be taken seriously. Such a measure is 
afforded by the notion of subjective probability, 
which is the formal husk of statistical probability 
filled with quite different content; or it is afforded, 
alternatively, by a concept differing essentially 
from that of probability not only in meaning but 
in form, namely, a measure of possibility. The 
actual occurrence of something that we had sup- 
posed impossible causes us surprise. Some surprise 
is caused by an occurrence that we had only with 
difficulty been able to think of as feasible, Thus, a 
measure of adjudged possibility, or rather of de- 
gree of difficulty in supposing a given outcome 
possible, is provided by the potential surprise 
(Shackle 1949; 1955; 1958; 1961) to which the 
individual is exposed by his attitude to that out- 
come. Subjective probability, on one hand, and pos- 
sibility or potential surprise, on the other, may each 
be called an uncertainty variable. They differ 
radically in nature. Probabilitiés must be assigned, 
in effect, to a list of contingencies looked on as ex- 
haustive and complete; or, if probability is ex- 
pressed as probability density, the function con- 
necting these densities with values of the outcome 
variable must be supposed to rest on exhaustive 
and stable analysis of the underlying conceivable 
outcome situations. Since the exhaustiveness of 
the list implies that the factual outcome, when it 
shall emerge, will necessarily be found in one or 
another of the contingencies of the list, the proba- 
bility to be assigned to the list as a unified whole 
is the equivalent of certainty, represented by the 
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number 1. This probability is distributed over the 
listed contingencies in fractions that, therefore, 
sum to unity; and so the present writer calls sub- 
jective probability a distributional uncertainty vari- 
able. Possibility, or potential surprise, by contrast, 
can be assigned in mutually independent degrees 
to an indeterminate number of contingencies, the 
list of which, therefore, has no need to be looked 
on at any moment as complete. Potential surprise, 
measuring adjudged imperfection of possibility, is 
a nondistributional uncertainty variable. Methods 
of applying the two sorts of uncertainty variables 
must, like the nature of the variables themselves, 
differ radically. 

Whereas a statistical frequency table can be 
applied directly, as it stands, to express the result 
of a divisible experiment, subjective probability, on 
the other hand, being concerned essentially with 
the outcome of a single, special, and nondivisible 
experiment, needs to be summarized to yield any 
usable message. This summary will take the form 
of a mathematical expectation, the result of multi- 
plying each contingency (for example, each hypo- 
thetical amount of monetary profit) by its adjudged 
probability and adding together the results. Other 
moments of the distribution may also be used. The 
second moment, for example, indicates the dis- 
persion of those “frequencies” that, although mean- 
ing little in the application of subjective probability 
as a guide to policy, are nonetheless the form in 
which the subjective probability judgments are ex- 
pressed. The dispersion may be looked upon as a 
measure of the relevance, or lack of it, of the 
mathematical expectation. The doubtful relevance 
of any measure that seeks to average a collection 
of rival and mutually exclusive (that is, mutually 
contradictory) hypotheses is a chief reason for 
adopting the other type of uncertainty variable and 
using it as a basis of focus values, These are the 
two constrained maxima of a function that in- 
creases with the numerical size of supposed gain 
or loss and decreases with increase of the potential 
surprise associated with the various hypotheses. 
Such a function, or surface, may be visualized as 
follows. Imagine a seacoast where the straight 
shore line represents that degree of potential sur- 
prise corresponding to absolute rejection of hy- 
potheses as impossible, while somewhat inland a 
line parallel to the shore represents zero potential 
surprise, the judgment that hypotheses are per- 
fectly possible. A straight arm of the sea runs in- 
land at right angles to the shore; and on either side 
of it near the sea, the sloping hillsides swing away 
to merge with those that front the sea. Sea level, 
along the shore edge and the central fiord, now 
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stands for zero ascendancy, a zero degree of the 
“interestingness” of hypotheses, the power of any 
one imagined or hypothetical outcome of a con- 
templated action to arrest attention. Ascendancy 
will be nil for hypotheses adjudged impossible, and 
it will be nil, we are supposing, for those that 
mean no change in the decision maker's situation, 
these being the hypotheses represented by points 
along the central fiord. The greater the improve- 
ment or deterioration represented by any hypo- 
thetical outcome (the greater the distance from the 
central fiord), the more interesting, the more 
ascendant, or attention commanding, the hypothe- 
sis will be, But the more nearly impossible (the 
nearer the sea) it is, the less will be its ascendancy 
over the decision maker’s mind. Now upon this 
landscape is superimposed a hill path, crossing the 
central fiord at its inland end, where potential sur- 
prise is nil, climbing on either hand across the 
bluffs where they swing round from the fiord to the 
sea, and descending again to sea level some little 
way along the coast. This path, in its shape in the 
horizontal plane, represents the adjudgment of po- 
tential surprise to diverse hypotheses of the out- 
come of some specified action. The highest altitudes 
attainable without quitting this path are the two 
constrained maxima of the ascendancy function, 
the focus points or, looked upon as purely scalar 
quantities of gain or loss, the focus values referred 
to above. A pair of focus values may be held to 
reconcile, as far as may be, the conflicting needs 
of the enterprisers’ situation: to settle upon one 
policy or course of action in face of an essentially 
plural or uncertain vision of the consequences. 
Uncertainty of expectation is not a contingent, 
curable disability from which human beings will 
some day be rescued by the advance of science. To 
suppose it would be like supposing that the phys- 
ical horizon of the ocean can be brought nearer by 
sailing toward it. Frank Knight has said that con- 
sciousness itself involves uncertainty, This surely 
must be so. Consciousness is the continual appre- 
hension of subjectively new things, circumstances, 
and conjunctures that were hitherto not known to 
exist or to be imminent. To know the future would 
destroy the possibility of this stream of continually 
fresh perceptions. But at a somewhat less basic 
level, we may say that uncertainty is the price of 
hope, and hope is the sustaining force of human 
endeavor. Who would exert himself if the result 
were a foregone conclusion? How could the result 
be a foregone conclusion if no one exerted him- 
self? Uncertainty is the price of hope, for only by 
exposing ourselves to possible loss can we expose 
ourselves to possible gain. The practical question 


is, How can this price be most explicitly, incisively, 
unmistakably expressed? How can it be made easy 
to judge and to grasp, so that a comparison can be 
made between what is paid in anxiety and what is 
gained in imagination of success? Can this best be 
done by throwing into the cauldron all the mutually 
contradictory hypotheses that the individual can 
think of, each multiplied by some subjective proba- 
bility that apes the gestures but altogether lacks the 
meaning of statistical probability? Or can it be 
done in a more clear-cut fashion by setting clearly 
in view what the decision maker stands to lose by 
each given available choice of action? Both meth- 
ods involve a judgment, a judgment, moreover, 
that by its nature quite escapes the categories, in 
P. W. Bridgman’s sense, of “operational” or “non- 
operational.” If measurement were possible, there 
would be no need for judgment; moreover, each 
method involves giving some weight to two or 
more possibilities, only one of which can come 
true. We are here concerned with a once-for-all, 
nondivisible, experiment. 

G. L, S. SHACKLE 


[See also DECISION MAKING, article on ECONOMIC AS- 
PECTS; PROBABILITY, article on INTERPRETATIONS.] 
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A Study of History. 


ECONOMIC FORECASTING 
See PREDICTION AND FORECASTING, ECONOMIC. 
Related material may be found under TIME SERIES. 


ECONOMIC GROWTH 


The articles under this heading give several 
views of the process of economic growth as a 
whole. For the role of specific factors in economic 
growth, see CAPITAL; CAPITAL, HUMAN; CAPITAL, 
SOCIAL OVERHEAD; and INNOVATION. On the role 
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of productivity in economic growth, see PRODUC- 
TIVITY and AGRICULTURE, article on PRODUCTIV- 
ITY AND TECHNOLOGY. Related material may be 
found under DEVELOPMENT BANKS; ECONOMY, 
DUAL; FOREIGN AID, article on ECONOMIC ASPECTS; 
INDUSTRIALIZATION; INDUSTRY, SMALL; MODERN- 
IZATION; and PLANNING, ECONOMIC, article on 
DEVELOPMENT PLANNING. 
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ur, THEORY 
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I 
OVERVIEW 

As a distinctive epoch in economic organization, 
modern economic growth, or development, dates 
from the eighteenth century, when its beginnings 
in western Europe can first clearly be discerned. 
It may be defined as a rapid and sustained rise in 
real output per head and attendant shifts in the 
technological, economic, and demographic char- 
acteristics of a society, Together with the more 
recent concepts of social development and political 
development, it forms the phenomenon historians 
designate “modernization,” which embraces innova- 
tion in numerous aspects of individual behavior 
and social organization. Although not a revolution 
in the literal sense of the word, modern economic 
growth merits recognition as a distinctive epoch 
by virtue of the scope of the changes associated 
with it, the irreversible nature of many of them, 
and the unprecedented rate at which they have 
occurred. It seems safe to say that in those econ- 
omies now characterized as “developed,” most of 
the population has experienced in the last one 
hundred years a greater advance in material well- 
being and a more sweeping change in way of life 
than occurred in any previous century of human 
history. 

This transformation encompasses wide-ranging 
changes in techniques of producing, transporting, 
and distributing goods, in the scale and organiza- 
tion of productive activities, and in types of outputs 
and inputs. It embraces major shifts in the in- 
dustrial, occupational, and spatial distribution of 
productive resources and in the degree of exchange 
basis and monetization of the economy. On the 
social and demographic side it involves significant 
alterations in fertility, mortality, and migration, in 
place of residence, in family size and structure, in 
the educational system, and in provision for public 
health. Its influence extends into the areas of in- 
come distribution, class structure, government or- 
ganization, and political structure. 
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Compared with previous epochs, modern eco- 
nomic growth has involved marked acceleration in 
the rate of social change generally—so great that 
it frequently creates a significant difference in the 
experience of only two successive generations. The 
rate of growth has varied substantially, however, 
among different economies undergoing develop- 
ment at a given time and within a given economy 
over time. Attempts to distinguish phases of 
modern economic growth, to classify types, and to 
identify sequences between economic and non- 
economic characteristics or among different eco- 
nomic aspects are still at at an early stage, but 
some suggestive generalizations have been ad- 
vanced. 

From the viewpoint of the world as a whole, the 
international spread of modern economic growth 
has so far been limited, although few parts of the 
world have remained untouched. While the pattern 
of diffusion in time and space has yet to be estab- 
lished with quantitative precision and partly de- 
pends on how widely the process is conceived, the 
picture in broad outline is as follows. Elements of 
the transformation become increasingly apparent 
when one examines parts of northwestern Europe 
in the eighteenth century, especially, on a fairly 
extensive scale, in Great Britain. In the course of 
the nineteenth century, as the process gathered 
increasing headway in the area of origin, its grad- 
ual diffusion southward and eastward throughout 
Europe occurred. By the end of the century, aspects 
of its beginnings could be identified in easternmost 
Europe, including Russia, and also in Japan. A 
somewhat similar development had been taking 
place concurrently in overseas areas settled by 
Europeans, mirroring to some extent the diffusion 
pattern within Europe, It was apparent first in 
areas initially settled chiefly by migrants from 
northwestern Europe—the United States in the first 
part of the nineteenth century, followed by Canada, 
Australia, and New Zealand—and subsequently in 
parts of Latin America, where migration from 
southern and eastern Europe was especially im- 
portant. In the twentieth century, and increasingly 
since World War u, the initial signs have become 
more widespread in parts of Asia and, to a smaller 
extent, Africa—perhaps most noticeably in areas 
with the longest and fullest contact with some of 
the original centers. However, by the middle of the 
twentieth century, for only about a fourth of the 
world’s population has the transformation pro- 
ceeded far enough for them to qualify as “de- 
veloped” by most of the usual indicators. 

The appearance and spread of modern economic 
development sharply accelerated the trend toward 


wider international contacts, which had been set 
in motion several centuries earlier by the great 
voyages of discovery. The peoples of the world have 
come increasingly to be linked together in a mu- 
tually interdependent system of economic and 
political relations and through new communica- 
tions networks furthering the flow of information. 
At the same time, the piecemeal character of the 
diffusion process has sharply aggravated interna- 
tional differences in economic, social, and political 
conditions, has repeatedly disturbed the balance of 
political power both between developing and under- 
developed areas and within the developing group, 
and has become a major factor in modern wars. 
In a fundamental sense, modern economic growth 
is a creature of the scientific revolution, for it 
reflects man’s growing knowledge of natural phe- 
nomena and the power thereby attained to mold his 
environment to his needs. Whether it will prove 
possible to contain the disruptive effects of its 
spread within a framework of nation-states would 
seem ultimately to depend on whether progress in 
social science can match that in natural science. 


Characteristics of modern economic growth 


It is possible to identify a number of recurrent 
features of modern economic growth if one draws 
on both the historical experience of now-developed 
nations and comparisons of countries at different 
levels of development at a given time. However, the 
latter comparisons, which are frequently resorted 
to in lieu of sufficient historical data or research, 
are not always trustworthy guides to changes over 
time. The “nation” is adopted here as the unit of 
economic growth because although there are sig- 
nificant growth differentials among regions within 
a nation, they are substantially less than those 
among nations. Moreover, the central government 
plays an important role in decisions affecting 
secular growth. The present summary emphasizes 
similarities, both between countries with different 
institutional conditions, such as the degree of 
central planning, and between the current experi- 
ence of today’s less-developed nations and the 
earlier experience of the now-developed countries. 
It emphasizes further the contrast between pre- 
modern conditions and those characterizing mod- 
ern economic growth. Only brief reference will be 
possible to differences in the experience of indi- 
vidual nations and variations in trends from one 
period to another within a given country, although 
such differences are sometimes substantial. _ 

Production. From the viewpoint of the eco- 
nomic system as a whole, the central feature of 
modern economic growth is an immense and con- 


tinuous rise in the yield from human economic 
activity. This is reflected in the long-term growth 
rates of output per head of the total population 
shown in Table 1. For most of the countries listed, 
the rate has averaged from 13 to 19 per cent per 
decade. Such rates, continued throughout a century, 
imply an expansion of product per capita between 
3.4-fold and 5.7-fold. For Sweden, Japan, and the 
Soviet Union, which have sustained average rates 
on the order of 26 to 28 per cent per decade for 
five decades or more, the multiplication in a cen- 
tury would exceed 10-fold. 


Table 1 — Average rate of growth of real product per 
capita in developed countries for long peri- 
ods up to about 1960 


Per cent change in real 


Period product per capita 


(in years)* per decade 
England and Wales (later U.K.) 

1780 to 1881 101 13.4 

1855-1859 to 1957-1959 101 14.1 
France 

1841—1850 to 1960—1962 105.5 17.9 
Germany (later F.R.G.) 

1871—1875 to 1960-1962 88 17.9 
Netherlands 

1900-1904 to 1960-1962 59 13.5 
Belgium 

1880 to 1960-1962 81 14.8 
Switzerland 

1890-1899 to 1957-1959 63.5 16.1 
Denmork 

1870-1874 to 1960-1962 89 19.4 
Norway 

1865-1874 to 1960-1962 91.5 19.0 
Sweden 

1861-1865 to 1960-1962 98 28.3 
Italy 

1898-1902 to 1960-1962 61 18.7 
United States 

1839 to 1960-1962 122 17.2 
Canada 

1870-1874 to 1960-1962 89 18.1 
Australia 

1861-1865 to 

1959/1960-1961/1962 97.5 8.0 

Japan 

1879-1881 to 1959-1961 80 26.4 
European Russia (later U.S.S.R.) 

1860-1913 53 14.4 

1913-1958 45 27.4 


* Calculated from midpoints. 
Source: Kuznets 1966. 


It should be recognized that any summary meas- 
ure, such as real national product per capita, 
reflects certain value judgments and that, conse- 
quently, levels or rates of development cannot be 
measured with the scientific precision of, say, steel 
output. For example, centrally planned economies 
would confine such a measure of aggregate output 
to material production. The concept of economic 
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growth is itself a subjective one to some extent. 
However, almost any reasonably comprehensive 
measure would show comparably immense produc- 
tivity growth, largely because requirements for 
such elementary goods as food, clothing, and shel- 
ter continue to take up such a large proportion of 
economic endeavor. 

The rates shown are averages for half a century 
or more. For shorter periods much higher and also 
much lower rates have prevailed; this reflects a 
second characteristic of modern economic growth, 
significant variability in the growth rate over time. 
Wide fluctuations in output growth also occur in 
premodern economies, primarily in connection with 
the vagaries of agricultural conditions. In develop- 
ing economies, however, the nonagricultural sector 
becomes increasingly responsible for fluctuations. 
The movements in centrally planned economies 
have so far received relatively little study. In non- 
centrally planned economies the most fully ana- 
lyzed movement is the “business cycle,” which has 
typically averaged 3 to 4 years in duration. Longer- 
term movements for which some regularity is said 
to exist are “major cycles” (Juglar cycles), of ap- 
proximately 7 years’ duration; “long swings” 
(Kuznets cycles), ranging from approximately 10 
to 30 years in duration; and “long waves” (Kondrat- 
ieff cycles), of approximately 50 years in duration. 
Although these longer-term movements have not 
found general acceptance among economists, they 
serve to emphasize the high variability that has 
been observed in growth rates over periods of a 
decade or two, making inferences regarding secular 
trends from data confined to such limited periods 
quite hazardous, 

The trend in output per head cannot be at- 
tributed to a corresponding rise in labor input per 
head of the population. It is generally true that in 
now-developed nations the proportion of the popula- 
tion in the labor force is higher than in the pre- 
modern period, chiefly because the proportion of 
the population of working age has risen as fertility 
has declined. But this has been more than offset 
by a decline in the average hours worked per year, 
so that, on balance, the man-hour input per head 
of the total population has usually declined some- 
what. 

More plausible factors in the rise of output per 
head are capital input and average education per 
worker. Certainly the rises in these are among the 
most striking trends in modern economic growth. 
Although precise evidence is lacking, the expansion 
in the per-worker stock of reproducible capital 
(primarily structures and equipment) has involved 
orders of magnitude comparable to that of expan- 
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sion in output per worker. With regard to education, 
modern economic growth has been accompanied 
by the virtual elimination of illiteracy, a trend 
toward universal primary and, subsequently, sec- 
ondary school education, and a rise in college en- 
rollments. The quality of the labor input has been 
further enhanced by substantial advances in the 
nutrition and health of the labor force. 

Nevertheless, when one considers the nature of 
premodern methods of production, it seems doubt- 
ful that greater capital, education, and other quality 
inputs per worker, within the framework of that 
rudimentary technology, would have yielded the 
immense and continuing rates of productivity im- 
provement characterizing modern growth. 

Technological innovation. The fundamental 
basis for this productivity growth has been tech- 
nological innovation on a widespread and continu- 
ous scale. Indeed, the concrete nature of the 
technological developments helps explain the 
changing nature of the economic inputs, as well as 
a number of other trends accompanying modern 
economic growth. 

Although the great variety of these innovations 
makes a summary description difficult, it is possible, 
nevertheless, to identify a few general features that 
usually distinguish modern from premodern meth- 
ods. Specifically, the use of methods involving 
mechanization and high-energy inputs per worker 
and the associated shift toward a mineral-based 
economy especially set off many modern tech- 
niques. Large and carefully controlled volumes of 
energy are applied to productive processes, drawing 
on mineral sources such as coal, petroleum, and 
natural gas; in contrast, premodern energy inputs, 
deriving largely from human, animal, wood, wind, 
and water sources, are much smaller and more 
erratic. Intimately associated with the energy 
changes are the shift from hand tools to machine 
processes and wider use of physical-structure ma- 
terials derived from ferrous and nonferrous min- 
erals rather than forests or agriculture. With the 
resort to machine production, final products be- 
come more standardized; labor becomes much 
more specialized, both by industry and, within 
enterprises in an industry, by process. A corre- 
sponding proliferation of detailed industrial and 
occupational classifications occurs. 

Effective use of modern techniques entails a 
number of corollaries, Because of technological in- 
divisibilities, optimum scales of production become 
greater. In manufacturing, this means that produc- 
tion tends to be organized in factory establish- 
ments, employing large numbers of workers and 
producing far in excess of local requirements, 


rather than in small shops involving few hands and 
producing largely for local consumption. 

Optimal locations too are affected by the more 
advanced techniques. With regard to raw materials, 
minerals are substantially less ubiquitously dis- 
tributed than forest and agricultural materials, and 
differentials with regard to the efficiency of dif- 
ferent locations, particularly with regard to energy 
materials, are thereby heightened. On the side of 
markets, because production so greatly exceeds the 
needs of those immediately engaged, existing pop- 
ulation centers offering concentrated markets be- 
come increasingly attractive. Transport junctions 
providing ready access to markets and/or materials 
also become more advantageous. The resulting dif- 
ferential advantage of certain locations, where one 
or more such factors is particularly favorable, leads 
to geographic concentration of production and 
urbanization. This tendency is reinforced up to a 
point by external, or “agglomeration,” economies. 

Even these limited observations clarify some of 
the trends in the industrial and occupational char- 
acteristics of the labor force during modern eco- 
nomic growth. Associated with the growth in 
specialization of labor by industry, task, and loca- 
tion in the production of commodities is an 
increasing need for transportation of these com- 
modities and for internal trade generally—and, in 
turn, for increasing monetization of the economy 
to facilitate the manifold growth of transactions. A 
rise is observed in the proportion of the labor force 
specialized, not in the direct production of tangible 
commodities or final services but in ancillary ac- 
tivities, such as moving, storing, and distributing 
goods. Associated with the growth of scale is a 
growing need for labor in control functions—in 
administrative and clerical jobs—as problems of 
coordination of processes and of plant layout mul- 
tiply. (Even in direct production itself, the worker's 
function increasingly becomes one of control, the 
tending of a machine.) A premodern economy 
could not spare such a large proportion of spe- 
cialized workers for continuous employment out- 
side direct production; a developing economy not 
only can, but apparently must, at least to a sub- 
stantial extent. Thus the trends in industrial and 
occupational structure of a developing economy 
testify both to the distinctiveness and immense 
productivity of the underlying technological innova- 
tions, for they show that the methods employed 
both require and permit a substantial proportion 
of labor to be engaged in activities other than direct 
production. [See INDUSTRIALIZATION.] 

The nature of the underlying technological de- 
velopments also helps explain the previously men- 


tioned trends in economic inputs. The growth of 
machine production and scale calls for much more 
capital investment per worker, both in equipment 
and (particularly) structures to house production. 
Transportation and power needs have major capital 
requirements, as does the need for workers’ ac- 
commodations where new locations are developed. 
Similarly, the much greater complexity of produc- 
tion requires a better educated labor force to learn 
and execute the new techniques. The continuous 
change characterizing modern economic growth 
(in techniques, locations, types of goods and jobs) 
calls for qualities of adaptability and adjustment 
on the part of labor which education facilitates. 

These brief remarks on energy and mechaniza- 
tion as leading technological features of a de- 
veloped economy do not do justice to the immense 
number of detailed changes implied by such char- 
acteristics or to the variety of other technological 
advances that have taken place, such as scientific 
hybridization, chemicals and plastics, electronic 
communications and controls, scientific manage- 
ment, and the supermarket, to mention only a few. 
Nevertheless, they do suggest the underlying logic 
of some of the principal trends in economic or- 
ganization characterizing modern economic de- 
velopment. 

Consumption. The rise in output per man-hour 
implies a similar trend in real income per member 
of the population, although in part, as was noted, 
the benefits of higher productivity are taken in the 
form of more leisure per worker. The specific im- 
plications of this rise in real income for the con- 
dition of the population would best be revealed by 
level of living indicators, if they were available, 
such as per capita calorie consumption, the starchy- 
staple ratio, persons per room, per cent of dwellings 
with piped water and flush toilets, various house- 
hold electrical appliances per capita, physicians 
and hospital beds per capita, school enrollment 
ratios, and newspaper circulation and cinema at- 
tendance per capita. Suffice it to say that modern 
economic development in general has meant an 
unprecedented advance for most of the population 
in food, clothing, shelter, household furnishings, 
and health, education, and recreational services— 
to the extent that dire conditions such as malnu- 
trition have virtually been eliminated. 

The foregoing statement refers to absolute levels. 
When one turns to relative magnitudes, to the 
proportionate distribution of the end product of 
the economy, the trends are less clear-cut. This is 
partly because variations among countries in the 
institutional conditions governing resource alloca- 
tion cause differences in the uses to which the 
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higher productivity accompanying modern eco- 
nomic growth is put. One well-established alloca- 
tive trend is a rise in the share of the gross national 
product devoted to gross capital formation, attrib- 
utable to the capital requirements of modern 
growth. A second is a decline in the proportion of 
consumption expenditure devoted to food. This de- 
velopment, rooted in the structure of consumer 
preferences, plays an important part in the decline 
in the relative importance of agriculture during 
economic growth, one of the most characteristic 
and pervasive trends. A third is a rise in the propor- 
tionate importance of new or modern goods rela- 
tive to that of traditional items of consumption, 
reflecting the impact of technological progress in 
the consumption sphere. It is an interesting specu- 
lation that whereas much of the substantive in- 
crease in the early phases of modern economic 
growth takes the form of food, clothing, and shelter, 
at a more advanced phase much of the change is 
connected with the automobile and the way of life 
associated with it. 

Income distribution. It is obvious that such 
massive changes in industrial, occupational, and 
spatial distribution, and in the relative proportions 
of factor inputs must leave their imprint on the 
income distribution of a society, for not all seg- 
ments of the population would be equally motivated 
or equipped to pursue the new opportunities. Gen- 
eralizations about trends, however, are frustrated 
by a serious lack of data and the fact that institu- 
tional differences intervene in this area to such an 
extent that even the concept of income shares 
sometimes differs among countries. Two statements, 
based almost entirely on fragmentary data for a 
few noncentrally planned economies in this cen- 
tury, may be ventured. One is that the return per 
unit of labor has risen relative to that of capital, 
offsetting a contrary movement in the relative 
factor quantities. As a result, changes in the rela- 
tive income shares of labor and property have been 
confined within fairly small limits. The second is 
that income inequality has on balance tended to 
lessen. This is suggested both by direct observa- 
tions on the size distribution of income and by 
scattered data on income-per-worker differences by 
industry, occupation, and/or region. There is some 
indication that this trend has chiefly involved an 
increase in the share of middle-income groups at 
the expense of upper and thus is partly bound up 
with the growing importance of a “middle class” 
during modern economic growth. [See INCOME DIS- 
TRIBUTION. ] 

External trade. Modern economic growth has 
brought with it a vast expansion in the absolute 
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volume of a nation’s international trade. Indeed, 
for most developed nations it has involved a greater 
relative rise in international trade than of GNP, 
despite the unprecedented growth rate of the latter 
(although this trend has been somewhat reversed 
in the twentieth century). Economies not experi- 
encing a relative rise in international trade are 
those centrally planned economies which have 
sharply restricted participation in international 
trade through autarkic policies and some of the 
overseas countries settled by Europe where trade 
played a major role even in the premodern period. 

Social and political changes. The movement 
from relatively high to low levels in mortality and 
fertility associated with modern economic growth 
has been so pronounced that it merits the distinc- 
tive title “demographic transition.” The mortality 
decline, which typically precedes that in fertility, 
has been most marked among (although not con- 
fined to) younger age groups, particularly infants. 
Life expectancy at birth in developed nations is 
now in the vicinity of seventy years, compared with 
a premodern figure of thirty to forty years. The 
major pestilential diseases (smallpox, the plague, 
cholera, yellow fever, typhoid, and typhus) have 
largely been eliminated, and, more recently, mortal- 
ity from infectious and parasitic diseases, such as 
tuberculosis, pneumonia, and dysentery, has been 
drastically reduced. The principal causes of death 
in developed economies today are chronic and de- 
generative illnesses (heart disease, cancer, acci- 
dents, and so on). The mortality movement is 
bound up in part with the improved living levels 
of the population, for example, improved nutrition, 
which strengthens resistance to disease. It is due 
also and increasingly, however, to important ad- 
vances in medical knowledge and significant in- 
novations in the sphere of public health. 

In a developed economy women usually marry 
later and bear fewer children—typically those sur- 
viving to the end of childbearing age will have 
averaged two to three births versus four to six for 
women in the premodern period. The reduction in 
the birth rate appears partly attributable to the 
decline in infant mortality, since to achieve a given 
number of surviving children, a smaller number of 
births is needed. From the viewpoint of females’ 
time required in childbearing and care, reproduc- 
tion, in the significant sense of surviving descend- 
ants, becomes, like economic production, a more 
efficient process during modern economic growth. 

The “dependency burden,” the ratio of dependent 
children and elderly persons to those in prime work- 
ing ages (say, from 15 to 59 years) usually de- 
clines in the course of development. This is due to 


a reduction in the proportion of children in the 
population as fertility declines; the proportion of 
elderly persons actually rises somewhat. The num- 
ber of persons per household also declines. In part 
this reflects the lower ratio of children to adults; 
in part it is due to the progressive decline of two- 
generation or three-generation family units during 
economic growth. 

As has already been noted, economic growth is 
marked by a progressive shift of population from 
rural to urban areas. The basic impetus for this 
arises from the impact of technological change on 
the spatial distribution of given industries, but it 
is significantly reinforced by the shift in final con- 
sumption patterns which favors the growth of non- 
agricultural industry relative to agriculture. The 
growth of urban centers necessitates a substantial 
redistribution of population through internal migra- 
tion. [See LABOR FORCE, article on MARKETS AND 
MOBILITY.] 

These changes in population characteristics 
have, in turn, influenced the trends in final prod- 
ucts, The reduction in household size and the aging 
of the population have altered final demands. The 
effect of the shift to urban locations is apparent in 
the upward trend in consumer expenditures on a 
number of predominantly urban services—trans- 
portation, communication, personal services, and 
recreation. 

In addition, the trend toward urbanization, and 
the consequent need for public municipal services, 
is one of the factors responsible for an increase in 
the relative size of government during modern eco- 
nomic growth, even in noncentrally planned econ- 
omies. More generally, in these economies an 
expansion tends to occur in public expenditure on 
“social” purposes (education, health, social security, 
housing, recreation) and “economic” purposes 
(agriculture, mineral resources, fuel and power, 
transport, roads, and so on). The size of the central 
government usually tends to rise relative to that of 
local governments, partly because of the growing 
complexity and integration of the economy. With 
regard to revenue sources, direct taxes, particularly 
individual income taxes, usually grow in impor- 
tance relative to indirect levies, such as excise taxes 
and foreign trade duties. [See TAXATION.] ? 

The trends characterizing modern economic 
growth significantly alter the economic bases of 
class differences and political power. In noncen- 
trally planned economies, for example, there has 
been a marked rise in the proportion of employees 
in the labor force and a diminution of self-em- 
ployed workers. In part this is associated with the 
shift in the industrial structure of the labor force 


toward nonagricultural activity, since the latter is 
characterized by a substantially higher proportion 
of employees to self-employed workers than is agri- 
culture. In part, however, it reflects a trend within 
the nonagricultural sector in favor of employees, 
due to the growth of factory relative to handicraft 
operations. Another pertinent change has been the 
rise in white-collar employment relative to manual 
labor, an influence strengthening “middle class” 
tendencies. Still another has been the drastic de- 
cline in the relative economic importance of 
agriculture, which has weakened the political im- 
portance of this sector. Such “noneconomic” reper- 
cussions of modern economic growth have, in turn, 
had feedback effects on the growth process through 
the types of social attitudes and political measures 
that have ensued. 

Patterns and types of economic growth. The 
time shape of these various trends is by no means 
uniform among the different characteristics. More- 
over, shorter-run changes through time may show 
significant departures from the underlying trend 
over lengthy intervals, because of fluctuations such 
as those mentioned earlier or other factors. 

A number of the changes are, however, subject 
to a distinct upper or lower limit, such as the rise 
in urbanization, literacy, and nonagricultural ac- 
tivity. As might be expected, for such character- 
istics the basic secular trend exhibits a logistic 
pattern, that is, initially change occurs at a slow 
rate, then at an accelerating one, and finally, as it 
approaches the limit more closely, at a decelerating 
one. Even with regard to narrowly defined cate- 
gories of commodities or industries, where no com- 
parable logical limit exists, evidence has been 
advanced to show the existence of a secular pattern 
of acceleration followed by retardation. Among 
other factors, the nature of technological change 
has been accorded an important role in accounting 
for this—in part because technological progress 
within a narrowly defined industry appears to slow 
down after a time, in part because technological 
developments in new commodities and industries 
create opportunities competing with the older ones. 
The latter point helps to reconcile the seemingly 
contradictory observation that growth in the econ- 
omy-wide level of output per man-hour does not 
show retardation. 

It has already been suggested that noticeable dif- 
ferences between countries sometimes exist in the 
pattern for a given variable or in the relationships 
among two or more variables. This may be illus- 
trdted by reference to demographic phenomena, 
where the currently available documentation is in 
some respects fullest, especially for the premodern 
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period. A shift from early to late marriage occurred 
in many western European areas well before the 
eighteenth century, and in a number of these a 
mild reversal, a decline in age at marriage, occurred 
in conjunction with economic advance in the nine- 
teenth and twentieth centuries. In eastern Europe, 
on the other hand, early marriage persisted in 
many areas through the 1930s despite other devel- 
opmental changes. With reference to the demo- 
graphic transition, it appears that fertility began 
to decline in some European areas prior to or si- 
multaneously with the mortality decline, Recent 
declines in mortality appear typically to be sharper 
and more rapid than earlier ones. An important 
issue, which can only be noted here, is to what ex- 
tent differences such as these tend to vary system- 
atically between early comers and late comers to 
the growth process. 

To turn from individual aspects of modern eco- 
nomic growth to the process viewed as a whole, we 
must examine whether it is possible to distinguish 
regular phases or periods in the process and 
whether the experience of various countries can be 
classified in terms of different growth “types.” For 
example, one stage-scheme that has been proposed 
identifies five phases: traditional society, precon- 
ditions for take-off, take-off, drive to maturity, and 
age of high mass consumption (Rostow 1963). 
Another recent typology, proposed by Hoselitz, em- 
ploys a threefold classification of growth: (1) ex- 
pansionist versus intrinsic, depending on the de- 
gree to which unused land and other resources are 
available; (2) dominant versus satellitic, depend- 
ing on the importance of foreign trade in the econ- 
omy; and (3) autonomous versus induced, based 
on the degree of central planning (Conference . . . 
1959). One advantage of the proposed stage con- 
ception is that it highlights the fact that modern 
economic growth is characterized by a substantial 
acceleration in the pace of social change. Moreover, 
its emphasis on the take-off as a point of transition 
to “self-sustaining” growth underscores a funda- 
mental aspect of some of the changes, namely, that 
they are largely irreversible. Similarly, the sug- 
gested typology brings to the fore certain factors— 
the absolute size of the economy, the nature of the 
decision-making processes—influencing the growth 
process. On the other hand, attempts to apply such 
schemes in their specific form to interpretation of 
actual experience have encountered important dif- 
ficulties. Although they are suggestive with regard 
to some of the causal mechanisms or factors un- 
derlying modern economic growth, the use of such 
concepts as valid generalizations about stages or 
types of growth seems premature in view of the 
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Table 2 — Crude death rate per thousand of total population, selected areas, 


since 1750 or earliest known date thereafter 


England United Germany Russia 
and Woles States (later F.R.G.) Italy (later U.S.S.R.) Japan Chile India Africa 
About 1750 32 
About 1800 25 25 
About 1850 23 23" 27 31° 40 35 
1900-1909 15 16 19 22 30 214 32 43 
1930-1939 12 “n n 14 18° 18 24 31 30-35" 
1955-1959 12 9 n 10 8 8 14 20 27 
a. 1870-1880. 
b. 1861-1870. 
e 1937. 


d. 1905-1909; probably an underestimate, 


limited extent to which the facts of growth have 
so far been established and runs the danger of 
tending to strait-jacket thinking on the subject. 


International spread of economic growth 


Ideally, a description of the world-wide diffusion 
process would be built on a number of time series 
of growth characteristics for each of the countries 
of the world since around 1750. Lacking this, prin- 
cipal reliance is placed here on a few characteris- 
tics of what are taken as fairly representative sit- 
uations in different parts of the world—although 
availability of data has played a part too in the 
choice of series and areas. In keeping with the ap- 
proach adopted here, economic growth is viewed 
as one facet of modernization more generally con- 
ceived. Two indicators of social development are 
used, the crude death rate and school enrollment 
rate, and two of economic development, nonagri- 
cultural share of the labor force and the growth 
rate of GNP per capita. Although the measures are 
crude—for example, variations in the age distribu- 
tion of population in time and space impair the 
comparability of death or school enrollment rates— 
as a whole they produce a reasonably consistent 
picture. 


Principal sources: Demographic Yearbook; Population Bulletin 
of the United Nations; Kuznets 1966. 


Typically, economic historians trace the origin 
of modern economic development to eighteenth- 
century England, and in particular to the industrial 
revolution. By the latter is meant, in part, the com- 
mercial application of Watt’s steam engine and 
certain innovations in the iron and textile indus- 
tries over the period from around 1730 to 1820, 
especially the latter half. From England, the in- 
dustrial revolution is viewed as having diffused 
gradually, first to neighboring European nations 
and England’s overseas descendants and then to 
some further nations, although the latter aspect of 
the picture becomes increasingly blurred. The de- 
scription presented is qualitative in nature, al- 
though approximate periods of an industrial revo- 
lution are identified for some of the now-developed 
nations. 

Indicators of modernization provide a broader 
picture, of which the economic forms a reasonably 
consistent part. Tables 2, 3, and 4 reflect, respec- 
tively, the diffusion of modern mortality conditions, 
education, and industrialization; each tells a fairly 
similar story. For this purpose, a useful way of 
forming impressions is to compare the dates at 
which countries reach a crude death rate of 30 per 
thousand, a school enrollment rate of 10 per cent 


Table 3 — Estimated percentage of total population of all ages enrolled in school, selected areas, 
since 1818 or earliest known date thereafter 


England United Germany Russia 

and Wales States (later F.R.G.) Italy (later U.S.S.R.) Japan Chile India Nigeria 
1818 6 
1830 9 15 17 
1850 12 18 16 b 2 “a 
1887 16 22 18 n 3 7 4 2 
1928" 16 24 17 i 12 13 154 4 1 
1954* 15 22 13 13 15 23 18 7 5 


a. Values for 1954 (and, to some extent, for 1928) for the most-developed countries are biased downward relative to those for other times and places, because of 


decline in ratio of school age to total population. 
b. Not available. 
e. Around 1870. 
d. 1938. 


Source: Easterlin 1965. 
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Table 4 — Percentage of labor force in nonagricultural activity, selected areas, 
since 1700 or earliest known date thereafter 


England United Germany 

end Wales States (later F.R.G.) 
1700-1775 55-60" 
1801 65 (68°) 28° 
1841 7 31 
About 1870-1880 85 50 58 
About 1900 91 63 65 
About 1925 93 76 70 
About 1935 94 80 72 
About 1950-1960 95 7 7 


a. Share of nonagricultural activity in national product. 
b. 1820. 
c. Not available. 


of the population, and a percentage of labor force 
in nonagricultural industry of around 30. England 
tended to be the leader, with major changes usually 
discernible in the eighteenth century. Within Eu- 
rope, Germany, a representative of northern and 
western Europe, was well on the road by the middle 
of the nineteenth century, followed with some lag 
by Italy, a representative of southern Europe, and, 
somewhat later still, Russia, a representative of 
eastern Europe. 

In the overseas areas settled by Europe, moderni- 
zation was well under way in the United States in 
the first half of the nineteenth century. Canada, 
Australia, and New Zealand, if data were shown, 
would probably lag somewhat, although not a great 
deal. The series for Chile show signs of moderniza- 
tion in the early twentieth century. As a represen- 
tative of Latin America, Chile has been in the van- 
guard, rather than near the average, and has been 
at least several decades ahead of Latin American 
nations with substantial Indian populations. (Also, 
Chile's position with regard to the share of the 
labor force in nonagricultural activity has been dis- 
proportionately high compared to most countries 
in this area.) In general, modernization in overseas 
countries populated chiefly by settlers from north- 


Russia 


Italy (later U.S.S.R.) Japan Chile India Africa 
38 17 
41 25 30 28 
48 29 49 61 25 
52 e 54 63 e 
59 60 67 70 27 26 


Principal sources: Kuznets 1957; 1966. 


ern and western Europe preceded that in Latin 
America, whose settlers originated primarily in 
southern Europe, Thus the New World pattern re- 
flects to some extent the diffusion pattern within 
Europe itself. 

In Asia and Africa, indications of modernization 
were apparent in Japan in the latter part of the 
nineteenth century and in the Union of South 
Africa in the early twentieth; but these are notice- 
able exceptions, at least as far as indexes of eco- 
nomic development are concerned. Typically, these 
indexes show relatively little evidence of modern 
economic growth in these continents. With regard 
to social development, however, the indicators for 
India, a country reasonably representative of 
changes in Asia, do suggest that elements of mod- 
ernization have been occurring in the course of the 
twentieth century, and even the fragmentary data 
for Africa show some signs of this since the 1930s, 
Nations of northern Africa would probably tend to 
be somewhat ahead of the sub-Saharan countries 
and more like India in timing. 

A table in a form comparable to the preceding 
ones, showing levels of real product per capita at 
various dates, would shed valuable light on trends 
in the relative standing of nations and the extent 


Table 5 — Per cent per decade growth rate of real gross national product per capita, selected areas, 
since 1700 or earliest known period thereafter 


England United Germany 

and Wales States (later F.R.G.) 
1700-1780 2.0 
1780-1841 13.5 
1841—1881 13.3 16.2 9.2 
1881-1913 17.4 16.9 18.1 
1913-1960 15.5 18.0 17.4 
1927-1960 14.1 17.3 23.7 


. 1851-1855 to 1871-1875. 
- 1862-1868 to 1874-1883. 
. 1860 to 1913. 

» Not available, 


aarce 


Russia Latin 
Italy (later U.S.S.R.) Japan America Indio* 
—0.8" 
9.1 14.4% 24.3 4 
17.4 27.4 27.9 5 
22.1 e 26.0 18.8 5 


. Periods are, in order, 1857-1863 to 1896-1904, 1896-1904 to 1946-1954, and 1937-1940 to 1959-1962. 


Principal sources: Kuznets 1964a; 1966, 
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of the gaps between them. Although sufficient data 
are not presently available for this, enough infor- 
mation can be secured on rates of growth to at 
least check some of the impressions so far obtained 
(Table 5). For those developed nations where rates 
are available over a long enough period, a notice- 
able rise to modern growth-rate levels is usually 
apparent at a time consistent with the previous 
tables. On the other hand, for India, the one less- 
developed area shown here with a reasonably long 
time series, the growth rate remains at a relatively 
low level, a picture consistent with that relating to 
the nonagricultural share of the labor force. 
Since the social development indicators do sug- 
gest the gradual spread in the twentieth century 
of elements of modernization to less-developed 
areas in Asia and Africa, it is of interest to look at 
their most recent experience. Estimates of the rates 
of change, converted to a per decade basis, of prod- 
uct per capita in these areas (available only for 
noncommunist countries) for 1957/1958 to 1963/ 
1964 are given in Table 6. There is perhaps a sug- 
gestion here of an increase in the rates for these 
areas compared to experience prior to World War 
1, although the earlier observation regarding vari- 
ability of growth rates over time cautions against 
placing excessive confidence in interpretation of 
such rates as secular ones. Some support for this 
inference, however, is provided by data on capital 
formation, which show a noticeable rise since the 
pre-World War 1 period in the proportion of re- 
sources devoted to this purpose in underdeveloped 
countries. 


Table 6 
Per cent 
All noncommunist countries 25 
Less-developed areas only 23 
India 21 
Africa* n 


* Not including Algeria, Congo (Leopoldville), and Republic of South Africa. 


Source: U.S. Agency for International Development. 


Effect on international disparities. One conse- 
quence of the uneven spread of modernization has 
been the creation of wide differences in various 
economic and social conditions among nations at 
any given time. Tables 2 through 5 shed light on 
the size and trend of such differences. The pattern 
for mortality and school enrollment rates is rather 
similar. In the century or so up to, say, World War 
1, international differences widened markedly as 
rapid improvement occurred in Europe, northern 
America, and Oceania, and relatively little in Asia, 
Africa, and Latin America. Since then, and particu- 


larly after World War u, as advances in the former 
group slowed down and those in the latter accel- 
erated, differences have narrowed noticeably, most 
markedly in mortality conditions. 

There is little indication in the economic devel- 
opment indicators, however, of a lessening of dif- 
ferences. On the contrary, Table 4 suggests that 
the gap in the percentage of labor force in nonagri- 
cultural activity has continued to widen. With re- 
gard to GNP per capita, it can also be inferred, 
despite the limited data, that relative levels have 
continued to diverge. Today’s developed countries, 
considered as a bloc, account for approximately 
one-quarter of the world’s population; four less- 
developed nations (mainland China, India, Indo- 
nesia, and Pakistan) account for about four-tenths. 
To judge from the indicators for the latter part of 
the nineteenth century, the developed group was 
well ahead of these four less-developed countries 
at that time. Since then, the developed group grew 
at a rate on the order of 15 per cent per decade, 
enough to double income in five decades. On the 
other hand, the four less-developed countries had 
such low income levels as of 1950 that to assume 
a growth rate over this preceding period of even 
the same magnitude would imply improbably low 
income levels for them in the late nineteenth cen- 
tury. This is supported by the figures for the only 
one of the four for which an estimate is available, 
India, where the estimated growth rate averaged 
only around 5 per cent per decade. 

Table 7 provides a summary view of the current 
state of international differences in GNP per capita 
produced by these historical trends. Although de- 
fects of the measure tend on balance to exaggerate 
differences, there is little question that to date the 
spread of modern economic development has cre- 
ated an unprecedented disparity in economic con- 
dition between, on the one hand, a small though 
sizable segment of the world’s population, chiefly 
resident in Europe and its overseas descendants, 
and, on the other, the vast majority, primarily con- 
centrated in Asia and Africa. (Table 7, in which 
populations are classified according to the national 
average GNP per capita, may exaggerate somewhat 
the relative position of the mass of the people in 
Latin America, since there is some evidence that 
income inequality there may be disproportionately 
high.) The disparity portrayed, which reinforces 
(and in part has arisen from) an already existing 
cultural division, itself becomes, via its political and 
other repercussions, a factor bearing on the pros- 
pect for continued long-term growth of now-devel- 
oped nations and the further spread of modern 
economic growth to less-developed ones. 
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Table 7 — Per cent of world population distributed by national average GNP 
per capita and geographic area, about 1958 


Southern Northern 

GNP per capita World Northwestern and eastern America Latin 
(1958 U.S. dollars) total Europe* Europe* and Oceania America Asia Africa 
Totol 100.0 6.7 15.0 71 7.0 57.1 7.1 
1,000 and over 13.3 6.2 0 7.1 1) 0 (9 
575-999 8.9 0.4 8.1 0 0.3 0.1 0 
350-574 6.1 0.1 3.9 0 1.4 0.1 0.6 
200-349 13.3 0 2.9 0 4.3 5.7 0.5 
100-199 6.6 0 0.1 0 0.8 3.8 1.9 
Below 100 51.7 0 0 0 0.1 47.4 4.1 


* Northwestern Europe comprises the U.K., France, F.R.G., Eire, the Low Countries, Scandinavia, and Switzerland; southern and eastern Europe, all the rest of Europo 


including all of Russia. 


Consequences for world population. The spread 
of modernization has brought major consequences 
for the size, distribution, and condition of the 
population of the world, viewed as a whole. The 
progressive diffusion of improved mortality con- 
ditions has led to a noticeable rise in the rate of 
increase of the world’s population. Between 1000 
and 1750 a.D. growth occurred at a rate that on 
the average would not quite have doubled popula- 
tion every 500 years. (This is, of course, an aver- 
age; there were wide variations over time and in 
space.) In the modern period world population 
doubled between 1750 and 1900, and in the twen- 
tieth century it has almost doubled again in the 
first 60 years. 

The uneven spread of modern mortality con- 
ditions has noticeably altered the international 
distribution of population, particularly between 
Europe, northern America, and Oceania, on the 
one hand, and Asia (excluding Asiatic Russia) 
and Africa, on the other. The lead of the former 
area in the mortality decline raised population 
growth rates there relative to Asia and Africa, so 
that by 1930 its share in the world total had 
risen from a fifth to a third. Since then, with the 
spread of reduced mortality conditions to Asia and 
Africa and with the continued transition to lower 
fertility in the West, this trend has been somewhat 
reversed. The share in the world total of Latin 
America, although small, has risen, at first grad- 
ually and then in the twentieth century with in- 
creasing rapidity as mortality declined sharply. The 
first line of Table 7 shows the approximate distri- 
bution of world population by region in the late 
1950s. 

By 1960 the world population exceeded three 
bilion—more than four times the 1750 level. 
Despite this great and accelerating growth, it is 
probably true that if one considers absolute rather 


Sources: Demographic Yearbook for 1963; United Nations 1952-1963; 
Kuznets 1964b. 


than relative levels, most persons throughout the 
world at that date were better off with regard to 
health conditions and formal education than was 
true in the past, although much of this change has 
been fairly recent. Whether economic conditions, 
in absolute terms, have generally improved is more 
debatable. Recent estimates of world per capita 
food production by region show, in general, fairly 
little difference for better or worse in the situation 
of those in less-developed areas between the late 
1930s and early 1960s. 

Effect on international relations. In 1750 the 
fortunes of those in any one area of the world were 
usually but tenuously connected with those of per- 
sons in other parts; in many respects individuals 
and nations were isolated from one another. A 
major consequence of the appearance and spread 
of modern economic growth has been a vast ex- 
pansion in international contacts and an increas- 
ing degree of interdependence among the peoples 
of the world. The obstacles imposed by physical 
distance have been drastically reduced by techno- 
logical developments in transport and communica- 
tion. The speed of international travel has been 
cut to less than one per cent of what it was at the 
beginning of the nineteenth century, and the in- 
ternational transmission of messages is virtually 
instantaneous. At the same time, the uneven 
spread of economic development has altered the 
international configuration of economic costs and 
political power in a way that contributes to the 
rapid expansion of contacts. 

Economic relations. In the area of economic 
relations, the physical volume of international 
trade rose almost sixfold from 1750 to 1850 and 
more than twentyfold in the next hundred years. 
This meant a growth in trade per capita of almost 
fourfold in the first period and tenfold in the 
second, Even though world GNP per capita rose 
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over this period it was in a substantially smaller 
proportion; hence it is clear that world trade rose 
greatly relative to world production. Moreover, all 
major areas of the world participated. A rough 
estimate for the share in world trade of Asia, 
Africa, and Latin America would run on the order 
of a fifth to a quarter both in the early nineteenth 
century and in the middle of the twentieth century. 
External indebtedness and international migration 
rose too at an unprecedented rate. At the peak of 
their growth, roughly between 1870 and 1914, the 
constant-price volume of foreign debts outstanding 
expanded around sevenfold and the volume of in- 
tercontinental migration was three times that in 
the preceding half-century. The international dis- 
tribution of investment and migration differed 
noticeably, however. In 1914, Asia and Africa ac- 
counted for perhaps one-quarter of the debt out- 
standing, but almost all intercontinental migration 
was confined to the European culture area and 
involved the settlement of overseas areas. The ex- 
pansion of all three economic flows—trade, invest- 
ment, and migration—was sharply curtailed in the 
interwar period, but since World War 11 it has re- 
sumed, although at a more moderate rate than 
before World War 1. 

The international flow of information, partly 
abetted by development of the international econ- 
omy, has expanded immensely too. Indications of 
this would be provided by such data as interna- 
tional mail flows per capita, exports and imports 
of foreign publications, and the number of inter- 
national periodicals and scholarly organizations. 

Political relations. Measures relating to inter- 
national political relations are scarce, but again 
there can be no doubt that the trend has been 
toward vastly expanded contacts—simple series on 
the number of personnel in consulates and diplo- 
matic organizations would reflect this. 

Since productive capacity is more closely related 
to military potential than is population, the spread 
of modern economic development has had major 
implications for the distribution of political power. 
The impression created by the population distri- 
bution as to the relative power of areas is virtually 
reversed when one looks at production. Although 
Asia, Africa, and Latin America outnumber the 
rest of the world by almost three to one, the rest 
of the world outproduces the former areas by more 
than three to one. Such differences are obviously 
pertinent to understanding why in 1914 Great 
Britain, France, and Germany, which together ac- 
counted for only about one-twelfth of the world’s 

population, counted as political dependencies areas 
embracing another one-quarter. But the spread of 


modern economic development altered the power 
balance not only between developed and less- 
developed areas but also among the various de- 
veloped nations themselves. The hegemony of 
Great Britain in much of the nineteenth century 
and the subsequent appearance of rivals such as 
Germany, the United States, Japan, and the Soviet 
Union show a timing pattern corresponding closely 
to the spread of modern economic development. 
The eruption in the twentieth century of the dead- 
liest wars in the history of mankind, in which the 
developed nations were the leading antagonists, 
is also clearly related. Rough estimates of the 
world-wide total of deaths from all wars involving 
500 or more deaths show the following trend: 
1820-1863, 1.5 million; 1863-1907, 4.0 million; 
1907-1950, 41.0 million. 

In the international political realm, then, as 
in the case of economic relations, the long-term 
record, broadly conceived, shows an immense ex- 
pansion of contacts, marred by serious disruption 
in this century. As this has occurred, so too have 
efforts at international cooperation on economic 
and political matters, as might be shown by data 
on the growing number of international confer- 
ences, administrative agencies, and arbitral tri- 
bunals. The establishment of world economic and 
political organizations can itself be conceived in 
part as a response to the pressing problems caused 
by the international diffusion of modern economic 
development within a world political framework 
of nation-states. 

Origins of modern economic development. The 
reasons for treating modern economic development 
as a distinctive epoch should be clear. In those 
nations now most developed there has occurred 
a radical change in the way of life of the average 
individual. At the same time, the diffusion of this 
new form of economic organization throughout the 
world has drastically altered international relations 
and created interdependence on an unprecedented 
scale. Although it is true that the majority of the 
world’s population has so far only begun to be 
touched, the rate of diffusion, compared with previ- 
ous epochs, such as that of settled agriculture, has 
been very rapid. 

How can one account for the appearance on 
the world scene of this phenomenon? The answer 
would appear to lie in the virtual explosion in the 
stock of knowledge, and particularly in scientifi- 
cally established knowledge, which has occurred in 
the past three centuries. Efforts to conceptualize 
this development in quantitative terms are only 
now in their beginning stages. One indication (but 
no more than that) is provided by the number of 


scientific periodicals at various dates: 1 in 1665; 
10 in 1750; 1,000 in 1850; and 80,000 in 1950. 
These figures, which represent only approximate 
orders of magnitude, suggest around a tenfold ex- 
pansion in each of the two successive centuries 
after 1650 and only a slightly lower expansion in 
the most recent century. 

In the developments of the seventeenth century 
now described as the scientific revolution, the di- 
viding line between “pure” science and technology 
was vague, As man mastered procedures for estab- 
lishing the “laws of nature,” it is understandable 
that among the problems which both stimulated 
such inquiry and attracted applications of the new 
knowledge, those of production of commodities 
and services should rank high. Thus, progress in 
the scientific comprehension of natural phenomena 
went hand in hand with the use of such knowledge 
to direct and control man’s physical environment 
in line with material needs. [See SCIENCE, article 
On THE HISTORY OF SCIENCE.] 

The scientific revolution helps account not only 
for the appearance of modern economic develop- 
ment as a distinctive epoch but also for the broad 
geographic pattern of its spread. Modern economic 
development makes its appearance in the Western 
world where the scientific revolution is occurring 
and spreads most rapidly to those areas where edu- 
cational development has made the transfer of new 
knowledge most feasible. The gains to be had from 
modern technology are not costless, however. As 
has been shown, major changes are involved which 
often require painful adjustments on the part of 
individuals or groups—in family life, economic 
activity, social status, or political power—and, 
therefore, call forth resistances to adoption of 
modern technology and more generally to change. 
It is understandable that typically the societies 
having so far found such adjustments most feas- 
ible (although nevertheless difficult) are those in 
which the same intellectual movement that had 
fomented the scientific revolution was promoting 
rational examination and discussion of social in- 
stitutions as well. [See TECHNOLOGY.] 

Viewed in these terms, modern economic growth 
is the manifestation in production of the growth 
and diffusion of the stock of knowledge stemming 
from the scientific revolution or, more generally, 
from the intellectual revolution, which began with 
the Renaissance. The increasingly urgent problems 
to which the spread of modern economic develop- 
ment has given rise are, at a remove, a reflection 
of the uneven development of this knowledge itself. 
However slow the diffusion of modern economic 
growth may appear to those interested in the wel- 
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fare of mankind as a whole, it has been rapid 
enough to produce serious social disruption within 
countries and major crises in international affairs. 
The capacity to cope with these problems has been 
limited by man’s inadequate grasp of social phe- 
nomena—indeed, by the limited awareness that 
such phenomena are even subject to scientific 
study and generalization. Moreover, the spread of 
economic development within a framework of 
nation-states may itself have impeded the growth 
of social science by promoting national ideologies 
that pose as science. If, therefore, one turns from 
the question of the origins of modern economic de- 
velopment, which lie in the growth of natural 
science, to that of the future outlook, much would 
seem to turn on the prospect for comparable 
progress in social science. 

RICHARD A. EASTERLIN 
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m 
THEORY 


Perhaps no subject has more fully engaged 
the minds and energies of economists from the 
very beginning than the search for a satisfactory 
explanation of economic growth. This was, in 
fact, the main focus of the work of Adam Smith 
and the classicists endeavoring to explain indus- 
trialization in eighteenth- and nineteenth-century 
England, of Marx and Schumpeter in the long neo- 
classical eclipse of interest, and, most recently, of 
the post-World War 1 revival by those concerned 
with growth both in the mature industrial and in 
the less developed nonindustrial societies. 

While there is by no means unanimity on how 
to define growth, it is convenient to adhere to the 


convention that real per capita national income or 
output represents the most reliable indicator of a 
system’s economic achievement at any point in 
time and that any change in real per capita in- 
come over time connotes economic growth. States- 
men and philosophers have joined economists in 
recognizing that economic growth defined in this 
way represents the most objective indicator of a 
society's welfare. It reflects changes in its ability 
to attain any socially agreed-upon set of goals, 
whether consumption, capital formation, national 
defense or, for that matter, leisure. 

Any viable theory of economic growth must thus 
be able to explain both the level of per capita in- 
come of a given economic system at a point in 
time and the determinants of changes in that per 
capita income from one period to the next. With 
respect to the former task, it is generally agreed 
that the determination of the level of per capita 
income at any particular historical moment de- 
pends in part on the quantity and quality of the 
economy's human resources; in part on the avail- 
able stock of material resources; in part on how effi- 
ciently people organize themselves for productive 
activity; and, finally, in part on the institutional 
environment in which the society finds itself. Simi- 
larly, changes in income per head from one period 
to the next must be explained in terms of changes 
in the above four dimensions. All growth theories 
must thus take into account, in one fashion or 
another, at least the following: the initial charac- 
teristics of and changes in the stock of material 
agents, i.e., the capital stock and changes in it; 
the initial characteristics of and changes in the 
society’s stock of human agents, i.e., population 
or labor force and population growth; the initial 
characteristics of and changes in the quality or 
efficiency of the production process, i.e., techno- 
logical change; and, finally, the evolution of the 
society's organizational or institutional milieu. 

The first of the above dimensions relates to the 
initial material wealth or capital stock per head 
and to the size of the economy's saving, i.e., that 
portion of output not consumed in the current 
period. Under the standard full employment as- 
sumptions we shall adhere to, all such savings are 
automatically invested and thus come to represent 
additions to the economy's capital stock in the next 
period. Since the size of the capital stock per head 
of the population is a principal determinant of 
output per head (or per capita income), the annual 
magnitude of the surplus per head is of the great- 
est importance. The second dimension, population 
and population growth, must be viewed as a two- 
edged sword. On the one side it provides the 
economy with the labor force necessary for pro- 


ductive activity, but on the other it requires con- 
sumption for its maintenance, thus diminishing 
the surplus available to enhance the per capita 
capital stock in the next period. The third dimen- 
sion, technological change, really is a shorthand 
way of referring to changes in the quality of the 
economy’s human agents, in the quality of the 
economy’s material agents, and in the over-all effi- 
ciency with which these factors are combined to 
produce output. Finally, changes in the way a so- 
ciety organizes itself with respect to its dominant 
social and political institutions, e.g., between mar- 
kets and direct controls as organizational devices, 
may have a profound impact on the more narrowly 
conceived economic performance of the society in 
question. 

Virtually all growth theory may thus be viewed 
as an elaboration of one or another of the above 
four essential facets, with each particular theory 
emphasizing what is considered most essential 
given the inductive evidence under consideration 
and the deductive reasoning being employed. Keep- 
ing this in mind, we shall first present the basic 
outlines of some of the more relevant growth theo- 
ries of an earlier day and then examine, again in 
necessarily broad terms, the status of the present- 
day theory of growth. 

The classical theory. The classicists—though 
there exist considerable differences as between 
Smith, Ricardo, Malthus, and Mill—thought in 
terms of the three factors of production—land, 
labor, and capital—cooperating to produce na- 
tional output; and growth, consequently, was seen 
as determined by what happens to land, labor, and 
capital over time. While in the classical theory 
land is viewed as fixed in supply, the population, 
following Malthusian notions, is seen to be a func- 
tion of the real wage (more exactly, of the gap be- 
tween the market wage and a long-run subsistence 
norm). Capital formation, made possible by the 
society's saving, provides the main engine of 
growth. More precisely, the total output generated 
at any given time is distributed to the various 
agents who exercise control over the three factors, 
i.e., the society's workers, landlords, and capi- 
talists, according to certain rules. Each class of 
owners then makes its own decision as to what 
proportion of income to consume and what pro- 
portion to save. To understand what happens to 
the stock of material wealth available for produc- 
tion in the next period, we must thus first under- 
stand the underlying classical theory of distribu- 
tioh and the classical theory of saving. 

The classical theory of distribution is inextri- 
cably intertwined with the concept of diminishing 
returns to increased dosages of labor (and capital ) 
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on the fixed land. As acreage that is less and less 
productive is brought under cultivation, the incre- 
mental output declines and the incremental return 
on the least productive land determines the return 
to land, or rent. The portion of total output left 
after rents have been paid is then distributed be- 
tween workers and capitalists, with workers paid 
at or near the institutionally determined subsist- 
ence wage level. Over time, as more and more 
labor (and associated capital) is expended on the 
land and diminishing returns become more pro- 
nounced, the landlord’s return (rent) increases 
and the capitalist’s return (profit) declines, with 
the worker's wage retaining its long-run constancy 
(as population growth erases any temporary gaps 
between the market wage and the subsistence 
wage). Since the classicists assume that workers, 
being inherently poor, are not able to save and that 
landlords, being inherently wastrel, do not choose 
to save, the entire burden of saving and investing 
falls on capitalists and their profits. With declin- 
ing profits, capital ac..mulation is increasingly 
squeezed, finally culminating in the cessation of 
further growth. Implicitly, if not explicitly, this 
prediction is based on the assumption that dimin- 
ishing returns in the preponderant agricultural 
sector will swamp constant or possibly even in- 
creasing returns in industry—and that improve- 
ments in the quality of the factors of production, 
or in the efficiency with which they can be or- 
ganized, are unlikely to be sufficiently important 
to change the outcome. [See the biographies of 
Ma.ruus; MILL; RICARDO; SMITH, ADAM.] 

Marx’s theory. Marx, in Capital: ‘A Critique of 
Political Economy (1867-1879), sees capital and 
labor combining in fixed proportions to produce 
output, with the accumulation of new capital re- 
sulting from the economy's surplus over consump- 
tion requirements in one period providing the mo- 
tive force for carrying the system forward to higher 
levels of per capita output in the next. The size of 
the saving fund automatically converted into in- 
vestment is once again determined by a distribu- 
tion theory and a saving theory. Marx’s distribu- 
tion theory is based on the inequality of bargaining 
power between capitalists and laborers in the 
market place, forcing wages down near subsistence 
levels. In his world, as in the classical, workers do 
not save and capitalists do not consume; thus, once 
again, investment is equal to total profits. 

Unlike his predecessors, however, Marx is much 
concerned with changes in technology altering the 
proportions in which capital and labor are com- 
bined to produce output, It is his view that all in- 
novations are by nature very laborsaving, in the 
sense that they raise the capital—labor ratio and 
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displace employed labor. It is this, plus exogenously 
determined population growth (Marx rejected Mal- 
thusian population theory), which contributes to 
the creation of the “reserve army of the unem- 
ployed,” which, in turn, deprives workers of all 
bargaining power. Meanwhile, as the capital stock 
per employed worker continues to increase, capital 
begins to lose its scarcity value, and its rate of 
return begins to fall. Capitalists will try to rescue 
the profit rate by using their dominant bargaining 
strength in the labor market to make employees 
work harder and further to reduce the wage, but 
the drive for capital accumulation finally reduces 
the rate of profit to a level which brings the capi- 
talist system to a halt. [See the guide to related 
articles under MARXISM. ] 

Schumpeter's theory. Perhaps Marx’s greatest 
contribution to growth theory was his emphasis 
on the importance of technological change in the 
production process. This emphasis was carried for- 
ward by Joseph Schumpeter in The Theory of Eco- 
nomic Development (1912). Schumpeter expanded 
the concept of innovation to include not only 
changes in technique but also changes in quality, 
in markets, and in supply sources. He was, more- 
over, the first really to focus on the human agent, 
the entrepreneur, who was capable of perceiving 
the potentiality of such changes and of doing 
something about them. 

Schumpeter’s theory attempted to analyze the 
economy's long-run progress by way of an ex- 
planation of its cyclical behavior as the system 
continuously departs from various quasi-equili- 
brium stationary states, only to return to others 
at ever higher levels of per capita income. A 
Schumpeterian stationary state is characterized by 
the fact that only enough net investment is being 
made to equip additions to the labor force at the 
capital—labor ratio applying to the already existing 
labor force. The economy is thus moving sideways: 
labor, capital, and output are all growing at the 
same rate and per capita income remains constant. 
This quasi-equilibrium state is then upset by the 
appearance—for reasons unclear but related to 
changes in the “climate’—of numbers of gifted 
individuals capable of perceiving additional at- 
tractive investment opportunities. These entre- 
preneurs appeal to the banking system for the 
creation of credit, which forces additional savings 
out of the public’s hands (through inflation) and 
into their own. Subsequently, pioneering entre- 
preneurs are followed by large numbers of less 
gifted imitators who tend to excess, and the sys- 
tem experiences overproduction and recession. But 

the new Schumpeterian stationary state is reached 


at a higher level of material wealth per person 
and a consequently higher level of per capita in- 
come. The growth process thus occurs as the 
consequence of discontinuous bursts of creative 
entrepreneurial activity. It finally comes to a halt 
only because of a change in the institutional set- 
ting as the “climate” for entrepreneurship dete- 
riorates secularly with the increasing bureaucratiza- 
tion of corporate enterprise. [See the biography of 
SCHUMPETER.] 


Modern theory 


Where contemporary growth theory also is 
concerned with the performance of the mature 
industrial society, it differs from the earlier at- 
tempts in that it is both less ambitious and more 
precise, Postwar growth theorists have been con- 
tent to relegate theories of population growth, of 
changes in the organizational or institutional 
framework, sometimes even of technological 
change, to other disciplines or to accept changes 
in these rather crucial dimensions as given, 
rather than to be explained within the framework 
of the growth theory itself. At the same time there 
is a conscious attempt to be more rigorous, to 
lean more heavily on modern methods of model 
construction, i.e., structural relationships supple- 
mented by behavioristic relationships. 

Harrod—Domar model. Modern growth theory, 
moreover, is not content to hypothesize certain 
reasonable behavioristic relationships based on 
casual observation but insists increasingly on 
statistical verification, including the estimation of 
the key parameters involved. It is thus greatly in- 
debted both to the development of the national 
income accounting system during the 1930s and 
to Keynes (1936), who, while himself primarily 
concerned with other, short-run stability matters, 
employed this system for more general theory con- 
struction and statistical implementation, [See Na- 
TIONAL INCOME AND PRODUCT ACCOUNTS.) This can 
perhaps best be demonstrated by examining a par- 
ticular view of growth, the so-called Harrod-Domar 
model (Harrod 1939; Domar 1946), as a prototype 
of post-Keynesian growth theory. While it repre- 
sents a rather early and relatively simple formula- 
tion of the problem, much of contemporary growth 
theory which follows can be said to represent de- 
partures from the basic theme presented here, €.g., 
using more realistic behavioristic assumptions. 

The basic ingredients of the Harrod—Domar view 
of long-run growth are as follows: 

(1) A production function which relates ‘the 
generation of total output, Q, to the available cap- 
ital stock, K, via the capital—output ratio, v, that 


is, Q = K/v. The theory assumes the constancy of 
the capital—output ratio, i.e., the number of units 
of capital it takes to produce a unit of output. 

(2) A theory of saving based on the Keynesian 
propensity to save, which stipulates that total sav- 
ings, S, in any one period is a given fraction, s, of 
total income or output, Q, of that period: S = sQ. 

(3) Adherence to the full employment of cap- 
ital hypothesis, i.e., all savings are automatically 
invested and become additions to the capital stock: 


Capital is the only factor of production explicitly 
considered in the Harrod—Domar system. Labor 
combines with capital in fixed proportions, but 
there is no attempt to reconcile exogenously deter- 
mined increases in the population or labor force 
with changes in the derived demand for labor as 
a factor input as capital accumulation proceeds. 


Table 1 — Harrod—Domar growth model 


t K Q Sel c L Q/L 
1 30 10 2 8 100 -1000 
2 32 10.67 2.13 8.54 102 1046 
3 34.13 11.38 2.28 9.10 104 1094 
4 36.41 12.14 2.43 9.71 106 1145 
5 38.84 12.95 2.59 10,36 108 1199 


Table 1 may help give a clearer picture of how 
the Harrod—Domar system operates. Let us assume 
that it requires 3 units of capital to produce each 
unit of output, i.e., the capital—output ratio is 3, 
and that the economy's saving propensity is .2, i.e., 
the various income recipients decide to consume 80 
per cent of their income in each period and to put 
away 20 per cent in the form of savings. If such an 
economy starts with, say, a capital stock (K) of 30 
in period (t) 1, it produces an output (Q) of 10. 
Out of this income 8 units are consumed (C) and 
2 are saved (S = I), leading to an increase of 2 in 
the capital stock of the next period. The new cap- 
ital stock of 32 units is capable of producing 10.67 
units of output in period 2. This new and higher 
level of income is once again allocated in propor- 
tions of 80 and 20 between consumption and sav- 
ings; the latter yields a further increment in the 
capital stock, and thus the economy continues to 
grow from one period to the next. The exogenous 
growth of the population or labor force (L), say at 
the rate of approximately 2 per cent a year, has no 
consequence for the productive capacity of the 
system, but affects its measured performance only 
in terms of per capita income growth as indicated 
in the last column of the table (Q/L). 

The rate of growth of capital can then be quickly 
determined by dividing each year’s increment, or 
the investment during that period, I = S = sQ, by 
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the already existing capital stock: I/K = sQ/K = 
s/v. In our example with s = .2 and v = 3, capital 
will be growing at an annual rate of 6.7 per cent. 
Moreover, since v = K/Q (the capital—output ratio) 
is a constant, the rate of growth of output or in- 
come over time must equal the rate of growth of 
capital. Whether or not per capita income increases 
or decreases in the course of the growth process 
will then depend on the relative speed with which 
the population is growing. If the rate of growth of 
population, g, is less than the rate of growth of in- 
come, s/v, per capita income will be rising at the 
rate (s/v) — g, etc. In the example, in other words, 
with population growing at a rate of approximately 
2 per cent, per capita income is rising, i.e., growth 
is occurring, at a rate of about 4.7 per cent an- 
nually, 

While we may raise a series of questions about 
its realism, this Harrod-Domar view of the world 
serves as a point of departure for much modern 
growth theory, not only as applied to the developed 
but also to the less developed world. The departures 
take the form of a series of modifications—some- 
times quite sweeping—of the stipulated rules gov- 
erning the way in which factors are combined to 
produce output, of how output is distributed, and 
of how technological and institutional change oc- 
curs. What follows will first trace some of the 
major modifications in modern growth theory as 
applied to the mature economy and then summar- 
ize the state of contemporary growth theory as 
applied to the underdeveloped world. 

Substitutable inputs models. A first and major 
modification of the Harrod—Domar world results 
from the explicit recognition (or better, a return to 
the recognition) that there exists more than one 
factor of production and that there is a strong pos- 
sibility not only of joint inputs of labor and capital 
but of their substitutability in producing a given 
output. This is the view typical of the so-called 
neoclassical growth theory represented by Solow 
(1956) and Swan (1956). Like Harrod—Domar, 
Solow accepts a proportional saving function, S = 
sQ, but unlike his predecessors, he conceives of a 
production process in which both capital and labor 
appear as substitutable inputs. The production 
function meets the usual conditions of constant 
returns to scale, i.e., if we double the amount of 
both labor and capital inputs, we double output. 
With the growth of the capital stock determined by 
the constant saving propensity and the now var- 
iable capital-output ratio [AK/K = (I/Q)/(K/Q) 
=s/v] and with the growth of the other input, 
labor, determined exogenously at the rate g [AL/L 
= g] we have all the necessary ingredients to trace 
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the performance of the system as a whole from one 
period to the next. 

If, for example, the rate of growth of capital, 
s/v, exceeds the rate of growth of population or 
labor, g, the economy's capital—labor ratio rises, 
i.e., each worker is equipped with a larger stock of 
material resources. According to the law of dimin- 
ishing returns, as each worker cooperates with 
more and more capital, the extra output due to 
each unit of capital declines, i.e., the capital—out- 
put ratio rises. As the capital—output ratio, v, rises, 
the rate of growth of capital, s/v, declines, and this 
process will continue in the long run until the rate 
of growth of capital falls to the level g, the rate of 
growth of labor. At this point the rate of growth 
of human and material resources is the same, and 
no further increases in the capital—labor ratio re- 
sult. Consequently, the capital—-output ratio also 
ceases to rise, and so we have arrived at a so-called 
“steady state” growth, in which capital, labor, and 
output are all growing at the same rate. Alterna- 
tively, if initially the rate of growth of labor, g, 
exceeds the rate of growth of capital, s/v, as the 
capital—labor ratio falls, the economy reaps more 
and more extra output per unit of extra capital: 
the capital—output ratio falls, and, by an argument 
entirely symmetrical, the rate of growth of capital 
rises until it reaches the level of the rate of growth 
of population. We again achieve a steady state, if 
now from the opposite “direction.” Output per per- 
son, our index of growth, is, of course, constant in 
the steady state; but it will have reached that par- 
ticular level by increasing toward it continuously 
in the case of the increasing capital-labor ratio 
regime and by falling toward it continuously in the 
case of the decreasing capital—labor ratio regime. 
Moreover, the permanently maintainable level of 
per capita income toward which the economy is 
tending in this very long run will differ, depending 
on the production function, the saving rate, and the 
rate of population growth. 

Prevalence of distribution theories. A second 
major modification of the Harrod—Domar world 
questions the implied irrelevance for aggregate 
savings of how income is distributed among the 
various economic agents. Robinson (1956) and 
Kaldor (1961), for example, point out that profit 
recipients and wage earners are likely to save dif- 
ferent proportions of their income, and since their 
relative shares in the total are bound to change 
over time, they challenge the constancy of the ag- 
gregate savings ratio, s, in the course of the growth 
process. In the extreme case they assume, like 
Marx and the classicists, that workers do not save 
and that profit recipients do not consume. In that 


case the savings rate, s, is equal to the profit share, 
aK/Q, and the rate of growth of capital, I/K = 
sQ/K, is equal to the profit rate, m. 

In this general context two major types of dis- 
tribution theory have been advanced. One—often 
called the neoclassical theory of distribution, since 
it depends on the possibility of continuous factor 
substitution—stipulates that market forces ensure 
that all human and material agents participating 
in the production process are compensated accord- 
ing to their marginal contribution to output. The 
smooth neoclassical production function, with con- 
stant returns to scale, permits us to “attribute” the 
total output of the economy, Q, to the quantities of 
the factors of production, say K and L, weighted by 
their productive contribution at the margin, i.e., 
their marginal productivities, MPP; and MPP,, 
respectively. Thus Q=MPP,K+MPP,L, with 
MPP,K/Q representing the portion of total output 
attributable to capital and MPP,L/Q the portion 
of total output attributable to labor. Alternatively 
put, a given percentage rate of growth of output 
between any two periods can be decomposed into 
two portions: (1) a “capital contribution,” which 
is the percentage increase in the capital stock mul- 
tiplied by the capital elasticity of output (i.e., the 
measure of how much output increased due to each 
percentage point increase in the capital stock), and 
(2) a “labor contribution,” which is the percentage 
rate of growth of the labor force multiplied by the 
labor elasticity of output (i.e., the measure of how 
much output increased due to each percentage 
point increase in the labor force). Hence AQ/Q = 
(AK/K)(MPP,K/Q)+(AL/L)(MPP,L/Q). The neo- 
classical theory of income distribution then states 
that competitive market conditions force the wage 
rate, w, to equality with the marginal product of 
labor, MPP,, and the profit rate, 7, into equality 
with the marginal product of capital, MPP. Con- 
sequently labor’s relative share in total income, 
wL/Q, is equal to its relative contribution to total 
output, MPP,L/Q; and similarly for the equality 
between 7K/Q and MPPxK/Q. 

A second theory, sometimes called the Marxian 
theory of distribution, insists, on the other hand, 
on the noncompetitive nature of markets and shifts 
from a functional or impersonal determination of 
rates of return to labor and capital to a determina- 
tion through the interplay of personal forces re- 
lated to the relative bargaining strengths of con- 
tending groups. Since there is no need to assume 
equality between various factors’ contributions and 
their income shares, this view is perfectly consist- 
ent with a belief in smooth neoclassical production 
functions. It is, however, true that many of its 


modern adherents, e.g., Kaldor and Robinson, view 
production in terms of fixed proportion processes 
and thus have no choice but to abandon marginal 
productivity. Unless the extreme case of subsist- 
ence wages is accepted, the distribution of income 
—and thus the aggregate savings ratio—in such 
models can be determined only in the sense that 
they must somehow be brought into line with an 
otherwise determined investment demand. We 
shall not, however, concern ourselves here with 
views of growth which abandon the notion that the 
system is “pushed forward” by the automatic in- 
vestment of full employment savings and is instead 
“pulled ahead” by investment opportunities, animal 
spirits, and the like. 

Technological change theories. The discussion 
of contemporary growth theory has thus far been 
concerned only with the consequences of larger 
quantities of inputs participating, perhaps in differ- 
ent combinations, in the production process. As 
was pointed out earlier, however, it is not merely 
the stock of material agents per capita but also 
changes in the efficiency with which they are de- 
ployed which may matter a great deal—not to 
speak of changes in the quality of the inputs or 
in the institutional framework within which the 
system is operating. Thus, a third and major modi- 
fication of the simple Harrod—Domar formulation 
must be the reintroduction of technological change. 
Kaldor (1957), for example, has introduced a so- 
called technical progress function intended to sum- 
marize the twin impact of a larger stock of capital 
per capita (capital deepening) and favorable 
changes in productive techniques (technological 
change) on the index of growth, per capita income. 
In the same year, Solow (1957), using a three- 
factor production function with constant returns 
to scale, the third factor being “time” or techno- 
logical change, found that only 15 per cent of 
recent per capita output growth in the United 
States can be explained by increases in the capital 
stock; 85 per cent must be ascribed to technologi- 
cal change. Whether or not we accept these sta- 
tistical findings matters less than the growing 
realization that growth cannot be explained simply 
in terms of larger quantities of constant quality 
inputs combining in known ways to produce 
output, Economists are reaching a real consen- 
sus that the as yet unexplained residual which 
Abramovitz has aptly labeled “a measure of our 
ignorance” (1956) is large; consequently, much 
recent growth-theoretic effort has been directed at 
this problem. This effort has been twofold: on the 
one hand, to define more rigorously what is really 
meant by technological change, and, on the other, 
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to work toward meaningful behavioristic relation- 
ships which may tell how and why such changes 
in production functions occur. [See AGRICULTURE, 
article on PRODUCTIVITY AND TECHNOLOGY; PRO- 
DUCTIVITY. ] 

As has been noted, the growth performance of 
any system in terms of the rate of increase of its 
per capita income invariably depends on the 
amount of material agents that collaborate with 
labor, i.e., the capital stock per capita, and on the 
quality of the economic agents and the efficiency 
with which they are used in the production proc- 
ess. “Technological change” is a semantic umbrella 
extended over all the latter; that is to say, all in- 
creases in output while the quantity of capital and 
labor is kept constant. At any point in time, then, 
technological change will raise the productivity of 
the existing quantities of capital and labor. Just 
how the introduction of a particular new method 
affects the productivity of the existing human and 
material agents provides the basis for Hicks’s at- 
tempt to classify innovations (1932). Hicks de- 
fines neutral technological change as one which 
raises the marginal productivity of labor and of 
capital at the same rate; a labor-saving techno- 
logical change as one which increases the marginal 
productivity of capital more than that of labor; 
and symmetrically for a capital-saving technologi- 
cal change. Harrod (1948) provides a somewhat 
different definition of technological change. Here 
there is neutrality if, after the innovation has 
occurred, the capital—output ratio remains un- 
changed at a constant rate of profit. If the capital— 
output ratio rises, it is a labor-saving innovation; 
if it falls, capital-saving. For reasons that cannot 
be dwelt on here, a Harrod-neutral innovation is 
equivalent to a labor-augmenting innovation, i.e., 
one which converts natural units of labor into 
better or more efficient units over time. It is the 
only type of technological change which still 
permits the system to move toward steady-state 
growth in the long run. Finally, and quite sym- 
metrically to Harrod, Solow (1963) and Fei and 
Ranis (1965) propose a third definition of neu- 
trality: constancy of the labor—output ratio, or per 
capita income, in the presence of a constant wage 
rate. Such an innovation may also be called capital- 
augmenting. 

‘An equally important classification of technical 
change for purposes of advancing growth theory 
proper is the differentiation between “embodied” 
and “disembodied” technological change—between 
innovations which have to be incorporated or em- 
bedded in new capital goods or in the labor force 
and those which do not and can, rather, be viewed 
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as an independent or third factor of production. 
Another question revolves around whether innova- 
tions should be viewed as exogenous and costless, 
falling in unpredictable quantity, like manna, from 
heaven, or endogenously related, perhaps to some 
specific economic activity, e.g., expenditures on 
education or on research and development, in the 
past. 

The early Solow and Abramovitz view of inno- 
vations as an unexplained residual really implies 
that technological change is disembodied and ex- 
ogenous. But this must be viewed as only a first 
step on the road to probing deeper into the nature 
of the changes which actually occur in the real 
world and into how they are produced. His prima 
facie implausible statistical results led Solow him- 
self, as well as others, to search for a so-called 
new view of investment (1960), in which tech- 
nological change is embodied in the latest vintage 
of capital, i.e., the current year’s investment. One 
mode] is essentially neoclassical, with labor allo- 
cated to the various heterogeneous qualities of 
capital stock (the younger the vintage, the more 
productive) according to the usual competitive 
assumptions, Other so-called putty-to-clay models 
(Johansen 1959; Phelps 1963) accept the neo- 
classical smooth production assumptions for the 
latest vintage of capital but assume it becomes 
“frozen” thereafter, i.e., it is transformed into a 
Harrod—Domar fixed-proportions type of produc- 
tion process. Arrow (1962) uses a somewhat 
different version of the vintage approach by link- 
ing a continuous reduction in the labor require- 
ments per unit of output on new machines to 
“experience,” which is in turn related to cumula- 
tive past investment activity in the economy. 

Finally, other models focus attention on changes 
in the quality of the labor force affecting either 
all vintages or only the latest entrants. Denison 
(1962) includes elements of both approaches. 
Moreover, there remains the question whether 
technological change, embodied or disembodied, 
is to be treated exogenously—perhaps as a free 
good—or endogenously, i.e., springing from some 
action of the system in the previous period. Im- 
provements in the quality of machinery and of the 
labor force can, for example, be attributed either 
to the sheer passage of time or experience, or to 
previous expenditures, e.g., on research and de- 
velopment and on health and education. Empha- 
sis on the latter type of theory has been growing 
of late [see, for example, CAPITAL, HUMAN and 
Schultz 1961]. 

It should be noted that there exists a large range 
for choice among the various innovation-theoretic 
hypotheses sketched in here. It is possible, for 


example, that all kinds of technological change are 
occurring simultaneously, from fully embodied (or 
factor quality-enhancing) innovations to fully dis- 
embodied (or organizational efficiency-raising) 
technological change, in one dimension; and from 
fully endogenous to fully exogenous, in the other. 
Unfortunately, the ability to advance reasonable 
hypotheses has run considerably ahead of the 
ability to test them rigorously in a statistical or 
econometric sense. Moreover, even if we could pre- 
cisely relate improvements in physical capital to 
research and development expenditures, and im- 
provements in human capital to educational ex- 
penditures, a really satisfactory theory of innova- 
tions must also be able to incorporate a plausible 
inducement or motivational mechanism at the 
level of the individual entrepreneur. In spite of 
some attempts in this direction (Fellner 1961), 
we still do not have a satisfactory explanation of 
just how the profit-maximizing entrepreneur is 
motivated to select one available type of innova- 
tion over another. In summary, the as yet inade- 
quate understanding of the manifold possible 
interactions between the economy's surplus over 
consumption, technological change, and the pas- 
sage of time provides the biggest obstacle at this 
time to a really satisfactory theory of economic 
growth, i.e., one which is not only sufficient to 
explain the historical record but is also necessary, 
and thus is ready for the ultimate predictive test. 

But if contemporary growth theory as applied to 
the mature economy has not yet given a completely 
satisfactory understanding of the sources of eco- 
nomic growth, a consensus seems to be emerging 
to the effect that conventional growth in the quan- 
tity of human and physical resources is of less im- 
portance than changes in the quality of the inputs 
and in the efficiency with which they are combined 
in the production process. By experimenting with 
different types of production functions and apply- 
ing theoretical as well as econometric ingenuity 
to the problem of “explaining” the residual growth 
in productivity, we are making steady, if slow, 
progress in the attempt to recapture some of the 
magnificence of the explanatory apparatus of an 
earlier day—while keeping our feet on firm, i.e., 
empirically testable, ground. 


Underdeveloped economies 

Thus far we have dealt mainly with the theory 
of growth in the mature or industrial economy. 
A second, and no Jess important, component of 
the post-World War 1 revival of interest in growth 
is the concern with economic progress, or the ab- 
sence of such progress, in the so-called underde- 
veloped or nonindustrial economies. The attempt 


to formulate a theory of growth applicable to such 
countries essentially may be viewed as another 
strand from the same basic framework of inquiry 
identified here as the post-Keynesian theory of 
growth. Patently there must exist a large measure 
of transferability in the way economic progress 
is viewed among contexts. Everywhere it requires, 
at a minimum, abstention from consumption, i.e., 
capital accumulation; and everywhere it is gov- 
erned by certain technological constraints, i.e., the 
state of the arts. Thus the basic Harrod—Domar 
formulation can again be viewed as a point of 
departure; but now in modifying the simplified 
view of how the world operates, we must take 
into account not only more realistic notions of the 
role of labor, factor substitutability, and techno- 
logical change but also of the heretofore neglected 
changes in the institutional milieu within which 
an economy operates. No longer is it possible to 
view organizational change as de minimis and to 
be neglected; in fact, changes in the environment 
and in the way a society chooses to organize itself 
in various markets may have a major impact on 
the growth performance of the system as a whole. 
For example, an understanding of the changing 
legal framework required to initiate and sustain 
growth, of the changing role of markets and ad- 
ministrative controls as allocative devices, and of 
the transition from extended to nuclear family 
concepts may be required along with an under- 
standing of the more narrowly conceived interac- 
tions between the society's human and material 
resources. Any approach to a really useful theory 
of economic growth is thus likely to require a 
broadening of the customary analytical framework. 
The typical less-developed economy is charac- 
terized by an extremely low ratio of material to 
human resources and a backward technology, to- 
gether yielding extremely low levels of per capita 
income, frequently barely above subsistence. Often 
a substantial proportion of the labor force is 
openly or disguisedly unemployed. Substantial 
growth is likely to be ruled out by the low level 
of saving possible at extremely low levels of 
per capita income, plus the customary stagnant 
levels of domestic entrepreneurship and technol- 
ogy. Often even the most elementary preconditions 
for sustained growth, e.g., geographic or political 
cohesiveness, absence of internal market barriers, 
a unified currency and postal system—all essen- 
tial for welding a conglomeration of resources into 
a national economy in the first place—are absent. 
And even where the environment is more con- 
ducive, attempts to move such an economy off 
dead center must overcome the formidable gravi- 
tational pull of accelerating population growth. 
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One difficulty standing in the way of a viable 
theory of growth for the less developed world is 
that most of the considerable post-World War 11 
efforts—by practitioner and academic economist 
alike—have been directed toward meeting the 
pressing problems of improved decision making in 
the immediate present and heavily tinged with the 
flavor of a country’s, or a region’s, particular eco- 
nomic and institutional characteristics. Few at- 
tempts to generalize from such specific country 
experience have been forthcoming. One of the no- 
table exceptions is Rostow’s stages theory (1960), 
which moves directly from an interpretation of 
the history of the now advanced countries to a 
general theory of growth in terms of sequential 
stages, including the famous “take-off” into self- 
sustained growth. While it may provide insights 
and direct the attention of others to more analyti- 
cal inquiry, it also does some disservice in terms 
of its casual marshaling of evidence and its dis- 
regard of the need to isolate key behavioristic 
relationships which determine the path of the sys- 
tem. However, most of the body of what may be 
called the theory of development evolving over the 
past several decades is of a more modest char- 
acter and may, in fact, be characterized as a series 
of useful theoretical insights, as yet falling short 
of being fitted together into any more general theo- 
retical framework. 

Most of these insights start with the premise 
that the underdeveloped system initially finds 
itself in some sort of “trap,” escape from which 
requires a major effort or considerable structural 
change. Rosenstein-Rodan (1961), for instance, 
emphasizes the importance of externalities and 
economies of scale, and thus the need for a large 
nonincremental development effort where only a 
small industrial base exists. Leibenstein (1957) 
points to the need to overcome the gravitational 
pull of stagnation, mainly the forces of accelerated 
population growth, as the system departs from 
quasi equilibrium. Nurkse (1953) sees the need 
for balanced growth between industries and sectors 
to ensure the simultaneous creation of markets 
and sources of supply for a sufficient number of 
hesitant entrepreneurs to move forward together. 
Hirschman (1958), on the other hand, hopes to 
solve the same decision-making bottleneck via 
imbalance between direct private and social over- 
head activity, a strategy which forces expansion 
on the system. 

While such ideas are essentially policy-focused, 
they patently cannot be divorced from some un- 
derlying view of how the typical underdeveloped 
system behaves. Formulations which recognize, 
implicitly or explicitly, the prevalence of a struc- 
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tural condition called dualism in many of these 
economies are especially helpful in this respect. 
Dualism in this context means that the economy is 
composed of at least two sectors: one relatively 
small, capitalistic, and mainly industrial, behav- 
ing like the mature competitive economy; the 
other relatively largq,traditional, and mainly agri- 
cultural, in which We material resources avail- 
able, land in particular, are insufficient to employ 
productively the existing supply of labor at pre- 
vailing rates of remuneration, i.e., in which there 
exists substantial overt and disguised unemploy- 
ment. [See ECONOMY, DUAL, ] 

Quite early Rosenstein-Rodan (1943) and Nurkse 
(1953) recognized that the disguised unemployed 
in the subsistence or agricultural sector, sharing 
equally in the extended family’s produce but con- 
tributing little or nothing to it, represent a form 
of “hidden saving,” or surplus, which can be 
mobilized through its reallocation to productive 
activity elsewhere. W. Arthur Lewis (1954) ad- 
vances the notion that growth in this sort of 
dualistic economy is a function of the rate at 
which the “unlimited supply” of low (or zero) 
productivity agricultural workers in one sector can 
be reallocated to higher productivity, commercial- 
ized activities in the other. With real wages in 
agriculture institutionally anchored, Lewis’ atten- 
tion is focused largely on the creation of employ- 
ment opportunities in the industrial sector, which 
would exercise a demand for the released agricul- 
tural workers at the same (or a slightly higher) 
constant level of real wages. Jorgenson (1961) 
and Fei and Ranis (1964) follow this up by exam- 
ining the developmental process in terms of the 
interaction between the two sectors over time. The 
latter, for example, emphasize the requirements 
for balanced intersectoral growth as increases in 
agricultural labor productivity—a function mainly 
of technological improvements—release agricul- 
tural workers and generate agricultural surpluses. 
These surpluses, together with the reinvestment 
of industrial profits, increase the quantity and 
quality of the industrial sector’s capital stock and 
thus its demand for the simultaneous absorption 
of such workers. As long as this balanced reallo- 
cation process pulls people out of the noncommer- 
cialized and into the commercialized sector at a 
rate in excess of population growth (population 
growth may be viewed as simultaneously adding 
to the pool of the underemployed) the economy 
is gradually divesting itself of its labor surplus 
characteristics, Once the twin forces of capital 
accumulation and technological change, raising 
productivity in both sectors in a balanced fashion, 


have achieved this objective, labor, like capital, 
becomes a scarce factor, bid for by landlords and 
industrial entrepreneurs alike, and the entire econ- 
omy has become commercialized. The economic 
calculus in the familiar one-sector world context 
then comes into operation, participation of all 
economic agents in the system’s productive and 
innovative processes becomes possible, and the 
transition to the mature economy is achieved. 

It should, of course, be recognized that the 
labor surplus economy, while representing a com- 
mon species of underdevelopment, is by no means 
typical of all such systems. Some underdeveloped 
economies, for instance, still have exploitable free 
land and experience year-round labor shortages. 
Others have no large traditional sectors but suffer 
from a colonial pattern of trade. There can, in 
short, be little doubt that the search for any really 
general theory of development has been severely 
handicapped by the heterogeneity in initial re- 
source endowment, in the rules of behavior, and 
in the institutional framework among the countries 
loosely bearing the underdeveloped label. In the 
face of this, economists presumably will continue 
to seek elements of transferability in the experi- 
ence of particular types of economic systems. The 
evolution of a meaningful typology of less-devel- 
oped economies may thus constitute a first step in 
the direction of a more generally applicable theory 
of development. 

Gustav RANIS 
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Im 
MATHEMATICAL THEORY 


Long-run economic growth, like short-run eco- 
nomic equilibrium, is studied from both the micro- 
economic and macroeconomic points of view. The 
microeconomic or multisector theory of growth, 
which can be thought of as a dynamic extension 
of the general equilibrium theory of exchange and 
production, has received considerable mathemati- 
cal attention in recent decades. Golden equilibrium 
growth (defined below) is the theory's central con- 
cept, and the mathematical analyses deal with the 
existence, optimality, and stability of golden equi- 
librium growth paths. 

The roles of monetary phenomena, international 
trade, and technological change have so far re- 
ceived little attention in the analyses. These factors 
may have important influences on actual processes 
of economic growth, and neglect of them may be 
the source of serious shortcomings in the micro- 
theory. But the microtheory is still in its early 
stages of development, and the study of these 
factors must as yet be left to macroeconomic 
analysis. 

The Walrasian model. The prototype of con- 
temporary growth models was formulated by Wal- 
ras (1874-1877). A description of the Walrasian 
model in its simplest form follows. 

It is assumed that there are two groups of cit- 
izens—workers and capitalists—and two industries 
—one producing a consumption good and the other 
producing a capital good. Assume for the moment 
that the capital good is not subject to depreciation. 
A number of basic manufacturing processes are 
available to firms in both industries, and the cap- 
ital-input and labor-input coefficients of each proc- 
ess are assumed constant. If perfect competition 
prevails among firms, no firm will gain supernor- 
mal profits. Thus, in an equilibrium state, 


(1) the price of the consumption good is less 
than or equal to the cost per unit of out- 
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put of any process of the consumption 
good industry, 


and 


(2) the price of the capital good is less than 
or equal to the cost per unit of output of 
any process of the capital good industry. 


Cost per unit of output is the sum of the capital- 
input coefficient multiplied by the price of the cap- 
ital service and the labor-input coefficient multi- 
plied by the wage rate. Conditions (1) and (2) 
must hold with equality for some processes, and 
those processes for which the conditions hold with 
strict inequality are not used in equilibrium. 

Demand for the consumption good is assumed 
to depend on the price of the consumption good, 
the price of the capital good, and total money in- 
come. If prices and money income change by the 
same proportion, demand is assumed to remain 
unchanged. Furthermore, the elasticity of demand 
with respect to total money income is assumed to 
be unity. In view of the definition of savings as the 
excess of income over consumption, 


(3) total income is identically equal to con- 
sumption plus savings. 


The amount of capital used in the economy is 
the sum over all processes of the output produced 
by each process multiplied by the corresponding 
capital-input coefficient. The range of summation 
covers the entire economy, that is, the consumption 
good industry as well as the capital good industry. 
Since the amount of capital utilized at any point 
of time cannot exceed the amount available, 


(4) the amount of capital utilized must be less 
than or equal to the amount of capital in 
existence, 


Similar conditions must hold for labor, the con- 
sumption good, and the capital good, namely, 


(5) the amount of labor utilized must be less 
than or equal to the amount of labor avail- 
able, 


(6) demand for the consumption good must 
be less than or equal to total output of 
the consumption good, 


(7) demand for the capital good must be less 
than or equal to total output of the capital 
good, 


The rule of free goods prevails in the markets, so 
that if one of the conditions (4) to (7) holds with 
strict inequality, then the corresponding price, say 
the price of the capital service in the case of (4), 


is set at zero. Conditions (1) to (7) are then found 
to imply the equality of savings and investment. 

Existence of a golden growth equilibrium. We 
now examine the Walrasian model for the exist- 
ence of a golden growth equilibrium, defined as a 
state in which outputs of all goods grow at a rate 
equal to the rate of growth of the labor force and 
in which all prices remain unchanged forever. The 
argument proceeds in terms of the warranted and 
natural rates of growth, the most fundamental 
concepts of growth economics originally due to 
Harrod (1948). 

Harrod defined the warranted rate of growth as 
“that over-all rate of advance which, if executed, 
will leave entrepreneurs in a state of mind in which 
they are prepared to carry on a similar advance” 
(1948, p. 82). Some preliminary analysis is re- 
quired to establish the warranted rate, for it is a 
concept that results from a combination of the so- 
called factor—price frontiers (Morishima 1964, pp. 
76-83; Samuelson 1962) and Kahn's interindus- 
trial multiplier (Kahn 1931). 

Suppose that prices are normalized so that the 
price of the consumption good is unity. Then the 
normalized wage rate gives the real wage. Condi- 
tions (1) and (2) now contain three variables— 
the price of the capital good, the price of the cap- 
ital service, and the real wage rate. Therefore, two 
of these prices, say the first two, can be expressed 
as functions of the third. We can then obtain a 
relation between the rate of return on capital, that 
is, the ratio of the price of the capital service to the 
price of the capital good, and the real wage rate. 
This relation is referred to as the envelope of the 
factor—price frontiers, or simply the factor—price 
frontier. The rate of return on capital can be shown 
to decrease when the real wage rate increases in an 
interval permitted by technology. 

It follows from (1) and (2) that there exist 
positive prices corresponding to any preassigned 
nonnegative value of the real wage rate, provided 
the value of the real wage does not exceed the 
maximum permitted by technology. This implies 
that (4), (6), and (7) hold as equalities. Since 
demand for the consumption good equals total in- 
come multiplied by the average propensity to con- 
sume (denoted by c), 


(8) consumption equals c times the sum of 
consumption and savings. 


If € denotes the output of the consumption good, 
x the output of the capital good, and p the price of 
the capital good in terms of the consumption good, 
then conditions (6) and (7), together with other 
equilibrium conditions, imply that consumption 
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equals & and that savings equals px. Substituting 
(6) and (7) into (8), we obtain the Kahn multi- 
plier, 


(9) &/x = pe/(1-c). 


From the factor—price frontier, prices depend on 
the real wage rate. Since c is assumed to depend 
only on prices, it is seen that the real wage rate is 
the ultimate determinant of the relative outputs of 
the two industries. 

The warranted rate of growth can easily be es- 
tablished now. Dividing condition (4) by the out- 
put of the capital good, we obtain to the right of 
the equality the reciprocal of the rate of growth of 
the capital stock. Since the processes for which (1) 
or (2) hold with strict inequality are not adopted, 
and since the choice of other processes is subject 
to (9), the rate of growth of the capital stock de- 
pends on the real wage rate. Hence, there is a 
relation between the growth rate of the capital 
stock and the real wage rate in the entire interval 
permitted by technology. This relation is called 
the warranted growth rate curve, Suppose w is the 
real wage rate arbitrarily set in the technologically 
permitted interval. Entrepreneurs will increase the 
stock of capital at the rate corresponding to the 
given value of w, and all the equilibrium conditions 
except (5) will be fulfilled. Entrepreneurs will wish 
to maintain this growth rate, and it will persist if 
the supply of labor adapts itself to the demand for 
labor. 

In the absence of technological change, the nat- 
ural rate of growth is equal to the rate of growth 
of the labor force. The rate of growth of the labor 
force is often assumed to be constant (independ- 
ent of the values of economic variables) but may 
more realistically be considered a function of the 
real wage rate. Suppose that the growth rate of the 
labor force is positive, zero, or negative depending 
on whether the real wage is above, equal to, or be- 
low the subsistence level. This relation between 
the natural growth rate (the growth rate of the 
labor force) and the real wage rate is called the 
natural growth rate curve. 

The warranted and natural growth rate curves 
will generally have at least one point of intersec- 
tion, and at that point the stock of capital and the 
labor force grow at a common rate. Suppose @ is a 
real wage rate at which the stock of capital, K, and 
the labor force, L, increase at the same rate g. Let 
the values of the other variables that satisfy the 
equilibrium conditions (1) to (7) be denoted by cor- 
responding letters with bars above them. A propor- 
tional increase in K and L gives rise to an increase 
of the same proportion in total income, provided 


prices remain constant. Since the income elasticity 
of demand for the consumption good is unity, the 
increase in total income gives rise to an increase 
of the same proportion in the demand for the con- 
sumption good. Therefore, if prices remain con- 
stant over time, equilibrium at any point of time t 
is established when 


& =e, 
iseli, 
R, =e R,, 
L, =eL,, 


where subscripts indicate time; that is to say, when 
the real wage rate w prevails, the stock of capital, 
the labor force, and the outputs of the two indus- 
tries grow forever at the common rate g, and prices 
remain unchanged. Such a state is called a state 
of golden growth equilibrium (Robinson 1956, 
p. 9; Morishima 1964, p. 78). 

The von Neumann model. The two-sector model 
examined so far assumes, among other things, 
(a) that firms can be classified into two industries 
producing distinct outputs, (b) that factors of pro- 
duction are instantaneously transformed into prod- 
ucts, (c) that the capital good does not suffer wear 
and tear, and (d) that the capital stock and the 
supply of labor are freely transferable from one 
firm to another. All these assumptions are un- 
realistic, and they crucially affect the analytical 
properties of the model. 

First, it is clear from actual input-output tables 
of various countries that many goods are both con- 
sumed and used as inputs in many industries in 
the economy; there is no clear demarcation be- 
tween consumption goods and capital goods. Sec- 
ond, a certain length of time generally must elapse 
between the original input of factors and the final 
output of a product. The period of production usu- 
ally differs from one good to another, but we may 
justify the assumption of uniform production lags 
by introducing as many fictitious intermediate 
products at each point of time as required. 

Third, according to the neoclassical treatment 
of depreciation, capital goods that were produced 
several years ago and have been subject to wear 
and tear are considered to be physically equivalent 
to some smaller amounts of new capital goods of 
the same kind. But it is generally impossible to find 
quantitative equivalents, in terms of new capital 
goods, of capital goods damaged in various degrees 
from past use. In fact, if an entrepreneur has a 
certain amount of a capital good that is in its final 
stage of wear and tear, he will have no capital 
equipment at the beginning of the next year; while 
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if he has a certain amount, however small, of a 
new capital good, he may use it for production 
throughout its whole lifetime. Only by treating cap- 
ital goods at different stages of depreciation as 
qualitatively different goods can we adequately deal 
with the age structure of the capital stock. Von 
Neumann (1937) suggested that used capital 
goods appearing simultaneously with products at 
the end of the production period be treated as by- 
products of the manufacturing process. A process 
that uses capital equipment is regarded as a proc- 
ess that converts a bundle of “inputs” into a bundle 
of “outputs,” where inputs are defined to include 
capital goods left over from the preceding period 
and outputs are defined to include qualitatively dif- 
ferent capital goods left over at the end of the cur- 
rent period. 

This treatment of capital goods enables us to 
discard the final assumption of perfect transfer- 
ability of capital equipment. In addition to being 
more productive, a new capital good is generally 
more transferable than a used one. A new machine 
will be sold to any factory that demands it, while 
a machine that has already been set up in a fac- 
tory usually will not be transferred to another fac- 
tory, even if the factory that owns the machine is 
overequipped and some other factory is under- 
equipped. This asymmetry in the transferability of 
capital goods can readily be incorporated in the 
growth model if we permit joint production in the 
sense von Neumann suggested and treat capital 
goods at different dates as different goods. 

From these considerations, it is clear that the 
growth model, if it is not to be trivial, must not 
only be multisectoral but must also be capable of 
dealing successfully with joint production. A model 
fulfilling these requirements was first proposed 
by von Neumann, He assumed the following: 
(a) there are constant returns to scale in produc- 
tion of all goods; (b) the supply of labor can be 
expanded indefinitely; (c) the wage rate is fixed 
at a level at which workers can only purchase the 
minimum amounts of goods biologically required 
to subsist; and (d) the whole of capitalists’ income 
is invested in new capital goods. It is evident that 
the model ignores the role played by consumers’ 
choice in the determination of the rate of growth; 
workers are like farm animals, and capitalists are 
simply self-service stands of capital. 

Consumers’ choice has been introduced into the 
model by relaxing the classical assumption that 
workers only consume and capitalists only save. 
Morishima (1964) studied models where (1) al- 
though workers still consume their entire income, 
their demand for consumption goods allows substi- 


tution in response to price changes, and (2) capi- 
talists spend a portion of their income on con- 
sumption goods in such a way that their demand 
for each good depends not only on relative prices 
but also on their income—the income elasticity of 
demand being unity. Morishima also replaced von 
Neumann’s unrealistic assumption of an indefi- 
nitely expandable labor force with a more plausible 
assumption of a labor force growing at a rate that 
depends on the real wage rate. 

Perfect transferability of capital goods is not a 
necessary condition for the existence of golden 
growth equilibrium; but the assumption of unitary 
income elasticities of demand, which is very strin- 
gent and unrealistic, is an indispensable condition. 
Aside from uneven effects of technological change 
on various industries, the only obstacle to golden 
growth is the existence of different income elastici- 
ties of demand for different consumption goods. 

In an economy where wages are paid at the end 
of each period, the golden equilibrium growth rate 
equals the product of the capitalists’ average pro- 
pensity to save and the rate of profit on fixed cap- 
ital. On the other hand, if wages are advanced at 
the beginning of each period, they earn profit dur- 
ing the period, and the golden equilibrium growth 
rate equals the capitalists’ propensity to save mul- 
tiplied by the rate of profit on total capital, which 
is the sum of fixed capital and the wages fund 
(Morishima 1964, pp. 145-151). 

Stability of a golden growth equilibrium. In 
examining the stability of a golden growth equi- 
librium, a problem posed is whether the mainte- 
nance of full employment of labor and capital 
along a path starting from arbitrarily or histor- 
ically given endowments of capital and labor will 
eventually lead to a state of golden equilibrium 
growth. Various writers have studied this problem 
and have found that a large class of two-sector 
models have a stable golden equilibrium if certain 
plausible restrictions are imposed on technology or 
on the consumption function. For example, in the 
case of the Walrasian two-sector model, stability is 
ensured if a constant proportion of income is de- 
voted to consumption. 

However, in more realistic multisector models, 
the tendencies toward a golden growth equilibrium 
may not exist. It has been argued that the stability 
of outputs in a multisector model implies the in- 
stability of prices, and vice versa. Jorgenson (1960) 
proved that this is the case for a closed dynamic 
Leontief input-output system. However, for an 
open system, he showed that stability of outputs 
implies stability of prices, except under rather im- 
plausible circumstances. Since examples of both 
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stability and instability of golden equilibrium 
growth can be found, the maintenance of full em- 
ployment of labor and capital is not a sufficient 
condition for the attainment of a golden growth 
state. 

A growth path is called efficient if there is no 
other path along which more of some goods can 
be produced without reducing the output of an- 
other good. A growth path is called optimal if there 
is no other path that can make the society better 
off, Full employment of labor and capital is a char- 
acteristic of an efficient or optimal growth path if 
labor and capital are always scarce. Whether or 
not these factors are scarce, however, depends on 
the amounts of goods to be produced. In a planned 
economy, a certain amount of the capital stock 
may not be used at some point of time if the exist- 
ing capital stock exceeds the amount required by 
a long-run growth plan designed to attain either 
efficiency in production or maximum satisfaction 
of the citizenry. Similarly, a portion of the labor 
force may not be used. Hence, rather than examin- 
ing the convergence of a full employment growth 
path to the golden equilibrium path, we might ex- 
amine the convergence of efficient growth paths 
to the golden equilibrium path. 

The turnpike theorems. The turnpike growth 
path (or, more briefly, the turnpike) is defined as 
the golden equilibrium growth path (there may be 
many such paths) that gives the maximum rate of 
growth, The study of convergence of efficient 
growth paths to the turnpike has produced the so- 
called turnpike theorems. 

Adopting von Neumann's assumptions presented 
above, the basic turnpike theorem may be stated 
as follows: Let the stocks of goods that the economy 
intends to have at the end of a planning period be 
specified in their proportions (but not in their ab- 
solute values) by the planning authorities. Con- 
sider efficient paths in the set of all feasible growth 
paths, In this instance, a feasible growth path is 
one leading from the historically given present 
state to the prescribed end. There is a long-run 
tendency for such efficient paths to approximate 
the turnpike as the planning period becomes longer 
and longer. In other words, the long-run efficient 
paths run along the turnpike for most of the plan- 
ning period, although they will have to leave the 
turnpike in the final part of the planning period 
in order to reach the objective (Dorfman et al. 
1958, pp. 326-345). 

This theorem would be especially useful in for- 
mulating a seven-year or ten-year growth plan for 
underdeveloped or socialist economies. Mathemat- 
ical economists have provided rigorous statements 


and proofs of the theorem (Radner 1961; McKen- 
zie 1963) and have found that there are exceptions 
to the theorem. Unless the technology available to 
the economy satisfies some special conditions, effi- 
cient paths will trace out undamped oscillations 
around the turnpike (Morishima 1964, pp. 171- 
173; Hicks 1965, pp. 224-225, 327-331). 

The theorem can be modified or extended in sev- 
eral directions. For example, it has been shown 
that the stocks of goods discounted by one plus the 
turnpike rate of growth and averaged along effi- 
cient paths over the whole planning period are very 
close to the turnpike stock vector if the length of 
the planning period is sufficiently long. Such a 
convergence theorem may be termed the mean 
turnpike theorem. There is no cyclic exception to 
the mean turnpike theorem; even though efficient 
paths may fluctuate around the turnpike regularly 
or irregularly, the average of the discounted values 
of deviations from the turnpike always approaches 
zero in the limit (Morishima 1965). 

The turnpike theorems discussed so far are 
based on the assumption that the labor force can 
be expanded at any rate. This assumption can be 
replaced by the more general assumption that the 
production and reproduction of workers is part of 
the over-all process of production. Workers cannot 
choose goods according to their own tastes but are 
fed by the dictator or planning authorities just as 
farm animals are. When there is a shortage of 
workers, they will be fed well to increase the birth 
rate; in a period of excess supply of workers, they 
will be underfed. Obviously, the model describes a 
shameless society such as a slave economy, al- 
though this is not very far from the state of affairs 
in some countries. The turnpike (mean turnpike) 
theorem for such an economy asserts that all effi- 
cient paths converge to the turnpike in the sense 
that output of each good per worker (the average 
discounted output per worker) approximates the 
turnpike output per worker when the planning 
period is very long. 

In the models just described, the planning 
authorities specified only terminal stocks of goods. 
Outputs of goods at any intermediate date were 
simply inputs for the immediately succeeding date, 
and there was no consumption at intermediate 
dates. (Feeding the workers was necessary to 
produce workers; the food they ate was an input 
for the process that produced workers who, in turn, 
were inputs for other production processes. ) Hence, 
a growth path was feasible if it merely satisfied the 
final-state conditions, and the turnpike theorems 
stated may be called final-state turnpike theorems. 
They hold when the sole objective is to bequeath 
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certain stocks of goods to a remote future genera- 
tion, for example, to our great-grandchildren, But 
if the welfare of children and grandchildren is to 
be taken into account, their consumption should 
also be part of the desiderata in the growth plan. 
Intermediate states as well as the final state must 
be evaluated according to some measure of welfare. 

This can be accomplished by introducing a 
utility function that depicts the preference order- 
ings (by the workers or by the ministry of wel- 
fare) of different streams of consumption and of 
amounts of work over the planning period. In par- 
ticular, suppose we discount consumption and the 
amount of work at each future date by a subjec- 
tive time-preference factor that does not exceed 
the turnpike growth rate. It is assumed that utility 
depends on the averages over all dates in the plan- 
ning period of these discounted values. An optimal 
growth path is one that maximizes this utility func- 
tion. 

With the aid of some additional assumptions, 
we can establish the following consumption turn- 
pike theorem: As the length of the planning period 
tends to infinity, any optimal path converges to the 
turnpike. As in the case of the final-state turnpike 
theorem, there will be cyclic exceptions to this 
consumption turnpike theorem if technology does 
not satisfy some restrictive conditions. Optimal 
paths may oscillate around the turnpike forever. 
However, a consumption mean turnpike theorem 
that is similar to the final-state mean turnpike 
theorem discussed above can be established, and 
there is no cyclic exception to the mean theorem. 

The consumption turnpike theorem, unlike the 
final-state turnpike theorems, could avoid cyclic ex- 
ceptions if it were assumed that society has an 
aversion to fluctuations. Of two consumption 
streams having the same average discounted values 
but having different fluctuations over the planning 
period, society may prefer the stream with the 
lesser fluctuations. An optimal growth path would 
then be one that maximizes a utility function that 
depends upon the average discounted values of 
consumption and of amounts of work and upon 
some measure of the variances of these streams. 
When the society's aversion to fluctuations is very 
strong, its optimal growth path will be a feasible 
path that has no fluctuations in consumption per 
capita, so that the convergence of consumption 
per capita to the turnpike is secured. 

Even in this case, however, it is still possible to 
have fluctuations of outputs around the turnpike; 
in fact, people would be prepared to accept fluc- 
tuations of outputs if fluctuations are required for 
an optimum steady stream of consumption. Hence, 


the introduction of very strong aversion to fluctua- 
tions will eliminate cycles in the per capita con- 
sumption of goods but not in per capita outputs. 


Micuto MORISHIMA 


[See also ECONOMIC EQUILIBRIUM; INTEREST; STATICS 
AND DYNAMICS IN ECONOMICS; and the biographies 
of von NEUMANN; WALRAS.] 
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Iv 
NONECONOMIC ASPECTS 


The concept of underdevelopment which has be- 
come quite popular since the end of World War n 
applies primarily to societies which are inferior to 
the principal Western nations in terms of economic 
performance and technological sophistication. In 
previous periods these societies were customarily 
referred to as “backward” or “arrested,” but these 
terms have fallen into disuse, and in their stead 
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“underdeveloped” or “developing” is used, In even 
more recent times, and with special regard to coun- 
tries emerging from colonialism, the concept of 
“new nations” is being employed, 

Although the condition of “underdevelopment” 
therefore is primarily an economic and technolog- 
ical feature of certain societies, it appears to have 
important sociological concomitants, and it is these 
social and cultural aspects of the condition of un- 
derdevelopment, as well as the variables which ap- 
pear important in analyzing the gradual emergence 
of these societies from a condition of economic 
and technological underdevelopment to a level of 
higher economic and technical performance, which 
will be stressed in the subsequent paragraphs. 

The description of underdevelopment and the 
identification of processes and policies which will 
lead out of it to higher levels of performance is 
primarily undertaken by economists. But econo- 
mists realize that in performing this task they have 
to take account of noneconomic, i.e., primarily so- 
cial and cultural, factors which strongly influence 
patterns of development. In fact, it may be argued 
that on the purely analytical level the economic 
problems of economic development are relatively 
simple, whereas the social and political aspects of 
the process are much more complex and elusive. 

It is generally acknowledged that the main path 
to higher levels of economic performance is 
through the rational organization of production, 
and this rational organization in turn depends 
primarily on the introduction of industry. Thus, in 
terms of occupational structure, underdevelopment 
is characterized by the absolute prevalence of agri- 
culture as the main source of livelihood; and eco- 
nomically more advanced societies usually show 
an increasing proportion of their labor force in 
secondary (manufacturing and mining) and ter- 
tiary (service) occupations. All societies which are 
in a state of underdevelopment, therefore, are non- 
industrialized or little industrialized, and the proc- 
ess of economic development may in a rough way 
be equated with the progress of industry and asso- 
ciated forms of economic activity. 

Classical approaches. We do not lack descrip- 
tions of nonindustrial societies, beginning with the 
ethnographic reports of “savages,” “barbarians,” or 
“exotic peoples.” But these accounts are of little 
analytical value. The first useful attempts to come 
to grips with the analysis of the social system of 
nonindustrial societies begin with the work of 
scholars who sought to describe entire social sys- 
tems associated with different levels of economic 
performance. Among the most famous attempts at 
holistic descriptions of differentiated social systems 
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are the well-known dichotomies of Ferdinand 
Tönnies (1887) between Gemeinschaft and Gesell- 
schaft, of Robert Redfield (1941; 1950) between 
folk and urban society, and of authors following 
Max Weber (1919-1920) between traditional and 
rational patterns of social action. This last contrast 
has received a great deal of attention, and we find 
not infrequently that action and behavior patterns 
in technologically and economically little developed 
societies are described as “traditional” and that this 
concept implies that these action patterns are in- 
efficient, technologically noncomplex, and strongly 
resistant to innovation, 

Redfield’s “folk society.” Before entering into a 
more detailed analysis of the concept of tradition- 
oriented behavior, it would be useful to draw atten- 
tion to two features of nonindustrialized societies 
which have received extensive treatment in the 
work of Redfield and Ténnies and their followers. 
One point often reiterated by Redfield is the fact 
that social acts in a folk society typically are not 
“single-interest” but “multiple-interest” actions. 
Productive activity, for example, not only has an 
economic purpose but also is conceived by the 
members of folk societies as containing ritual ele- 
ments, elements pertaining to social cohesion or 
structure, “political goals,” and others. This very 
“multidimensionality” of all social behavior in folk 
societies is at the bottom of some of the difficulties 
in bringing about changes in behavior. If social be- 
havior were unidimensional, change would be rela- 
tively easy. Since in addition to meeting one spe- 
cific objective, a given action is conceived of as 
simultaneously meeting other objectives, change is 
possible only if the new way of acting can be in- 
terpreted as comprising all these associated objec- 
tives also, In brief, some such behavior as planting, 
or harvesting, or engaging in exchange is con- 
ceived of not merely as productive activity but also 
as behavior maintaining the stability and relational 
adequacy of a person's position in his culture. 
Hence, if different forms of productive activity are 
proposed, they will prove acceptable (without 
strong external compulsion ) only if they also meet 
in some form or another all or most of the other 
objectives which were met by the activity to be 
replaced. 

Tönnies “small community.” The emphasis 
placed by Ténnies and his disciples on the signifi- 
cance of community also has played an important 
role in the study of conditions surrounding techno- 
logical change and economic innovation. The point 
to be emphasized in this context is that for many 
nonindustrialized societies, the small group is the 
relevant unit of social cohesion. This small com- 
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munity often has its origin in tribal or kinship re- 
lations, but what is important about it is that its 
membership is usually strictly circumscribed, lim- 
ited to persons who have either long-standing face- 
to-face relationships or some other form of close 
common identification. All outsiders, i.e., all per- 
sons who do not belong to the small community, 
are strangers and are often regarded with suspicion. 

In many underdeveloped societies these highly 
particularistic groups still exist, and in some in- 
stances they have considerable strength, They often 
appear as tribal groups, but also may constitute 
village communities, castes, or other associations 
based on kinship or quasi-kinship ties or joint occu- 
pancy of a small area, Usually the group lives in a 
compact area. Geography and familiarity thus rein- 
force one another, and the small community may 
be considered in some cultural contexts as a hard- 
shelled unit whose main forms of social interaction 
occur only within the community and whose rela- 
tions to the outside are tenuous and often associ- 
ated with suspicion and fear. In the development 
process, strong tendencies are set in motion to 
break up this isolation of the community and to 
enmesh its members in different forms of social 
relations with the outside. Moreover, it is often 
those persons who have low status within the com- 
munity who most easily penetrate the wall built 
around it and who tend to interact most frequently 
with the remainder of the society. 

Thus the significance of the small community is 
its resistance against absorption into the “great 
society” and the conflicts which arise on the social 
and personal level in the absorption process, in 
which primary loyalties to the small group grad- 
ually tend to be replaced by loyalties to the larger 
society. Moreover, many of the institutions which 
have meaning within the context of the small com- 
munity lose this meaning in the framework of the 
larger society, The family as a productive unit that 
yields economic security loses its place in a society 
in which industrialization has occurred and in 
which the economic ties are with persons outside 
the kinship group and economic security is ob- 
tained through governmental or other insurance 
schemes. Similarly, the intimate relation with tri- 
bal or village deities loses its full significance for 
those who enter the larger society, and the patterns 
of deference and authority within the small com- 
munity have no force outside it. All this tends to 
produce conflicts within and beyond the small com- 
munity. One may notice them in India, where they 
manifest themselves in conflicts within and be- 
tween castes; in Africa, where they appear as strug- 
gles between centralized authorities and tribal 


chieftains; and elsewhere, where they take on still 
different forms. 

The strong nationalism which permeates many 
industrializing countries is the chief ideological 
underpinning for a process of social change which 
leads from the ubiquity of small, particularistic 
communities to a more uniform, structurally di- 
versified but more highly interdependent society. 
Other processes associated with industrial develop- 
ment and less subject to manipulation by agitators 
and intellectuals support this trend. Among these 
are urbanization and bureaucratization of govern- 
mental and productive procedures. The onslaught 
on the small community thus comes from all sides, 
and it is not likely that it can withstand the com- 
bined impact of these forces. 

From the description of the small community 
and the folk society, with its multifaceted features 
of social interaction, we can see that these two con- 
cepts are mainly different descriptions of the same 
social type. A third, and on the whole more pop- 
ular, way of describing this type of society is to 
point to its traditionalism or, perhaps more cor- 
rectly, to describe it as a society in which tradi- 
tional forms of action predominate. As we have 
seen, the concept of traditional social action ulti- 
mately goes back to Max Weber (1922), who jux- 
taposed it to different forms of rational action. 
Since the concept of traditional or tradition-oriented 
action has gained such wide popularity, it must be 
considered in somewhat more detail. 

Weber's “traditional behavior.” Although Max 
Weber did not explicitly make the important dis- 
tinction between traditional action and traditional- 
istic action, it is implicit in his work. Traditional or 
tradition-oriented social action is found in all so- 
cieties, Weber described it as action based upon 
the psychic attitude set for the habitual workaday 
and the belief in the everyday routine as an invio- 
lable norm of conduct (1922). But beyond this, the 
concept of traditional action may be widened by 
including forms of social behavior which have been 
taken over from ancestors and forebears, because 
all populations have a sense of their historical past 
and a need for continuity of behavioral norms and 
because these forms of social action can appropri- 
ately be fitted into the action schema of a society. 
This does not mean that traditional action excludes 
change. Many practices in modern, highly indus- 
trial societies are based on tradition and traditional 
norms. This is true of such everyday behavior pat- 
terns as forms of greeting and rules of personal 
conduct, but it is also true of more complex be- 
havior in the political and economic spheres. It 
would be difficult to separate societies at different 
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levels of economic performance by the relative 
“quantity” of tradition-oriented behavior that their 
members show. 

Traditionalistic action, on the other hand, may 
be defined as action based upon the self-conscious, 
deliberate affirmation of traditional norms in full 
awareness of their traditional nature and alleging 
that their merit derives from that traditional trans- 
mission from a sacred orientation (Weber 1919- 
1920). In other words, traditionalistic action is a 
conscious revival of a past glorious age or past 
sacred lore, an ideology which looks to the past as 
providing a set of norms whose revival would again 
lead to splendor and greatness. In the nationalisms 
of many industrializing countries, especially those 
whose roots go deep into a great past—such as 
India and the Islamic countries—we can find strong 
traditionalistic admixtures. These ideologies consist 
not merely in the rejection of Western values; the 
revival and revitalization of old values, often long 
dead, is demanded, and external behavior is ap- 
proved which is thought to be in conformance with 
these norms, 

Traditional action, though it may sometimes con- 
flict with the demands of modernization and tech- 
nological change, usually is an important reinforc- 
ing element in the preservation of stability in a 
period of rapid change. On the whole, the perma- 
nence of manifold traditions in social behavior may 
be an important factor mitigating the many disloca- 
tions and disorganizations which tend to accom- 
pany rapid industrialization and technical change. 
Traditionalistic action, on the other hand, since 
it tries to elevate outdated practices and values to 
the level of current behavioral norms, usually has 
a reactionary character and tends to retard eco- 
nomic change. [See the biographies of REDFIELD; 
Tönnies; and WEBER, MAX.] 

Parsons’ pattern variables approach. One at- 
tempt to elaborate further the characteristics of 
tradition-oriented societies and, in fact, to break 
into its components the rationalism-tradition di- 
chotomy, is the description of societies at different 
levels of economic performance by means of the 
pattern variables developed by Talcott Parsons 
(1951). 

Of the five pairs of pattern alternatives stated by 
Parsons, three are immediately applicable: the 
choice between modalities of the social object 
(achievement versus ascription), the choice be- 
tween types of value orientation (universalism 
versus particularism), and the definition of scope 
of interest in the object (specificity versus diffuse- 
ness). In applying these three pattern variables to 
the distinctions between industrialized and pre- 
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dominantly nonindustrial societies, we find that 
industrialized societies are characterized by the 
predominance of achievement standards in the dis- 
tribution of economic roles and objects, that they 
employ universalistic criteria in this distribution 
process, and that economic roles in these societies 
are typically functionally specific. Underdeveloped 
societies, on the other hand, predominantly exhibit 
features of ascription, particularism, and func- 
tional diffuseness in the corresponding fields of 
social action. It should be emphasized that we are 
considering norms of social behavior. In other 
words, the complex of pattern variables present in 
any society constitutes an ideal type. 

Achievement—ascription. The achievement—as- 
cription dichotomy is closely related to, though 
not identical with, the contrast between status- 
oriented and contract-oriented societies first stressed 
by Henry Sumner Maine (1861). If we apply this 
dichotomy to economic objects, we find that in a so- 
ciety in which ascription is the norm, economic roles 
are distributed ideally on the basis of who a person 
is rather than what he can do. A practical example 
of a society based on ascription would be an “ideal” 
caste system, in which each caste is associated with 
a certain occupation and in which only members of 
a given caste are admitted to that occupation. The 
caste system, however, although it may have come 
close to this ideal in some localities at certain 
times, never, as a whole, exhibited fully ascriptive 
features [see CasTE]. But it is quite clear, from the 
example given, that in a society in which economic 
roles are assigned on the basis of status or ascrip- 
tion, social mobility is made difficult and social 
change, to the extent to which it depends upon 
mobility, is severely impeded. 

In contrast, the predominance of an achievement 
norm with respect to the distribution of economic 
roles means that the primary criterion for attain- 
ing a certain occupation is based on a person's ca- 
pacity to perform the required tasks. In practice, 
an actual test may be involved in the process of 
allocating economic roles, or, lacking this, certain 
objective criteria, such as successful completion 
of a certain number of years of school, or the ob- 
taining of a degree, may be prescribed. Again, it is 
well known that pure performance criteria are not 
applied everywhere in industrialized countries, but 
the ideal of an achievement norm is strong enough 
so that even where economic roles are actually 
allocated on the basis of ascription, the pretense 
is made that the performance requirement has 
been met. 

Universalism—particularism. The next pair of 
pattern alternatives, universalism—particularism, is 
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related to the first pair. They do not prescribe 
norms concerning who is to perform a given role 
but stipulate whether the same rules apply to 
everyone. A good example of the application of 
particularistic norms with respect to economic ac- 
tion is the case of European medieval society, 
in which specific rules applied to peasants and 
burghers, to nobles and commoners. In many un- 
derdeveloped societies, certain markets and certain 
transactions are reserved to certain groups, and 
only the admission of an outsider to an otherwise 
closed group permits him to perform the functions 
reserved for this group. The principle of universal- 
ism, on the contrary, makes no such distinctions. 
The same rules apply to all, the principle of formal 
equality being elevated to a general norm of social 
behavior, 

Specificity—diffuseness. It almost follows log- 
ically that in a society in which economic roles are 
distributed on the basis of universalistic perform- 
ance criteria, the roles themselves are functionally 
highly specific. This requirement is an outflow of 
the rigorous application of the principle of achieve- 
ment, which is of little value unless a role can be 
clearly defined and circumscribed. Functional spec- 
ificity is, moreover, an outflow of the increasing 
division of labor. Adam Smith, in his famous ex- 
ample of the manufacture of pins, clearly demon- 
strates its economic advantage. For, as he points 
out, “the improvement of the dexterity of the work- 
man necessarily increases the quantity of work he 
can perform; and the division of labour, by reduc- 
ing every man’s business to some one simple op- 
eration, and by making this operation the sole em- 
ployment of his life, necessarily increases very 
much the dexterity of the workman” (Smith [1776] 
1937, p. 7). There are other advantages, in Adam 
Smith’s view, to the division of labor, but he quite 
accurately places the development of a high degree 
of functional specificity first. 

Functional diffuseness stands in direct contrast 
with specificity. The simple peasant in a nonindus- 
trial society is a characteristic representative of the 
functionally diffuse. He not only performs all work 
connected with producing a crop but he also builds 
his house, makes his implements, and often pro- 
duces his own clothes and other final consumption 
goods. As in the cases of the ascription—achieve- 
ment duality and the particularism—universalism 
duality, we find noncharacteristic cases which seem 
to contradict the generalization that diffuseness is 
normally associated with underdeveloped societies. 
For example, in India the system of social division 
of labor under the predominance of caste has led, 

even in a nonindustrial society, to a high degree of 


functional specificity. On the other hand, certain 
occupations, especially on the highest managerial 
level, are functionally diffuse, even in highly indus- 
trialized societies. In general, functional specificity 
has been instituted more widely for the simpler and 
less complex tasks, but the progressive specializa- 
tion in business management, and even in scien- 
tific pursuits, is a sign that this process of occupa- 
tional differentiation is ubiquitous and strong in 
modern advanced societies. 

The use of pattern variables has the advantage 
of bringing into sharper focus some of the strategic 
mechanisms of social change associated with in- 
dustrialization and technical progress. Although 
universalistic norms need not generally replace 
particularistic ones, the transition from a system of 
ascription to one based on achievement in the allo- 
cation of economic roles and the replacement of 
functionally diffuse by functionally specific norms 
for the definition of economic tasks appear to have 
taken place in all cases of successful economic de- 
velopment. It is, therefore, useful to relate the de- 
scription of industrial and nonindustrial societies 
which uses the pattern variables to the earlier de- 
scriptions which used the folk—-urban continuum 
and the community-society dichotomy. 

Integration of approaches. We have seen earlier 
that the folk society is characterized by a multi- 
dimensional meaning of economic acts, i.e., their 
relevance not merely as acts of production or ex- 
change but also as acts of ritual, assertion of asso- 
ciative values, and so on. Alternatively, this fact 
can be described by pointing to the high degree of 
functional diffuseness of economic acts in the folk 
society. If a particular form of social behavior has 
meaning in several spheres of social action, it 
must, by necessity, be diffuse. It has many mean- 
ings, and although the actual manipulations de- 
manded in its performance may be rigidly pre- 
scribed, its “multidimensionality” gives it the 
character of functional diffuseness, which, inci- 
dentally, also makes it so resistant to change. 

Similarly, we may conceive of the small commu- 
nity as a set of institutionalized interpersonal rela- 
tionships based primarily on ascriptive character- 
istics. The cohesion and compactness of the small 
community are enhanced because economic roles 
are tied to ascriptive status and because even 
where there is a considerable degree of specificity 
in different economic roles, as in the Indian village, 
ascriptive norms provide a stability and internal 
rigidity which makes change from within exceed- 
ingly difficult. It is only the breakup of the small 
community or its infiltration from the outside 
which tends to reduce the significance of ascrip- 
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tion in the distribution of economic (and other, 
e.g., political or deference) roles. 

The gradual destruction of the traditional folk- 
like small community thus is accompanied in the 
economic and technological spheres by a process 
of differentiation of economic roles and a relaxa- 
tion of the rules assigning those roles to particular 
actors. But the process is not a smooth one; it pro- 
ceeds in spurts and jolts. In this process new insti- 
tutions develop, and as each new institution be- 
comes established, it provides a pivotal point 
around which further changes gather momentum. 
If the juxtaposition of societies on different levels 
of economic modernization provides a classic case 
of the comparative analysis of social institutions, 
the analysis of the transformation from one to the 
next level of economic performance may be re- 
garded as a study of the dynamics of institutional 
change. 

Social deviance. This is not the place to dis- 
cuss in an exhaustive manner the process of insti- 
tutional change, which is a much broader process 
than merely overcoming a state of economic un- 
derdevelopment. But it may be useful to discuss 
the phenomena of social deviance and social mar- 
ginality, because they have been applied quite 
commonly in the analysis of the social concomi- 
tants of industrialization and economic develop- 
ment, 

Deviance. Let us first turn to a brief considera- 
tion of social deviance. As already stated, we are 
concerned here primarily with those forms of de- 
viant behavior which are relevant for economic 
activity and organization. If the concept of de- 
viance is to have operational meaning, it cannot be 
interpreted simply as signifying behavior which is 
new; it must imply that this set of innovating acts 
is opposed in some way to existing social norms or 
approved forms of behavior. In other words, a de- 
viant always engages in behavior which constitutes 
a breach of the existing order and which is either 
contrary to, or at least not positively weighted in, 
the hierarchy of existing social values. If we apply 
this concept to the behavior displayed by business- 
men and merchants in the course of the economic 
history of western Europe, we find that we can 
speak of genuine deviance in those periods and so- 
cieties in which entrepreneurial behavior did not 
belong to the category of social actions which were 
considered as constituting the “good life.” As late 
as the fifteenth century, this was true of certain 
kinds of financial entrepreneurship, which was 
always tainted by the official opposition of the 
church against usury. And later, when financial 
entrepreneurship became fully respectable, indus- 


trial entrepreneurship came to be regarded with 
some disdain because it “dirtied one’s hands.” These 
sentiments toward business or industrial activity as 
not quite proper for a gentleman are familiar in 
many underdeveloped countries today. 

If deviance implies some breach with existing 
social norms, it is interesting to investigate further 
from what social classes or groups persons come 
who engage in various forms of deviant behavior. 
Clearly, the expected rewards of this behavior must 
be attractive, and persons engaged in this behavior 
are likely to feel a strong urge to rise in the social 
scale (perhaps a strong motivation for achieve- 
ment) or to have resentments against some aspects 
of the existing order. It is in these terms that the 
rise of the western European bourgeoisie or even 
of the lower samurai groups in Japan have been 
explained. In the view of some, this reshuffling of 
social positions is the result of class struggles; in 
the view of others, it is a gradual evolutionary 
process. 

Marginality. An alternative hypothesis is that 
persons engaging in deviant behavior are at the 
margin of a given culture or are in a social or cul- 
tural position in which they straddle more than 
one culture. We may identify cases in which de- 
viance coincides with social marginality. For ex- 
ample, in medieval Europe the earliest money- 
lenders were often foreigners. In Italy at the time 
of Gothic and Langobard rule, they were Syrians, 
Byzantines, and Jews. Later, when Italians turned 
to financial entrepreneurship on a large scale, the 
Genoese and Pisans, Sienese and Florentines, 
who were all lumped together under the name 
“Lombards,” became the financial entrepreneurs 
north of the Alps. 

The role of marginal individuals in various 
economic pursuits in many economically under- 
developed countries is eminently manifest today. 
One could cite the Chinese in various southeast 
Asian countries, the Indians in east Africa, and 
the widely scattered Lebanese and Syrians who 
make their appearance as businessmen in west 
Africa, Latin America, and elsewhere in poor 
societies. 

What is the mechanism which allows marginal 
individuals to perform the roles they apparently 
have so widely accepted? As Robert E. Park 
(1913-1944), the inventor of the concept and of 
the significance of social marginality, has stressed, 
marginal men are—precisely because of their am- 
biguous position from a cultural, ethnic, linguistic, 
or sociostructural standpoint—strongly motivated 
to make creative adjustments in situations of 
change and, in the course of this adjustment 
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process, to develop innovations in social behavior. 
Although many of Park’s very general propositions 
have been refined by subsequent researchers, the 
theory of social marginality has not advanced 
sufficiently to supply convincing evidence for the 
role marginal individuals may play in all episodes 
of social change. Even if it is admitted that mar- 
ginal persons tend to make creative adjustments 
more often than to relapse into old orthodoxies or 
to embrace new ones, the record is not at all clear, 
and there are some students who warn us that 
marginal individuals may be more prone than 
others to succumb to anomie and thus to become 
carriers of trends leading toward social disorgan- 
ization rather than to creative innovations. [See the 
biography of PARK.] 

Institutionalization of deviance. In circum- 
stances in which a certain amount of deviant be- 
havior has been displayed, the anchoring of this 
behavior in a new institution is of strategic sig- 
nificance. Originally a form of deviance, it becomes 
routinized and may display all the characteristics 
of some highly approved form of social behavior. 
Thus the institutions in which deviant action is 
anchored form an advance post from which further 
deviance becomes possible. For example, the insti- 
tutions which arose in western Europe before the 
industrial revolution and in Japan before the Meiji 
period were already the end products of a process 
of social change which had begun with deviant 
behavior; these institutions, in turn, by their very 
existence made possible further economic and 
technological change. 

Sanctions. Whether or not any given form of 
deviance will lead to the elaboration of new social 
institutions and ultimate routinization of this 
pattern of social action will depend upon several 
factors, among which the system of sanctions 
existing in a society may be the most important. 
These sanctions may be internalized, i.e., they 
may reside in the values and beliefs of a popula- 
tion; or they may be externalized, i.e., they may be 
imposed by persons in power, by the elite, against 
actual or would-be deviants. It appears that in 
some societies, e.g., imperial China, both these 
types of sanctions were very strong. In pre-Meiji 
Japan, internal sanctions had partially broken 
down, and the power of the shogunate had become 
increasingly weak, so as to soften external sanc- 
tions to a point at which they were inadequate to 
prevent the formation of new institutions, or at 
least the beginnings of these innovations. 

Strategic groups. Thus the analysis of social 
change may be couched largely in terms of con- 
sidering the impact of deviance, whether exercised 


by marginal men or not, the gradual institution- 
alization and routinization of deviance, and the 
range of sanctions opposed to deviant behavior. 
This analysis may be carried out, initially, on the 
“aggregate level,” i.e., it may take into account an 
entire society at once. But our insights into social 
change may be sharpened if we disaggregate the 
variables in our analysis, i.e., if in a complex so- 
ciety, we take account not of changes affecting the 
society as a whole but of those affecting special- 
ized sections or classes in the society. For deviance, 
sanctions, and the process of institutionalization 
have a different place and impact among different 
groups. Take sanctions as an example. Clearly, 
in societies in which ascriptive norms are strong, 
different individuals, depending upon their status 
positions, will be subject to different internalized 
sanctions; and in a society with extensive particu- 
larism, even external sanctions will be imposed 
and enforced to a very different degree on persons 
belonging to different groups or classes. 

Elites. Thus, we are likely to discover in any 
society certain strategic groups which become the 
carriers of innovations. In some instances these 
groups may be composed of marginal individuals, 
especially if the innovations are transmitted from 
the outside. The role of marginal individuals in 
the acculturation or culture contact process has as 
yet been insufficiently explored, but their promi- 
nent participation in this process follows almost 
as a matter of definition. Another group which 
often plays a strategic role is the elite of a society. 
Although considerable attention has been given to 
the role of the elite in preserving a status quo, its 
impact on the introduction of organizational and 
technological innovations has perhaps been under- 
estimated. In general, social change has been seen 
as being accompanied by a “circulation of elites,” 
rather than as a process in which existing elites 
are capable of reorienting the systemic goals which 
they attempt to implement. Yet in the present eco- 
nomically underdeveloped countries, in which so 
much economic change is managed by the holders 
of political power, the role of the existing elites 
as innovators must be acknowledged. The entre- 
preneurial functions, which in Western countries 
were displayed predominantly by independent 
businessmen often belonging to a not fully en- 
franchised and politically impotent bourgeoisie, 
have been taken over by bureaucrats who operate 
with the blessing and under the protection of the 
political power apparatus. [See ELITES; ENTRE- 
PRENEURSHIP.] 

Role of the state. In part, the intervention of 
the state in the industrialization process is Cer- 


tainly an outcome of the greater pressures, of the 
greater distance between reality and aspirations, 
which exist in the present. We may list several 
general factors which tend to enhance the role of 
the state in the process of economic growth. The 
urge for massive state intervention in the process 
of economic and industrial growth will be the 
stronger: (1) the greater the range of ends and 
the higher the level of attainment sought; (2) the 
shorter the time horizon within which the ends 
are to be attained, that is, the more rapid the rate 
of economic growth desired; (3) the more unfavor- 
able the factor and resource endowments; (4) the 
greater the institutional barriers to economic 
change and industrialization; and (5) the more 
backward the economy in relative terms. As time 
progresses without development, the fifth condition 
is bound to obtain, simply because the later the 
onset of industrialization, the more backward a 
country will be in relative terms. But if this con- 
dition holds, it is also likely that the first, second, 
and fourth conditions will obtain, and thus we 
may conclude from this empirically derived set 
of conditions that in the course of time incentives 
and urges for state intervention in the industri- 
alization process are constantly on the increase. 
But this means that industrialization as a goal 
progressively becomes an objective of over-all 
social policy and that existing elites, whatever their 
primary ends may have been in the past, have 
reoriented the hierarchy of the systemic goals by 
assigning an increasingly important place to eco- 
nomic development. But the increased interest of 
governments in economic growth and industriali- 
zation means not merely that they are capable of 
exercising control over the total resources of a 
society to be applied to its economic buildup, but 
also that, more effectively than any other agency, 
they can influence the forms of social behavior by 
altering patterns of rewards and sanctions and 
otherwise intervening in the social structure. This 
shows again that, in spite of the strong impact of 
ideological considerations in present-day develop- 
ing nations, the conditions of social structure and 
the state of relative retardation and underdevelop- 
ment exert, in turn, an important influence upon 
the ideological forms as well as the socioeconomic 
relations under which the development process 


takes place. 
Bert F. HOSELITZ 
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I 
ANCIENT AND MEDIEVAL THOUGHT 
Ancient period 
Greek thought. In ancient Greece, “economics” 
did not mean what it means today; rather, it em- 
braced everything pertaining to household manage- 
ment. (It kept this meaning until the eighteenth 
century.) Political economy, or economics in the 
present sense of the word, was not regarded as an 
independent discipline but was an integral part of 
ethics or politics. Socioeconomic questions were 
the province of ethics insofar as they dealt with 
private business contracts and of politics insofar 
as they touched upon public policy and affected 
social arrangements. It is not surprising, therefore, 
that Aristotle discusses economics in his books 
Nicomachean Ethics and Politics. 
The Greek philosophers did not go beyond funda- 
mentals and trite observations, Yet, their contribu- 


tions are important because, as Joseph A. Schum- 
peter (1954) remarks, their economics is the 
fountainhead of practically all further work. They 
raised all the crucial questions, from value and 
price to economic organization, with which econ- 
omists are still concerned at the present time. 

Plato. Plato (427-347 B.C.) started in his 
Dialogues from the premise that no individual is 
self-sufficient and stated that cooperation and 
mutual intercourse are therefore the basis of the 
state as well as of the economy. Division of labor 
creates efficiency because, according to Plato, diver- 
sity of innate talents prompts individuals to spe- 
cialize in what they are best fitted for, a fact that 
is sometimes overlooked or even denied—for exam- 
ple, by Adam Smith. States also are not self- 
sufficient, and rarely do they possess such a variety 
of resources that they can get along without com- 
merce. The existence of an exchange economy is 
taken for granted, since Plato stated that each 
community will need a market place and a token 
money for purposes of exchange. Contrary to what 
is often asserted, Plato was not really an advocate 
of communism. In his scheme, communism was 
limited to the warrior class, whose members were 
forbidden to own property and were expected to 
share common meals and to live together like sol- 
diers in a camp. Nevertheless, Plato was taunted 
by his pupil Aristotle for making this impractical 
proposal. 

Aristotle. Like Plato, Aristotle (384-322 B.C.) 
assumed from the outset the existence of an ex- 
change economy based on the division of labor and 
the institution of private property. He justified the 
latter on grounds of efficiency—people take better 
care of what is their own than of what is held in 
common. Money is necessary to obviate the incon- 
veniences of barter. Besides being a medium of 
exchange, money is a measure of all things and 
“a guarantee of exchange in the future,” since it 
can be stored until needed. Although money, Aris- 
totle recognized, varies in purchasing power, its 
value tends to be rather constant and more stable 
than that of other commodities. Therefore, Aris- 
totle, unlike Plato, did not favor a token currency, 
but required money to be made of a substance, 
such as silver, that is “really valuable itself” as well 
as easy to carry. 

According to Aristotle, the source of value is 
need, for no exchange will take place without need; 
and the basis of exchange will differ as needs 
differ. Although in his Topica Aristotle referred to 
the marginal principle, there is no evidence that 
he ever thought of applying it to demand theory. 
Instead, he developed the concept of value in use 
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and value in exchange, which much later involved 
Adam Smith in inextricable difficulties. Medieval 
and later commentators have tried in vain to eluci- 
date the obscure passages that Aristotle devoted to 
price determination in Book V of his Nicomachean 
Ethics. While remaining silent about the virtues of 
competition, he mentioned monopoly as a device 
for exploiting the public. 

Aristotle was definitely of the opinion that eco- 
nomic relations ought to be ruled by justice, of 
which there were two kinds: corrective, or commu- 
tative, justice, which applied to such private trans- 
actions as buying and selling; and distributive 
justice, which regulated the distribution of wealth. 
Corrective justice rested on the principle of equiva- 
lence; distributive justice was based on merit, the 
criteria of which might vary from one society to 
another, This whole concept of justice was later 
taken over by Thomas Aquinas and the Scholastics, 
almost without modification, and is still accepted 
by Roman Catholic social scientists, 

Aristotle cited the story of Midas to emphasize 
the point that money is not identical with wealth. 
Agriculture and household management were hon- 
orable, but he frowned upon “krematistics,” or such 
wealth-accumulating activities as trade. Above all, 
he deplored moneylending because it involved 
usury—“money bred of money” is unnatural be- 
cause money was invented to serve as a medium 
of exchange. These views, too, were adopted by 
the Scholastics. 

Roman thought. In contrast with the Greeks, 
the Romans were not strong in speculative philos- 
ophy and made no significant contributions to eco- 
nomics, with the possible exception of treatises on 
agriculture which were practical rather than the- 
oretical. The Romans, being administrators, ex- 
celled in legislation; and their great contribution 
is the body of law codified by Emperor Justinian, 
who reigned from 527 to 565. To be sure, Roman 
law is a compilation of legal texts containing noth- 
ing that even remotely approaches economic analy- 
sis; however, because of the use made of it by the 
Scholastics, it is very important to the evolution of 
economic thought. 


Medieval or scholastic thought 


Scope and method. Scholastic economics is 
often regarded as a medieval doctrine; but this, 
strictly speaking, is incorrect. Although it had its 
roots in the Middle Ages, it outlived this period by 
more than two centuries. Far from dying around 
1500, scholastic economics continued to flourish 
throughout the sixteenth century with the famous 
school of Salamanca, founded by the great jurist 
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Francisco de Vitoria, o.P. (c. 1480-1546). It 
reached its zenith of refinement and elaboration 
in the great works of synthesis and vulgarization 
of the seventeenth century and still had some vigor 
left in the eighteenth century, when Pietro Ballerini 
(1698-1769) and Daniel Concina, o.P. (1687— 
1756) took a last stand to defend the church's 
traditional doctrine on usury against the insidious 
attacks of Marchese Francesco Scipione Maffei 
(1675-1755) and Nicolas Broedersen (c. 1690- 
1772). 

Even then scholastic economics did not die, It 
left its imprint, although unacknowledged and 
even disavowed, on the works of Abbé Ferdinando 
Galiani (1728-1787) and Abbé Antonio Genovesi 
(1712-1769), both Neapolitan forerunners of the 
classical school. Adam Smith (1723-1790), if 
Schumpeter is correct, owed more to Scholasticism 
than to mercantilism or physiocracy. In any case, 
scholastic doctrines were transmitted to him by 
Hugo Grotius (1583-1645) and Samuel von Pufen- 
dorf (1632-1694), whose works were used as text- 
books when Smith attended the course in moral 
philosophy taught by Francis Hutcheson (1694— 
1746) at Glasgow College. This is a fact not to be 
denied, Regardless of controversies about who in- 
fluenced whom and to what extent, there was 
continuity—not a sudden break—a point that 
should be stressed. 

Like the Greek philosophers, the scholastic Doc- 
tors did not consider economics as an autonomous 
subject, but as a branch of moral philosophy (or 
moral theology). This is significantly the subject 
taught by both Smith and Genovesi, the former at 
Glasgow College and the latter at the University 
of Naples. However, Smith made economics into 
an independent science governed by expediency 
rather than by ethics. 

In building their philosophical system, of which 
economics was a part, the scholastic Doctors com- 
bined elements from five different sources: the 
Bible, patristic literature, Greek philosophy, canon 
law, and Roman law. Greek philosophy, especially 
that of Aristotle, and Roman law were perhaps the 
most important for their economics. Canon law 
supplied only the canons outlawing usury and 
those denouncing trade as a sinful occupation. The 
scholastic writers took from Roman law the classi- 
fication of contracts, which provided the frame- 
work for their whole doctrine. For example, the 
just price was discussed in connection with the 
emptio-venditio, or sales contract; and usury, in 
connection with the mutuum, or straight loan. 

The medieval mind was legalistic. The question 
asked was not how the economic system func- 
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tioned, but whether this or that was licit or illicit, 
just or unjust. In other words, the scholastic ap- 
proach to economic problems was legal and ethical 
rather than mechanistic. The overemphasis on 
usury by many scholastic authors may have given 
the impression that this one question was the core 
of their doctrine, but this is not so. According to 
the Schoolmen, the scope of economics was to de- 
termine the rules of justice that applied to the 
exchange of goods or services (commutative jus- 
tice) and to the distribution of income and wealth 
(distributive justice), This distinction between 
commutative and distributive justice was, of course, 
borrowed from Aristotle. Social justice is a new 
concept added by neoscholastic writers in recent 
times. 

Like Marxian dialectics, the scholastic method 
follows a set pattern. One is certain to find eco- 
nomic questions discussed in any scholastic treatise 
dealing with moral theology or entitled De con- 
tractibus (“Concerning Contracts”) or De justitia 
et jure (“Concerning Justice and Law”). Often it 
will be unnecessary to go beyond the title page or 
the table of contents in order to identify a treatise 
as belonging to the scholastic school. 

The scholastic Doctors, following Aristotle, as- 
sumed that man was unable to minister to his own 
needs without assistance from his fellow men. 
According to Thomas Aquinas, 0.P. (1225?—1274), 
the division of labor was ordained by a divine 
providence that endowed men with greater inclina- 
tions for one profession than for another (Summa 
contra gentiles 111, 134). 

In accordance with canon law, the community 
of goods was regarded as utopian, except when 
practiced on a small scale in monasteries or con- 
vents. While not an institution of natural law, pri- 
vate property, Thomas Aquinas declared, was an 
addition thereto devised by human reason (Summa 
theologica 11-11, 66, 2, ad 1). He justified its exist- 
ence on the same grounds that Aristotle did: first, 
because common property is apt to be neglected 
and, second, because public ownership only engen- 
ders confusion and discord, In the absence of plan- 
ners, a planned economy was still inconceivable. 
Although property was privately owned, the use of 
it was common, and superfluities ought to be given 
to the poor. However, people were entitled to live 
as befitted their station in life. A knight, for exam- 
ple, was not required to give away his horses, 
because he might need them to fulfill his feudal 
obligations. Only in case of extreme necessity did 
all things revert to common ownership. Thus, a 
poor man on the point of starvation did not steal 
when he took a piece of bread without the permis- 
sion of its owner. 


The scholastic Doctors extolled agriculture as an 
occupation leading to virtue, but shared all the 
prejudices of Aristotle and of the Church Fathers 
against trade. Did not the canon law proclaim that 
merchants could scarcely, if ever, please God 
(Decretum Dist. 88, c. 11)? The attitude of the 
theologians mellowed only gradually, Thomas 
Aquinas approved of manufacturers and of im- 
porters who brought needed commodities from 
abroad. Later, the warehousing function was also 
recognized as legitimate, but retailing had to wait 
until the sixteenth century before receiving ap- 
proval. 

The scholastic doctrine centered on two main 
problems: the theory of the just price, and usury. 
Both have given rise to a great deal of misunder- 
standing. 

Value theory. The scholastic Doctors were al- 
most unanimous in recognizing utility as the source 
of value. In the words of Schumpeter, their analysis 
“Jacked nothing but the marginal apparatus” (1954, 
p. 1054). Value was not regarded as an intrinsic 
quality, but as something dependent upon the men- 
tal process of valuation. To illustrate this point, 
the Schoolmen often cited St. Augustine (354— 
430), who stated in The City of God that, human 
superiority notwithstanding, a horse or a gem was 
often worth more than a slave, Some even turned 
this argument into a paradox by insisting that, if 
value were a matter of natural dignity, a living 
creature, such as a fly, would be more valuable 
than all the gold in the world. Value, in other 
words, was a function of utility. The other element 
of value—scarcity—was not overlooked. Perhaps 
one of the best scholastic expositions of value 
theory is found in the sermons of St. Bernardino 
of Siena, 0.F.M. (1380-1444). However, he is not 
the original author of the passage, which he ap- 
propriated, without acknowledgment, from a manu- 
script—still unpublished—of Pierre Olivi, 0.F.M. 
(1248-1298 ). He felt free to do this because Olivi 
had been accused of heresy. According to Olivi and 
Bernardino of Siena, there are three sources of 
value: scarcity (raritas), utility (virtuositas), and 
desirability (complacibilitas). Scarcity does not call 
for comment; utility is an objective quality: want- 
satisfying power; complacibilitas can only have one 
connotation: a subjective desire to gratify a want. 
This interpretation agrees with that of Schumpeter 
(1954, p. 98) who, however, wrongly regards St. 
Antoninus of Florence, 0.P. (1389-1459) as the 
originator of this conception. It is regrettable that 
this line of thought was not pursued further. The 
Scholastics were certainly on the right track, al- 
though they did not quite succeed in solving the 
riddle of value. 
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The just price. Next to value comes price de- 
termination, Roman law had left this matter to the 
higgling and haggling between contracting parties. 
The medieval glossators added the phrase sed com- 
muniter (“but it must be commonly”) to the prin- 
ciple that goods are worth as much as they sell for 
(res tantum valet, quantum vendi potest). Thus, 
price became a social phenomenon to be deter- 
mined by the community. How can a community 
set a price? There are two possibilities: either 
spontaneously, by the chaffering of the market, or 
authoritatively, by public regulation. The first later 
became known as the “natural” or “vulgar” price; 
the second, as the “legal” price. In the absence of 
regulation, the market price was presumed to be 
just. This was the theory of both the civil law and 
canon law jurists. Among the theologians, only the 
Thomists acquiesced and arrived in their own way 
at the same conclusion; the Scotists and the nom- 
inalists dissented. Thus the Scholastics were 
divided and supported three rival, and partly con- 
flicting, theories of the just price. 

Albertus Magnus (1193-1280) stated unambig- 
uously that the just price is set by the estimation 
of the market at the time of the sale. His pupil 
Thomas Aquinas was less specific and precise. 
Aquinas told a story (borrowed from Cicero) of a 
merchant who brought wheat to a country where 
there was a dearth. The question raised is whether 
this merchant may sell his grain at the prevailing 
price (pretium quod invenit) or whether he is 
bound to disclose that additional provisions are on 
the way. Thomas Aquinas answered his own ques- 
tion by stating that this merchant was not bound 
to do so by the rules of justice, but would act more 
virtuously if he did tell, or if he lowered his price. 
This answer, it seems, leaves no doubt as to the 
position of Thomas Aquinas. 

The same point of view was taken by St. Bernar- 
dino of Siena, John Nider (1380-1438), and the 
majority of the theologians: the just price is set 
by “common estimation,” which means the ap- 
praisal of the market place—with the reservation 
that they never questioned the right or even the 
duty of the public authorities to set and regulate 
prices in an emergency. 

Nothing has been said thus far about the cost 
of production as a price determinant. Albertus 
Magnus and Thomas Aquinas, however, did not 
overlook this point entirely, and stated in their 
comments on Aristotle that arts and crafts would 
be doomed to destruction if the producer failed to 
recover his outlays in the selling price of his prod- 
uct, Albertus Magnus gave the example of a car- 
penter who will cease making beds if the price he 
receives does not compensate him for his expense 
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and workmanship. In other words, according to 
Albertus Magnus and Aquinas, the market price 
cannot fall permanently below cost. Unfortunately 
this idea was lost; and the later Thomists focused 
all their attention on the market price, disregarding 
cost of production as if the two concepts were 
antithetic, 

The champion of the importance of cost of pro- 
duction was John Duns Scotus (1265-1308 ). Start- 
ing with the observation that the merchant per- 
forms a useful function, he drew the conclusion 
that a just price should cover all the merchant’s 
costs, including a normal profit and a compensa- 
tion for risk, The weakness of this theory is, of 
course, that Scotus did not ask himself whether 
the merchant would be able to sell his wares above 
the market rate if his costs were too high. Scotus 
had very few followers; the best-known are another 
Scot, John Mayor (1469-1550), and a Portuguese, 
Johannes Consobrinus, also known as Joao Sobrinho 
(d. 1486), who taught for some time at Oxford. 

Price regulation found strongest support among 
the nominalists. Jean de Gerson (1363-1429), 
for a while chancellor of the Sorbonne, even sug- 
gested entrusting to the public authorities the fix- 
ing of all prices—under the pretext that no one 
should presume to be wiser than the lawmaker. 
Since this scheme proved impractical, it found little 
support. Another nominalist, Henry of Langen- 
stein the Elder (1325-1397), formulated the rule 
that if the authorities failed to set a fair price, the 
producer was allowed to set it himself; however, 
he ought not to charge more than would maintain 
him and his family suitably in his social condition. 
If he should overprice his wares in order to enrich 
himself or to better his status, he committed the 
sin of covetousness. The question whether such a 
producer would be able to obtain a price above that 
of his competitors was not raised by Langenstein. 
First published in 1874 by Wilhelm Roscher, Lan- 
genstein’s rule received a great deal of publicity 
and was hailed by many, including Max Weber, 
Werner Sombart, R. H. Tawney, Heinrich Pesch, 
and Amintore Fanfani, as a typical formulation of 
the theory of the just price. There is, however, not 
the slightest justification for this enthusiasm, Far 
from being representative, Langenstein was a rel- 
atively minor figure; and his views were those of 
a minority group that exerted little influence in his 
time except perhaps in the German and Polish 
universities, which in the fifteenth century were 
bastions of nominalism. 

Although the Schoolmen were unable to agree 
on a criterion of the just price, they were unani- 
mous in their condemnation of monopoly, which 
was defined broadly as any collusion to manipulate 
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prices. First of all, monopolies were branded as 
“conspiracies” against liberty; next, they were 
deemed injurious to the commonweal because they 
created artificial scarcity; and finally, they were 
blamed for raising prices above the competitive 
level—that is, above the level that would prevail if 
there were no monopoly. Profits derived from mo- 
nopolistic exploitation were stigmatized as turpe 
lucrum, or illicit gain, which, like usury, was sub- 
ject to restitution. 

Since the Doctors favored market price, it should 
cause no surprise that they also were opposed to 
price discrimination, According to St. Bernardino 
of Siena, a seller ought not to take advantage of a 
buyer's ignorance, rusticity, or special need. In 
other words, prices ought to be the same to all, rich 
and poor alike. This was strictly in accordance 
with commutative ‘justice, which, it is recalled, was 
based on equality and reciprocity. 

The theory of the just price was applied to wages 
also, since wages were defined almost as if they 
were the price of labor (pretium laboris). Conse- 
quently, wages were determined by common esti- 
mation—that is, by the forces of supply and 
demand—excluding, of course, all attempts at 
exploitation, The author who gave the fullest treat- 
ment to this question was St. Antoninus, archbishop 
of Florence, to whom the purpose of wages was to 
enable the worker to support himself and his de- 
pendents on his social level. To achieve this pur- 
pose, St. Antoninus insisted upon punctual pay- 
ment of the wage agreed upon and reproved 
employers who paid their workers in truck or in 
debased coin, He did not carry his criticism further, 
although he must have known that the wool and 
silk guilds of Florence attempted to keep wages 
down by using the antimonopoly legislation of the 
commune to prevent the formation of “brother- 
hoods” among the workers. In general, the School- 
men were less favorable to the guilds than is gen- 
erally assumed by Fabian socialists and Catholic 
historians who idealize the Middle Ages, Concern- 
ing wage differentials, Bernardino of Siena made 
the pertinent remark that skilled workers are better 
paid than unskilled because skill is scarce, for it is 
not acquired without toil and expensive training. 

Monetary theory and usury. Monetary theory 
made little progress during the Middle Ages. The 
main author on this topic was Nicole Oresme (c. 
1325-1382), who did not go much further than 
Aristotle. While neglecting monetary theory, most 
of the scholastic Doctors devoted an inordinate 
amount of space to usury, which they apparently 

regarded as a major social problem. 

A great deal of misunderstanding exists about 

usury. According to modern concepts, usury is an 


exorbitant, oppressive interest rate; but the defini- 
tion given by the Schoolmen was quite different. 
Usury was any increment, whether excessive or 
moderate, beyond the principal of a loan, or 
mutuum. Consequently, according to all the Doc- 
tors, usury occurred only in a loan, If it could be 
shown that a contract was neither explicitly nor 
implicitly a loan, there was no usury involved. Of 
course, a loan -could be concealed under the color 
of another contract, which then became a contract 
in fraudem usurarum, or deceptively usurious. 

With this approach to the problem, it is easy to 
see how the usury question became a hotbed of 
elusive discussion. The scholastic definition of 
usury allowed the merchants to make the most 
of legal technicalities and the Doctors to display 
their talents for casuistry and subtle distinctions, 

Banking is as good an example as any. Since 
lending at interest was forbidden, the bankers 
found another way of making profits—by dealing in 
foreign exchange. The purchase of a foreign draft, 
because of the slowness of communications, always 
involved granting credit as well as dealing in ex- 
change. Interest was, of course, concealed in the 
rate, or price, of exchange. Nevertheless, the bank- 
ers argued that a cambium, or foreign-exchange, 
contract was not a loan, but a legitimate business 
transaction. Although the argument rested on sheer 
sophistry, it was accepted by the theologians unless 
the cambium contract was too patently a disguised 
loan, as in the case of dry, or fictitious, exchange. 
The practical result of this tolerance was to outlaw 
discounting but to tie banking to exchange. 

No charge could be made for lending, but the 
lender was sometimes entitled to compensation for 
reasons extraneous to the loan. Thus there emerged 
the theory of extrinsic titles: the three principal 
ones were poena conventionalis, damnum emer- 
gens, and lucrum cessans. Poena conventionalis was 
a penalty for tardy payment; damnum emergens, 
compensation for damages suffered by the lender. 
These two were readily admitted as valid; but not 
so lucrum cessans, or cessant gain, which meant 
that the lender might claim the same return as that 
yielded by rival or competing investments, Thus 
defined, lucrum cessans is in fact the same thing 
as the modern concept of opportunity cost. To 
admit this title, too, would have jeopardized the 
whole usury doctrine. Lucrum cessans was, there- 
fore, rejected by Thomas Aquinas and most of the 
theologians. Later, in the sixteenth century, it was 
permitted by some latitudinarians, but only be- 
tween merchants. 

The usury doctrine was the Achilles heel of 
scholastic economics. It involved the Schoolmen 
and their sixteenth-century and seventeenth-cen- 
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tury successors in insuperable difficulties that con- 
tributed greatly to bringing their whole doctrine 
into disrepute. 

Later Scholasticism. As already mentioned, 
Scholasticism continued to prosper throughout the 
sixteenth century and afterward. The late Scholas- 
tics of the school of Salamanca made some new 
contributions, mainly in refining the old doctrines. 
The quantity theory of money was- accepted as a 
matter of course. More than ever the adherents of 
the new school insisted upon the fairness of market 
price in the absence of public regulation. Some of 
them, such as Martin Azpilcueta (1493-1586), 
better known as Dr. Navarrus, were very skeptical 
about the merits of price fixing because it was un- 
necessary in normal times and ineffectual in times 
of dearth. Luis de Molina, s.J. (1535-1601), more 
famous for his views on grace than for his eco- 
nomic theories, formulated the law of supply and 
demand by stating that “a concourse [concurren- 
tium] of buyers, more considerable at one time 
than at another, and their greater eagerness to buy 
will drive prices up, whereas paucity of purchasers 
will bring them down” (De justitia et jure un, disp. 
348, no. 4). He also insisted that value depended 
upon consumers’ preferences rather than upon 
qualities inherent in commodities. A Belgian Jesuit, 
Leonardus Lessius (1554-1623), made two minor 
contributions: allowing monopolies with regulated 
rates for the common good and giving an accurate 
description of the Antwerp money market, implic- 
itly recognizing the presence of interest concealed 
in the exchange rates. 

In the seventeenth century, the economic teach- 
ings of the scholastic school were systematically 
presented in the great synthetic works of cardinals 
Juan de Lugo, s.J. (1583-1660) and Giambattista 
de Luca (1613-1683), but they made no new con- 
tributions. Scholastic economics had reached ma- 
turity. By failing to renew its methods of analysis, 
scholastic thought fell into discredit and entered 
into a precipitous decline that involved other sci- 
ences and philosophy as well as political economy. 

RAYMOND DE ROOVER 


[See also the biographies of Aquinas and OrRESME.] 
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Il 
MERCANTILIST THOUGHT 


“Mercantilism” is the label commonly given to- 
day to the doctrine and practices of nation-states in 
the period roughly from the fifteenth to the eight- 
eenth centuries with respect to the nature and the 
appropriate regulation of international economic 
relations. In this doctrine great emphasis is put 
on the importance of maintaining an excess of ex- 
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ports of goods and services over imports as the 
sole means whereby a country without gold or sil- 
ver mines can obtain a continuous net inflow of 
the precious metals, regarded as essential to na- 
tional wealth and strength. In the eighteenth cen- 
tury the elder Mirabeau and Adam Smith applied 
to this doctrine the terms “mercantile system” and 
“commercial system” to emphasize its contrast with 
the doctrine of the physiocrats, which minimized the 
importance of foreign trade and put its emphasis 
instead on the importance of agricultural produc- 
tion. In the 1860s German writers introduced the 
term Merkantilismus; corresponding terms, such as 
“mercantilism” in English, thereafter gradually be- 
came standard in all the languages of the Western 
world. The term is sometimes objected to because 
it is often used in a pejorative sense, or because it 
is held, justly, that although often so used, it inade- 
quately represents the varied content of the eco- 
nomic thought of some four centuries, Similar ob- 
jections can, of course, be made against most 
general abstract terms ending in “ism,” but it does 
not seem possible to do without them and it does 
seem possible to use them with disciplined re- 
straint. In this article an attempt is made to limit 
the application of the term to the special and dom- 
inant aspects of thought and practice with respect 
to international economic relations during the fif- 
teenth to eighteenth centuries. 

The doctrine. The essentials of the doctrine 
can be summarized in terms of five propositions or 
attitudes: (1) policy should be framed and exe- 
cuted in strictly nationalistic terms, that is, na- 
tional advantage alone is to be given weight; (2) 
in appraising any relevant element of national pol- 
icy or of foreign trade, great weight is always to be 
put on its effect, direct or indirect, on the national 
stock of the precious metals; (3) in the absence of 
domestic gold or silver mines, a primary national 
goal should be the attainment of as large an excess 
of exports over imports as is practicable, as the sole 
means whereby the national stock of the precious 
metals can be augmented; (4) a balance of trade 
“in favor” of one’s country is to be sought through 
direct promotion by the authorities of exports and 
restriction of imports or by other measures which 
will operate indirectly in these directions; (5) eco- 
nomic foreign policy and political foreign policy 
are to be pursued with constant attention to both 
plenty and “power” (including security under this 
latter term) as coordinate and generally mutually 
supporting national objectives, each capable of 
being used as a means to the attainment of the 
other. 

This constituted the solid core of mercantilist 


doctrine, from which there was little dissent before 
the 1750s by writers on economic matters, but it 
left room for extensive debate within the ranks of 
adherents of the doctrine. There could be major 
differences in the reasoning presented for adher- 
ence to the respective propositions here listed, and 
there could be sharp differences of opinion as to 
the choice of means by which the accepted objec- 
tives could best be pursued. 

Mercantilism was essentially a folk doctrine, 
evolved in the light of the prevailing historical cir- 
cumstances and values by simple inference from 
the apparent facts. It was a doctrine of practical 
men not given to subtle economic analysis, which 
was in fact sparsely available in the age of mercan- 
tilism. The philosophers before the 1750s, the the- 
ologians, and the universities neither challenged 
it nor made any important contributions to it. It 
was not an area in which disciplined scholarship 
showed any deep interest. 

Differences within the doctrine. The most strik- 
ing difference of doctrine within the ranks of the 
mercantilists turned on why the indefinite accumu- 
lation of the precious metals should be regarded 
as an important national objective. The terms 
“wealth,” “treasure,” and “riches” were used with 
considerable ambiguity, sometimes in a broad sense 
to cover stocks of valuable goods of any kind which 
could command a price, but more often in a nar- 
row sense to signify only the precious metals. The 
narrow usage was occasionally extended to com- 
modities (other than the precious metals ) which had 
great durability and high value per unit of bulk, 
such as precious stones and even tin and copper. 
The emphasis with respect to enrichment, to eco- 
nomic improvement, was never in terms of the 
level of consumption, and when it was in terms of 
the level of production or output, it was usually 
with reference to the contribution such production 
could make, directly or indirectly, to the acquisi- 
tion and retention of “wealth” or “riches” or “treas- 
ure” in the narrow sense. It was on accumulation 
that the emphasis was put, and there was a wide- 
spread assumption, tacit or explicit, on the part of 
the mercantilist writers of the period that accumu- 
lation over long stretches of time could be achieved 
only or predominantly by the piling up of stocks 
of durable and high-value-per-unit commodities, 
especially of the precious metals. Often the only 
link with consumption as an economic objective 
was the recognition of their ready convertibility 
through exchange into essential consumers’ goods 
as a reason why a limited number of specified dur- 
able goods were to be regarded as pre-eminently 
constituting items of national “wealth” or “riches.” 
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The emphasis on the “store-of-wealth” function 
of the precious metals competed, however, with the 
emphasis of other mercantilist writers on the “cir- 
culation” function of the precious metals in their 
role as money, an emphasis which led to hostility 
to the use of the precious metals as hoards or in 
plate and jewelry. These writers believed that pro- 
duction and employment varied in physical volume 
in close proportion with the variations in the 
amount of money in circulation, They thus over- 
looked or denied that the main consequence of an 
increase in the amount of money might be a gen- 
eral rise of prices; they perhaps were taking for 
granted that there normally existed large amounts 
of unemployed labor and natural resources. At 
least for the later mercantilists emphasis tended 
to shift from the store-of-wealth to the circulation 
function of the precious metals. But when paper 
money was introduced, it became more difficult to 
reconcile emphasis on circulation with continuing 
stress on the importance of the precious metals and 
on a favorable balance of trade as the means of 
acquiring them. 

One method used was to deny the advantages of 
paper money, or to support limitations on the issue 
of paper money which would prevent it from act- 
ing as a stimulus to the export and a deterrent to 
the import of the precious metals. But as long as 
the emphasis continued to be on the circulation 
function of the precious metals, the absence of any 
obvious answer to the question why paper money 
could not perform this function adequately and 
more cheaply tended to lead either to a return to 
the older emphasis on the store-of-wealth function 
of the precious metals or to the substitution—for 
stress on the monetary and balance-of-trade aspects 
of mercantilism—either of protectionist ideas rest- 
ing in large part on nonmercantilist arguments or 
of a new receptivity toward free trade ideas. 

An additional and widely expounded ground for 
emphasis on the desirability of indefinite national 
accumulation of the precious metals was based on 
the observation that the rate of interest and the 
availability of credit varied with the quantity of 
money, the former in the inverse direction and the 
latter in the same direction. It was argued that 
cheapness and abundance of credit would promote 
enterprise, employment, and production and would 
increase the ability to compete in foreign trade by 
lowering the interest element in the costs of pro- 
duction of domestic goods. Before the 1750s no one 
appears expressly to have pointed out that a given 
increase in the national stock of money, to the ex- 
tent that it caused a rise in the price level, would 
either leave the rate of interest unchanged or 
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cause it to rise rather than to fall and would leave 
unchanged the real availability of credit, as dis- 
tinguished from its amount measured in monetary 
terms. With the advent of paper money, moreover, 
it was no longer necessary to have a net inflow of 
precious metals from abroad in order to have an 
increase in the national stock of money. 

Writers who saw an inflow of the precious 
metals as increasing the rate of employment of 
human and other productive resources presumably 
perceived that this would result in increased con- 
sumption on the part of the owners of these re- 
sources, There was also widespread—but by no 
means universal—acceptance of the general desir- 
ability of an increase in the population and the 
number of potential workers and no doubt a rec- 
ognition that such an increase would involve 
increased consumption. But the emphasis on in- 
creased production was made to rest much more 
on the contribution it could make to a favorable 
balance of trade, on the support it could give to 
an increase in population, and on its role in allevi- 
ating the moral and other evils of involuntary un- 
employment, of vagrancy, and of pauperism than 
on an acceptance in principle of the desirability 
of a higher level of per capita consumption for the 
general public. “Luxury” expenditures, for exam- 
ple, on the part of the working classes were almost 
universally deprecated, and even for the well-off 
were much more often disapproved than approved, 
except as they were believed to be necessary means 
to the employment of otherwise idle resources or to 
the maintenance of appropriate status and dignity 
for the upper classes. Increase in production was 
sought primarily for the contribution it could make 
to the accumulation of wealth in the form of du- 
rable, valuable commodities, at least if one judges 
by what the writers of the period expressly said. 

Implementation. In the early stages of mercan- 
tilism it was often the practice to seek the general 
objectives by more or less direct and particular 
regulation of the details of individual commercial 
transactions involving trade with foreigners. Thus, 
in England there was for a time regulation by the 
Royal Exchanger of foreign exchange transactions 
and by other official agencies of transactions in 
commodity markets, so as to make sure that each 
individual transaction should as far as possible 
make a net contribution to the national stock of 
the precious metals. Later commentators on mer- 
cantilism labeled such practices as “pullionism” or 
“palance-of- (individual )-bargain system.” It was 
believed that attention should be given not only to 
the aggregate balance of trade but also to the 
separate balances in the trade with particular 
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countries or in particular sectors of commercial 
activity. There was considerable suspicion, for in- 
stance, that the trade with the East Indies was for 
all of Europe a “losing” trade involving a chronic 
drain of the precious metals to the East. Thomas 
Mun first formulated a persuasive defense of the 
English trade with the East Indies even if it did 
involve, in its first effect, a net drain from England 
of the precious metals. This drain, he claimed, was 
not an ultimate one, since by re-export at higher 
prices of its imports of Indian commodities, Eng- 
land more than regained the precious metals which 
had initially been sent to India. Mercantilist liter- 
ature, however, long continued to support discrim- 
ination between countries in the regulation of 
imports according to the usual state of the trade 
balances with such countries, or as an incident to 
tariff bargaining, or as an instrument of power 
politics, 

The mercantilists gave priority status with re- 
spect to eligibility for export to goods with a high 
labor content in relation to their value. Export of 
manufactures was favored over export of agricul- 
tural products ready for consumption; exports of 
raw materials such as raw wool, or minerals, was 
regarded as injurious or wasteful. The export of 
machines and tools and the emigration of skilled 
workers were regarded as specially injurious. Un- 
derlying these positions were the beliefs that labor 
was in such abundant supply that it was permis- 
sible to treat it as nearly equivalent nationally to 
a free good, and that restriction of export of raw 
materials or of machines would not substantially 
diminish their domestic rate of production and 
would result in their retention for domestic process- 
ing or use. Such restrictions would thus work to 
make the balance of trade favorable and to increase 
domestic employment. 

The general mercantilist position was that im- 
ports of goods and services were in principle de- 
sirable only if (a) they were essentials which could 
not be produced, at whatever cost, at home, or 
(b) they were raw materials which could not be pro- 
duced at home in the needed quantities except by 
the withdrawal of scarce resources from the pro- 
duction for domestic use or for export of goods with 
a higher labor content in proportion to their value, 
or (c) they needed to be imported as a quid pro quo 
for other countries’ allowing their nationals to im- 
port from the country in question. The implicit 
mercantilist ideal was zero import, and export only 
in exchange for the precious metals. In France, 
Colbert and others gave this ideal express formula- 
tion in replying to objections raised by Frenchmen 
that the severity of French import restrictions 


would result in other countries’ prohibiting entry 
of French products, Colbert claimed that France 
alone had the potentiality to produce at home the 
whole range of commodities essential to national 
prosperity, whereas none of its neighbors could dis- 
pense with France’s commodities, 

A variant of mercantilist doctrine, expounded 
mostly but not exclusively by English writers, sub- 
stituted for a favorable balance of trade in terms 
of monetary values a “balance of labor” in terms 
of the relative labor content of the exports and the 
imports—with an aggregate excess of the labor 
content in the exports over that in the imports 
treated as “favorable.” This has been regarded by 
some modern commentators as a “refinement” or 
improvement of the balance of trade doctrine. It 
would, however, be easily possible for a given trade 
situation highly unfavorable by the trade balance 
criterion to be highly favorable by the balance of 
labor criterion, and vice versa. Under the balance 
of labor criterion, moreover, the fewer units of im- 
port commodities obtained on the average per unit 
of export commodity, other things being equal, the 
more “favorable” would be the balance of labor. 

Political objectives. Mercantilism had political 
as well as strictly economic objectives in view. The 
minimum objectives were an even balance of trade 
and an even balance of power. But as large an ex- 
cess of exports over imports as possible was an as- 
Piration of all countries, and the great powers 
sought more than an even balance of power. They 
sought enough superiority of power to “give the 
law” to other countries, to enable conquest of ad- 
joining territory or overseas colonies, or to defeat 
their enemies in war. It was general doctrine that 
strength was necessary as a means of protecting 
wealth and of augmenting it, while wealth was a 
strategic resource, necessary to produce strength 
and to support its exercise. With wealth one could 
finance and equip armies and navies, hire foreign 
mercenaries, bribe potential enemies, and subsidize 
allies. Power could be exercised to acquire colonies, 
to win access to new markets and to shut foreigners 
out of one’s own markets, and to monopolize trade 
routes, high-seas fisheries, and the slave trade with 
Africa. “Power” was clearly and obviously a relative 
matter; what mattered was the ratio of power, not 
the terms of the ratio. It was also true of power 
that geography had great importance in determin- 
ing what comparisons were relevant; landlocked 
countries had little occasion to concern themselves 
with their power relative to a distant maritime 
power, and being a neighbor to a strong country 
could mean being under constant threat. It was 
also a distinctive feature of political relations that 
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comparisons of strength were relevant not only be- 
tween pairs of countries but also between groups of 
actual or potential allies. The emphasis on inter- 
national comparisons, on ratios, which was highly 
relevant in the political sphere in a world of power 
politics, whether the power was expected to serve 
national aggression or national security, was often 
carried over to the economic sphere, where it had 
little relevance. It could and did lead to gross con- 
fusion about the nature and significance of na- 
tional wealth and national economic well-being. 

When great emphasis was placed, in the eco- 
nomic sphere, as it logically was in the sphere of 
power rivalries, on an inherent conflict of interests, 
this had grave consequences both for economic 
policy and for international politics. If it was rela- 
tive status that solely or mainly mattered, economic 
damage to a rival country could logically be treated 
as equivalent to economic benefit to one’s own 
country, and famine abroad, to bountiful harvests 
at home. Such reasoning abounds in the mercan- 
tilist literature, and it was moral or sentimental 
revulsion against it more than superior economic 
analysis which brought much of the late eight- 
eenth-century Enlightenment to the support of 
free trade ideas, Even among writers who were 
primarily interested in economic matters, the mer- 
cantilist “jealousy of trade” fostered, as overcom- 
pensation, an exaggerated belief in the harmony 
and mutuality of economic interests between coun- 
tries. 

The doctrine that low real wages (per hour or 
per day or per piece) were in the national interest 
was widely prevalent in England in the seventeenth 
and eighteenth centuries and has sometimes been 
labeled by modern writers as “the mercantilist la- 
bor doctrine.” Many English writers did expound 
this doctrine, with favorable balance of trade con- 
siderations obviously in mind. But a substantial 
number of writers denied the proposition on which 
the doctrine was based, namely, that English la- 
borers, once their minimum needs were taken care 
of, preferred idleness to more (or superior) com- 
modities, or, as one eighteenth-century writer 
phrased it, that for workers in general “the luxury 
of indolence tends always to swamp the luxury of 
goods.” Or, if they accepted the proposition as true 
to fact and regarded voluntary idleness as an evil, 
they proposed, on humanitarian and other grounds, 
the search for remedies less oppressive for the 
poor than low rates of real wages. It seems difficult 
to find on the Continent any trace of a special af- 
finity between mercantilist thought in general and 
the low-wage doctrine, perhaps because it was gen- 
erally impossible for the poor there to attain a basic 
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minimum of subsistence without working to nearly 
the limits of their endurance, perhaps because in 
Catholic countries the frequency of religious holi- 
days when work was prohibited satisfied their crav- 
ings for rest, leisure, and time-consuming dissipa- 
tion. 

Distinctive aspects of mercantilism. Mercantil- 
ism was a doctrine of extensive state regulation of 
economic activity in the interest of the national 
economy. It took for granted that man was inher- 
ently self-regarding and would pursue his own in- 
terests without concern about the consequences of 
such behavior for the interests of the community. 
It accepted as axiomatic that if individuals were in 
their economic behavior left free from tight regula- 
tion, the consequences for the community would 
be disastrous. But this had been practically uni- 
versal doctrine from classical antiquity on, and 
therefore did not distinguish mercantilist from pre- 
mercantilist thought. 

Substantially new in mercantilist thought, how- 
ever, was its systematic adjustment to the concen- 
tration of power and the monopolization of loyalty 
by nation-states, which in their relations with other 
states followed a “Machiavellian” or amoral code, 
and were more extensive in area of jurisdiction 
than the earlier city-states and feudal barons but 
less extensive than the empires of classical antiq- 
uity and than the universal Catholic church of the 
Middle Ages. Also substantially new in mercantil- 
ism were its greater concern with economic matters 
as one phase of the then prevalent secularization 
of thought and practice, the change in the specific 
character of the economic objectives of the polit- 
ical authorities, and the new administrative pat- 
terns of regulation of communal life. These new 
features were the product of the growth of com- 
merce and of the changes in political organization 
associated with the breakdown of the Holy Roman 
Empire and of feudalism and the absorption of the 
hitherto substantially autonomous city-states by 
the new nation-states. Mercantilism was a doctrine 
of state intervention in economic life, but of state 
interventionism of a special pattern and with some 
special objectives. It was thus in sharp contrast 
with the later laissez-faire doctrine. It was also, 
however, in sharp contrast with some present-day 
systems of state interventionism, such as socialism, 
Russian communism, and the welfare state, for in 
principle at least these do not have the accumula- 
tion of the precious metals, favorable balances of 
trade, and national limits to moral obligations as 
central and ultimate objectives. 

Differences in practice. Agreement on the gen- 
eral mercantilist objectives left abundant room for 
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major differences both among periods and among 
countries in the choice of methods used in pursu- 
ing these objectives and in the degree of vigor of 
their pursuit. Practice was conditioned by limita- 
tions of administrative capacity; pressure of con- 
flicting national objectives; domestic resistance 
arising out of regional, class, and occupational spe- 
cial interests; military weakness; and the idiosyn- 
crasies, the apathy or enthusiasm, and the dynastic 
loyalties of monarchs. 

The techniques adopted could be monetary ones, 
involving control of exchange markets and of the 
movement of the precious metals across national 
boundaries. They could take the form of regula- 
tion of individual commercial transactions; of reg- 
ulation by general tariffs, prohibitions, or quanti- 
tative restrictions; or of subsidies to exports or 
to exporting or import-competing industries. The 
governments could themselves set up and operate 
factories producing for export or replacing imports; 
they could set up and operate companies engaged 
in foreign trade; they could grant monopoly priv- 
ileges to privately owned chartered companies to 
produce and sell specified products in the domestic 
market, to engage in foreign trade on the basis of 
special privileges, and to administer overseas col- 
onies. Governments could encourage immigration, 
restrict emigration, or promote early marriages in 
the belief that growth of population would serve 
the general mercantilist objectives. Wages and in- 
terest rates could be subjected to legal maxima in 
the belief that this would improve the national 
competitive position in foreign trade. Wars could 
be embarked upon for mercantilist reasons. On all 
of these matters, while ultimate objectives could be 
static within countries and uniform as between 
countries, the selection of means to serve these ob- 
jectives could differ between countries and could 
undergo constant change through time within 
countries because of change of circumstances and 
of opinions. 

Mercantilism in practice always in some meas- 
ure fell short of what doctrine called for. Perhaps 
the most important deviations of practice from 
doctrine were those resulting from the fiscal neces- 
sities of government. All governments in the age 
of mercantilism found it difficult to finance their 
general activities. To adhere to mercantilist objec- 
tives without regard to fiscal considerations would 
often involve the exemption of important catego- 
ries of exports from customs duties, the substitution 
of outright prohibition of specific exports and im- 
ports for customs duties—with a consequent loss 
of revenues—or the grant of subsidies to favored 
industries, shipping, fisheries, or colonies, all of 


these being measures which would involve an in- 
crease of government expenditures or decrease of 
government revenues. 

Restraints on importation carried beyond some 
uncertain point could lead foreign countries in- 
jured thereby to adopt retaliatory or defensive 
measures, with the possible result that the gross 
contribution to a “favorable” balance of trade made 
by the import restrictions might even in the short 
run be more than offset by the adverse effect on 
exports. 

Most mercantilist measures involved a burden 
on some occupational or regional sectors of the 
population. Such sectors, without challenging the 
general objectives of mercantilism, would com- 
monly resort to all the forms of pressure and per- 
suasion available to them to obtain a relaxation 
of the measures or a revision of them which would 
shift the burdens elsewhere. Thus, in England the 
graziers would press for a relaxation of the restric- 
tions on the export of raw wool, and the independ- 
ent merchants would protest vigorously against 
the special privileges granted to the trading com- 
panies. Even where absolute monarchy prevailed, 
governments found it necessary to make conces- 
sions to such dissenting groups. 

Every measure restrictive of trade established a 
possibility of private profit from its evasion or vio- 
lation, and no country was able to prevent exten- 
sive violation of the regulatory measures by smug- 
glers, tax evaders, merchants operating illegally in 
restricted trades (“interlopers”), and bribed en- 
forcement agents. Public resistance to particular 
restrictive measures and to the personnel endeav- 
oring to enforce them, and lax administration at 
the top levels often led to apathy in enforcement. 
When Adam Smith, in 1778, entered into his duties 
as a commissioner of customs, he was astonished 
to find, expert though he already was, how much 
of his own personal effects consisted of articles of 
foreign manufacture which it was illegal not only 
to import but also to possess, and he warned a 
friend that the latter’s wife, upon investigation, 
would probably find that she was even a worse 
offender. 

In Britain in particular, although there was gen- 
eral approval in principle of mercantilism, there 
was almost equally general dislike of the adminis- 
trative institutions and practices essential to its 
effective execution. The British public was jealous 
of the exercise of power by the executive branch of 
the national government, of administration con- 
ducted by the central authorities in London instead 
of locally, and of agents of the central government 
with powers of inspection and arrest. Legislation 
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was more centralized than in most countries, but 
enforcement was highly decentralized and was 
largely left to unpaid local magistrates with con- 
siderable autonomy and to suits brought on their 
own initiative by interested parties or by voluntary 
informers who were remunerated from the mone- 
tary penalties imposed by the magistrates as a re- 
sult of such suits. The higher the customs duties 
and the more burdensome the regulations and pro- 
hibitions, the greater was the incentive to evade or 
violate them, so that in many cases difficulty of 
enforcement led to restraint in the severity of leg- 
islation or to partial or complete abandonment of 
serious attempts at enforcement. It seems quite 
plausible, therefore, that at least in England mer- 
cantilist measures were in practice not nearly as 
severe a restraint on foreign trade in the eighteenth 
century as were, say, the transportation costs of 
the time or than are the ordinary tariffs of present- 
day protectionism. 

While there was a substantial unity of doctrine 
throughout the Western world with respect to the 
proper objectives of commercial policy, the differ- 
ences between countries in political organization, 
administrative structure, and geographical circum- 
stances led to very substantial differences in the 
intensity with which, and the selection of devices 
whereby, they pursued these objectives. In the 
smaller Germanic states, for instance, mercantil- 
ism was little more than a vague general doctrine. 
The major interests of German intellectuals re- 
lating to economic and political matters, as repre- 
sented by the contents of cameralist writings and 
university courses, were directed to the principles 
of management of the absolute rulers’ finances, of 
organization and conduct of professionalized public 
administration, and of management of official 
property, including mints, mines, forests, and an 
occasional factory. In France, although public ad- 
ministration was on the whole centralized to an 
extent without parallel in England, taxation (in- 
cluding customs duties), property law, and guild 
regulations were largely under autonomous local 
administration following traditional and regionally 
diverse patterns and principles. 

The decline of mercantilism. Criticism of the 
prevalent methods of pursuing mercantilist objec- 
tives was always fairly common in countries where 
some free discussion was tolerated. Much of this 
criticism, however, whatever its analytical merit, 
was special pleading by spokesmen for a political 
faction, an industry, a region, a particular port or 
town, or a particular privileged company. 

In the 1750s there first began to appear compre- 
hensive criticism of the basic principles of mer- 
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cantilism by persons of stature with no visible pri- 
vate axes to grind. One major source of criticism 
was from exponents of an essentially new gospel 
of individualism which extolled the merits, on 
ethical and political as well as on economic 
grounds, of freedom of the individual from detailed 
regulation by the state. Here important voices were 
those of Adam Smith in Britain and of the marquis 
d’Argenson, the physiocrats, and Turgot in France. 
Important also was the widespread revulsion 
among intellectuals against the past record of al- 
most continuous war and preparation for war, for 
which mercantilism was largely blamed. It was, 
in fact, much more the pacific and cosmopolitan 
views of the philosophes and the Illuminati on the 
Continent and of men like David Hume and Adam 
Smith in Britain than the more strictly economic 
argument of these and other writers which first put 
mercantilism seriously on the defensive among in- 
tellectuals. 

In the early years of the nineteenth century the 
English classical school of economists rejected 
mercantilism on the basis of economic analysis, of 
which a part was substantially original with them. 
The school claimed that trade conducted under in- 
dividual initiative and free from official regulation 
was inherently of mutual profit both to the individ- 
uals directly involved and to the community as a 
whole, and they applied this to domestic and inter- 
national trade alike. Here they had had since the 
1750s a number of important predecessors. They 
added, however, an analytical justification of this 
position which was essentially new, the principle 
that allocation of resources to production in ac- 
cordance with comparative costs would maximize 
aggregate output and that the operations of individ- 
uals acting in their own interest in a free and com- 
petitive market would conform with this principle. 
They did not deny that this was subject to the 
qualification that producers knew both what their 
relevant costs were and what were the prices at 
which their products could be sold. But they 
claimed, as an obvious proposition, that business- 
men were better informed on these matters than 
government could be. From this reasoning, they 
proceeded to the policy conclusion that the deter- 
mination of what commodities and in what quan- 
tities a country could export and import to its 
greatest advantage should be left to the outcome of 
the decisions of individual businessmen operating 
to maximize their own incomes. This was a sharp 
break with mercantilism’s insistence on the neces- 
sity of regulation of economic behavior and its 
ranking of the desirability of export and import of 
particular commodities according to whether they 
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were manufactures or agricultural products or raw 
materials and according to their labor content. To 
the classical school these were more or less arbi- 
trary classifications, whose correspondence, if any, 
in practice with classification according to the com- 
parative cost principle would be fortuitous. 

The classical school also rejected the mercan- 
tilist stress on the balance of trade and on the na- 
tional supply of the precious metals. They claimed 
that in the absence of government regulation an 
international automatic equilibrating mechanism 
would bring each country the amount of specie ap- 
propriate to its needs and circumstances and would 
prevent trade balances from getting into serious 
disorder. Here they had predecessors in the eight- 
eenth century, most notably, perhaps, Isaac Ger- 
vaise and David Hume, 

It is to be noted that neither the mercantilists 
nor the classical school distinguished clearly and 
systematically between short-run and long-run ef- 
fects, and that insofar as one can judge from the 
historical context and the implications of their 
writing, the mercantilists were as a rule thinking 
in terms of short-run effects and the classical 
school in terms of long-run effects. Appraisal by 
economists of the comparative analytical merits of 
the mercantilists and their classical school critics 
should therefore give careful consideration to the 
distinction between short-run and long-run analy- 
sis, if they regard this, as does the writer, as a 
crucial distinction. 

The classical school doctrine with respect to in- 
ternational trade became dominant for a while in 
England and obtained a wide degree of qualified 
acceptance elsewhere. There followed in England 
over half a century of approximately complete free 
trade, and elsewhere it led to a substantial liberali- 
zation of foreign trade policy. The restrictions on 
foreign trade which continued to be imposed were 
supported on grounds which were largely nonmer- 
cantilist in character; they were often designed, 
in fact, to protect agriculture rather than manufac- 
tures. Survival into the present of mercantilist doc- 
trine and practice is by no means rare. There are, 
however, important differences between mercan- 
tilist doctrine and the doctrines by which present- 
day state regulation of foreign trade are mainly 
supported, and major differences also in the respec- 
tive patterns and techniques of regulation. As far 
as scholars are concerned, support of mercantilism 
as it operated in its prime, based on express accept- 
ance of its objectives, its doctrine, and the appro- 
priateness of its practices to its doctrine, seems to- 
day to be confined to a minority, mostly economic 
historians. The analytical grounds on which mer- 
cantilism has been supported or criticized are in 


both cases sometimes of very disputable merit. 
Where policy is concerned, final appraisal, here, 
as generally in the social sciences, needs to deal 
expressly, as it often fails to do, with political, ethi- 
cal, and socioeconomic values, as well as with the 
abstract logic of relations between actual and sup- 
posed matters of fact. It needs to call, therefore, 
on the resources of all the major social disciplines. 
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PHYSIOCRATIC THOUGHT 


“Physiocrats” denotes the économistes, those who 
subscribed to the tenets of physiocratie (derived 
from physeikratia) and advocated governance of 
economic and political activity “in keeping with the 
laws implanted in Nature by Providence.” Consti- 
tuting the first “school” of political economy to 
emerge in the history of that science, they exer- 
cised considerable influence, mainly in France, 
where most lived, in the 1760s and early 1770s. 

The climate of opinion in France in the 1750s 
was favorable to the ascendancy of a promise-laden 
set of principles such as the physiocrats put for- 
ward. The country’s main industry, agriculture, 
was poorly carried on and generally in a bad way. 
Its system of taxation was vexatious, burdensome, 
and wasteful, Mercantilist policy and practice inter- 
fered with internal and external trade, retarded 
agriculture, and sustained privilege. It was be- 
lieved in some quarters that adverse economic poli- 
cies had brought about a diminution in population. 
Early in the century and again in the 1740s a num- 
ber of works had appeared in which French agri- 
cultural and commercial policies were criticized 
and the country’s economic experience was con- 
trasted unfavorably with that of a progressive Eng- 
land. The merits of economic freedom and com- 
petition, though still poorly understood, had come 
to be appreciated by some administrators. Eco- 
nomic analysis was improving under the stimulus 
of such authors as Hume and Cantillon. 

The beginning of the school may be dated as 
1756, when Francois Quesnay published in the 
Encyclopédie his first economic article, “Fermiers,” 
to be followed in 1757 by “Grains,” in which the 
partially transitory influence of Cantillon’s “rent” 
theory is present; these essays foreshadowed much 
of what was to become the school’s body of princi- 
ples. The formation of the school was facilitated 
by Quesnay’s enjoyment of the patronage of Mme. 
de Pompadour and the sinecure of physician-in- 
ordinary to Louis xv. In 1757 the already famous 
Victor Riqueti, marquis de Mirabeau, author of the 
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popular Lami des hommes, ou traité de la popula- 
tion (1756-1760), after an interview with Ques- 
nay at his entresol at the Palace of Versailles, 
became his first disciple and shortly his most en- 
thusiastic and energetic co-worker. 

The fundamental principles of the school were 
first set down in the Tableau économique, of which 
three “editions” seem to have been issued in 1758— 
1759, the last accompanied by explanations and 
maxims. Most of the later editions of the Tableau 
illustrated the advantages of compliance with phys- 
iocratic principles. By the middle 1760s Quesnay 
had acquired a number of faithful disciples, at- 
tracted by his and Mirabeau’s writings, by discus- 
sions at their lodgings, and by the belief that in 
their principles might be found solutions for some 
of the nation’s economic ills. Outstanding among 
these were Nicolas Baudeau, Pierre Paul Mercier 
de la Riviére, Guillaume Francois Le Trosne, Pierre 
Samuel Du Pont de Nemours, and Anne Robert 
Jacques Turgot, who subscribed to many, but not 
all, the tenets of the school. They, along with lesser 
members of the school, produced a stream of pub- 
lications that popularized and clarified Quesnay’s 
economic views, at first without modifying their 
fundamental content significantly. In time, how- 
ever, some of Quesnay’s disciples found themselves 
under pressure to make certain physiocratic eco- 
nomic tenets more compatible with current reality 
and to develop a doctrine of judicial control to bal- 
ance that of legal despotism and thereby preserve 
the social order. 

The intellectual influence of the physiocrats, 
great in the 1760s, underwent a rapid decline after 
1770. A combination of circumstances generated 
political and intellectual opposition, Many special 
interest groups believed that they would be dis- 
advantaged economically by physiocratic policies, 
among them tax farmers, financiers, manufac- 
turers, landowners, possessors of exclusive. privi- 
leges, and those who feared an increase in the price 
of bread. Turgot, who had risen to the office of 
comptroller-general, was relieved of his office in 
1776, not long after he had introduced a number 
of physiocratic reforms; nearly all of these were 
suspended, and the remaining physiocratic journal 
was suppressed. The tenets and recommendations 
of the physiocrats were subjected to attack by both 
economic writers (e.g., Forbonnais, Galiani, Gras- 
lin, Mably) and noneconomists (e.g, Linguet, 
Necker, Voltaire). The residual authority of these 
tenets was gradually undermined by Smith's 
Wealth of Nations, which appeared in 1776, two 
years after Quesnay’s death and eight years after 
he stopped writing on economic matters. 

Interest in physiocracy did not completely dis- 
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appear. Traces of it remain in early nineteenth- 
century economic literature. Physiocratic concerns 
continued to stimulate discussion, and some phys- 
iocratic reforms were finally introduced during the 
French Revolution. Physiocracy exercised some in- 
fluence throughout Europe, particularly in Austria, 
Baden, Poland, Sweden, and Tuscany. 

Presuppositions and theories. The physiocratic 
presuppositions and theories were largely im- 
bedded in physiocratic models, the famous Tab- 
leaux économiques. Underlying the self-sustaining 
interclass flow of money, goods, and services in- 
corporated in the idealized tableaux was a class 
structure more advanced than that then found in 
France. In the structure of the tableaux, the largest 
of the three classes, the productive, comprising 
one-half the population, engaged in agriculture, 
fishing, and mining. The proprietary class, made 
up of landed proprietors and those supported by 
proprietary income, comprised one-fourth the popu- 
lation, as did the sterile, or artisan, class, which 
included the balance of the population. In time, 
France’s class structure presumably would corre- 
spond to this ideal; for the physiocrats, especially 
Turgot, were alert to evolutionary forces which, 
having generated the existing structure, could also 
generate its successor. 

Aggregate national interclass flow of product and 
income was summarized in the Tableau. The pro- 
ductive class produced 5,000 million livres of out- 
put; it did this with the aid of land rented from 
the proprietors, an investment of 2,000 million 
livres avances annuelles in seeding, cultivation, 
etc., and a longer-period past investment of 10,000 
million livres avances primitives in durable instru- 
ments, animals, etc., of which one-tenth was an- 
nually replaced. For expositive convenience, con- 
stant returns were assumed, with an increment of 
two units avances annuelles always resulting in an 
increment of five units in output. It was recognized 
that the response of output to increased avances 
annuelles was conditioned by the state both of 
avances primitives and of avances foncières (i.e., 
relatively permanent investments in buildings, 
clearing, drainage, etc.) made by proprietors in 
the past. Only Turgot discussed increasing and de- 
creasing returns to inputs, but in a quite different 
connection. Of the 5,000 million livres of annual 
reproduction, or output, 3,000 million constituted 
reprises, or the return of expenses incurred by land- 
renting fermiers, 2,000 million in the form of 
avances annuelles and 1,000 million as outlay 
upon the maintenance of the avances primitives. 
The remaining 2,000 million represented the net 
product, the approximate monetary equivalent of 
which passed to the proprietors from the farmers 


who cultivated the land under a competitively de- 
termined lease that normally stipulated an annual 
rent of approximately the anticipated net product; 
this money was in effect used by the proprietors to 
purchase 1,000 million livres of produce from the 
productive class and of output from the sterile 
class. The sterile class got from the productive 
class 2,000 million livres of produce, which it con- 
sumed or worked up into wrought goods and 
services for the other two classes and its own 
needs. It utilized little or no fixed capital and re- 
placed its working capital in the current production 
period; for this reason Turgot compared its ad- 
vances to the avances annuelles of the productive 
class. 

Underlying the Tableau is the school’s key postu- 
late, that only the productive class cultivating the 
land produced a net product, whereas the sterile 
class merely recovered its outlay and the proprie- 
tary class, being disposable (i.e., not being actively 
involved in the continuation of the circular eco- 
nomic flow), could serve essentially public pur- 
poses. Expansion of the economy and the popula- 
tion therefore depended upon expansion of the 
expenditure of the productive class and the result- 
ant expansion of the net product; hence the com- 
position of expenditure was as important as its 
growth and stability. The physiocrats therefore 
condemned luxe de décoration, or excessive ex- 
penditure with the sterile class, but not luxe de 
subsistance, or heavy expenditure with the produc- 
tive class; for usually diversion of expenditure from 
the productive class to the sterile class reduced the 
net expenditure of the former and thereby reduced 
the net product, and conversely. They also con- 
demned man-made restrictions upon the sale of 
produce (among them duties and other barriers to 
internal and external trade in farm products); 
hoarding; the outflow of funds to foreign lands; 
and diversion of money into circuits that did not 
enter into agricultural markets. Some of these re- 
strictions, they believed, had their origin in tax 
farming, flotation of government loans, monopolies 
and special privileges, and the absence of a regime 
of economic liberty. They noted that the demand 
for produce tended to rise as its quality improved 
and as the incomes of the masses rose above bare 
subsistence; and they sought to show that if prod- 
uce commanded a bon prix, agricultural income, 
net product, and the economic condition of the 
population inclusive of the masses would improve. 

The physiocrats’ theory of taxation and such 
theory as they had of factor pricing and distribu- 
tion were corollary to their postulate that only the 
product of land included a surplus. All taxes there- 
fore were ultimately incident upon the net product. 
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From this it followed that this surplus should be 
taxed immediately; then the cost of collecting rev- 
enue for the support of the state, together with the 
adverse side effects of taxation, would be mini- 
mized. Not more than the completely disposable 
one-third of the net product should be devoted to 
state and church functions, among them avances 
souveraines for public capital; otherwise, invest- 
ment in agriculture would be too little. The pro- 
prietors needed to invest about one-third of their 
rent (i.e., net product) in the repair and improve- 
ment of their property and to retain another third 
as compensation for avances fonciéres and the 
bearing of the risks and cares associated with 
ownership. Only the gross income of the productive 
class included a surplus. While the current price 
of labor might temporarily exceed its fundamental 
(or long-run supply) price, much as the current 
price of a product might temporarily exceed its 
fundamental price, the price of labor did not nor- 
mally include a surplus; it corresponded to the 
conceivably upward-elastic subsistence and other 
costs of population replacement. Although Ques- 
nay’s scheme could not with consistency allow the 
farmer’s income (or that of other entrepreneurs ) 
to include a profit or surplus, his followers tried 
inadequately to explain away what appeared to be 
such a surplus, and Turgot went so far as to in- 
clude a normal return on capital in “fundamental” 
price. Quesnay and some of his followers tried to 
treat interest as a surplus-free capital-replacement 
cost, but others finally had to conceive of interest 
as including more than such cost. 

The physiocrats put great stress upon the role 
of investment. Quesnay and most of his disciples 
confined this role to agriculture. The stress on in- 
vestment in discussions of net product appeared also 
in Quesnay’s comparison of grande and petite agri- 
culture, The former, involving large-scale and tech- 
nologically advanced methods, required heavy capi- 
tal investment; the latter, based upon traditional 
methods and backward owner-cultivator relations, 
utilized little capital. Grande agriculture was much 
more productive than petite; substitution of grande 
for petite might for a time permit increasing re- 
turns to capital. Turgot went beyond Quesnay to 
declare investment to be important in industry and 
commerce as well as in agriculture and to suggest 
how it gets allocated among alternative uses in 
keeping with prospective returns. In this and other 
respects, indeed, Turgot, together with Baudeau 
and Du Pont, among others, modernized the feudal 
framework within which Quesnay and Mirabeau 
had set their discussion. P 

The physiocrats sought to integrate their polit- 
ical and (not entirely homogeneous) philosophical 
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views with their conceptions of actual and ideal 
economies. Although they appreciated, in some- 
what varying measure, the power of self-interest 
and the workings of a system of interdependent 
prices, together with the roles of private property 
and a regime of economic liberty and competition, 
they also believed that the state might play an im- 
portant part in making physiocracy function and 
realize individual-transcending purposes as well, 
For there existed a discoverable natural order, com- 
pliance with whose principles was essential to 
man’s prosperity and happiness. Positive laws must 
reflect this order and a limited monarch, or “tute- 
lary authority,” must express and support it, sub- 
ject to the ultimate approval of a judiciary com- 
petent to determine if natural law was being 
correctly interpreted. 

The physiocratic contribution. Physiocracy con- 
tributed significantly to the development of eco- 
nomic science, and some of its concerns have 
continued to interest economists and others. In the 
first half of the nineteenth century physiocratic 
influence was reflected in discussions of such mat- 
ters as underconsumption and the origin, form, 
and role of economic surplus, though these were 
examined within a quite different socioeconomic 
context than that envisaged by Quesnay. Detailed 
analysis of physiocracy was facilitated by the pub- 
lication of a number of physiocratic works in 1846 
under Daire’s auspices. As early as the 1860s the 
circular flow model present in the Tableau com- 
manded the’ attention of Marx, whose reproduction 
schemes probably were inspired by it; and not long 
after, Walras developed a tableau he considered 
“analogous” to Quesnay’s. During the ensuing 
forty to fifty years much able scholarship was de- 
voted to the physiocrats. With the emergence after 
1930 of modern equilibrium and macroeconomic 
theory, together with national income accounting 
and input-output models, interest in relevant com- 
ponents of the physiocratic system increased, in 
Japan as well as in the West. This renewed atten- 
tion was also stimulated by revival of interest in a 
major concern of the physiocrats, economic devel- 
opment. During the past half-century, moreover, 
the political and philosophical conceptions of the 
physiocrats have been subjected to renewed ap- 


praisal. 
JosEPH J. SPENGLER 


[See also the biographies of QUESNAY; TurRGOT.] 
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Iv 
SOCIALIST THOUGHT 


In the half-century prior to the Russian Revolu- 
tion of 1917 the dominant doctrine inspiring the 
major socialist parties of continental Europe was 
Marxism (or was directly derived from Marxism). 
Since 1917 Marxism has become the official doc- 
trine of the socialist sector of the world (i.e., of the 


Soviet Union and China and of the other countries 
of Europe and Asia associated with them). Treated 
historically, therefore, description and analysis of 
socialist thought must run predominantly in terms 
of Marxian doctrine. This is not to say that there 
have been no other different and rival socialist 
creeds that have been influential and continue to 
find an echo today. Marx spoke of the so-called 
“utopian socialists,” who had preceded him and in 
contrast with whom he called his own doctrine 
“scientific socialism.” Merging with these, there 
have been various brands of “ethical socialists,” 
including the Christian socialists, basing them- 
selves on this or that ethical principle as the pre- 
eminent one, such as “equality” or “community 
values,” and on social motives, as against pursuit 
of “selfish” individual values and motives. Still 
others, such as the Fabians in England and the so- 
called Kathedersozialisten and their imitators on 
the Continent, advocated purely on grounds of ex- 
pediency an extension of the economic functions 
and responsibilities of the state, thus identifying 
their “socialism” (and its consequential critique of 
individualism) with étatisme. Before coming to 
Marxism as a social philosophy something must 
accordingly be said about the historical origins and 
the varieties of these non-Marxian theories. 

Utopian socialism. The author of one work on 
the socialist tradition (Gray 1946) starts with 
Moses, Lycurgus, and Plato, passing from them to 
the Essenes and the early Christian Fathers and 
thence to St. Thomas Aquinas and Sir Thomas 
More. Indeed, Plato and More have been cited as 
forerunners in many a work on the subject. But 
this article will not go so far back as this. It must 
be sufficient to distinguish those writers of the 
eighteenth or early nineteenth century who, in the 
shadows of the emerging modern world, sought to 
paint a picture of a perfect society of the future, 
deducible from first principles either of rationality 
or of morality and attainable only if mankind were 
sufficiently reasonable or good. Among these was 
Mably, a French contemporary of Adam Smith, 
who in a series of quasi-Platonic dialogues devel- 
oped a critique of the institution of private prop- 
erty and who believed that nature had destined all 
men to be equal. He argued that the institution of 
private property both annihilates the primitive and 
natural equality of man and enables the indolent 
and unworthy to live at the expense of the active 
and industrious. Another eighteenth-century figure 
who both attacked the irrationality of existing so- 
ciety and went into considerable detail about the 
structure of an ideal society was Morelly (Code de 
la nature 1755). 
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The most quoted and influential of the architects 
of a utopian future were Saint-Simon and Fourier. 
The former, a count descended from an old and 
honored family who renounced his title during the 
French Revolution, became the founder of some- 
thing of a school (which included the positivist 
philosopher Auguste Comte). After his death there 
was even established a Saint-Simonian church. 
Among other proposals for the reorganization of 
society on new principles he propounded a scheme 
for productive associations and a projet de travaux 
under the aegis of government and advocated the 
principle that the rights of property ought to be 
rooted solely in its contribution to the production 
of social wealth. Here his disciples, who developed 
his doctrines in notable respects, went further and 
preached the end of inheritance of property and 
its eventual transfer to the state. It was they who, 
incidentally, coined the formula “from each accord- 
ing to his ability, to each according to his needs.” 
In his final work, Nouveau christianisme, 1825, 
Saint-Simon sought to expound a new religion ded- 
icated to “the great aim of the most rapid improve- 
ment in the lot of the poorest class . . . the most 
numerous class.” Persecuted and divided, the Saint- 
Simonian school disintegrated in the course of the 
1830s. 

Fourier is best known as the author of a scheme 
for the organization of phalansteres, communities 
in which both production and social life were to be 
organized on a cooperative or communal basis. 
This would allow the natural, inborn “harmony” 
of man to be realized—a harmony that existing 
commercial civilization had destroyed. In this new 
society work, instead of being a burden, would be 
enjoyed. 

‘Another sketch of a communist utopia was 
Cabet’s Voyage en Icarie of 1838. A more direct 
influence on French socialism in the middle and 
later nineteenth century was Proudhon, author of 
Qu’est-ce que la propriété?, 1840, and coiner of 
the aphorism “Property is theft.” This aphorism 
was for him the answer to the Lockean right to 
property by labor. Yet, regarding property, he could 
be called a “distributivist” as much as, or even more 
than, a socialist. His influence has been more in 
the direction of anarchism than of socialism, since 
two of his central ideas were equality and individ- 
ual freedom and he preached against communism 
and the authoritarian state. His remedy for the 
evil of taking (and living on) interest was a sys- 
tem of universal and interest-free credit to be or- 
ganized through a mutual credit bank (his system 
of mutualité)—a proposal that not surprisingly 
drew the fire of Marx's criticism in the latter’s 
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Misère de la philosophie. [See the biographies of 
FOURIER; PROUDHON; SAINT-SIMON.] 

The Ricardian socialists in England. The germ 
of socialist ideas in England before Marx lay in a 
critique of classical political economy by a group 
of writers and pamphleteers who have come to be 
loosely described as the Ricardian socialists. A cen- 
terpiece of this critique for the main figures of this 
group was a concept of exploitation couched in 
traditional eighteenth-century terms of “natural 
right.” They were Ricardian in the sense that they 
sought to use Ricardo’s theory of value in such a 
way as to turn it, with the aid of natural-right no- 
tions, against the main precepts of the Ricardian 
school. 

By the end of the eighteenth century Spence and 
Ogilvie had derived from the principle of natural 
right the conclusion that ownership of land should 
be shared equally and that no man should have 
more than he could cultivate. Nature or God had 
given the land “in common to all men,” and equal 
sharing of land by all was the basic guarantee and 
sine qua non of human freedom. By analogy, in 
the year after Ricardo’s death William Thompson 
(in An Inquiry Into the Principles of the Distribu- 
tion of Wealth 1824) deduced the right of labor to 
the whole produce of labor from the postulate that 
labor is the sole (active) creator of wealth. In ex- 
isting society this was prevented by a system of 
“unequal exchanges” that resulted in part of labor’s 
product being filched by the possessors of economic 
advantage. Apart from its injustice and its offense 
against the Benthamite maxim of “greatest hap- 
piness,” this system deprived labor of much of its 
necessary incentive (substituting want as the spur 
to labor) and hence was inimical to national wealth. 
Such a notion could be held to have been implicit 
to some extent in Adam Smith’s treatment of profit 
and rent as “deductions” and Ricardo’s treatment 
of them as alternative and rival forms of surplus. 
But in Thompson’s notion of appropriation, or €x- 
ploitation, what was implicit in his forebears is 
given an explicit extension that those forebears 
would probably have disowned. Thompson, inci- 
dentally, also attempted a reply to Malthusian pes- 
simism by stressing the historical relativity of pop- 
ulation trends. 

The year following Thompson's Inquiry there 
appeared Thomas Hodgskin’s Labour Defended 
Against the Claims of Capital, which opens with 
the statement, “Throughout this country at present 
there exists a serious contest between capital and 
labour.” (Two years later his lectures at the London 
Mechanics Institution were published as Popular 
Political Economy.) Hodgskin similarly distin- 
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guished property associated with one’s own labor, 
which is a natural right, from property as the 
power to appropriate the product of the labor of 
others—that is, Lockean “natural right” from the 
“legal or artificial” right of ownership by conquest 
or appropriation. In a famous passage he declares, 
“I am certain that till the triumph of labour be 
complete; till productive industry alone be opulent, 
and till idleness alone be poor . . . till the right of 
property shall be founded on principles of justice 
and not those of slavery . . . there cannot and there 
ought not to be either peace on earth or goodwill 
amongst men.” Halévy says of his ideas that, while 
they “have their starting point in the philosophy 
of Bentham, it is in the philosophy of Karl Marx 
that they find their resting place.” Contempora- 
neously with Hodgskin, in 1825, John Gray pub- 
lished his Lecture on Human Happiness. Fourteen 
years later there appeared J. F. Bray’s Labours 
Wrongs and Labour's Remedy, which also contrasts 
“unequal exchanges” with equal and speaks of the 
exchange between capital and labor as “legalised 
robbery.” Both writers ended by advocating some- 
what vaguely a kind of Owenite cooperation. 

These writers apparently had in common the 
a priori derivation of ideal precepts for rebuilding 
society from postulated first principles of “justice” 
or of “natural right.” But what links them as fore- 
runners of Marx is their common championship 
of productive labor against the appropriation of 
labor's product over and above a subsistence wage, 
in consequence of the concentration of property 
ownership in comparatively few hands. 

Apart from the French utopians and English 
Ricardians, the German economic writer Rodbertus 
is sometimes included in the category of pre- 
Marxian socialists and with his generalized con- 
cept of rent has been called an anticipator of 
Marx’s theory of surplus value. Certainly his theory 
at first sight has a good deal in common with that 
of the English Ricardian socialists. But the main 
concern of his theory was to provide an explana- 
tion of crises of overproduction (in terms of under- 
consumption) and of how these could be prevented. 
His critique of existing society must be classed as 
“conservative socialism,” and the social reforms he 
advocated as a forerunner of “Bismarckian social- 
ism” rather than of the popular socialist movement 
as we know it. Again, Lassalle (in some respects 
influenced by Rodbertus) was a popularizer and 
propagandist of socialist ideas rather than a theorist 
in his own right. [See the biographies of OWEN; 
RODBERTUS; THOMPSON.] 

The Fabians and guild socialism, By the end of 
the century, when Fabian socialism arose in Eng- 


land as a rival both of nineteenth-century economic 
liberalism and of Marxism, the climate of thinking 
had changed. Gone was the influence of eighteenth- 
century rationalism and of the metaphysic of nat- 
ural right, and gone with them was the habit of 
deriving ideal models for a future society from 
some mythical “natural” state of society in the past. 
The end of the Victorian era, the time of transition 
from the age of steam to that of electricity and 
from free trade to imperialism, had a more prac- 
tical, more mundane, and more circumscribed cast 
of thought. The Fabians were not alone in their 
preoccupation with the inadequacies of laissez- 
faire and the propriety of extending the economic 
functions of the state. Certain academic econo- 
mists, notably Sidgwick, had already opened this 
question, as earlier Jevons himself had done much 
more cautiously and as afterward Marshall and his 
disciple and successor Pigou were to do. 

Among the authors of the Fabian Essays of 1889 
were some famous names, such as Bernard Shaw, 
Sidney Webb, Graham Wallas, and Sidney Olivier, 
who, although sharing a common platform, spoke 
each with an individual accent. Bernard Shaw had 
been weaned from Marxism to the economic 
theories of Jevons (under the economist Wick- 
steed’s influence) and from early revolutionary 
faith to a belief in evolutionary “gradualism,” which 
was the hallmark of the group as a whole. Webb 
was the patient empiricist, versed in the literature 
of royal commissions and acts of Parliament, who 
could report voluminously and in detail on social 
ills and inefficiencies needing remedy and the prac- 
tical steps by which governmental action could re- 
move them. In his Fabian essay he remarks that 
“history shews us no example of the sudden sub- 
stitution of Utopian and revolutionary romance,” 
attacks the age of individualism as the age of an- 
archy, and advances a radical program of specific 
reforms as the necessary complement to political 
democracy. As a group, the Fabians were concerned 
with particular evils and remedial measures, rather 
than with any general philosophy of society or 
even (Shaw excepted) with the denunciation of 
private property and the receipt of rent, interest, 
and profit. Much emphasis was laid on efficiency, 
and their essential method would probably be called 
today “social engineering.” Some have even denied 
them the name “socialists,” owing to their lack of 
interest in any radical reconstitution of the property 
basis of society. Perhaps it is in Bernard Shaw, and 
in him only, that are found traces of continuity 
with earlier brands of socialism, whether of the 
English or the Continental variety, since he makes 
polemical use (in the Fabian Essays and in others 
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of his works) of a generalized concept of rent as 
an “unearned surplus” reminiscent of Marxian 
surplus value—a socially created surplus, which 
ought to be appropriated by society and not by in- 
dividuals. 

Close on the heels of the Fabians—and to a large 
extent as a reaction against the strong element of 
étatisme in their outlook—came the comparatively 
short-lived but luminous movement known as 
“guild socialism.” Originating in a group of writers 
connected with the journal New Age (edited by 
A. R. Orage) in the first decade of the present cen- 
tury, it was soon reinforced by recruits from the 
contemporary university generation (mainly Ox- 
ford Fabians and most notably G. D. H. Cole). It 
drew largely upon the ideas of the French syndi- 
calists, with their emphasis on industrial direct 
action and the “industrial democracy” of direct 
workers’ control, to correct the centralizing and 
bureaucratizing bias traditional to state socialism. 
(Cole's early work, The World of Labour of 1913, 
is eloquent of this French inspiration.) Their tar- 
get of attack was less the particular inefficiencies 
of capitalist individualism than the evils and the 
hateful human degradation of “wage slavery,” with 
labor treated as a commodity, the abolition of 
which required that the social ownership of indus- 
trial capital be combined with the organization of 
industry under the control of democratic guilds 
composed of the actual producers (i.e., workers by 
hand and brain in the industries in question). In- 
dustrial democracy in this form was necessary not 
only to emancipate the workers but also to comple- 
ment, indeed to realize, political democracy. In 
their theory of the state, guild socialists tended to 
be pluralist and to reject the notion of state sov- 
ereignty. In its denunciation of wage slavery guild 
socialism had more affinity with earlier and with 
Continental socialist thought than had the more in- 
sular English Fabianism. [See the biographies of 
CoLE; WALLAS; WEBB, SIDNEY AND BEATRICE.] 

Marxian socialism. Not surprisingly, in view of 
its Hegelian roots, Marxian socialism started with 
a philosophy of history and a methodology. In a 
much-quoted phrase, Marx spoke of finding Hegel 
standing on his head and of proceeding to set him 
on his feet. This he claimed to have done by enun- 
ciating his materialist interpretation of history. Ac- 
cording to this, it was the mode of production of 
any given epoch that was the key to the interpreta- 
tion of that epoch, including its “superstructure” of 
ideas and moral sentiments and its legal and polit- 
ical institutions. This mode of production was con- 
ceived of as embracing not only its productive tech- 
nology but also the prevailing “social relations of 
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production”—namely, relations between men that 
turned upon their relations to the process of pro- 
duction and in particular to ownership of the 
means of production. In effect, these were class 
relations, and the contradictions inherent in such 
relations were the basis of class struggle, the prime 
mover of historical change to date. 

History since the end of tribal society had wit- 
nessed three main modes of production: slavery, 
feudal serfdom, and modern capitalism based on 
wage labor. All of these were forms of class society 
—each marked by class antagonism in its specific 
way—in which the producer was in a position of 
subjection to a ruling class whose power rested on 
ownership. In consequence of this subjection the 
surplus product, over and above what the producer 
himself retained for subsistence, was appropriated 
by the ruling and owning class, whether slave- 
owners, feudal seigneurs, or capitalists. In the first 
of these socioeconomic forms, the ruling class 
owned the person of the laboring producer as well 
as the impersonal, material means of production. 
In the second, it had the legal right to annex a cer- 
tain portion of the labor time of the producer, 
whether in the form of direct labor services or of 
tribute in kind. In the third, the laborer was in legal 
status a free agent, the relation between him and 
the capitalist being that of a contractual market re- 
lationship, yet the economic compulsion of his 
propertyless status obliged the proletarian wage 
earner to sell his labor power for little more than 
a subsistence wage (or for even less in conditions 
of acute unemployment). Thus the wage-labor— 
capital relationship under capitalism bore a major 
analogy with earlier and more patently servile 
forms of class relationship; property right per se 
was able to draw to its possessor, independently of 
any productive activity, a share of the total product. 

This, in brief, was Marx’s concept of exploitation 
(and as fruit of exploitation, class struggle). His 
economic analysis, as expounded in Das Kapital, 
was designed to enlarge on this analogy with pre- 
vious modes of appropriating surplus product and 
to show how the persistence of a difference between 
the value of labor power (sold for wages) and the 
value of its product was consistent with the “law 
of value”—that is, with conditions of a free market 
and of perfect competition. Unlike earlier socialist 
writers, he did not deduce the existence of surplus 
value or exploitation from some principle of natural 
right of labor to its product (all too often supposed 
by commentators and critics to be inherent in the 
labor theory of value). The analogy with earlier 
modes of appropriating a surplus product was for 
him a historical datum, which he sought to ex- 
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plain in terms of economic theory—doing so by 
penetrating below the market “appearance” of 
things to the “essence” of social relations under 
capitalism (the relationship between capitalist and 
proletarian as that of owner and propertyless ). For 
this reason, the boundaries of economic analysis 
were drawn more widely than in the narrower 
market-equilibrium studies to which we have grown 
accustomed in post-Menger, post-Jevonian econom- 
ics, from which property relations and their influ- 
ence are excluded because they are thought to 
belong to social rather than to economic theory. 

Marx's explanation turned on his distinction, 
to which he attached great importance, between 
labor and labor power. Labor power was what was 
sold as a commodity in return for wages and, like 
other commodities, sold for a price determined by 
the cost in labor necessary to produce and repro- 
duce it. This was the cost of producing its own sub- 
sistence—its essential input (this being modified, 
as Marx, like Ricardo, allowed, by a historically 
relative factor of social habit and custom). Hence 
wages absorbed only part of the product of labor 
at work for any given length of time—the value of 
labor power as a productive input was never more 
than a fraction of the net output emerging from 
the productive process. The difference was surplus 
value, which accrued to title of ownership as profit, 
interest, or rent. 

In this consisted the main part of his critical 
diagnosis of contemporary society. But it was also 
fundamental to his description of the dynamic of 
capitalist society and his prediction of its eventual 
replacement by socialism, With the development 
of the capitalist mode of production the class strug- 
gle would develop both in extent and in acuteness. 
With the widening and deepening of exploitation 
the proletariat would acquire class consciousness 
and would develop its own organization, both eco- 
nomic and political, as the eventual instrument of 
capitals overthrow. But there were two other 
agents of the dynamic process. First, there was a 
continuous tendency both toward concentration of 
production into larger units and toward centraliza- 
tion of capital itself, tendencies that at the same 
time encouraged more concentrated and more en- 
during organization of labor, while confronting 
labor with a more centralized, impersonal, and ty- 
rannical foe. Second, because of its uncoordinated 
character (its characteristic “anarchy of produc- 
tion”), combined with a growing contradiction be- 
tween the rate of growth of productive power and 
the slower rate of growth of markets, the process 
of capital accumulation was periodically inter- 

rupted by dislocating economic crises of overpro- 


duction. Such crises served the function of re-creat- 
ing the reserve army of the unemployed when it 
became depleted by the expanding demand for 
labor and wages showed signs of rising and en- 
croaching upon surplus value. They also encour- 
aged the tendency to concentration on the side of 
capital (the larger swallowing the smaller in lean 
years) and increased the instability of the worker's 
status and condition. 

The inevitable outcome and only “solution” to 
these gathering contradictions was a revolt of or- 
ganized labor against the growing tyranny of cap- 
ital, as the latter showed itself increasingly to be 
a “fetter on production,” no longer revolutionizing 
technique and expanding productive capacity as it 
had done in its halcyon days but restricting and 
wasting productive capacity and holding it in check. 
On its negative side such a revolt could only take 
the form of dispossessing the capitalists of their 
ownership rights—the famous “expropriation of the 
expropriators.” On its positive side revolutionary 
transformation must take the form of the transfer 
of the means of production into social ownership 
and the social organization of production on a 
planned basis, since in conditions of modern tech- 
nique and large-scale industrial production the 
kind of solution favored by Saint-Simon and Prou- 
dhon—the distribution of property in small units 
to all citizens—was clearly impracticable. 

A social transformation of this kind, the most 
revolutionary known to history, would liquidate the 
class antagonism of previous class society by sub- 
Stituting the social equality of a community of ac- 
tive producers, where everyone was a worker draw- 
ing an income from society, for the unequal and 
divided society of those who owned and those who 
were dispossessed. The period of human history 
characterized by successive forms of class exploi- 
tation, each with its specific type of dominant and 
exploiting class appropriating surplus product in its 
own manner, would have closed. But this did not 
mean that historical change would have come to 
an end. The technical means of production would 
continue to develop, probably more rapidly than 
before; human organization, adapting to chang- 
ing economic conditions and needs, would peren- 
nially undergo change. But the basic cause of social 
antagonism as known before in human history 
would have disappeared. 

There was no pretense, however, that the rela- 
tive social equality of all citizens as workers and 
producers would be the realization of an ideal of 
absolute justice among men. Socialists of the 
Marxian school have always spoken (since Marx 
wrote his Critique of the Gotha Programme) of two 
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stages of socialism, a lower and a higher. In the 
former, although work incomes would constitute 
the sole category of income and inequalities due to 
the existence of property incomes would have dis- 
appeared, some differences of income would still 
remain, owing to the necessity of differentiating 
wages according to the amount and kind of work 
performed. Only at the latter stage, when the pro- 
ductive powers of society had been sufficiently de- 
veloped and the moral standards of society suffi- 
ciently raised, would it be possible to achieve the 
fuller social equality of “from each according to 
his ability, to each according to his needs.” It has 
become customary in recent decades to call the 
first “socialism” and to reserve the name “commu- 
nism” for the second. One could say that the former 
would realize equality of opportunity for all, but 
the effect upon individual incomes of inequality of 
human capacities and talents would not be elimi- 
nated; only under “full communism” would differ- 
ences of human capacities and needs cease to be of 
economic significance. 

Marx and Engels and their followers always re- 
garded it as inconsistent with their conception and 
method to prepare anything resembling a blueprint 
of the future socialist society. The attempt to 
do so was the hallmark of the utopian socialist, 
and in their ascetic refusal to emulate their prede- 
cessors in this respect, they stood at the opposite 
pole from Fourier and his obsessive love of detailed 
prescription. Socialism, it was stated, would be 
established by “the proletariat organized as the rul- 
ing class,” which would forthwith “convert the 
means of production into State property”; it would 
“centralize all elements of production in the hands 
of the State” (i.e., of the proletariat organized as 
the ruling class) and would “increase the total of 
productive powers as rapidly as possible.” There 
were some occasional hints in the writings of Marx 
and Engels that production would be organized 
consciously under some kind of prearranged social 
plan. But apart from the comments already quoted 
from the Critique of the Gotha Programme, that is 
virtually all. Lenin, who had the task of laying the 
foundations of the first socialist state, declared that 
“in Marx there is no trace of attempts to create 
utopias, to guess in the void at what cannot be 
known.” [See the biographies of BERNSTEIN; KAUT- 
SKY; LUXEMBURG; MARX.] 

Post-1917 social democracy. In the years since 
the Russian Revolution the socialist world has been 
more or less sharply divided between those who 
recognized this event as a genuine socialist revo- 
lution and those who denied it such a name. The 
difference partly turned on the methods used to 
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achieve and to consolidate the revolution, namely 
the use of insurrection and armed force and the 
regime of the “dictatorship of the proletariat.” But 
there was also the deeper issue of whether social- 
ism could be built at all in a backward country of 
weakly developed industry and predominantly 
peasant agriculture. The Russian Mensheviks de- 
nied that it could be and declared that the stage 
was set in Russia for no more than a “bourgeois 
revolution” against tsarist absolutism. What was 
distinctively new in Lenin’s controversial interpre- 
tation when he arrived back in Russia in April 1917 
was that, while accepting that a bourgeois revolu- 
tion was in process, he nonetheless declared that 
the industrial proletariat could and should seize 
power in alliance with the peasantry. In doing so, 
the proletariat could transform a bourgeois rev- 
olution into a socialist one and eventually start 
to build socialism. The discussion about “socialism 
in one country” that was to develop within the 
Bolshevik ranks in the following decade was in 
large degree an extension of this same controversy, 
since it was concerned with the question of whether 
the transition to socialism, already started by the 
nationalization decrees of 1917-1918 and carried 
over into the “mixed economy” of the 1920s, could 
be completed unless the revolution spread to other, 
more technically advanced, countries of Europe. 

Socialist parties in western Europe (with the 
exception of the Italian) generally followed the 
Menshevik line in their estimate of the Soviet revo- 
lution. They proceeded to affirm their devotion to 
democratic parliamentary methods and their inten- 
tion of achieving socialism, not by a single revolu- 
tionary act, but by a series of modifying reforms 
in the existing structure and by a gradual exten- 
sion of the economic functions of government. The 
rift in socialist thought and policy was deepened 
after the formation of the Third (or Communist) 
International in 1919, in opposition to the Second 
International, which after its collapse in 1914 was 
to be revived in 1920. The concept of socialism 
current in most social democratic circles increas- 
ingly approached that of Fabian gradualism and, 
in the course of two decades, in most cases ceased 
to be Marxian in anything but name. After World 
War u, partly under pressure of the “cold war,” 
the leading parties of continental Europe and 
Scandinavia not only eschewed Marxism, but 
dropped from their programs any proposal for ex- 
tensive socialization of production. 

Some would say that the temper of the times 
is to eschew general social theories as speculative 
or metaphysical and that for this reason one can 
no longer speak of socialist theory apart from the 
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Marxist school. Certainly it is true that the tend- 
ency in England and elsewhere has been to favor 
an increasingly empirical approach. Sixty-three 
years after the appearance of the original Fabian 
Essays a number of younger thinkers of the English 
Labour party combined to produce in 1952 a col- 
lection of New Fabian Essays under the editorship 
of R. H. S. Crossman. What is remarkable about 
this new volume, in contrast with the emphasis of 
its forebear, is the playing down of socialization in 
the traditional sense of the transfer of means of 
production to state ownership (even to the point 
of dismissing it as an obsolete Marxian prejudice). 
If there is a single unifying theme in terms of 
which socialism as a credo is here definable, it is 
perhaps to be found in an emphasis on social equal- 
ity. This is to be realized primarily through the 
extension of social services, a widening of educa- 
tional opportunities, and progressive taxation. In- 
deed, one writer, C, A. R. Crosland, speaks as 
though the aims of the socialist movement were 
already achieved to a considerable extent, since the 
metamorphosis of capitalism “into a quite different 
system .. . is rendering academic most of the tra- 
ditional socialist analysis,” and state intervention 
in economic life has so increased as to “justify the 
statement that the capitalist era has now passed 
into history.” Property rights, it is said, “no longer 
constitute the essential basis of economic and social 
power,” which has passed to a new class of man- 
agers. Nationalization and “the early Fabian em- 
phasis on collectivism” are expressly rejected as 
key to the definition of socialism, and equality of 
status is enthroned as the essence of the definition 
instead. Before the war, in 1937, Douglas Jay in 
The Socialist Case had already said, “If we are to 
have the substance and not the shadow, we must 
define socialism as the abolition of private un- 
earned or inherited incomes rather than of the pri- 
vate ownership of the means of production”; while 
as for planning in any of its forms, these are “pos- 
sible rather than necessary elements of socialism.” 
The economists’ debate. There remains to be 
said something in summary about the narrower dis- 
cussion of socialism, by economists, which itself 
falls into two halves: discussion of the comparative 
merits of the two rival systems in the attainment of 
some postulated “optimum” and discussion of alter- 
native mechanisms, or “models,” for the operation 
of a socialist economy. The latter has become a 
lively subject of debate today in the socialist coun- 
tries themselves. 
As a result of the new economics of Jevons 
and Menger in the last quarter of the nineteenth 
century, two opposite tendencies arose among 


leading academic figures. First, as we have noted, 
there was a tendency in England especially (which 
Jevons himself cautiously initiated) to re-examine 
the case for laissez-faire and the exceptions to it. 
This re-examination, developed by Sidgwick and 
Marshall, drew attention to a number of “excep- 
tions,” in which public interest conflicted with pri- 
vate and in which production of wealth failed to 
be maximized when left to the free play of market 
forces. In the twentieth century, with the increas- 
ing prevalence of monopoly and restricted competi- 
tion, this critique was extended to include the 
adverse effects of “imperfect competition” or “mo- 
nopolistic competition”—the excess capacity latent 
in excessive product differentiation and also the 
swollen costs and distorting effects of salesmanship 
and advertising to which they give rise. Second and 
concurrently, the development of mathematical 
theories of general market equilibrium brought with 
it, as a signal corollary, a new justification of free 
enterprise: namely, that the general equilibrium 
toward which a competitive market always tends 
represents a maximum of utility (in given condi- 
tions of demand and of economic resources). This 
corollary of their analysis was underlined by Wal- 
ras and popularized by his follower Pareto. What 
made this theorem on inspection less impressive 
than at first appeared was that the postulated op- 
timum was relative to a given income distribution 
(or at least of a given structure of factor ownership 
taken as datum). Hence, it could not be used to 
pass judgment on income distribution itself. Its 
critics pointed out that the Walras—Pareto theorem 
defined, not a unique optimum position, but a 
whole series of positions. In a modified form, how- 
ever, the theorem continued to be used as a justi- 
fication of free competition as that which secures 
an optimum result relative to whatever income dis- 
tribution existed (it being implicitly recognized 
that the state could, and should, modify that in- 
come distribution through taxation—so far as dis- 
incentive effects of taxation upon production al- 
lowed). 

Soon, however, a counterattack was mounted 
upon the socialist case with the aid of this theorem, 
in the form of the contention that a socialist econ- 
omy, since it would lack a market for factors of 
production, would have no way of attaining an 
optimum (of utility or welfare) or even of ascer- 
taining in what direction this lay. Hence, in the 
words of Von Mises (the name chiefly associated 
with this argument), a socialist economy would be 
nonrational and uneconomic ex natura (1922). In 
the two decades that followed Von Mises’s challenge 
socialist economists (in Germany in the course of 
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the 1920s and in England and the United States 
in the 1930s) sought an answer to it by demon- 
strating various ways in which the problem could 
be solved. Most of the suggested mechanisms, how- 
ever, involved the creation of actual markets or 
else of quasi-market processes for factors of pro- 
duction and “producers goods” (i.e., all productive 
inputs) and the simulation of competition under 
socialism, Henry Dickinson, for example, proposed 
actual markets; Lange proposed a system of ac- 
counting prices to be adjusted so as to equilibrate 
supply and demand by a trial-and-error process, 
with output decisions and investment decisions 
taken on the basis of these accounting prices ac- 
cording to certain rules (1936-1937). Such solu- 
tions mainly implied decentralized decision making 
(at the level of individual industries or enterprises ) 
and accordingly set limits to the amount of central- 
ized planning (other than “indicative planning”) 
that could be used. This was not the case, however, 
with all the suggested solutions. In 1908, for ex- 
ample, the Italian economist Barone had already 
advanced one such solution in mathematical form. 
But doubt was expressed as to whether, as a cen- 
tralized planning solution, it could be regarded as 
practicable in view of the complexity of the calcu- 
lations involved. [See the biographies of BARONE; 
LANGE.] 

It should be added that the further discussion of 
what has come to be called “welfare economics,” 
especially in the course of the 1950s, has intro- 
duced considerable doubt as to whether the maxi- 
mizing of welfare or of national output (or income) 
could be given any precise meaning. In both cases 
this doubt was primarily due to the aforementioned 
difficulty of income distribution; in the second 
case, for example, it was due to the dependence of 
prices, in terms of which output is summed into an 
aggregate, on income distribution. There was the 
difficulty introduced by a growing emphasis on the 
conventional element in wants and the influence 
of other people’s consumption on an individual's 
satisfactions, as well as the effect of advertising 
propaganda (“the hidden persuaders”) upon de- 
sires, Even if sense could still be conceded to the 
idea of maximizing something, it followed that the 
“tolerances” to be allowed to any mechanism for 
achieving it were considerably wider than had 
earlier been supposed [See WELFARE ECONOMICS. ] 

During the 1950s the question of more, or less, 
centralized or decentralized models and of the de- 
gree to which planning and a market mechanism 
could be combined also began to occupy economic 
discussion in the socialist countries of eastern 
Europe. On the one hand, this discussion was pro- 
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voked by the need to give more initiative to the in- 
dividual industrial enterprise in regard to the choice 
of inputs and outputs (within a general framework 
of planning) in a period when considerations of 
efficiency, quality, innovation, and attention to con- 
sumers’ requirements were becoming more impor- 
tant than mere quantitative increase of output of 
a given range of products, which had been the 
main preoccupation of an earlier ‘period. Combined 
with this was a reconsideration of the type of col- 
lective incentive to the enterprise that would be 
most conducive to the beneficial use of this initia- 
tive. On the other hand, the discussion was 
prompted by an increasing use of linear program- 
ming methods for selecting an optimum plan from 
among a range of alternative and self-consistent 
plans, according to the system of the Leningrad 
mathematician Kantorovitch, Such methods of op- 
timal planning can be applied at various levels— 
that is, to decentralized or centralized decisions, to 
integral parts of a plan, or to a plan as a larger 
whole (at the time of writing they have been ap- 
plied only to the former). In each optimal solution 
there is implicit a set of “shadow prices,” in terms 
of which the cost of inputs necessary to yield a 
given output program is minimized (or alternative- 
ly the output yielded by a given quantity of avail- 
able inputs is maximized). Accordingly, the ques- 
tion of what system of prices is consistent with the 
choice of optimum methods of production is im- 
mediately raised. 

The Yugoslav economy was the first socialist 
economy to adopt a fairly drastic degree of decen- 
tralization, early in the 1950s. This took the form 
of giving economic enterprises greater discretion 
in their output programs (and even in large 
measure their investments) on the basis of contrac- 
tual arrangements with other industries, enter- 
prises, or wholesale and retail bodies. Yugoslavia 
is often cited, accordingly, as an example of a 
“decentralized market model.” In recent years, 
however, other countries, including the Soviet 
Union, have in varying degrees moved in the di- 
rection of decentralization of planning and of free- 
ing the enterprise as a decision-making unit from 
a surfeit of detailed directives. New “models” of 
decentralized (contractual) supply arrangements 
and enterprise autonomy linked with collective en- 
terprise incentives have been experimented with, 
especially in consumer goods industries (e.g., the 
Liberman scheme in the U.S.S.R. and the Sik pro- 
posals in Czechoslovakia). nee 

However, while in the socialist countries in- 
creased concern has been shown with optimal 
planning, among Western economists the focus of 
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interest has been shifting away from questions of 
static equilibria to questions of growth. One could 
say that most economists are more concerned today 
to use growth potential as a criterion of judgment 
for an economic system than to use its capability 
either for attaining an economically perfect allo- 
cation of productive resources (defined in some 
way) or for ensuring an equitable distribution of 
income. On the relative weight to be attached to 
such criteria opinion naturally varies among econ- 
omists, as it always has done and continues to do 
among socialists. But so far as a growth criterion 
is concerned, there can be little doubt that socialist 
economies have a distinctly good record; vide the 
high rates of growth in Soviet industry in the pre- 
war decade and again in the planned economies 
as a whole in the postwar period. (In agriculture, 
on the other hand, although there have at times 
been successes, the record is less impressive.) 
Long-term planning in the Soviet Union in par- 
ticular has set itself the goal of maintaining an 
industrial growth rate in the neighborhood of 
10 per cent during this and the ensuing decade, 
and of overtaking the U.S. economy both in abso- 
lute production and in per capita production at an 
early date. Much in the comparative economic 
judgment of the two systems will no doubt turn 
on the result. 

MAURICE Doss 


[See also CommuNISM; COMMUNISM, ECONOMIC OR- 
GANIZATION OF; MARXISM; PLANNING, ECONOMIC, 
article on EASTERN EUROPE; SOCIALISM; VALUE, 
LABOR THEORY OF.] 
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y. 
THE HISTORICAL SCHOOL 


The German historical school of economics 
reached its peak in the second half of the nine- 
teenth century. It developed as a countermovement 
of romantic (that is, historical and universalist) 
thinking in opposition to classical economics, 
whose approach was timeless and atomistic. 

The historical school embedded economic theory 
(often isolated by the classical economists) in a 
matrix of all the fields in which man is socially 
active and thereby laid the original foundations of 
a social theory of economics. Such thinkers as 
Adam Müller and Friedrich List were the pre- 
cursors of the historical school. 

The precursors. Adam Müller (1779-1829) 
did not regard the economy as a separate sphere 
of existence, dominated by the striving for profit. 
He considered the economy, rather, to be one ele- 
ment in the primordial social order, interrelated 
with custom, law, education, politics, and religion, 
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and changing as these changed historically. In his 
theory of money, he went so far as to define money 
(intrinsically) as an expression of the desire for 
cooperation, He regarded governmental economic 
policy as a productive force and not merely as a 
necessity to be endured or, as did most of the 
classicists, as an organ of order and rule that 
should be kept in check. Yet Miiller was not an 
avowed enemy of free enterprise and wanted to 
see it retained in a corporate state. 

Miiller’s ideas had their philosophical founda- 
tion in the works of J. G. Fichte, especially in the 
latter's treatise Der geschlossene Handelsstaat, 
written in 1800. (Miiller’s fundamental ideas also 
resembled those of G. W. F. Hegel.) Starting from 
the same Enlightenment ideal of individual free- 
dom as did the liberal classicists, Fichte reached 
an entirely different conclusion. He advocated en- 
larging the powers of the state, provided that the 
state was defined as having a social purpose. This 
romantic economic and social philosophy had as 
its practical objective the welfare state rather than 
the constitutional state. Fichte ultimately produced 
a defense of an autarkic state, isolated from the 
world market, whose planned economic constitu- 
tion was “beneficent” domestically. 

There is a bridge that leads to Marx both 
from Fichte and from Müller. In his Lehre von 
Gegensatzen of 1804, Müller laid down the philo- 
sophical foundations of his economic views. He 
asserted that all living phenomena arise from the 
interaction of opposites, and he considered the in- 
vestigation of such interactions to be the task of 
economics. Like Marx, but a generation earlier, he 
viewed the economy and economics as evolving; 
indeed, he saw them as totally immanent in his- 
tory. However, his interest was centered on the 
spiritual forces behind the material appearances 
of economic life; in this he was quite the opposite 
of Marx. (The relationship of base to superstruc- 
ture was reversed.) He criticized classical eco- 
nomics as mechanistic, coldly rational, material- 
istic, and static. 

The Romantic school, which included, in addi- 
tion to Miiller, such men as Edmund Burke, F. von 
Gentz, K. L. von Haller, J. Görres, and F. Baader, 
had almost no influence. Its imprecise, cloudy 
thought was unable to make headway against the 
dominant classical economics. Moreover, the eco- 
nomic developments of the era (the urge toward 
emancipation of the bourgeoisie, the Continental 
blockade, the tariff law, and later the Zollverein ) 
were not such as to allow the politically conserva- 
tive (corporate state) tendencies of the Romantics 


to prevail. 
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The more influential economic ideas of Friedrich 
List (1789-1804) were put forward in Das na- 
tionale System der politischen Okonomie, written 
in 1841. The German idea of the nation, awakened 
in the struggle against Napoleon, was the starting 
point for Lists economic thinking. He was pri- 
marily a practitioner of national economic integra- 
tion, and his theoretical insights sprang from this 
soil. Starting from the historical fact that the econ- 
omies of different countries are not equally strong, 
he advocated “educational” tariffs to protect infant 
industries and to enable weak, underdeveloped 
nations to rise to the standard of the richer, more 
industrialized ones. Only then would true interna- 
tional competition be possible. 

List opposed his doctrine of a historical expan- 
sion of productive forces to the timelessness of 
liberal cosmopolitanism. He held that a nation’s 
wealth is represented not by stocks of commodities 
but by the abilities and skills of its people. For 
Adam Smith, only material production was produc- 
tive; List also considered education, administration, 
and communication to be historically important 
productive forces. 

The older historical school. The above were 
the roots of the older German historical school, 
whose principal representatives were W. Roscher 
(1817-1894), B. Hildebrand (1812-1878), and 
K. Knies (1821-1898). These men were Marx’s 
contemporaries. At that time the bourgeois move- 
ment for the unification of Germany was flour- 
ishing, and Prussian-German power politics had 
begun its then victorious course. Following Miiller, 
the older historical school (in particular Knies and 
Hildebrand) emphasized the inclusion of eco- 
nomics in the totality of the common moral and 
national life, whose historical, organic growth is 
based on free human deeds rather than on natural 
laws. In his methodological critique of classical 
economics, Die Nationalökonomie der Gegenwart 
und Zukunft (1848), Hildebrand denied the exist- 
ence of natural laws applying to economic activity. 
He asserted that economics must seek for the laws 
of the historical evolution of the economy, rather 
than for immutable laws. Hildebrand did concede 
that an objective regularity of evolution operates 
in the economy. (The stages of this evolution he 
called the natural economy, the money economy, 
and the credit economy. ) 

As the most consistent advocate of the historical 
approach, Knies, in his Die politische Okonomie 
vom geschichtlichen Standpunkte (1853), con- 
ceded no such absolute validity to evolutionary 
laws. He admitted that certain similarities can be 
observed in phenomena. But these do not involve 


456 ECONOMIC THOUGHT: The Historical School 


necessity, for they are not linked by any causal 
nexus. They are merely analogies. Human action 
does not arise from an “innate” egotism but derives 
its motives from the spirit of the times. 

Precisely because of this dubious substitution 
of subjective motivations for „objective law, eco- 
nomic science is indebted to Knies for clearing the 
way for the methodologically significant insight 
that statistical rather than causal regularities 
apply in the historical—social realm. With this, 
Hildebrand and Knies broke definitively with the 
ideas of the classicists. Roscher, in his Grund- 
riss zu Vorlesungen über die Staatswirthschaft 
nach geschichtlicher Methode (1843), remained 
somewhat dependent on classical ideas, though he 
stressed the historical method of research. 

The similarities between the older historical 
school and Marx indicate a common intellectual 
background. For both, historical progress is a basic 
concept. Both take human actions in relation to 
productive forces as their point of departure, and 
both emphasize the concept of society. But Marx 
opposed his “rational” research into the laws of the 
historical process to the historical school’s relativis- 
tic concept of societies, which it saw as based on 
an irrational flow of history. For Marx the base— 
superstructure relationship is the opposite; he re- 
garded the social institutions as a superstructure 
dependent on the economy, rather than vice versa. 

The later historical school. Thus the older his- 
torical school concentrated its attention on the fun- 
damentals of universal historical economic devel- 
opment. The later historical school used this view 
as a point of departure and, at the beginning of 
the 1870s, turned to research into concrete details. 
The founder of this later school was Gustav Schmol- 
ler (1838-1917), and its adherents included L. 
Brentano, K. Biicher, G. F. Knapp, K. T. von Inama- 
Sternegg, and J. Conrad. Economic historicism also 
gained a more or less firm footing in England, 
France, Italy, and the United States. Men such as 
T. E. Cliffe Leslie, J. K. Ingram, W. J. Ashley, 
Charles Gide, L. Cossa, and Charles A. Beard were 
among its representatives in these countries. 

The “great” dispute over method (Methodenstreit) 
with the marginalist neoclassicists forced the mem- 
bers of the later historical school to seek further 
clarification of their point of view, as in Schmoller’s 
“Zur Methodologie der Staats- und Sozialwissen- 
schaften” (1883). The antitheses were historical 
induction vs. deduction, individualizing vs. general- 
izing, and descriptive economics vs. economics that 
searches for laws or patterns. Another characteris- 
tic of the later school was its ethical and practical 
approach to questions of social policy, especially to 


labor problems. Schmoller was instrumental in the 
founding of the Verein fiir Socialpolitik in 1872, 
whence the designation of the school as “profes- 
sorial socialism” (Kathedersozialismus ). 

Brentano's resolution of the problems inherent 
in previous naively linear correlations of working 
time, labor output, and wages was a significant in- 
sight in political economy. So was Knapp’s theory 
of money, presented in The State Theory of Money 
(1905), in which money is defined as a creation 
of the law rather than a “commodity.” 

At about the turn of the century there developed 
a German trend toward historical and sociological 
research which included such important thinkers 
as W. Sombart and Max Weber, This movement, 
which was a successor to and a critique of the his- 
torical school, was also influenced by W. Dilthey. 
Later on, O. Spann and his disciples went back 
directly to Adam Miiller. 

Influence on institutionalism. The influence of 
the historical school was not confined to Europe 
but was felt in America as well, particularly by the 
American institutionalists. After World War 1 they 
increasingly worked toward independence of eco- 
nomic thought. In the 1920s J. R. Commons, R. G. 
Tugwell, A. B. Wolfe, C. E. Ayres, W. H. Hamilton, 
M. A. Copeland, W. E. Atkins, and others tried to 
establish an American system of teaching and re- 
search. Characteristically they turned away from 
marginal utility and the entire abstractly theoret- 
ical approach of British neoclassical political econ- 
omy. These were replaced by empirical studies of 
economic behavior along social-psychological and, 
to some extent, social-ethical lines. 

Thorstein B. Veblen’s economic relativism and 
his witty criticism of John Bates Clark’s doctrine, 
based on marginal utility theory, demonstrated 
most clearly how weary a then influential group 
of American economists had become of the tutelage 
of their intellectual elders, the European theorists. 
The criticism of the institutionalists was aimed 
primarily at the fiction of the “economic man” and 
at the hypothesis of an exclusively hedonistic moti- 
vation for his actions. This institutionalist thinking 
had many links to European ideas, The initial gen- 
eral rejection of the classical theory by American 
economists, as well as their acceptance of the meth- 
odological principles of the older German histor- 
ical school (especially in the form in which they 
were cast by Bruno Hildebrand), was inspired by 
personal and intellectual contacts at German uni- 
versities while the historical school was at its peak. 

Institutionalism and the closely related behavior- 
ism proceed from the assumption that economic 
acts are not solely governed by the hedonistic prin- 
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ciple. Man’s economic behavior, his needs and de- 
sires, his ways and means, are merely functions of 
a constantly changing evolution that is infinitely 
complex, is molded by particular social institutions, 
and is in this sense “institutional.” The true nature 
of economic relationships can be learned only by 
thorough study of economic institutions, of the 
changes they undergo in the course of their evolu- 
tion, and of their interrelationships with the 
changes in the psychological peculiarities of men’s 
economic behavior. Therefore the institutionalists 
demanded a dynamic concept of the economy. 

This conceptual structure of institutionalism 
clearly echoes aspects of German experimental 
psychology and of Spencer's evolutionary doctrine, 
with the ideas of traditional British association psy- 
chology in its background. It cannot be denied, 
however, that the relativism central to this institu- 
tionalist way of thinking is close to the basic no- 
tions of the German historical school. 

There is still another channel leading from the 
German historical school to American institution- 
alism, In the 1920s certain economists led by 
Commons became tired of neoclassicist research in 
general, and of the dispute about what is “free of 
value judgments” in particular. In their reaction 
they adopted the moderate standpoint that had 
earlier been advocated in Germany by Bruno Hilde- 
brand, with his historical approach. They asserted 
that in establishing goals of economic policy, one 
must take into account the mass-psychological and 
social-ethical aspects of these goals. 

A. T. Hadley, E. R. A. Seligman, and R. Mayo- 
Smith were among those who supported the “value- 
judgment-free” point of view and opposed Com- 
mons in this “value-judgment debate,” which is 
often called the “little” dispute over method (kleiner 
Methodenstreit). Veblen was in the forefront of the 
challenge to John Bates Clark’s economic theory, 
which was free of value judgments. As a young 
economist, Clark’s son, John Maurice Clark, agreed 
with Veblen on particular issues. 

These discussions had implications for economic 
policy. The institutionalists felt that the abstract- 
theoretical postulate of absolute economic freedom 
should be disregarded, and stress should be placed 
instead on the actual development of economic in- 
stitutions and the behavior of individuals and of 
important socioeconomic groups. Governmental eco- 
nomic planning and control were justified, if neces- 
sary, to ensure the social-ethical ideal of welfare 
in the long run. This view was adopted by W. C. 
Mitchell, Commons, Hamilton, L. D. Edie, and 
Tugwell, and, to a greater or lesser degree, by 
Wolfe, John Maurice Clark, F. C. Mills, O. F. 
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Boucke, D. Friday, and other economists. Still, 
American institutionalists did not simply accept the 
concepts of the German historical school. The meth- 
odological pragmatism of the majority of Amer- 
ican institutionalists is manifested by their failure 
to evolve a unified cognitive procedure of a specific 
institutionalist nature. True, they accepted the de- 
ductive method. Yet their special interests lie in 
empirical-inductive, quantitative, mathematical- 
statistical, and causal-genetic analysis. For exam- 
ple, the work of W. C. Mitchell was primarily con- 
cerned with the analysis of business cycles and 
economic development. He is primarily responsible 
for the great growth of empirical research and of 
studies aimed at statistical and historical verifica- 
tion. It now appears that a more or less har- 
monious collaboration of abstract and empirical 
economic thought tends toward attaching increas- 
ing importance to the statistical, applied-mathe- 
matical, and historical approach. 


THEO SURÁNYI-UNGER 


[See the biographies of BÜCHER; Commons; HILDE- 
BRAND; KNAPP; Knies; List; MÜLLER, ADAM; Ro- 
SCHER; SCHMOLLER; VEBLEN.] 
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vI 
THE AUSTRIAN SCHOOL 


Value theory before 1871. The “marginal rev- 
olution” of the 1870s is generally recognized as a 
major step in the advance of economic theory. 
Those who have been able to start with its estab- 
lished results often find it difficult to comprehend 


why an obvious and simple idea, which had oc- 
curred to many earlier thinkers, should have exer- 
cised such a great effect when W. S. Jevons, Carl 
Menger, and Léon Walras independently rediscov- 
ered it at about the same time and, particularly, 
why the tradition founded by Menger should so 
profoundly have affected economic theory for two 
generations. To account for this it is necessary to 
elaborate on the distinction that is commonly but 
inadequately expressed as a contrast between “ob- 
jective” and “subjective” theories of value. 

Since value is evidently an attribute possessed 
by certain things or services, it was natural to seek 
its determinants in some property or properties of 
the particular objects that possess it. This proce- 
dure had proved successful in the physical sciences, 
and the expectation that objects having the same 
value should have also other “intrinsic” properties 
in common therefore seemed reasonable, Of course, 
it had often been seen that the decisive factor 
might be something discoverable not in the object 
itself but rather in the relations of men to the ob- 
ject. Ever since the medieval scholastic philos- 
ophers (and even Aristotle), it had been pointed 
out again and again that to possess value an object 
must be useful and scarce. But this idea was rarely 
followed through systematically (though an excep- 
tion must be made for the greatest of the early 
anticipators of modern theory, Fernando Galiani 
in his Della moneta of 1750) and never to the point 
of realizing that what was relevant was not merely 
man’s relation to a particular thing or a class of 
things but the position of the thing in the whole 
means-end structure—the whole scheme by which 
men decide how to allocate the resources at their 
disposal among their different endeavors. 

It was largely from approaching the problem of 
value through patient analysis of the character of 
economic choice in the various types of possible 
relations between different means and different 
ends that there ultimately emerged the explanation 
of value that made the work of Menger and his 
pupils so immediately effective. Although the an- 
swers supplied by Jevons and Walras to the old 
value paradox were no less correct, they were con- 
cealed from most contemporary economists by the 
mathematical notation employed. Moreover, these 
authors treated the solution of the utility paradox 
as merely a preliminary step to be gotten quickly 
out of the way in order to get on with the main 
business, the explanation of value in exchange. 
The Austrians, on the other hand, made the full 
analysis of the conditions of valuation, independent 
of the possibility of exchange, so much their cen- 
tral task that later they had to defend themselves 
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against the misunderstanding that they believed 
marginal utility provided a direct explanation of 
price. Of course, the subjective value that it ex- 
plains is merely a first step to the second stage, the 
theory of price. 

On the Continent the utility-cum-scarcity ap- 
proach had, since Galiani, remained an important 
tradition and considerable insight had been achieved 
in such works as E. B. de Condillac’s Le commerce 
et le gouvernement, 1776, Louis Say’s Considéra- 
tions sur lindustrie, 1822, Auguste Walras’s De la 
nature de la richesse, 1831, Jules Dupuit’s De la 
mesure de Vutilité des travaux publics, 1844, and 
finally in the remarkable but at the time completely 
overlooked work of H. H. Gossen, Entwicklung der 
Gesetz des menschlichen Verkehrs, 1854. In Eng- 
land, on the other hand, the generally much more 
highly developed classical political economy had 
become wedded to an “objective” labor theory of 
value, and the utility tradition persisted only as a 
sort of undercurrent of protest, which reached a 
high point in 1834 with W. F. Lloyd's The Notion 
of Value. It was fully swamped when as late as 
1848 J. S. Mill in his Principles of Political Econ- 
omy not only re-expounded the classical position 
but calmly asserted, “Happily, there is nothing in 
the laws of value which remains for the present 
or any future writer to clear up; the theory of the 
subject is complete” (book 3, chapter 1, sec. DA 
[See the biographies of DuPUIT; GALIANI; GossEN.] 

Menger and the origins of the school. While 
W. S. Jevons (who with his preliminary sketch of 
a theory of value based on “final utility” had antic- 
ipated Menger by nine years) was writing in direct 
opposition to dominant doctrine and had little more 
than Benthamite utilitarianism to draw upon, both 
Menger and Walras were able to build on a rich 
literary tradition favorable to the utility approach. 
But in Vienna, where Menger worked, there was 
little interest in economic theory. At that time eco- 
nomics was taught at the University of Vienna by 
Germans of chiefly sociological interests. The rapid 
rise of a distinct Austrian school of economic 
theory was thus entirely owing to Menger’s work— 
though it did coincide with the university’s rise to 
great eminence in a number of other fields, which 
for some fifty or sixty years made it an intellectual 
center of great influence. 

Carl Menger was a civil servant in the prime 
minister’s office at Vienna when at the age of 31 
he published his first and decisive work, Principles 
of Economics (1871). It was the first part of an 
intended treatise, the rest of which never appeared. 
It dealt with the general conditions that create 


economic activity, value, exchange, price, and 
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money. What made it so effective was that the ex- 
planation of value it offered arose from an analysis 
of the conditions determining the distribution of 
scarce goods among competing uses and of the way 
in which different goods competed or cooperated 
for the satisfaction of different needs—in short, 
what has been called above the “means—ends struc- 
ture.” It is this analysis that precedes the theory of 
value proper. Friedrich von Wieser was to develop 
this systematically into a vorwerttheoretische part 
of economic theory that made the Austrian form of 
marginal-utility analysis so suitable as a basis of 
further development. From this analysis springs 
most of what is known today as the logic of choice, 
or the “economic calculus.” 

Menger’s exposition is generally characterized 
more by painstaking detail and relentless pursuit 
of the important points than by elegance or the 
use of graphic terms to express his conclusions. 
Though always clear, it is labored, and it is doubt- 
ful whether his doctrines would ever have had wide 
appeal in the form in which he stated them, How- 
ever, he had the good fortune of finding at once 
avid and gifted readers in two young men who had 
left the University of Vienna some time before 
Menger became a professor there. They decided to 
make the fulfillment of his teaching their lifework. 
It was mainly through the work of Eugen von 
Bohm-Bawerk and Friedrich von Wieser, class- 
mates and later brothers-in-law, that Menger’s ideas 
were developed and spread. Gradually during the 
1880s, when their most influential works appeared, 
they were joined by others working in the universi- 
ties or elsewhere in Austria. Of these, Emil Sax, 
Robert Zuckerkandl, Johann von Komorzynski, and 
Robert Meyer particularly deserve mention, Some- 
what later came Hermann von Schullern zu Schrat- 
tenhofen and Richard Schüller. 

The year 1889, in which the greatest number of 
important publications of the group were concen- 
trated, also saw the appearance of an important 
theoretical treatise by two Viennese businessmen, 
Rudolf Auspitz and Richard Lieben, Untersuchung- 
en über die Theorie des Preises (1889). This can, 
however, only with qualifications be included in 
the works of the Austrian school. It moved on par- 
allel but wholly independent lines and with its 
highly mathematical exposition was too difficult for 
most contemporary economists, S0 that its im- 
portance was recognized only much later. 

Of great importance for spreading the teachings 
of the school, especially in Germany, was the fact 
that another Viennese professor, Eugen Philippo- 
vich von Philippsberg, though not himself an ac- 
tive theoretician, incorporated the marginal-util- 
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ity doctrine into a very successful textbook, 
Grundriss der politischen Okonomie (1893). For 
some twenty years after publication this remained 
the most widely used textbook in Germany and 
almost the only channel through which the mar- 
ginal-utility doctrine became known there. In other 
foreign countries, especially England, the United 
States, Italy, the Netherlands, and the Scandi- 
navian countries, the basic economic publications 
of the Austrians became known sooner, partly in 
English translations, Probably the most important 
foreign adherent was an exact contemporary of 
Böhm-Bawerk and Wieser, the Swede Knut Wick- 
sell. Although he was also greatly indebted to 
Walras, Wicksell could write in 1921 that “since 
Ricardo’s Principles there has been no other book 
—not even excepting Jevons brilliant but some- 
what aphoristic and Walras’ unfortunately difficult 
work—which has had such a great influence on 
the development of economics as Menger’s Grund- 
sdtze.” [See the biographies of AusPitz and LIEBEN; 
MENGER; WICKSELL,] 

Development by Béhm-Bawerk and Wieser. In 
the theoretical development of Menger’s ideas the 
most important steps were Wieser’s interpretation 
of costs as sacrificed utility (or “opportunity costs,” 
as the concept later came to be called) and the 
theory of the determination of the value of the fac- 
tors of production by “imputation” (Zurechnung). 
(This latter term and the term Grenznutzen— 
“marginal utility’—were introduced by Wieser.) 
In the field of value theory proper Béhm-Bawerk 
contributed mainly by the lucidity of his exposition 
and the great skill of his polemics. His most impor- 
tant original contribution was his theory of capital 
and interest, which was by no means accepted by 
all members of the school. In general it may be said 
that the school never developed a strict orthodoxy 
but that the development of Menger’s ideas by its 
different members shows distinct differences. This 
applies particularly to Böhm-Bawerk and Wieser, 
who in many respects represented very different 
intellectual types and whose endeavors marked the 
starting points of two distinguishable traditions 
within the school. Böhm-Bawerk was a particularly 
consistent logical reasoner, a masterly debater, and 
at the same time, a man of great practical expe- 
rience (he was three times Austrian minister of 
finance), He was able to ground his work on a 
thorough command of the whole of economic lit- 
erature, and he was always ready to take issue with 
other views. He therefore kept closer in many ways 
to Menger’s foundation than Wieser, who after the 
initial stimulus received from Menger, very much 
pursued his own ways. Wieser’s work in conse- 
quence has a highly personal character, and though 


in some respects (e.g., in the analysis of the role 
of different degrees of monopoly) he anticipated 
later development, in others, such as his insistence 
on the calculability and interpersonal compa- 
rability of utility, he developed in directions that 
had later to be abandoned. His Social Economics 
(1914), which constitutes the only complete sys- 
tematization of the economic theory of the earlier 
Menger school, therefore cannot be regarded as 
representative but nonetheless constitutes a dis- 
tinctly personal achievement. [See the biographies 
of B6HM-BAWERK; WIESER.] 

Controversy with the historical school. Carl 
Menger was a very effective teacher, and through 
his numerous pupils and his participation in the 
contemporary discussion of economic and financial 
policy he exercised considerable influence on Aus- 
trian public life. If after his first work, his literary 
contributions to pure theory, except in the field of 
money, are few, this is to be ascribed mainly to his 
involvement in the Methodenstreit, his dispute 
about method with the leader of the younger 
German historical school, Gustav Schmoller, That 
Menger’s book had remained practically unnoticed 
in Germany was chiefly owing to the fact that the 
dominance of the historical school had almost 
eliminated the teaching of economic theory from 
German universities. In these circumstances it was 
natural that it should seem urgent to Menger to 
vindicate the importance of theoretical research. 
This he undertook in his Problems of Economics 
and Sociology (1883), which in several respects 
became as important for the development of the 
Austrian school as his earlier Grundsätze, although 
in detail his methodological views were not fully 
accepted even within his own school. But the sys- 
tematic justification of what Schumpeter (1908, 
p. 94) was later to call “methodological individ- 
ualism” and the analysis of the evolution of social 
institutions (in which he revived ideas originally 
conceived by Bernard Mandeville and David Hume) 
had a profound influence on all members of the 
school and, later, far beyond the limits of econom- 
ics. Compared with this, the points immediately at 
issue with Schmoller are of lesser significance, al- 
though at the time they led to a passionate ex- 
change of opinions to which several disciples on 
each side contributed. This exchange caused such 
a rift between the two groups that the German uni- 
versities came to be filled more and more exclu- 
sively by members of the historical school and the 
Austrian universities by members of Mengers 
school. [See the biography of SCHMOLLER.] 

The third and fourth generations. In 1903 
Menger retired prematurely from teaching and in 
consequence exercised little direct influence on the 
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formation of the third generation of the Austrian 
school, which grew up during the decade preceding 
World War 1. These years, during which Böhm- 
Bawerk, Wieser, and Philippovich were teaching at 
Vienna, were the period of the school’s greatest 
fame. It was particularly Béhm-Bawerk’s seminar 
that provided the center of theoretical discussion 
and produced the leading members of the third 
generation. Among them, particular mention must 
be made of Ludwig von Mises, who continued the 
tradition of Böhm-Bawerk, and Hans Mayer, who 
continued that of Wieser. Joseph Schumpeter, al- 
though much indebted to Böhm-Bawerk, absorbed 
so many other influences (particularly that of the 
Lausanne school) that he cannot be wholly re- 
garded as a member of this group. The same might 
be said of Alfred Amonn, who stood close to the 
English classical tradition, Somewhat younger but 
still wholly active at Vienna were Richard von 
Strigl, Ewald Schams, and Leo Illy (whose name 
was originally Leo Schénfeld). 

In the 1920s a fourth generation appeared, to 
which Gottfried Haberler, Fritz Machlup, Alexan- 
der Mahr, Oskar Morgenstern, Paul N. Rosenstein- 
Rodan, and the author of this article belong. Most 
of these men, however, did the greater part of their 
work outside Austria. [See the biography of Von 
MISES, LUDWIG.] 

FRIEDRICH A. VON HAYEK 
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vit 
THE INSTITUTIONAL SCHOOL 


Institutional economics is very largely an Amer- 
ican intellectual product stemming from the work 
of Thorstein Veblen and other economists working 
in the Veblenian tradition. The term “institutional” 
was applied to this type of economics early in the 
current century because it examines the economic 
system as a part of human culture, which is a com- 
plex of many institutions. The concept of an “insti- 
tutional school” can be used only very loosely—in 
the sense that the members of this school have the 
same philosophical orientation, the same broad cul- 
tural approach to economic studies, and the same 
way of viewing the American economic system. 

The development of institutionalism can be di- 
vided into three major periods, The first period 
from 1890 to 1925 was the period when Veblen 
was laying the foundations of the institutionalist 
movement. The second period runs from 1925 to 
1945; during these years a group of institution- 
alists, prominent among whom were John R. Com- 
mons, Wesley C. Mitchell, John M. Clark, and 
Rexford G. Tugwell, continued the work of institu- 
tional economics after Veblen had come to the end 
of his career. The third period includes the years 
since 1945 when work along similar lines has been 
continued by a new generation of institutionalists, 
conspicuous among whom are Clarence E. Ayres, 
John K. Galbraith, Leon H. Keyserling, Gardiner C. 
Means, and Gerhard Colm. 

Veblen’s cultural economics. An understanding 
of Thorstein Veblen’s economics, which he de- 
scribed as “evolutionary” or “cultural” economics, 
must start with the fact that Veblen came to eco- 
nomics from the field of philosophy. Veblen started 
his economic analysis by asking the fundamental 
philosophical question: How does an economist or 
any other scientist comprehend the real world about 
him? Veblen’s answer was that a scientist's com- 
prehension of reality depends upon his intellectual 
or philosophical orientation, or mental set, which 
is a product of his life experience. This intellectual 
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orientation of the scientist may induce him to re- 
gard the outside world as a static mechanism or as 
an evolving process. According to Veblen the intel- 
lectual orientation of Alfred Marshall and other 
orthodox economists led them to view the economic 
system as a static mechanism, whereas his own 
orientation led him to regard the economic system 
as a dynamic process. 

In explaining the American economic system as 
a developing process, Veblen turned to the field of 
cultural anthropology as a model for economic sci- 
ence. In his hands economics became a study of the 
material aspect of human culture. Like human cul- 
ture, the economic system is an evolving historical 
product that passes through an endless number of 
developmental stages. There is no one universal 
economic system, but rather there are many differ- 
ent ones, such as the capitalist, socialist, fascist, 
and communist systems. Although he carried eco- 
nomic inquiry back to the early Stone Age and 
projected it into a socialist state of the future, 
Veblen’s main interest was always in explaining 
the nature and functioning of American capitalism 
in the period from 1875 to 1925. All institutional- 
ists since Veblen have continued to make the study 
of the evolving American capitalist system their 
main concern. 

The heuristic device Veblen used in constructing 
his theory of American capitalism is the dichotomy 
between “industry” (technology) and “business” 
(finance). This dichotomy is both psychological 
and cultural, since it finds expression in the conflict 
between the individual's drives (or instincts) for 
workmanship and acquisitiveness, and between so- 
ciety’s industrial and business systems, The indus- 
trial system, Veblen held, is serviceable to mankind 
because it produces “economic” values, whereas the 
business system is disserviceable, since it can cre- 
ate only “pecuniary” values. Economic values are 
“real” values, goods of high social and private utility 
that contribute to race survival; pecuniary values 
are pseudo, or false, values that enlarge a person’s 
financial assets but make no contribution to race 
survival. According to Veblen the evolution of Amer- 
ican capitalism was leading to a widening of the 
conflict between industry and business; the time 
would eventually be reached when this conflict 
would be so intense that the common people, led 
by technocrats, would attempt to overthrow the 
capitalist system in order to establish a socialist 
regime based on workmanship. However, there was 
no guarantee that the working population would 
be successful in establishing socialism. Veblen 
thought it quite possible that the vested interests, 
in cooperation with the military class, would suc- 
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ceed in establishing a fascist type of society. If the 
technician-led working class were to succeed in re- 
placing capitalism with socialism, the socialist 
society of the future would eliminate the dichotomy 
between industry and business and enable the in- 
dustrial system to produce an abundant supply of 
economic values. Veblen did not think in terms 
of any early establishment of a regime of work- 
manship. 

Although Veblen was very critical of the ortho- 
dox economics of his time, he never intended to 
dispense with all accumulated economic theory. 
Much of the economic theorizing since the time of 
Adam Smith concerning supply, demand, costs, 
and related matters is as useful to an institution- 
alist as to any other economist. What Veblen 
objected to was the normalistic philosophical under- 
pinning, the rationalistic psychology, the particu- 
laristic methodology, and the mechanistic view of 
the economic system associated with orthodox or 
Marshallian economics, Veblen did not dispense 
with the pure theory of competitive economic be- 
havior. He said that it was not very useful in ex- 
plaining twentieth-century capitalistic behavior, 
and that economists should go far beyond the pure 
theory of competition to develop a theory of cap- 
italism. For Veblen pure economic theory could 
only be a first approximation to economic reality. 
Above the level of pure economic theory it was 
necessary to construct a theory of the concrete 
evolving economic system (for the United States a 
theory of capitalism) before economic reality could 
be adequately comprehended. 

Veblen’s views about the nature of economic sci- 
ence are substantially those of all later members 
of the institutional school. Therefore, critics who 
assert that this school would do away with all in- 
herited economic theory, that institutional eco- 
nomics is concerned only with the description of 
economic activity, or that it is not concerned with 
improving economic theory are misinformed about 
the nature of institutional economics as developed 
by Veblen and those who followed him. 

Institutional economics after Veblen. The post- 
Veblenian group of institutionalists that came into 
prominence in the 1920s and 1930s differed from 
Veblen in a number of important ways. Unlike 
Veblen they did not attempt to theorize about the 
whole sweep of economic development from the 
Stone Age to the contemporary phase of capitalist 
evolution. These later institutionalists denied that 
capitalism would come to a catastrophic end. Com- 
mons, Mitchell, Clark, and Tugwell in the 1930s 
were optimistic pluralists who did not accept Veb- 
len’s dualistic version of the class struggle. They 
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did not agree with Veblen that capitalism must 
inevitably disappear and be replaced by either 
fascism or socialism, These men were antisocial- 
istic solidarists who believed that class conflict 
could be replaced by class cooperation in a remod- 
eled type of capitalism. 

What the institutionalists of the 1930s took from 
Veblen was his basic view of the economic system 
as an evolving process, his division of the economic 
process into the technological (industrial) and the 
institutional (business) processes, and his view 
that conflict and not harmony is the fundamental 
characteristic of contemporary economic life. They 
also agreed with Veblen that the force that keeps 
the economic process in motion is technological 
change, which not only makes existing institutional 
arrangements obsolete but also opens the door to 
new institutional arrangements more in harmony 
with technological requirements. The post-Veblen- 
ian institutionalists accepted Veblen’s view that 
it was possible, but not inevitable, that mankind 
would harmonize the industrial and business sys- 
tems so as to produce an abundance of real, or 
economic, values, 

John R. Common’s primary interest was in the 
new industrial order which developed after 1875 
and which replaced the equilibrium of the earlier 
competitive economy with a conflict-laden disequi- 
librium. In this new stage of capitalist develop- 
ment, collective action gave rise to major protec- 
tionist groups organized by businessmen, workers, 
and farmers to advance their sectional interests. 
In the period from 1875 to 1929 some of the con- 
flicts between these interest groups were eliminated 
by the courts operating under the rule of judicial 
sovereignty. But after 1929, when “welfare cap- 
italism” emerged and the government took a more 
positive role in economic affairs, numerous gov- 
ernmental administrative agencies took over the 
courts’ work of domesticating the new industrial 
system and curbing economic conflict. Commons 
looked forward to seeing a form of “reasonable 
capitalism” in which conflict would be replaced by 
cooperation and reasonable values and practices 
would be established through administrative agen- 
cies working together with private economic groups. 

In analyzing the era of finance capitalism that 
came after 1875, Wesley C. Mitchell made use of 
Veblen’s dichotomy between industry and business. 
Mitchell’s “money economy” is the financial super- 
structure that rests on the underlying real-product 
or industrial economy. The industrial economy 
deals with industrial technology and real output, 
whereas the money economy involves considera- 
tions of costs, prices, financial investment, and 


profits. In Mitchell’s “finance capitalism,” or “high 
capitalism,” of the period from 1875 to 1929, eco- 
nomic guidance was provided not by consumers but 
by large investors. This guidance led to much co- 
ordination within individual business enterprises 
but very little between them. A characteristic fea- 
ture of Mitchell’s high capitalism was the recurring, 
self-generating business cycle that periodically led 
to booms in the money economy which were fol- 
lowed by depression in both the money and real- 
product economies. By the early 1930s Mitchell had 
come to the conclusion that technological change 
had so fundamentally altered the structure and 
functioning of the American capitalist system that 
the only way to eliminate the business cycle and 
secure a satisfactory coordination of the money 
and real-product economies was to introduce lim- 
ited democratic national economic planning. This 
planning would enlarge general welfare by stabiliz- 
ing the economy, providing full employment, and 
turning out more of Veblen’s economic values, or 
of what Mitchell described as goods that were 
“urgently needed.” 

John M. Clark looked at the American economy 
from a point of view that emphasizes the distinc- 
tion between “social” and “commercial” efficiency. 
He contended that the market mechanism no longer 
functioned satisfactorily when the economy was 
dominated by large-scale business enterprises. 
Prices were no longer closely related to costs, sur- 
plus profits were not eliminated, and many rigidi- 
ties were introduced into the economic system. The 
business concept of cost was restricted so as to 
exclude many important social costs such as un- 
employment, communal health hazards, unused 
productive capacities, and other costs associated 
with business fluctuations, Clark drew a distinction 
between social and market values and between 
social and market costs. Social values are those 
that the private market system ignores, such as 
clean air, scenic beauty, public health, and com- 
munity welfare. Social costs such as unused pro- 
ductive capacity, unemployment, destruction of 
worker morale, and resource spoliation are also 
ignored by the private market mechanism. Clark’s 
concerns were with bringing commercial efficiency 
closer to social efficiency and with making the eco- 
nomic system provide adequate supplies of both 
social and market values, He proposed to attack 
these problems in the 1930s through what he de- 
scribed as “social liberal” planning. This national 
planning would set forth the broad economic ob- 
jectives of the nation in a social budget and would 
endeavor to achieve them by applying indirect 
economic controls only at certain strategic points. 
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In the 1930s Clark had hoped that this “social con- 
trol of business” would be sufficient to enable the 
American economy to achieve a high level of social 
efficiency. 

It was Rexford G. Tugwell’s view that by the end 
of the 1920s the American industrial system had 
become mature but that the business system was 
still immature. Technological progress since 1875 
had its own logic, which pointed toward large in- 
vestments, mass production, low unit costs of pro- 
duction, low selling prices, and mass consumption. 
This logic of the industrial system, however, was 
unable to work itself out because of the obstacles 
placed in its path by the ideology of businessmen. 
Under the guidance of businessmen, large corpo- 
rate enterprises have denied the logic inherent in 
technological progress by restricting output, keep- 
ing prices high enough to secure excessive profits, 
distorting income distribution, and by failing to 
assume responsibility for the social as well as the 
private costs of production, 

The new equilibrium toward which Tugwell be- 
lieved that the American economic system was 
moving in the 1930s could be hastened by estab- 
lishing a fourth branch of government, namely, the 
directive branch. It would be the responsibility of 
this branch to improve the general welfare by dem- 
ocratically planning the nation’s economic activities 
with the voluntary cooperation of private business 
enterprise. Tugwell suggested in the 1930s that 
large-scale business enterprises should be federally 
incorporated and that all the enterprises in each 
industry should be represented by a tripartite body 
representing business, labor, and consumers. The 
government with the aid of a central planning 
board would develop an investment policy that 
would keep private investment in line with the 
needs of planned economic growth. Price guidelines 
would be developed so that more of the advantages 
derived from technological progress would be passed 
on to consumers in the form of lower prices. Agri- 
cultural planning would be coordinated with indus- 
trial planning so that surplus farm population 
would find employment in the expanding industrial 
core of the economy. In this manner the logic of 
business enterprise would be harmonized with the 
logic of technological progress in a coming era of 
cultural equilibrium. 

Institutionalism since World War 1m. Institu- 
tional economics since the end of World War n 
has been largely associated with the work of such 
economists as Clarence E. Ayres, Gardiner C. 
Means, Gerhard Colm, John K. Galbraith, and 
Leon H. Keyserling. These economists have worked 
along the lines laid down by Mitchell, Clark, and 
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Tugwell in the 1930s. A main difference, however, 
is that the institutionalists of the 1930s were deal- 
ing with welfare capitalism, whereas the institu- 
tionalists of the late 1940s and 1950s have been 
concerned with the problems of the next phase in 
the development of American capitalism, which 
may be called “guided capitalism.” 

The postwar institutionalists agree that the tech- 
nological changes of the past few decades have so 
altered the structure and functioning of the Amer- 
ican economic system that it does not, of its own 
accord, provide high levels of production and em- 
ployment, an efficient use of resources, and an 
equitable distribution of income. Ayres, Colm, and 
Keyserling accept Means’s view of “collective cap- 
italism.” According to this view large collective or- 
ganizations, such as corporations and trade unions, 
have so much economic power that they can pre- 
vent the use of economic resources that would 
result in full employment and sustained economic 
growth at a high level. Administered prices and 
wages have introduced seriously disruptive rigidi- 
ties into the economic system, which make it 
difficult for the economy to achieve its full em- 
ployment growth potential. Gerhard Colm contends 
that the American capitalist system is not “coherent” 
enough to prevent a considerable disequilibrium 
from existing. The cultural forces that make for 
coherence are too weak to prevent the growth of 
conflicts between special interests and the general 
welfare, Like the institutionalists of the 1930s, 
present-day institutionalists have come to the con- 
clusion that controlling the nation’s major eco- 
nomic power groups, eliminating the economy's 
numerous rigidities, and curbing its tendencies 
toward a less than full use of capital and labor 
require a considerable remodeling of the American 
economic system, but within the traditional social 
and political framework. 

In proposing to solve the nation’s major economic 
problems with the aid of limited democratic na- 
tional planning, present-day institutionalists have 
a large advantage over the earlier institutionalist 
proponents of national planning. In the 1930s 
national income analysis was in the earliest stages 
of its development; input-output analysis and 
econometrics were even less developed. Mitchell, 
Clark, and Tugwell could not quantify their plan- 
ning proposals through projections of gross national 
product, since it was not then possible to make 
such projections. In other words, the planning pro- 
posals of the institutionalists of the 1930s were in 
no way operational and so could have little influ- 
ence on economic decision making by the govern- 
ment. Since 1945 Colm, Keyserling, Means, and 
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other advocates of limited democratic national 
planning have been able to turn to the national 
economic budgeting developed during World War 11 
and now widely used in western Europe as a device 
for quantifying their national planning proposals 
and making them operational. Present-day institu- 
tionalists recommend the use of the national eco- 
nomic budget—what Keyserling describes as “the 
American economic performance budget”—to ex- 
plore the pattern of future economic growth and to 
indicate the kind of balance among productive fac- 
tors that would be necessary to reach sustained and 
adequate growth. 

The institutionalists’ “facilitative” or “indicative” 
national planning would be applied only at strate- 
gic points where the private market economy needs 
to be supplemented, if it is to achieve the desired 
growth goal. No such moderate planning could be 
successful without the active cooperation of busi- 
ness, agricultural, and labor groups, These major 
economic interest groups would become participants 
in the program of facilitative planning through a 
national economic council, 

Institutionalism and the value problem, A mat- 
ter of major interest to institutionalists is the objec- 
tives of their limited democratic national planning. 
From the institutionalist viewpoint, originating 
with Veblen, what ultimately provides direction for 
the economy is not the price system but the value 
system of the culture in which the economy is 
embedded. Economic guidance can only be as good 
as this cultural value system, John K. Galbraith 
calls attention to this value problem in stating his 
view that our affluent society suffers from an im- 
balance in which we produce too many private 
goods and too few public goods. Clarence E. Ayres 
tackles the same problem in his defense, based on 
his “technological theory of value,” of the current 
“industrial way of life.” 

In Ayres’s analysis all values originate in the life 
process; the basic criterion of value is whether or 
not a good contributes to an improvement of the 
life process, that is to say, to race survival. As 
science and technology progress and the industrial 
system becomes more complex, more real or “tech- 
nological” values are created, and the life process 
is improved, Running through all values, economic 
and otherwise, is a unity of value in the sense that 
what is good or bad is in all circumstances deter- 
mined by reference to the same criterion—science 
and technology. All such values as freedom, equal- 
ity, security, abundance, and excellence are inter- 

related and are associated with the scientific and 
technological basis of the life process. For example, 
science and technology have made possible an eco- 
nomic abundance, which in turn has enlarged the 


freedom of the consumer. The technological proc- 
ess gives rise to real, or “genuine,” values that 
enhance the life process, but the institutional or 
“ceremonial” process is the source of pseudo values 
that weaken the life process. The latter values 
reflect inherited superstitions and attitudes toward 
status and privilege that have no foundation in 
science and technology and are detrimental to the 
welfare of the human race. Ayres regards the shift 
toward a more industrialized society as a favorable 
development, since in such a society science and 
technology undermine myth and superstition and 
substitute real for “ceremonial” values. He views 
industrialization as a movement toward a more 
“reasonable” or “rational” society; what is needed 
to expedite this movement, in Ayres’s opinion, is 
more “pragmatic” national economic planning such 
as is proposed by Means, Colm, and Keyserling. 

The influence of the institutional school, The 
number of economists in the United States who can 
be readily identified as institutionalists has never 
been very large. Much of the early influence of the 
institutional school was due to the fact that a 
few of its members, especially Veblen, Commons, 
Mitchell, and Clark, became very well known, Al- 
though the work of Veblen and other institution- 
alists in the years from 1890 to 1939 did not alter 
the general course of academic economics in the 
United States, it undoubtedly contributed to making 
economists in general somewhat more sophisticated 
about the limitations of their approach. Since Veb- 
len’s time economists have been more aware of the 
significance of the preconceptions underlying their 
work and of the limitations of their assumptions 
about human nature and the nature of the eco- 
nomic system. In recent years the institutionalist 
approach has been particularly useful in the study 
of developing countries, whose problems are most 
successfully handled when analyzed from the view- 
points of a number of social sciences. Institutional 
economics has also interested scholars concerned 
with an interdisciplinary approach to the social 
sciences. 

The influence of the institutional school has been 
limited by the fact that its broad approach to eco- 
nomics has not been accepted by the majority of 
economists. Furthermore, the breadth of their ap- 
proach leads the institutionalists to be interested 
in data that are of little concern to many other 
economists. These data relate to matters not readily 
amenable to mathematical or quantitative treat- 
ment, such as the impact of technological change, 
the relations between the economic system and its 
surrounding culture, and the development of a 
theory of capitalism. 

Another factor restricting the influence of the 


institutional school has been the nature of its eco- 
nomic policy recommendations, which are far less 
widely accepted than are Keynesian policy pro- 
posals. Since, in the United States, democratic na- 
tional planning even of the moderate “indicative” 
type is neither politically acceptable nor of interest 
to the great majority of economists, the impact of 
institutionalist policy recommendations has been 
quite limited, Nevertheless, the institutionalist eco- 
nomic policy recommendations are sympathetically 
received in trade union quarters and by liberal ele- 
ments in the Democratic party, Furthermore, eco- 
nomic trends in western Europe, where limited 
democratic national planning has been widely 
adopted in capitalist countries, suggest the possi- 
bility that the influence of the institutional school 
may be larger in coming decades than it has been 
recently. Should national economic planning cross 
the Atlantic, the institutional school could very well 
gain much more influence on the development of 
both economic theory and economic policy than it 
has at present. 

ALLAN G. GRUCHY 


[See also PLANNING, ECONOMIC; and the biographies 
of CLARK, JOHN MAURICE; Commons; MITCHELL; 
VEBLEN.] 
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ECONOMIC UNIONS 
See under INTERNATIONAL INTEGRATION. 


ECONOMIC WARFARE 


Economic warfare is state interference in inter- 
national economic relations for the purpose of im- 
proving the relative economic, military, or political 
position of a country. While the concept has a pre- 
cise analytical content, it tends to have emotional 
overtones, especially under some of its many syno- 
nyms: economic aggression, penetration, infiltra- 
tion, exploitation, assault, drive, offensive, im- 
perialism, attack, aggression campaign, invasion, 
incursion, and many others. 

The concept can best be understood against a 
background of what economists consider “normal” 
international economic relations. The world as 
visualized by the classical economists was one in 
which the individuals and enterprises of many na- 
tions strive to maximize utility and profits, using 
lowest-cost resources, wherever found, As a result 
each nation tends to specialize in the production of 
items in which its economic units have a compara- 
tive advantage, and private interests seeking profits 
buy in the cheapest markets and sell in the dearest. 
This search for the gains from trade is conducted 
within a framework of convertible currencies, with 
automatic mechanisms determining the principal 
economic variables—prices of goods and factors, 
and incomes—and carrying the burden of adjust- 
ment in the relatively free flow of goods and serv- 
ices across national borders, The role of the govern- 
ment is minimal and is primarily to assist the 
optimizing behavior of individual economic units. 

This model is an abstract construction only. His- 
torically, however, there have been approximations 
to it, particularly in the nineteenth century, Some 
permanent departures from the model have re- 
ceived widespread recognition and are not regarded 
as economic warfare. State interference with trade 
in order to foster high-cost but potentially com- 
petitive industries has long been accepted. Dis- 
crimination through various forms of exchange 
controls, multiple rates, tariffs, quotas, and bi- 
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lateral balancing of trade has become ordinary 
policy with, in some cases, an economic justifica- 
tion. In most of these instances the government has 
acted primarily in support of domestic industry to 
achieve purely domestic economic and political 
goals, 

State interference on behalf of the state itself is 
what distinguishes economic warfare from other 
types of international economic policies. It is the 
conscious attempt to enhance the relative eco- 
nomic, military, and political position of a country 
through its foreign economic relations. The action 
must be purposeful; otherwise a nation merely pur- 
suing the benefits of trade for its citizens would be 
considered engaging in economic warfare if the 
action should in fact improve its relative position. 
It is the position of the country in the hierarchy 
of power, its position in relation to other countries, 
rather than any absolute accretion to its power, that 
is relevant. This can be accomplished by a country 
which gains more than other countries gain, gains 
while others lose, or loses while other countries 
lose more. 

Economic warfare serves the purposes of the 
state, be they economic, political, or military, or 
some combination of the three. All of these pur- 
poses may be served simultaneously by the same 
action, or in some cases a course of action may 
serve one of them at the expense of another or the 
other two, The precise balance among purposes is 
determined by the total goals of the state. The 
State, in effect, has a utility function, and economic 
warfare is an effort to raise the value of one or 
more economic or noneconomic variables. 


Military aspects 

The military aspect of economic warfare has two 

basic dimensions. One involves the use of economic 
warfare as an adjunct to military operations. The 
other involves its use to strengthen the peacetime 
military establishment and prepare for war. In 
both, however, the aim is substantially the same. 
In peace, in preparation for war, or in war a nation 
wishes to acquire the maximum net resources 
available for military use and to deny the enemy 
or potential enemy any resources which may con- 
tribute to his war-making capability. 

In peacetime or in preparing for military opera- 
tions, the use of force may well be inappropriate, 
since it may jeopardize future plans. Economic 
warfare, however, may accomplish some of the 
same purposes as force. Resources may be acquired 
by improving the terms of trade of the country with 
respect to the rest of the world or with respect to a 
single nation or group of nations. A country may, 
through trade, acquire such influence over another 


country that the latter will be willing to form a 
military alliance rather than forgo the trade, or at 
least be willing to be neutral in a potential conflict. 
Furthermore, the victim may be induced to join in 
acts of economic warfare against still other coun- 
tries. Economic relations may assume such im- 
portance and be subject to such manipulation that 
the trading partner will lose some or all of its 
sovereignty. Many techniques can and have been 
bent to these ends, such as the use of trade to dis- 
rupt markets, changing sources of supply or re- 
fusing to supply goods in an unpredictable manner, 
and manipulation of foreign exchange and gold 
holdings. So long as these actions are undertaken 
to acquire resources or to deprive a potential enemy 
of resources, economic warfare is being used in 
direct support of military operations. 

In wartime, economic warfare is a natural and 
important adjunct to military actions. Again, the 
principal purpose is to prevent resources from fall- 
ing into the hands of the enemy and to acquire as 
many resources as possible, These actions operate 
largely through nonbelligerents and will be used 
even though a military blockade may be neither 
feasible nor desirable. In some cases, action may 
be directed at weakening the enemy's economy, 
such as the counterfeiting of enemy currency 
during World War n. 

Embargo. A special case of economic warfare 
is the full or partial embargo in anticipation of 
possible war, such as that maintained by the 
United States, with some western European sup- 
port, against the Soviet Union, eastern Europe, and 
Communist China during the 1950s and 1960s. A 
country may fear the possibility of war with an- 
other country and decide that no trade should take 
place which might enhance the war-making po- 
tential of the prospective enemy, If and when war 
does come, the enemy will not be as well off as he 
would have been if there had been no embargo. 
This type of economic warfare has frequently been 
employed preceding military operations. 

Embargoes generally confer short-run benefits on 
the perpetrator, but if war does not come, the long- 
run benefits may accrue to the embargoed nation. 
Because of modern technology most economies 
have such flexibility that there are few goods for 
which a substitute cannot be found, even if at a 
somewhat higher cost. When confronted with an 
embargo, a nation begins to develop substitutes, 
perhaps using resources formerly exported to bear 
most of the cost of the development. If the embargo 
period is long, the country will have made the 
necessary substitution, so that the complete em- 
bargo of wartime does not impose any further bur- 
den. This kind of economic warfare can backfire 


if the potential enemy is forced prematurely, at a 
time when it can perhaps better afford it, to pre- 
pare itself for war by developing self-sufficiency. 
In this case, the initiator has at most imposed some 
added costs on the enemy, but at the expense of 
facing an enemy not only more adequately pre- 
pared but also now insulated against this particular 
kind of economic warfare. Of course, a small coun- 
try which has no alternate sources of supply or 
alternate markets can be reduced to impotence by 
an embargo by a large economic power. 

Since World War n the United States has been 
the principal proponent of the embargo and the 
communist countries the principal victims. The 
United States first initiated a strategic embargo 
against the Soviet Union and eastern Europe when 
Czechoslovakia became a communist country in 
1949, By “strategic” is meant that not all exports 
to those countries were eliminated, but only those 
considered important from the point of view of 
their war-making capability. During the Korean 
hostilities the embargo was strengthened and a 
complete embargo was applied against China. The 
United States tried to persuade most of its allies to 
initiate similar embargoes, but usually somewhat 
weaker ones were imposed. 

There were many difficulties in applying the 
strategic concept, which has no precise economic 
meaning, since all economic goods are in reality 
substitutes for one another. After the Korean hos- 
tilities ended, most strategic controls were gradu- 
ally relaxed almost to the vanishing point, except 
in the case of Communist China. When Cuba en- 
tered the Soviet sphere of influence, the United 
States applied an embargo, nominally strategic but 
in fact punitive. It was almost a complete embargo, 
excluding some foodstuffs and medicines. It is un- 
likely that the strategic embargoes have signifi- 
cantly benefited the United States. Although they 
may have created temporary bottlenecks and plan- 
ning problems in the Soviet Union, they tend to 
make the communist world more self-sufficient and 
have increased intra-Soviet area trade. In the case 
of Cuba it has helped to drive that country into the 
Soviet economic sphere. 


Economic aspects 

_ Although it directly supports the military estab- 
lishment, economic warfare, even when serving 
military purposes, is directed at the economy. If 
a country is at war or proposes to g0 to war, eco- 
nomic warfare serves a military goal. If a country 
is not at war and does not necessarily intend to 
engage in military hostilities, then economic war- 
fare may be serving an economic function. All of 
the measures usable in wartime are also usable in 
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peacetime. Some of them, however, are not gen- 
erally employed because of their intimate associa- 
tion with active hostilities (e.g., preclusive buying). 

For economic purposes, the actions constituting 
economic warfare generally fall into five cate- 
gories: (1) guaranteeing sources of supply, (2) 
guaranteeing markets, (3) improving the terms of 
trade, (4) denial, and (5) economic take-over, The 
first three of these do not necessarily imply eco- 
nomic warfare; they may be normal commercial 
transactions undertaken in pursuit of private 
profits. They may, however, also be undertaken for 
the specific purpose of increasing the economic 
power of a country more rapidly than that of other 
countries. A wide variety of techniques are avail- 
able for attaining these ends, ranging from those 
which are also usable in wartime to those appar- 
ently less predatory, such as gradually becoming a 
large trading partner and then threatening to with- 
draw trade, the building up of debts in other coun- 
tries, and the extension of credit. 


Political aspects 

All of the above acts of economic warfare may 
also be used to further the third important purpose 
of economic warfare—the pursuit of political ad- 
vantage. This purpose, always important, has 
gradually assumed greater and greater significance 
as nations have become increasingly cautious about 
the use of force to achieve aims which transcend 
the economic benefits that may be conferred by 
economic warfare, The search for political power 
may take three forms. It may be quite general, A 
country may simply desire respectability and status. 
Many countries on the way up, in process of po- 
litical and economic development or in the consoli- 
dation of revolutionary gains, may want, initially 
at least, only to be recognized and noted as a mem- 
ber of the family of nations. Other countries, on 
the way back from a disastrous national experi- 
ence, such as a lost war, may desire readmittance 
and a return to their former place. 

The political purpose may also be manifested in 
a highly specific fashion. A country may wish to 
conclude an alliance or to obtain the vote of an- 
other in some international organization. Or it may 
want another country to eliminate a particular po- 
litical or military leader, to undertake some change 
in domestic or foreign policy, or the like. The 
change may be either internal or external, or both. 
In most cases, however, it concerns foreign policy, 
since it is in this field that the power position of 
the initiating country is most directly affected. 

The ability to employ economic warfare for these 
political goals is an indication of the relative eco- 


nomic power of the country. It implies either an 
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economy significantly larger than that of the in- 
tended victim or an economy which is planned and 
can be directed by the government to the attain- 
ment of specific goals, Such is the case, for ex- 
ample, with the Soviet economy, Successful eco- 
nomic warfare implies growing economic strength 
of the protagonist relative to the intended victim, 
and even when such actions do not add directly to 
the economic capabilities of the initiator, they 
usually worsen the economic position of the victim. 

The ultimate degree of political, as well as eco- 
nomic, influence is the take-over of another coun- 
try. Just as economic warfare may be directed at 
reducing the flexibility of another country’s econ- 
omy to such an extent that the country can no 
longer make decisions with respect to its own re- 
sources, so economic warfare may also be used to 
pave the way for political amalgamation, to deprive 
a country of its national sovereignty. For some 
countries, e.g., Germany in the 1930s, the take- 
over has been the ultimate goal of economic war- 
fare, and the other objects have been but stepping- 
stones to this end. This is true, however, only in 
the specific historic context in which a major 
power feels itself endowed with the truth and with 
the obligation to propagate it throughout the world. 
Throughout most of history, nations have been 
willing to set their sights much lower and have 
used economic warfare to attain much more limited 
objectives. 


Strategies in economic warfare 


Economic warfare is ordinarily a two-sided 
affair. When an intended victim of economic war- 
fare discovers his position, he can be expected to 
retaliate, It is, of course, possible that he may go 
beyond economic warfare and may even initiate 
hostilities. Usually, however, the victim assumes 
one of three basic postures: (1) He may be pas- 
sive and accept whatever solution the initiator of 
economic warfare proposes, This is a fairly rare 
phenomenon. (2) The victim fights back, attempts 
to defend himself, and only grudgingly gives in. 
The victim may choose to fight on the grounds 
already chosen—to break the embargo, to pay off 
accumulated debts by borrowing elsewhere or by 
selling accumulated balances at a discount, or to 
attempt to extricate himself from whatever other 
economic snares he is involved in, He may also 
fight back by using a different policy to inflict harm 
on his tormentor, so that the pressure will be re- 
laxed. Thus, if country A holds large balances of 
the currency of country B and is using them to 
elevate its own position, country B may employ 
quotas or other discriminatory policies to raise the 


siege. (3) Both countries may engage in economic 
warfare against one another but with different pur- 
poses, and both may be successful. The country 
with a narrow political purpose may achieve it 
through economic warfare against the country 
which is achieving its specific economic purposes 
in the same conflict. The political gains of the 
former may more than offset its economic losses. 
The economic gains of the latter may more than 
offset its political losses. Thus, in economic war- 
fare each side can consider itself the winner and, 
by the same reasoning, each side can lose. 

In the conduct of economic warfare, there are 
only two basic approaches. They are the carrot and 
the stick, cajolery and coercion, persuading the 
intended victim or forcing him through economic 
means into the desired course of action, Each has 
its applications. They may be used separately, but 
not infrequently they are used together. 

Coercion. The stick, or coercion, has been used 
most frequently. After a country achieves a posi- 
tion of economic influence in another, it may 
threaten to stop making purchases, to cut off sup- 
plies, or to refuse to pay obligations unless some 
concession—economic, political, or military—is 
made. In two most common situations the victim 
is maneuvered into a position in which a sub- 
stantial proportion of his trade is with the initiator, 
or the latter manages to become deeply indebted 
to the victim. Whether these devices will work de- 
pends heavily on the economic position of the vic- 
tim and the market for his products. If that market 
is relatively brisk, the victim may chance switching 
his sales to other markets rather than make a con- 
cession. A victim in a favorable internal economic 
and trade position may be able to withstand non- 
payment of a debt for some time if it is not too 
large, expecting that it will eventually be repaid 
regardless of whether he gives in. On the other 
hand, a slow market for its product, depressed 
world economic conditions, a very large debt, or a 
large percentage of trade in the hands of the other 
country may leave a victim with no alternative but 
to pay premium prices for his supplies, to sell at a 
lower price to the other country, or to let his po- 
litical and military policies be influenced by his 
trading partner. Usually this form of economic 
warfare damages the general prestige and standing 
of the country employing it, a cost which the initi- 
ator should calculate in advance. 

Persuasion. The carrot, or persuasion, is a 
prestige builder. It consists of the use of foreign 
economic relations to favor another country and 
thereby win its good will and support. Favoritism 
is a delicate instrument and does not carry with it 


the sometimes unpleasant connotations of co- 
ercion; it may, however, be just as effective. The 
usual method is to give more favorable terms of 
trade—either lower export prices or higher import 
prices—than would be provided in the world mar- 
ket, or to loan or give resources to the other coun- 
try. On the basis of these economic favors, a coun- 
try may expect, in return, reciprocal favors in the 
form of political support, an alliance, or perhaps 
neutrality. The recipient, however, may not be 
swayed by the economic benefits, in which case the 
initiating country can only return to the status quo 
ante, leaving the recipient with some net economic 
gains, 

It is possible and often likely that the two modes 
of economic warfare can be used in succession. 
For example, country A may pursue a policy of 
granting favors to country B, rewarding B for di- 
verting an ever-increasing proportion of trade to A 
from its previous channels, making tied loans or 
grants which use capital equipment from country 
A only, and giving other forms of economic succor. 
Country B becomes used to these favors and comes 
to regard them as fundamental to its welfare. So 
important do these economic considerations be- 
come that the recipient is deluded into thinking 
that the concessions made to its benefactor are 
really in its own interest. 

The method of persuasion, of course, works best 
when there is at least an ostensible similarity of 
outlook and interest between the two countries. In 
case the delusion breaks down, the initiator of eco- 
nomic warfare is in a position to employ the co- 
ercive method. While the recipient is being lulled 
by favors, the initiating country is gradually cor- 
nering a larger proportion of its trade, perhaps 
even approaching monopsony of some of the re- 
cipient’s exports or monopoly of some of his im- 
ports. From this vulnerable position it is but a short 
step, to being subject to massive coercion by the 
former benefactor. In its stupor, the recipient may 
not recognize what is happening until the initiator 
has maneuvered himself into a powerful and po- 
tentially damaging position. Then the latter, by 
withdrawing or threatening to withdraw, can often 
influence the policy of the victim. 

It must be recognized that the use of persuasion 
may render the initiator vulnerable, particularly if 
the external economic relations of the two contend- 
ing countries are of about the same size. If the 
trade is large for the recipient, it is also large for 
the initiator, and the recipient under some cir- 
cumstances may subject the initiator to pressure 
through the same relations that the initiator is em- 
ploying. If one country is substantially more power- 
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ful economically than the other, it can afford to 
pass up the debt repayment, can find other markets 
and new supplies. But the more comparable the 
countries are in economic power, or the more 
special economic considerations render them prox- 
imate, the greater is the likelihood that this form 
of economic warfare may backfire and hurt the 
country that initiated it. 


Economic warfare and state trading 


All the techniques of economic warfare, many 
of which have already been mentioned, may be 
classified as acts of buying or selling, borrowing or 
lending. The use of these techniques is consistent 
with either private or state trading, If private trad- 
ing is the predominant form, however, then eco- 
nomic warfare can be waged only insofar as the 
government intervenes and acts, either positively 
or negatively, to influence the decisions of private 
traders, 

A government-imposed quota or increased tariff 
in a country where private interests do the foreign 
buying may be an act of economic warfare if its 
intent is to improve the country's relative power 
position. A loan, sale, or purchase of some product, 
bilateral balancing of trade, and many other meas- 
ures may also constitute economic warfare, The 
ingredient common to all these techniques is gov- 
ernment action, Private actions may in some cases 
have the same result as economic warfare, al- 
though they are not necessarily so intended. Only 
government action is intended to be in the national 
interest and only government action, directly or 
through private traders, can constitute economic 
warfare. 

Perhaps the most singular characteristic of state 
trading is that the nation which practices it is 
necessarily and automatically engaged in economic 
warfare, Comprehensive state trading stands alone 
in its power as an instrument that utilizes eco- 
nomic means to improve a country’s relative eco- 
nomic, political, and military position, The concen- 
tration of all external transactions in the hands of 
the government implies that trade no longer is 
solely a means for increasing the income of the 
country. Economic decisions are automatically 
transmuted into economic-political—military de- 
cisions in which costs are balanced against benefits 
in all aspects of national life. 

ROBERT LORING ALLEN 


[See also FOREIGN AID; INTERNATIONAL TRADE CON- 
TROLS; MILITARY POWER POTENTIAL; SANCTIONS, 
INTERNATIONAL. Other relevant material may be 
found in FOREIGN POLICY; War.) 
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ECONOMICS 


References to specific topics in economics will 
be found throughout this article. For articles giving 
an overview of some of the major fields of eco- 
nomics, see ECONOMETRICS; ECONOMIC GROWTH; 
ECONOMIC THOUGHT; History, article on ECO- 
NOMIC HISTORY; INCOME AND EMPLOYMENT THE- 
ORY; INTERNATIONAL TRADE; and WELFARE ECO- 
NOMICS. For guides to major fields not summarized 
in single articles, see the entries under AGRICUL- 
TURE; INDUSTRIAL ORGANIZATION; MONEY; PUBLIC 
FINANCE; and LABOR ECONOMICS. 


Economics, according to a widely accepted defi- 
nition, is the study of the allocation of scarce re- 
sources among unlimited and competing uses. It is 
the social science that deals with the ways in which 
men and societies seek to satisfy their material 
needs and desires, since the means at their dis- 
posal do not permit them to do so completely. 

Much of the work of the discipline can be fitted 
into this framework, and no other comes so close 
to accommodating all of it. However, the frame- 
work has a serious shortcoming: it does not en- 
compass the problem of depression, or the inade- 
quacy of aggregate demand. Most modern market 
economies experience periods in which large quan- 
tities of resources are idle—particularly labor and 
plant capacity—so that the principal question is 
not in what way to use them, but how to put them 
to any use at all. The distinction between the prob- 
lems of allocating resources among uses and of 
achieving their full use corresponds very roughly 
to the distinction between the two main branches 
of economics: microeconomics and macroeco- 
nomics, The latter, however, includes some aspects 

of money and the general level of prices that also 
have important implications for resource allocation. 

Decisions about resource allocation are made in 
their simplest form within the household, in the 


familiar process of “making ends meet,” or budget- 
ing; the original meaning of economics was the 
practice of economy at the household level. What 
is now called economics was at first called po- 
litical economy, to draw attention to its broader 
theater of action. 

The concept of scarcity is crucial to an under- 
standing of resource allocation. Almost all re- 
sources are scarce under most circumstances, in 
the sense that if they were available without cost, 
more would be used than could be supplied. Even 
air, the classic example of a limitless resource, can 
become scarce. Pure air is now the exception in 
crowded urban areas and air pollution can be pre- 
vented only at substantial costs. Water is free in 
many places but scarce and costly in arid or densely 
populated areas [see WATER RESOURCES]. Goods, 
such as sand or dirt, that may be free in their 
original location have costs when they are trans- 
ported to the places at which they are needed. 

The counterpart of pervasive scarcity is the un- 
limited extent of material wants, The overwhelm- 
ing majority of the world’s population would always 
like a little more (often a lot more) income and 
consumption than it has, almost regardless of the 
current level. The exceptions to this generalization 
are of two kinds—ascetics (a rare breed in mod- 
ern societies) and the minute fraction of the 
world’s population so wealthy that it need never, 
so to speak, consult the price column of the res- 
taurant menu. Even among the very wealthy, with 
no desire to increase their consumption, there are 
some who seek to increase their incomes as a game 
or as a means of augmenting their power. 

Economics as a social science does not examine 
what people ought to want, as distinguished from 
what they do want. The first question lies largely 
in the realm of ethics, aesthetics, or religion. Never- 
theless, much writing by professional economists 
makes assumptions, explicit or implicit, about the 
proper goals of economic activity, Such writing can 
be considered economic philosophy rather than 
economic science narrowly defined. The former is 
often called normative economics and the latter 
positive economics. It is a positive statement to say 
that, other things being equal, a fall in the price of 
milk will increase its consumption—and the va- 
didity of such a statement can be tested against 
evidence. It is a normative statement to say that 
therefore the price of milk should be lowered. The 
second statement is not subject to disproof by evi- 
dence and implies that the speaker has personal,or 
social values according to which the consumption 
of milk is “better” or more important than com- 
peting alternatives. If such values are widely 


shared, they may form an appropriate basis for 
social policy, but the validity of the values them- 
selves nevertheless remains beyond the reach of 
scientific confirmation or testing. 

The goals of economic activity are often sub- 
sumed under the heading “utility,” a construct em- 
bracing all of the satisfactions to be obtained 
through consumption, production, and related be- 
havior [see Utitrry]. It is generally assumed that 
the objective of economic activities is to maximize 
utility, subject to the limitations of resources, and 
that utility will be increased by the increased con- 
sumption of goods and services. In principle, utility 
can also be increased by such intangible factors as 
beautiful scenery, pleasant working conditions, 
amiable companions, or political power. However, 
such desiderata are seldom explicitly introduced 
into economic analysis, for a variety of reasons— 
some are not subject to measurement, some can- 
not be produced through economic activity—and 
when they are specified in detail almost all are very 
differently evaluated by different people. Moreover, 
some lie in the domains of other social science dis- 
ciplines whose methods are better suited to analyze 
them, The assumption that the principal goal of 
economic activity is to produce goods and services, 
while clearly an oversimplification, is nevertheless 
useful in a wide variety of problems and can be 
modified in special cases, as seems appropriate. 

In a dictatorship the utility to be maximized is 
that of the dictator; in a slave society it is that of 
the slave owners rather than the slaves. In free so- 
cieties it is usually assumed that each person seeks 
to maximize his own utility and in general will be 
the best judge of how to do so. 

An improved method of raising beef increases 
utility for beef-eating Americans or Argentines, 
although not for vegetarians; however, the explana- 
tion of such cultural, religious, or taste differences 
in assessing the value of additional output is not 
the economist’s task. The national output includes 
bread and circuses, cathedrals and billboards, vita- 
mins and poisons, The economist can study the 
forces affecting the market price or cost of each 
and under some circumstances can also say that a 
market price does not correctly reflect the under- 
lying values of consumers and producers. Like any 
other citizen, he has personal opinions based on 
values that transcend market values, but if he 
cloaks these with the authority of his discipline, he 
arrogates wisdom. 

The economic value of a particular good will not 
ordinarily depend on the use to which it is put— 
the price of a stick of dynamite is the same whether 
it is used to mine coal or to blow up a bridge. 
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Almost all societies would judge the value of the 
outcome to be positive in the first case and nega- 
tive in the second, but the standards by which this 
judgment is made are not provided by economists 
in their scientific capacities. 


The methodology of economics 


The purpose of science is to achieve understand- 
ing either for its own sake—the satisfaction of 
“idle” curiosity—or as a basis for prediction and 
control. Although both elements are present in 
economics, the latter is particularly evident. [See 
PREDICTION AND FORECASTING, ECONOMIC.] The 
predictions sought generally refer to the aggregate 
behavior of sizable groups of people and not to the 
behavior of individuals. In most economic inquiries 
it is sufficient to reach such conclusions as that in 
a given market the consumption of milk increases 
1 per cent in response to a 2 per cent reduction in 
its price; the economist ordinarily does not care 
whether this occurs hecause each family increases 
its consumption 1 per cent or because half the 
families increase their consumption 2 per cent and 
the other half not at all. (This indifference will dis- 
appear if the division of families into two groups is 
nonrandom; for example, it might be valuable to 
know that low-income families increase their con- 
sumption more than high-income families.) An 
economist almost never attempts to predict the 
consumption behavior of a particular individual in 
response to a price change. In some economic 
studies, data are used in which the observations 
refer to individuals; such studies as yet typically 
fail to explain a major part of the variance in in- 
dividual behavior. 

The focus on aggregates makes it possible to 
summarize the accumulation of economic knowl- 
edge in statements of observed regularities, or 
“laws,” that do not apply to individuals considered 
singly. The preceding example about milk is a 
special case of the “law of demand”—the quantities 
of a commodity or service purchased move in- 
versely with its price, other things being equal. 
This law is not contradicted by the discovery that 
some consumers stop buying a commodity whose 
price falls, because it no longer confers social dis- 
tinction on the user (the so-called snob effect), 80 
long as such behavior does not dominate the mar- 
ket for any commodity. 

Partly because of the focus of economics on 
large-group behavior, economists can seldom use 
the method of laboratory experiment. (The few ex- 
ceptions lie chiefly on the borderlines between eco- 
nomics and psychology; for example, subjects are 
given money to spend, gamble, or use in similar 
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economic games.) The use of laboratory experi- 
ment is not, of course, a requisite of a science. 
Among natural sciences much of astronomy and 
meteorology are similarly disadvantaged. In eco- 
nomics the place of the laboratory experiment is 
largely taken by other methods of testing hypothe- 
ses, in which variables extraneous to the hypothe- 
sis being tested can, within limits, be held under 
statistical, rather than physical, control. 

The use of statistical control of extraneous vari- 
ables corresponds to the common theoretical device 
of “holding other things constant” in analyzing the 
functional relationship between two variables. For 
example, the statement that the consumption of 
milk increases in response to a fall in price can be 
represented as a demand curve, a downward slop- 
ing line on a graph whose vertical axis measures 
the price of milk and whose horizontal axis meas- 
ures the rate of consumption, [See DEMAND AND 
SUPPLY.] This relationship assumes that other 
factors affecting milk consumption, such as con- 
sumer income and tastes, do not change. A change 
in one of these factors will cause the demand 
curve to shift. A more complicated relationship, 
which expresses the rate of consumption as a func- 
tion of both price and income, is often more useful 
and can be estimated statistically, Such a three- 
dimensional demand surface is hard to show 
graphically and is therefore less common in ele- 
mentary expositions. 

A more complete analysis of the market for milk 
would include the conditions of supply as well as 
of demand. The relationship between the price of 
milk and the quantity supplied (generally positive, 
or upward sloping) is known as the supply sched- 
ule or the supply curve. Together the supply and 
demand curves determine the quantity sold and 
the price. For given supply and demand conditions, 
the position so determined is a stable equilibrium 
position, and quantity and price will tend to return 
to it if the market is subjected to a small accidental 
disturbance. [See Economic EQUILIBRIUM.] How- 
ever, if the disturbance changes the underlying 
conditions of demand or supply, the market will 
move to a different equilibrium position. For ex- 
ample, if the taste for milk changed so that people 
wanted it more than before, the result would prob- 
ably be a permanently higher consumption of milk, 
perhaps at a higher price. 

The method that confines analysis to compari- 
sons of the initial and final equilibrium positions 
of a market or system of markets is known as the 
method of comparative statics because it is not con- 

cerned with the path of the adjustment through 
time—that is, with such questions as whether milk 


consumption expands first quickly and then slowly 
in response to some initial impetus or expands at a 
steady rate. Dynamic analysis is concerned with 
the time path of movement. [See STATICS AND 
DYNAMICS IN ECONOMICS.] For example, in a study 
of hyperinflation the time pattern of response to a 
previous increase in the price level is a crucial 
attribute of a satisfactory theory. Although “static” 
is sometimes used as a term of reproach and 
“dynamic” as one of praise, the choice between dy- 
namic and static analysis must rest on the nature 
of the problem, with no general presumption that 
one is superior. Dynamic analysis requires addi- 
tional knowledge about human behavior—knowl- 
edge that is not always available, for it involves 
knowing not only how people respond to economic 
stimuli, but at what rate, 

As the foregoing observations suggest, economics 
makes extensive use of quantitative data and has 
a large body of abstract theory that can be extended 
and explored for consistency through the use of 
formal logic, For this reason economists began to 
use mathematics and statistics earlier and more 
intensively than most other social scientists. [The 
use of such methods in economics is discussed in 
ECONOMETRICS. | 


The functions of an economic system 


We now turn to an overview of the substance of 
economics and in particular to the central problem 
of resource allocation. This will be approached 
through a classification of the functions of an 
economy devised by Knight (1933). Of his five 
functions, we shall consider four, designated here 
as: (a) determining the composition of output; 
(b) organizing production; (c) distributing the 
product (or income); and (d) providing for the 
future. (The fifth concerns the allocation of a fixed 
stock of goods over short periods of time.) Later 
we shall turn to the discussion of money, the price 
level, and the level of employment, which, as noted 
earlier, lie largely outside the resource-allocation 
framework. 

Determining the composition of output. Mod- 
ern societies determine what to produce in two 
basic ways—through political or governmental de- 
cisions and through the use of markets. These 
methods can be combined in varying proportions. 
Even in a country as noted for the use of markets 
as the United States, many important classes of 
goods and services—such as schools, roads, police 
protection, and national defense—are provided 
wholly or largely outside the market system in the 
sense that consumers pay for them through taxes 
rather than by the direct purchase of the services. 


The nonmarket sector is much larger in such 
socialist countries as the Soviet Union, and their 
reliance on the market to guide production de- 
cisions is usually weaker where markets are used, 
However, even the Soviet Union has a few legal 
private markets, particularly for perishable farm 
produce. The activities in which government ser- 
vices are provided without explicit charge are not 
coextensive with the government sector of an econ- 
omy—for example, government post offices sell 
mail services to users by means of postage stamps 
(though not always at prices that cover costs), 
much as private businesses sell their services. 

In the market sector, consumer preferences are 
transmitted to producers through decisions to pur- 
chase the goods and services offered for sale. ( Pro- 
ducers may also use survey techniques or experi- 
mental markets to test consumer acceptance of 
new products [see MARKET RESEARCH]. ) When con- 
sumers want more of a product that is sold on a 
free market, its price will tend to rise and its pro- 
duction will become more profitable, Existing pro- 
ducers will increase their output and new producers 
may enter the market, Conversely, if consumer de- 
sires for a product weaken, prices and output will 
tend to fall, [See DEMAND AND SUPPLY ] Although 
prices are in general the best signals to producers 
of changes in the pattern of demand, consumer 
preferences can also be transmitted through mar- 
kets in which prices are fixed. In such markets, 
decreases in demand are indicated by unsold inven- 
tories of goods or by idle capacity to perform serv- 
ices; increases in demand are indicated by the 
disappearance of inventories and by queues, Most 
markets for consumer goods in socialist economies 
operate in this fashion, as do those monopolistic 
markets in capitalist economies in which prices are 
administered by sellers. 

In socialist economies, decisions about the quan- 
tities of resources to devote to consumer goods are 
made centrally through government planning. 
Once these decisions are made, the planners have 
an incentive to heed market signals of surpluses 
and shortages and to allocate resources to par- 
ticular products or to change product prices so as 
to achieve maximum consumer satisfaction with 
the allotted resources. However, the mechanisms 
for doing this in planned economies are not yet 
very successful. [See COMMUNISM, ECONOMIC OR- 
GANIZATION OF; and PLANNING, ECONOMIC, article 
on EASTERN EUROPE.] 

An economy in which all markets were fully 
responsive to changes in consumer tastes would 
exemplify consumer sovereignty [see CONSUMER 
SOVEREIGNTY]. Complete consumer sovereignty 
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under free enterprise would require the absence of 
monopolies, so that consumers could be provided 
with that amount of each commodity whose costs 
of production they were just willing to pay. (A 
monopoly would be able to restrict output below 
this level to obtain a price in excess of costs.) In 
equilibrium such a system would have the prop- 
erty, given the distribution of income, that the 
composition of output could not be changed with- 
out reducing the satisfaction of one or more con- 
sumers. Informative advertising would be per- 
missible, or even desirable, but advertising that 
persuades without informing represents a departure 
from consumer sovereignty [see ADVERTISING, ar- 
ticle on ECONOMIC ASPECTS], In principle, consumer 
sovereignty would be possible in a socialist econ- 
omy if planners were willing to abide by decision 
rules that simulated the behavior of competitive 
markets. 

The pattern of production resulting from con- 
sumer sovereignty depends not only on consumer 
tastes but also on the distribution of income and 
wealth, If income and wealth were redistributed 
more equally, there would be less production of 
goods and services consumed largely by the highest 
income groups (Cadillacs, domestic service) and 
by the lowest (potatoes, bus rides), Theoretically, 
there is less equality in decision making through 
the market than through democratic political proc- 
esses, for in political elections each person has one 
vote, while in the market his “voting power” is pro- 
portional to his expenditures. The actual contrast 
is less sharp, for there are concentrations of politi- 
cal power as well, and not every citizen has equal 
influence on the outcome of political decisions. 

An important advantage of a market system is 
that it characteristically provides for the tastes of 
minorities. It is often worthwhile for producers to 
make a product wanted by only a few people, pro- 
vided that these few are willing to pay a little more 
than the price of a standard commodity, Consider, 
for example, the large number of different vari- 
eties, sizes, and shapes of bread offered for sale in 
a big city with a diverse population, By contrast, 
the provision of goods or services by governments 
has tended toward standardization and uniformity. 
Some critics of market economies view this variety 
as wasteful and argue that the efficiencies of stand- 
ardization outweigh the importance of minor dif- 
ferences in tastes. 

The market system also offers strong incentives 
to producers to develop new or improved products, 
in the hope that these will gain consumer accept- 
ance and prove profitable. Innovation in consumer 
goods is a more conspicuous feature of market 
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economies than of planned economies, although 
neither system can guarantee that innovations 
which would succeed will always be forthcoming. 

Organizing production. Production is the com- 
bination of resources (factors of production) to 
produce desired output. Early in the history of 
economics the concept of production was associated 
with making physical products. Agriculture, min- 
ing, manufacture, and construction were consid- 
ered productive activities, whereas transportation, 
trade, and such services as education were not. 
This distinction has long been abandoned. It is now 
recognized that transportation and trade add to 
the value of commodities by making them avail- 
able where they are wanted and that furnishing 
services can contribute as much to the satisfaction 
of consumers as furnishing commodities [see IN- 
TERNAL TRADE], 

Specialization and exchange and the division of 
labor are central concepts in the economics of pro- 
duction [see SPECIALIZATION AND EXCHANGE]. In 
undeveloped economies, much production is car- 
ried out by individuals and small groups (families, 
tribes, communities) which consume their own 
output. For example, in the early colonial period 
in North America, families usually grew or caught 
most of their own food, built their own houses, and 
made their own cloth, clothing, furniture, soap, 
and candles. In modern economies, most workers 
are specialized in the production of a particular 
commodity or service and usually perform a spe- 
cialized task within the production process. Special- 
ization increases output because workers acquire 
specialized skills and save time that would other- 
wise be spent in switching from one activity to 
another. Specialization usually enlarges the scale 
of production, and large-scale production justifies 
the use of expensive specialized tools and ma- 
chinery. Moreover, the production of each good can 
be carried on in the most suitable locations. Al- 
though within some limits the division of labor 
undoubtedly increases output, it is sometimes ar- 
gued that it deprives the worker of the satisfaction 
of creating a complete product and of variety in 
job content, thereby also diluting standards of 
craftsmanship and contributing to industrial unrest. 

The possibility of specialization rests on the prin- 
ciple of relative advantage, which states that it is 
advantageous for a productive activity to be carried 
on by an inferior producer, provided that the supe- 
rior producers are even more superior at doing 
something else. To take a familiar example, sup- 
pose that a doctor can mow his lawn in an hour, 
and the best gardener he can hire takes two hours, 
It will nevertheless pay him to hire the gardener 


at $2 an hour if he can earn $5 in an additional 
hour of medical practice. (We assume that the doc- 
tor attaches less than $1 an hour of recreational 
value to lawn mowing.) The principle just illus- 
trated also underlies the specialization of produc- 
tion among nations and is therefore the cornerstone 
of the theory of international trade. [See INTERNA- 
TIONAL TRADE, article on THEORY.] 

Efficient production involves combining resources 
so as to minimize the cost of the output. We are all 
accustomed to partial measures of efficiency that 
relate useful output to the use of a single input— 
for example, miles per gallon of gasoline as a meas- 
ure of the efficiency of an automobile. Partial 
measures of efficiency or productivity, such as out- 
put per man-hour, are also found in economics, but 
economists are usually more interested in the effi- 
ciency with which all resources together are used. 
A measure of this is “total factor productivity’— 
that is, useful output divided by the total input of 
economic resources, weighted by their respective 
prices or costs. [See PRopuctiviry.] This measure 
reflects the condition that it pays to conserve one 
factor by increasing the use of a second only if the 
added costs of the second factor are less than the 
value of the savings of the first. For example, an 
automobile that runs more miles per gallon is eco- 
nomically inefficient if the added cost of the better 
engine exceeds the cost of the gasoline saved over 
its lifetime, both valued at the same point in time. 

Economists have long thought of resources, or 
inputs, in three traditional categories: land, labor, 
and capital. Land consists of the natural properties 
of soil, minerals, and climate; labor, of the current 
services (mental and physical) of humans; and 
capital, of such things as structures, machines, 
vehicles, livestock, inventories, and improvements 
to land such as clearing and terracing [see CAPITAL; 
LAND; NATIONAL WEALTH; AGRICULTURE, article on 
CAPITAL]. There is, however, nothing sacred about 
this particular trinity. The traditional factors are 
often very difficult to separate, as in the case of land 
and improvements to it, The factors can be more 
finely subdivided (for example, management can be 
distinguished from labor ), or the scheme can be ex- 
tended to recognize additional inputs, such as 
entrepreneurship [see ENTREPRENEURSHIP]. Hu- 
mans can also be regarded as embodying invest- 
ment made through education and training [see 
CaPITAL, HUMAN]. 

The relation between the combination of inputs 
in a production process and the resulting output is 
like a recipe in which labor, capital, and other 
classes of inputs replace such ingredients as butter, 
eggs, and milk. Such recipes are called production 


functions. [See PRODUCTION and ECONOMIES OF 
SCALE.] Cookbook recipes usually call for constant 
proportions of the various ingredients; in economic 
production, however, the proportions of inputs are 
generally varied in response to changes in factor 
prices so as to substitute less expensive for more 
expensive inputs. Although the possibility of sub- 
stitution among factors may be limited by tech- 
nology, it is often much greater than might at first 
be supposed. Steel mills in the United States, where 
labor is expensive, and in India, where labor is 
cheap, may use very similar basic production proc- 
esses, yet Indian mills will find it economical to use 
much more labor and less capital in such auxiliary 
operations as materials handling. 

For some purposes, the cost of an input is ade- 
quately represented by its market price, For others, 
one must turn to the more fundamental concept of 
opportunity cost, or alternative cost, which states 
that the cost of employing a unit of a factor of pro- 
duction in any activity is the output lost by the 
failure to employ that unit in its best alternative 
use, For example, the cost to the economy of chang- 
ing the use of a tract of land on the outskirts of a 
city from farming to residences is represented by 
its value as farmland, which in turn depends on its 
contribution to farm output. However, if there were 
several residential or industrial development plans 
in each of which the value of the land was higher 
than in farming, the cost of using it in any one 
plan would be its highest value in any of the others. 
[See Cost.] 

Difficult problems arise when some of the out- 
puts or inputs in a production process have no 
market value or when some of the costs of produc- 
tive activity are not borne by the enterprise engaged 
in it. The evaluation of a proposal to convert farm- 
land near a city into a free public recreation area 
is essentially a question of judging whether the 
benefits exceed the costs, Production costs imposed 
on parties not engaged in the production process 
are exemplified by air and water pollution from in- 
dustrial wastes, especially where the losses are so 
widely diffused that individual lawsuits to recover 
damages are impractical. [See EXTERNAL ECONO- 
MIES AND DISECONOMIES.] The presence of benefits 
or costs with no market valuation often leads to 
government regulation or intervention (such as the 
zoning of land or the prohibition of child labor) in 
the production processes of market economies. 

In the classical period of economics, until about 
1870, it was held that the cost of production deter- 
mined the price or value of a commodity [see VALUE, 
LABOR THEORY OF]. The beginning of the neoclassi- 
cal period is marked by the recognition that value 
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is jointly determined by cost and by demand. The 
costs of producing a commodity often rise as total 
output is expanded and as resources less suited to 
its production are drawn into use. In such cases, an 
increase in demand requires the expansion of pro- 
duction into areas of higher costs and therefore 
causes an increase in price. However, changes in 
price can also arise on the supply side through 
changes in the underlying cost conditions, includ- 
ing changes in the supply of productive resources 
or in the technology of production, and through 
changes in the extent of monopoly power. 

Once resources have been committed to a par- 
ticular kind of production, they often cannot be 
easily withdrawn. For example, only a small frac- 
tion of the original construction cost of a railroad 
line could be retrieved as scrap value or from sale 
of the right of way. The remaining original con- 
struction costs, often called fixed costs or sunk 
costs, become irrelevant to day-to-day decisions 
about pricing and the nature of service. Tt will pay 
for the line to continue in operation if a pattern of 
prices and service is possible in which revenues 
cover current operating costs and interest on sal- 
vage value. Financial losses to the owners of spe- 
cialized fixed capital are not social losses, and are 
not really costs, since such capital has no alter- 
native use. As specialized capital equipment wears 
out, the costs of replacing it become relevant to 
decisions concerning the abandonment or continu- 
ation of the activity in which it is used. Analogous 
problems arise in the case of labor with specialized 
training. 

The period within which some costs are fixed is 
called the short run; in the long run, all costs can 
be changed by changes in inputs. The length of 
the short run in chronological time varies with the 
nature of the production process and the durability 
of the capital goods used, It may be a hundred 
years for a hydroelectric project and a few weeks 
for an establishment making inexpensive cotton 
dresses. n 

Decisions about the organization of production 
also involve determining the lowest-cost location of 
productive facilities. Some kinds of facilities are 
located where raw materials are accessible (iron 
and steel mills, for example ). Some activities take 
place close to sources of low-cost power ( aluminum 
reduction and production of electrochemicals), 
others close to important markets for the product 
(automobile assembly), and still others where suit- 
able labor is available or inexpensive E 

ts), [See SPATIAL ECONOMICS: } 
< conse income. The third basic function 
of an economy is to distribute income. In a society 
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based on subsistence agriculture most income is 
distributed in kind; each family consumes largely 
the products of its own labor and land. In complex 
economies most income is distributed in money, 
representing generalized purchasing power or com- 
mand over all goods and services offered for sale. 
The dominant use of money-income payments re- 
sults from both specialization in production and 
the increasing variety of things consumed. The 
producer uses money income to purchase small 
quantities of many products, not necessarily in- 
cluding the one that he himself makes. 

Income is the net value of production during a 
period—the amount available for consumption or 
net saving after initial wealth or capital has been 
maintained. In other words, income is the maxi- 
mum amount that an economy (or an individual) 
could consume during a period without becoming 
poorer. If actual consumption is less than this, 
there has been saving during the period, and wealth 
has increased, 

The functional classification of income depends 
on the kind of productive services furnished by the 
income recipient [see INCOME DISTRIBUTION, article 
On FUNCTIONAL SHARE]. Wages and salaries are the 
return for furnishing labor services, interest is the 
return for furnishing capital, and rent, in one sense 
of the term, is the return for furnishing land [see 
WAGES; INTEREST; RENT]. (These categories are 
less neat in income statistics than in theory; a land- 
lord who receives rent for a building, for example, 
is furnishing both land and capital.) The income 
share called profit is hard to identify with a par- 
ticular factor of production. Much of what is called 
profit in both business and national accounts is a 
return to equity capital and corresponds to interest 
in economic theory. Some accounting profit repre- 
sents a return to positions of monopoly power, 
sometimes called rent in another sense of the term. 
The residual, or “pure,” profit is regarded by some 
economists as the return to entrepreneurship or to 
innovation and by others as a return (either posi- 
tive or negative) for taking noninsurable risks. 
[See Prorit.] 

Certain streams of payments to individuals, 
called transfer payments, do not correspond to any 
current contribution to production, but are claims 
of one set of individuals against the income of an- 
other. Public assistance is one example. The sum 
of wages and salaries, rent, interest, and profits 
exhausts the national income, and transfer pay- 
ments arise from a secondary redistribution of 
these shares. 

The basic principle of income distribution in a 

private enterprise economy is that individuals 


should receive as income the value of their con- 
tribution to production, including both the value of 
their labor and of the services of any productive 
land or capital that they own, This system gives 
each person an incentive to use his resources fully 
and where their contribution to the value of output 
is greatest. In practice, differences between the con- 
tribution of a factor to production and the income 
it receives can be caused by some kinds of imper- 
fections in the markets for productive services. The 
income distribution is also modified by transfer 
payments, which are received largely by low-income 
groups; by the provision of free government serv- 
ices; and by taxes, particularly progressive personal 
income taxes. For most capitalist countries, these 
modifications seem to work on balance in the direc- 
tion of greater equality of incomes. 

The distribution of incomes in capitalist societies 
has long been criticized by socialists and reformers 
as too unequal. The best known competing prin- 
ciple is that of Marx: “From each according to his 
abilities, to each according to his needs.” No large, 
complex economy has ever operated on this prin- 
ciple for more than brief periods, although it may 
operate in certain religious orders and cooperative 
communities. Attempts to apply the Marxian prin- 
ciple have broken down because of two problems, 
assessing needs and maintaining incentives to work 
and to save. 

The main difference between the principles of 
income distribution in contemporary capitalist and 
socialist countries lies in the treatment of nonlabor 
income. In both types of economies, workers are 
paid approximately in accordance with the value 
of their work. In the socialist countries, however, 
almost all productive capital is owned by the state, 
so that personal income from property is negligible. 
(In the Soviet Union some private income arises 
from interest on savings deposits and government 
bonds and from the private ownership of livestock 
by collective-farm families.) In capitalist econo- 
mies, property income is much less equally dis- 
tributed than labor income. It does not follow, how- 
ever, that the distribution of income, as a whole, is 
more equal in socialist countries. In developed cap- 
italist countries, labor income is the great bulk of 
total income; a somewhat less equal distribution of 
labor income in socialist than in capitalist coun- 
tries could therefore offset the effect of the property 
share on the equality of total incomes and probably 
does offset much of it. However, there appears to 
be no counterpart in socialist economies to the very 
highest incomes under capitalism. 

The distribution of income among persons is not 
only unequal but asymmetrical. Almost all income 


distributions are markedly skewed to the right; a 
concentration of individuals below the mean is bal- 
anced by a long “tail” much above the mean, Since 
natural abilities were long assumed to be symmetri- 
cally distributed and most income was seen as a 
return to various abilities, the skewness of real- 
world income distributions motivated much work 
by economists and statisticians. [See INCOME DIS- 
TRIBUTION, article on SIZE; and SIZE DISTRIBUTIONS 
IN ECONOMICS.] 

The major social problem related to the personal 
distribution of income is poverty, an older and 
more pressing concern of economists and sociolo- 
gists. In economies at a low level of development, 
most peasants and unskilled workers are considered 
poor, In the highly advanced economies, relatively 
few families with employed male heads are con- 
sidered poor; poverty is largely a problem of the 
aged, the disabled, the long-term unemployed, and 
households with female heads. However, there are 
exceptions. For example, in the United States there 
are a substantial number of poor farm families and 
poor Negro families headed by employed men. [See 
Poverty and CONSUMERS, article on CONSUMPTION 
LEVELS AND STANDARDS.] 

The definition of poverty has varied with time 
and place. In the United States in the mid-1960s, a 
widely accepted definition was that families with 
incomes below $3,000 a year were poor; about 
one-fifth of all families fell below this mark. In 
earlier times or in underdeveloped countries, the 
standard of living embodied in this definition would 
be viewed as comfortable or even luxurious, If the 
accepted poverty line moves upward as fast as 
average income, the elimination of poverty be- 
comes exceedingly difficult. But however elusive 
the definition of poverty, the social problem is real 
and serious. Low income is a major factor contrib- 
uting to crime, disease, and ignorance, which in 
their turn can give rise to new generations of the 
poor. 

Policies to combat poverty fall in three general 
categories: public assistance or relief, rehabilita- 
tion and retraining, and the regulation of product 
and factor prices. Economists have tended to criti- 
cize the simplest forms of direct relief, once on the 
ground that they raised the natural rate of increase 
of poor populations and more recently on the 
ground that they weaken the incentive to work. 
Education, rehabilitation, and retraining have been 
preferred because they attack and sometimes elimi- 
nate a fundamental cause of poverty—the small 
actual or potential contribution of low-income 
workers to production. For the aged or disabled, 
social insurance is increasingly viewed as the best 
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approach to assuring a decent minimum standard 
of living [see AGING, article on ECONOMIC ASPECTS]. 

Greater equality of income is also an announced 
objective of numerous public policies that directly 
alter product or factor prices. For example, agri- 
cultural price and income policies in many coun- 
tries raise the incomes of farm families, which are 
usually lower than those of urban families, Such 
policies may increase the income of the favored 
sector, but often not that of the poorest families 
within it. Such policies also tend to restrict output 
or distort the allocation of resources, so that the 
choice is between a given national income un- 
equally shared and a smaller one somewhat more 
equally shared, Indeed, the result is sometimes to 
lower income without increasing equality at all. 
[See AGRICULTURE, article on PRICE AND INCOME 
POLICIES; SUBSIDIES.] 

The demand for equalization policy is especially 
strong in times of war or disaster, when there is a 
general disposition to prevent the rich from bidding 
up the prices of goods in short supply. To achieve 
“fair shares” or “equality of sacrifice,” the role of 
money in the distribution of income is often sup- 
plemented by the direct rationing of certain scarce 
items on a per capita basis or to users whose needs 
are judged to be greatest. Such policies also have 
substantial efficiency costs, which may reduce the 
income available for distribution. [See PRICES, ar- 
ticle on PRICE CONTROL AND RATIONING.] 

Providing for the future. The fourth major 
function of any economy is to provide for the future 
through the conservation of natural resources, the 
maintenance or replacement of existing stocks of 
physical capital, and the net accumulation of cap- 
ital. Increases in the stock of human skills and in 
the fund of accessible knowledge also are important 
in providing for the future. 

The views of economists on conservation often 
clash with those of others, including natural scien- 
tists [see CONSERVATION, article on ECONOMIC 
ASPECTS]. Two main kinds of conservation prob- 
lems can be identified, the first of which arises 
when a resource, such as wild game or fish, cannot 
be owned. No hunter has an incentive to spare an 
animal today so that he may take it in the future, 
for if he fails to act now, the game may be taken 
by someone else. Conservation under these condi- 
tions requires such government controls as closed 
seasons and bag limits. Interesting economic prob- 
lems arise in defining the optimum size of the pop- 
ulation to be conserved. 

Less severe policy problems arise where re- 
sources can be owned, as can land and solid min- 
erals. (Oil and natural gas are an intermediate 
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case.) If the owners of such a resource believe that 
it is becoming scarce and that few substitutes will 
be available, they have an incentive to conserve it 
in order to gain from the eventual rise in its price, 
The higher the cost of holding this asset, including 
the rate of return on alternative forms of wealth, 
the weaker will be this incentive. Nevertheless, 
overexploitation of resources can occur in such 
cases because of ignorance on the part of resource 
owners or because the interest rate confronting 
them does not correctly reflect the social cost of 
capital. 

A larger part of the problem of providing for the 
future is that of maintaining and increasing physi- 
cal capital. Capital is increased through saving, 
much of which is undertaken by individuals to pro- 
vide for personal contingencies, such as illness and 
old age, and to accumulate wealth to bequeath to 
descendants or others. Saving is also undertaken 
because it yields income (interest), which corre- 
sponds to the productivity of capital in production. 
If the rate of interest is 6 per cent, refraining from 
consuming $100 of income this year will make pos- 
sible consumption of $106 next year; alternatively 
it will make possible the consumption of an income 
stream of $6 in perpetuity, The return on capital 
provides both an incentive to accumulate and an 
incentive to refrain from using up present capital. 

Investment provides capital for a growing popu- 
lation and also increases the amount of capital per 
person available to raise output per head. With un- 

changing technology the return on capital would 
tend to fall as the best investment opportunities 
were used up; however, this tendency is offset by 
technological changes that create new investment 
opportunities, The theoretical problems involved in 
measuring the stock of capital and discovering the 
principles that determine its yield are among the 
most difficult in economics. [See CAPITAL.] 

Determining the optimum rate of investment in 
a growing economy also raises difficult problems, 
which have received increasing attention in mathe- 
matical economic theory [see INTEREST and Eco- 
NOMIC GROWTH, article on MATHEMATICAL THE- 
ory]. The crucial elements in this analysis are the 
rate of growth of population, the rate of return on 
capital, the rate of technological change, and the 
weights to be attached to the satisfaction of present 
and future generations. 

Society requires many kinds of capital. One im- 
portant way of classifying these distinguishes cap- 
ital used in production from that used directly in 
consumption, such as houses, private automobiles, 
and home appliances [see CONSUMERS, article on 


CONSUMER ASSETS]. A second important classifica- 
tion distinguishes the capital used in the produc- 
tion of a particular product, such as livestock, 
machinery, or factories, from capital—such as 
roads, water supply systems, communications facil- 
ities—that enters indirectly into the production of 
many products. The latter, called social overhead 
capital, is often publicly owned, [See CAPITAL, so- 
CIAL OVERHEAD.] Adequate provision for overhead 
facilities may be prerequisite to investment in the 
production of specific products. 

Investment in new physical capital or the acqui- 
sition of existing physical assets by individuals, 
corporations, or governments often requires the in- 
vestor to raise funds externally by borrowing or 
issuing shares. In the public mind the resulting 
financial assets, such as mortgages, bonds, and 
stocks, are more often identified with capital than 
are the underlying physical assets. Savings are 
made available for investment when financial as- 
sets are purchased by individual savers or by finan- 
cial institutions, such as savings banks and insur- 
ance companies, in which savers hold their funds. 
[See FINANCIAL INTERMEDIARIES. ] 

A third important way of providing for the future 
is through technological change, including the dis- 
covery of new products and processes. Technologi- 
cal change in its broadest sense involves a wide 
range of activities: the growth of fundamental sci- 
entific knowledge through basic research, the in- 
vention of machines and methods, the dissemina- 
tion and diffusion of knowledge through education, 
and the introduction of new methods in particular 
applications. An increased stock of knowledge is a 
more important legacy to succeeding generations 
than an enlarged stock of physical capital. New 
technology is usually incorporated, or “embodied,” 
in the stock of physical capital and in the skills of 
the labor force. [See INNOVATION; RESEARCH AND 
DEVELOPMENT; AGRICULTURE, article on PRODUC- 
TIVITY AND TECHNOLOGY.] 


Money, income, and the price level 

So far our discussion of the functions of the 
economy has proceeded with little reference to 
money, yet all but the simplest economies would 
grind to a halt without this essential lubricant. 
Some specialization and exchange can take place 
with the aid of barter, the direct exchange of one 
commodity or service for another. However, barter 
requires that each party wants the exact kind and 
quantity of goods offered by the other, which would 
happen only by coincidence. Money, by freeing ex- 
change from this coincidence of double demand, 


permits specialization to be greatly extended. The 
use of money as a medium of exchange therefore 
appears very early in economic development; 
money also furnishes the unit of account and, with 
other assets, serves as a store of value. [See the 
general article on MONEY.] 

Early money consisted of scarce commodities of 
various kinds, especially precious metals, and the 
minting of metals into coins was one of the earliest 
economic functions of the state. In modern econ- 
omies, banknotes and other paper currency have 
replaced coins, except in small transactions. These 
notes were originally redeemable in gold or silver; 
this has now become uncommon. Paper currencies 
were once thought to derive their value only from 
their convertibility into metallic money; however, 
experience has not supported this belief. It is now 
clear that money derives its value from its useful- 
ness as a medium of exchange and that irredeem- 
able paper currencies can have stable or even rising 
value if their issue is restricted. 

The largest part of money in developed econo- 
mies consists of bank deposits, usually transferable 
by check. Banks, through their lending operations, 
can create and destroy money in the form of bank 
notes and deposits. Recognition of this has led to 
their regulation by central banks (usually agencies 
of the government) as a means of controlling the 
quantity of money and thus affecting its value. 
[See BANKING and BANKING, CENTRAL.] 

The value of money may be defined as the recip- 
rocal of an index number of the general level of 
prices [see INDEX NUMBERS, article on PRACTICAL 
APPLICATIONS]. Changes in the price level are im- 
portant because not all individual prices are equally 
affected and relative prices and resource allocation 
are therefore influenced. In particular, changes in 
the price level affect the distribution of real wealth 
and income; a sustained rise in prices, or inflation, 
enriches net debtors at the expense of net creditors 
because the debt is stated in nominal or monetary 
units rather than in terms of command over real 
goods. A sustained fall in prices, or deflation, has 
the opposite effect. [See INFLATION AND DEFLATION. | 

One of the best developed theories of the value 
of money is the quantity theory [see Money, article 
on QUANTITY THEORY]. The demand for money, or 
cash balances, arises because money lowers the 
cost of making transactions. Recent formulations 
of the quantity theory express this demand as a 
function of real income or wealth, the expected rate 
gf change of prices, the interest rate, and other 
variables. This function is thought to be reasonably 
stable in the long run, in which case the general 
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level of prices is largely determined by changes in 
the quantity of money. Inflation results from sus- 
tained increases in the money supply relative to 
real income, and deflation from sustained decreases. 

Changes in the demand for money under condi- 
tions of constant supply can often be inferred from 
the turnover rate of the money supply, usually 
called the velocity of circulation [see MONEY, ar- 
ticle on VELOCITY OF CIRCULATION], A decreased 
desire to hold money would lead to a rise in veloc- 
ity as holders sought to dispose of excess balances; 
an increased desire to hold money would lower 
velocity. 

A decrease in the stock of money or its failure 
to grow as fast as productive capacity can cause, 
or contribute to, a fall in aggregate demand, out- 
put, and employment. Similarly, increases in aggre- 
gate demand brought about by increases in the 
stock of money or by other means will cause in- 
creases in output and employment when there is 
unused productive capacity, Once capacity is fully 
used, further increases in aggregate demand only 
raise the price level. Since capacity output will not 
be reached at the same moment in all branches of 
the economy, an intermediate zone exists in which 
increases in demand will increase output and 
prices simultaneously. However, behavior in this 
zone need not be symmetrical for increases and 
decreases in demand, A decrease in demand is 
likely at first to reduce output with little effect on 
prices. 

As this suggests, money is a crucial element in 
economic fluctuations—the business cycle or the 
alternation of periods of prosperity and depression 
and of rising or falling prices—which have been 
one of the most conspicuous features of modern 
capitalist economies [see BusINEss CYCLES). Econ- 
omists have long studied the sequence of events in 
business cycles—the amplitudes of fluctuations, 
the length of expansions and contractions, and the 
leads and lags of turning points of series at cyclical 
peaks and troughs. Such observations have helped 
in the formation of theories of the causes of busi- 
ness cycles, Special attention has naturally been 
given to the causes of depression and widespread 
unemployment [see EMPLOYMENT AND UNEMPLOY- 
MENT]. 

Interest in the construction of such macroeco- 
nomic theories was heightened by the great depres- 
sion of the 1930s and in particular by the work of 
John Maynard Keynes. Central to Keynes's thinking 
was the concept of deficiency of aggregate demand. 
If income receivers spent their entire incomes on 
consumption, such a deficiency could not arise; 
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however, a portion of income is saved, and Keynes 
held that this portion rises as income rises. [See 
CONSUMPTION FUNCTION.] Still, no problem arises 
if desired investment equals saving. All savings 
will then be spent in the purchase of new invest- 
ment goods, and the flow of income will continue 
undiminished. But Keynes argued that this equality 
need not hold—unfavorable investment opportuni- 
ties and high interest rates can lead to a reduction 
in desired investment and an excess of saving. [See 
INVESTMENT.] Under these circumstances, aggre- 
gate demand will fall short of productive capacity, 
real income will drop until saving is equal to de- 
sired investment, and there will be persistent un- 
employment. In Keynes’s view, governments should 
then create the required demand through an excess 
of government expenditures over tax receipts. 

Until Keynes, orthodox economists had argued 
that unemployment was caused primarily by ex- 
cessive wage rates and could be alleviated by wage 
cuts. Keynes assumed that in modern capitalist 
economies wages do not fall when output falls. In 
any case, his emphasis on the adequacy of demand 
led him to counsel against wage cuts. Wages, he 
pointed out, are incomes as well as costs, and their 
reduction could lower employment by diminishing 
the purchasing power of wage earners and their 
expenditures on consumption. 

A crucial relationship in the Keynesian system 
is that between the quantity of money and the rate 
of interest, which takes a form that prevents the 
interest rate from falling below some minimum 
level. This “liquidity trap” implies that expansion 
of the money supply during a depression will fail 
to lower interest rates and therefore will not stim- 
ulate investment, [See LIQUIDITY PREFERENCE; for 
a discussion of the full Keynesian model, see IN- 
COME AND EMPLOYMENT THEORY.] 

During the 1930s, concern about aggregate de- 
mand stimulated work on systems of national in- 
come accounting that record the income and 
expenditures of the whole economy by sector [see 
NATIONAL INCOME AND PRODUCT ACCOUNTS]. The 
simultaneous development of macroeconomic the- 
ory and macroeconomic accounting, among other 
forces, has led to a fundamental change in the atti- 
tude of governments toward depression and unem- 
ployment. It is now generally accepted that govern- 
ments can, and should, act to maintain aggregate 
demand at levels as close as possible to productive 
capacity, although there are still important dis- 
agreements about the best ways of doing this under 
particular circumstances. The principal possibilities 
are to increase the quantity of money and hold 
down or reduce interest rates through expansionary 


policies of the central bank, to reduce taxes, and 
to increase government expenditure. [See MONE- 
TARY POLICY and FISCAL PoLicy.] Opposite meas- 
ures can be used to control inflation resulting from 
an excess of aggregate demand over capacity. The 
use of tax reductions or of increases in govern- 
ment expenditures as means of combatting defi- 
cient demand implies that the budgets of central 
governments should be in deficit in times of less 
than full employment. 


Subfields of economics 


The theory of resource allocation (microeco- 
nomic theory) and the theory of money, income, 
and the price level (macroeconomic theory) are 
the core of economics, whose outlines have been 
sketched above, Surrounding this core are several 
specialized fields. The mention of these here must 
be extremely brief, especially for fields described in 
single articles elsewhere in the encyclopedia. 

Mathematical economic theory applies mathe- 
matical tools to theoretical problems in economics. 
Econometrics applies advanced statistical tools to 
empirical problems in economics. [For a survey of 
both fields, see ECONOMETRICS.| Scientists in these 
fields have an international association, the Econo- 
metric Society. 

The history of economic thought is the study of 
the work of past economists and their influence on 
one another, on other disciplines, and on the world 
of affairs. In this encyclopedia several schools of 
economics are discussed as such [see ECONOMIC 
THOUGHT]. However, the article on economic 
thought does not discuss the English and Ameri- 
can classical and neoclassical schools, whose ideas 
are reflected in many substantive articles and 
whose leading figures are discussed in biographies 
[see SMITH, ADAM; Ricardo; MILL in the classical 
period; MARSHALL; EDGEWoRTH; JEVONS; CLARK, 
Joun Bates, in the neoclassical period; KNIGHT; 
ROBERTSON; PIou in more recent years]. Simi- 
larly, the Keynesian school of economics is in- 
cluded not under economic thought, but under 
substantive entries and biographies [see KEYNES, 
JoHN MAYNARD; HANSEN]. 

Economic history applies economic analysis to 
problems that are outside the recent past, although 
such problems often have implications for the pres- 
ent and future [see HISTORY, article on ECONOMIC 
HISTORY]. There has been a strong recent move- 
ment toward the application of econometric meth- 
ods to economic history. Interest in the field has 
been stimulated by concern about the economies 
of underdeveloped nations and by the lessons those 
nations can learn from the economic history of 


developed areas. Economic historians have an ac- 
tive international association and several special- 
ized journals. Closely related to economic history 
is the rapidly expanding field of economic develop- 
ment, which covers both the theory of growth and 
the more practical application of economics to the 
problems of less developed areas [see ECONOMIC 
GROWTH]. 

Most other specialized fields in economics deal 
with problems of particular parts or aspects of the 
economy and the institutions peculiar to them. 

Industrial organization considers problems aris- 
ing from the structure of firms, industries, and 
markets. The firm is the entity that engages in pro- 
duction in the private sector of the economy. In 
simple economies, firms are owned by individuals 
or groups of partners, but large-scale economic 
activity is now carried out principally by corpora- 
tions—firms that can be owned by many share- 
holders, who have no liability for corporate debts 
beyond the possible loss of value of their shares. 
Shareholders typically have little control over the 
affairs of the corporation; effective control is in the 
hands of professional managers. [See CORPORA- 
TION.] 

An industry is the set of firms or establishments 
carrying out some branch of economic activity, 
although the boundaries of such a branch are not 
always easy to define usefully. A market in its 
simplest meaning is a place at which transactions 
are made; more broadly, it is the whole web of 
relationships between buyers and sellers. [See MAR- 
KETS AND INDUSTRIES.] The boundaries of markets, 
like those of industries, are often difficult to define. 

A central concern of the field of industrial organ- 
ization is the changes that take place in the char- 
acter of markets as the number of buyers and 
sellers increases and as collusion among buyers or 
sellers becomes more difficult. At one extreme lies 
monopoly [see Monopoty]. In principle, a monop- 
oly is a market with only one seller, although all 
sellers are affected to some degree by competition 
from other markets. A monopolist in aluminum 
production, for example, would face the competi- 
tion of other metals in some uses of his product. 
Markets dominated by a few sellers (oligopolies) 
raise interesting theoretical problems concerning 
interaction and collusion among the sellers, which 
are often approached through game theory [see 
Oicopoy]. At the other extreme are competitive 
markets with many sellers and buyers who do not 
collude [see COMPETITION]. 

° With given technology, competition promotes 
economic efficiency; monopoly or oligopoly tend to 
produce restriction of output and higher prices. 
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However, it is sometimes argued that monopoly is 
indispensable to technological progress. The advo- 
cates of this controversial view contend that very 
large corporations are necessary for innovation be- 
cause modern research is too expensive to be car- 
ried on by small competitive firms [see RESEARCH 
AND DEVELOPMENT]. 

Other topics considered in the field of industrial 
organization include processes that change the 
structure of industry and the government regula- 
tion of industry structure, output, and prices [see 
ANTITRUST LEGISLATION; MERGERS; REGULATION 
OF INDUSTRY]. 

Agricultural economics is concerned with the 
efficient use of resources in agriculture, the tech- 
nology of food and fiber production, the distribu- 
tion of farm products, and the income and welfare 
of farm populations [see AGRICULTURE]. Agricul- 
tural economists also study the migrations from 
rural areas that result from the high rate of natural 
increase of farm populations and the displacement 
of labor by improved farm technology. The organi- 
zation of agricultural production differs sharply 
between different parts of the world—the family 
farm owned or rented by the operator, the large 
collective farm and state farm of the Soviet Union, 
the tropical plantation employing wage workers. 
The form of organization has important implica- 
tions for efficiency and welfare. [See LAND TENURE; 
COMMUNISM, ECONOMIC ORGANIZATION OF; PLAN- 
TATIONS. ] 

Labor economics is both a branch of economics 
and of the interdisciplinary field of industrial rela- 
tions, Although most of this broader field was once 
occupied by economists, it has increasingly at- 
tracted sociologists, psychologists, political scien- 
tists, and lawyers. Labor economists are therefore 
tending to specialize in the analysis of wages, 
hours of work, and employment [see WAGES; 
LABOR FORCE, article on HOURS OF WORK; EMPLOY- 
MENT AND UNEMPLOYMENT]. They share with 
demographers an interest in labor force participa- 
tion and also study government regulation of work- 
ing conditions and the provision of social insurance 
and other wage supplements [see Acine, article on 
ECONOMIC ASPECTS; LABOR FORCE, article on PAR- 
TICIPATION; UNEMPLOYMENT INSURANCE; Work- 
MEN’S COMPENSATION]. 

A central institution in this field is the labor 
union—an association of workers formed to im- 
prove wages and working conditions through col- 
lective bargaining or political action [see LABOR 
RELATIONS and LABOR UNIONS]. Although labor 
economists concentrate on the impact of unions 
on wages and employment, they are still concerned 
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with the growth and internal functioning of 
unions. 

Public finance applies economic analysis to the 
activities of the state and their impact on the pri- 
vate economy, particularly on the level of income 
and employment [see BUDGETING; DEBT, PUBLIC; 
FiscaL POLICY]. A major part of the field is the 
study of tax systems, including the effect of taxes 
on income distribution: whether they fall propor- 
tionally on all levels of income, are regressive 
(take a larger share of the lower incomes), or are 
progressive (take a larger share of the higher in- 
comes) [see TAXATION]. Other studies of taxes deal 
with the effect of alternative tax systems on incen- 
tives to work, to save, and to use resources effi- 
ciently. Another part of the field deals with gov- 
ernment expenditures—with determining the kinds 
of economic activity most appropriate to the state 
and devising criteria for deciding which particular 
public projects are worth undertaking [see PUBLIC 
EXPENDITURES]. 

International economics extends economic anal- 
ysis to the world economy. A country’s transactions 
with other countries are summarized in its balance 
of payments [see INTERNATIONAL MONETARY ECO- 
NOMICS, article on BALANCE OF PAYMENTS]. Some 
of these transactions are trade in commodities and 
current services, which reflects specialization among 
countries in production [see INTERNATIONAL 
TRADE], Economists study both the actual pattern 
of trade and the theoretical conditions underlying 
it; the theory of international trade is perhaps the 
best developed body of theory in any subfield of 
economics. The effect of tariffs or quotas on the 
world economy and on the nation imposing them 
is a central problem in international trade theory. 
[See INTERNATIONAL TRADE CONTROLS.] 

Other international transactions are financial, 
such as the construction of branch plants in one 
country by companies owned in another and the 
flow of liquid capital through securities purchases 
[for examples, see INTERNATIONAL MONETARY 
ECONOMICS, article on PRIVATE INTERNATIONAL 
CAPITAL MOVEMENTS]. Conditions of international 
trade and capital flows are greatly affected by the 
relations between the values of various national 
currencies, now a frequent subject of study and of 
international discussion [see INTERNATIONAL 
MONETARY ECONOMICS, articles on EXCHANGE 
RATES and INTERNATIONAL MONETARY ORGANIZA- 
TION]. 

The preceding list of the fields of economics is 
not complete, Among the additional fields often set 
apart by professional associations, specialized jour- 
nals, or special courses of instruction are consumer 


economics, comparative economics, welfare eco- 
nomics, and regional economics. Space limitations 
preclude any discussion of these here, though they 
are covered elsewhere in the encyclopedia, [See 
CONSUMERS; CAPITALISM; COMMUNISM, ECONOMIC 
ORGANIZATION OF; ECONOMIC GROWTH; WELFARE 
ECONOMICS; SPATIAL ECONOMICS.] 


Organization and growth of the profession 


Economics has grown rapidly in recent years, as 
measured either by the membership of professional 
associations or by the volume of professional liter- 
ature. The American Economic Association was 
founded in 1885 and the Royal Economic Society 
(then the British Economic Association) in 1890. 
The former had fewer than 200 members in 1886; 
it passed the 2,000 mark in 1912. The largest in- 
creases, however, occurred between 1940 and 
1962, during which time the American Economic 
Association grew from just over 3,000 members to 
more than 11,000. In 1959 there were about 5,000 
fellows of the Royal Economic Society. As of 1964 
there were 37 national economic associations affili- 
ated with the International Economic Association, 
including 20 in Europe, 7 in Asia, and 6 in the 
Americas. 

The growth of professional literature has been 
just as impressive. In 1961—1962 an Index of Eco- 
nomic Journals was published, covering only ar- 
ticles in English. It requires 270 pages to index the 
articles published in the 38 years between 1886 
and 1924, and 533 pages for the 5 years between 
1954 and 1959. The oldest journal covered is the 
Zeitschrift fiir die gesamte Staatswissenschaft 
(Tübingen), founded in 1844. The next, the Quar- 
terly Journal of Economics (Cambridge, Mass.), 
began in 1866. By 1900 there were seven—three 
in the United States, two in Germany, and one each 
in Britain and Sweden. The number of journals 
covered in 1961—1962 was 76, of which 29 were 
published in Europe, 24 in the United States and 
Canada, 19 in Asia, and 4 in Africa. The Index 
omits many more journals that publish no material 
in English, 

Most economists are teachers in colleges and 
universities, although employment in business and 
government is increasing. Within universities most 
economists are in departments of economics, but 
in the United States there are also many on the 
faculties of schools of business, both graduate and 
undergraduate. Economists in business schools 
have worked closely with statisticians and applied 
mathematicians on problems of business decisioh 
theory and operations research. 

Economists employed in business are sometimes 


engaged largely in descriptive statistics and public 
relations and have had little professional training. 
A growing number of business firms, however, em- 
ploy economists with graduate training to do eco- 
nomic analysis. 

Perhaps the fastest growing use of economists, 
almost everywhere, is in government—at all levels 
from municipalities to international organizations, 
but particularly in national governments, Econo- 
mists are mvolved in a large number of different 
government functions. Many are involved in col- 
lecting and analyzing statistics and preparing sta- 
tistical estimates such as national income accounts 
and prices indexes. Economists in central banks 
and treasuries or ministries of finance are engaged 
in formulating and executing monetary and fiscal 
policies, Still others in such agencies as budget 
bureaus or ministries of defense may be making 
cost-benefit analyses of public works projects or 
weapons systems. One of the most important roles 
of government economists is in central planning 
organizations, ranging from the very large and 
powerful central planning agencies of the commu- 
nist countries, through the highly influential plan- 
ning agencies of France and the Netherlands, to 
the largely advisory bodies like the Council of Eco- 
nomic Advisers in the United States. 

An example may suggest the impact of changes 
in economic thought on government policy in the 
past three decades. In 1932, at the depths of the 
great depression, when the unemployment rate was 
over 20 per cent, the United States sharply in- 
creased the rates of a number of excise taxes and 
introduced several new ones in an effort to reduce 
the budget deficit. Shortly afterward John Maynard 
Keynes was to argue that such a policy would pro- 
long a depression. In 1964 the United States re- 
duced income taxes sharply in the face of a budget 
deficit because the unemployment rate of 5.7 per 
cent was unacceptably high, and it is probably not 
accidental that unemployment soon began to fall. 

The contrast between these opposite policies 
illustrates strikingly the influence, in this instance 
a salutary one, of economic thought on policy. This 
influence is an important force drawing students 
into the discipline, but it also imposes on econo- 
mists obligations to develop skills and judgment 
commensurate with their responsibilities, 

ALBERT REES 
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ECONOMICS OF DEFENSE 


It is essential to see defense as an economic 
problem and to understand what economizing 
means. It is also important for nations to attempt 
to use an economic calculus (quantitative eco- 
nomic analysis) in making choices and to shape 
the institutional arrangements in defense planning 
with care. Although the setting for defense plan- 
ning varies from nation to nation, these issues are 
similar everywhere. 

The economic problem. Choosing among pos- 
sible defense activities can be viewed as an eco- 
nomic problem: that of allocating available re- 
sources so as to get the greatest good from them. 
The alternatives include various possible sizes of 
the defense budget, i.e., various allocations of re- 
sources to defense, other governmental functions, 
and the private sector. The alternatives then in- 
clude various allocations of the total defense budget 
among particular “outputs,” or missions, such as 
nuclear retaliatory capability, limited-war capabil- 
ity, token forces, arms control, civil defense, var- 
ious kinds of mobilization bases, sealift, and so on. 
As with outputs in the private sector, some of these 
are close to being final goods and some (like sea- 
lift) are intermediate outputs, which are used in 
turn to produce other outputs. 
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Choices among various ways of producing these 
outputs are sometimes called “lower-level” choices. 
Here are included such options as the use of alter- 
native base locations, logistic arrangements, launch 
facilities, contract provisions, and personnel pol- 
icies. In creating defense forces there are many 
substitution possibilities—larger regiments versus 
better-equipped regiments, electronic countermeas- 
ures versus numbers of missiles, truck transport 
versus airlift, rifles versus mortars. These lower- 
level choices are economic problems too. When one 
estimates the costs and gains of different subma- 
rine forces and deployments, one is concerned with 
the worth of resources in competing uses. When 
one considers alternative locations for air-defense 
missiles or sites for vehicle fuel storage and repairs, 
one is dealing with the worth of these resources in 
rival uses, 

Needless to say, the choices at different levels 
are not independent of each other. The size of the 
budget should depend upon, among other things, 
the costs and gains of producing specific capabil- 
ities, i.e., the specific lower-level options that con- 
front a nation, In turn, the choices at lower levels 
should often depend upon the size of the budget. 

Another set of alternatives embraces various 
kinds of research and development intended to in- 
vent new defense policies or systems. Effective at- 
tempts to advance technology and efficient resource 
allocation within any given technology are both 
important; the absence of either would leave de- 
fense or any other activity at a comparatively prim- 
itive stage. Because relatively great uncertainties 
surround research and development alternatives, 
choosing among them calls for a different approach 
from decision making under conditions of com- 
parative certainty (Klein & Meckling 1958; Nelson 
1961). 

Economizing. “Economizing” does not imply 
scrimping and penny-pinching in defense (or in 
any other activity), nor does it call for meeting all 
needs. Instead, in defense planning “economizing” 
implies cutting defense costs whenever resources 
can produce more-valuable outputs elsewhere and 
meeting defense needs whenever they are more 
important than other needs. 

Many persons believe in or talk as if they believe 
in “needs” or “requirements” that should be met re- 
gardless of cost. Others seem to believe in cost or 
budget ceilings that must be adhered to no matter 
how large the gains that might be obtained by 
going above the ceiling. Still others believe that one 
should set up priority lists like the following: (1) 

laser; (2) new supersonic bomber; (3) improved 


long-range camera. What such a list means is hard 
to see: should all resources go to the first item? 
should resources go to the first item until an extra 
dollar there brings a zero return, even though the 
other items are worth something? should more 
money be spent on the first item than on the sec- 
ond and more on the second than on the third? 
The trouble is that a priority list does not face the 
real question: how much should be allocated to 
each item? 

None of these beliefs give proper guidance. Hard 
though it may be to apply, the economic principle 
is the one to keep in mind: make any resource 
shift that yields more than it costs. When gain ex- 
ceeds costs, a transfer of resources produces net 
gains and, thus, a greater total output. There are 
two major approaches to the application of this 
principle: improvement of the economic calculus 
(information about the costs and gains from alter- 
native courses of action) and improvement of the 
institutional arrangements that shape defense 
choices. As in the private economy, both are im- 
portant. Identification of the correct choice will not 
help much if institutions are such as to lead deci- 
sion makers to make a different choice. On the 
other hand, better institutions are hard to devise 
if good choices cannot be identified. In making 
good decisions at every level—decisions about the 
size of budgets and about the allocation of given 
budgets—both the institutional arrangements and 
the economic calculus play an important role. 

The possibility of an economic calculus. First 
of all, what does “economic efficiency” really mean? 
If one could calculate all gains and costs in com- 
mensurable terms, if these amounts were univer- 
sally and unambiguously true, and if there were no 
uncertainties, the correct actions would be those 
that yielded net gains and they could be conclu- 
sively identified. In most activities, certainly in de- 
fense, these conditions do not hold. One major 
stumbling block is the fact that there is no gen- 
erally accepted value that can be assigned to a 
change in the amount of defense as a whole or to 
an increment in any defense mission. 

We can sometimes derive a production-possibility 
curve showing the various combinations of two de- 
fense outputs—say, ability to shoot down enemy 
missiles and ability to retaliate—attainable with 
the same total budget. But we have no indiffer- 
ence curves, and no price tags, to show how extra 
units of each capability should be valued. It is im- 
possible, therefore, to identify the most valuable or, 
optimal combination. (Even the most valuable 
combination would not necessarily be optimal, be- 


cause there would be no assurance that the given 
budget was optimal. ) 

By exploring alternative ways of using the re- 
sources, however, one can sometimes show how to 
obtain more of one capability without sacrificing 
any of the other. Assuming there are no uncounted 
effects, such a change is an unambiguous improve- 
ment. In technical terminology, this constitutes 
moving to an efficient point on the production-pos- 
sibility curve. 

Moreover, when alternative actions affect only 
a single objective (seldom completely true), one 
can sometimes find a closely related type of im- 
provement. By designing and comparing alterna- 
tive ways to achieve that objective, one can often 
discover ways to achieve a specified amount of the 
objective at a lower cost or ways to achieve a 
larger amount of the objective at a specified cost. 
Such an improvement is akin to a shift toward an 
efficient point, because again the final result is the 
following: more of some desirable output can be 
produced without giving up anything else. This is 
the customary meaning of “economic efficiency.” 

Program budgeting. One device that is intended 
to facilitate the use of an economic calculus in 
budgetary choices is program budgeting, which the 
U.S. Defense Department introduced in 1961. Its 
central feature is the presentation of costs for pro- 
grams, or output categories (e.g., retaliatory capa- 
bility, weapon systems, arms control), rather than 
for input categories (e.g., personnel, transporta- 
tion). By thinking in terms of programs, officials 
can better take into account interdependencies— 
such as the effects of buying submarine-launched 
missiles or the worth of extra land-based ballistic 
missiles. Officials can presumably make better sub- 
jective judgments about the worth of output cate- 
gories than they can make about the ultimate worth 
of inputs. These calculations also provide better 
clues to the worth of increments or decrements in 
output categories. Another feature of program 
budgeting is the presentation of the anticipated 
costs of a program decision for several years ahead 
rather than for one year at a time. This should 
help officials take into account the full-cost impli- 
cations of decisions instead of merely the initial- 
payment implications. 

Program budgeting is no panacea. Hard choices 
will still be relatively hard. There are difficulties, 
both conceptual and practical, in providing the 
most useful cost estimates. Also program budget- 
ing, unless implemented with care, may be con- 
ducive to excessive centralization of authority, 
which itself entails disadvantages or costs. This 
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will be considered below, in the discussion of the 
role of institutional arrangements. Our interest 
here is in program budgeting as an information 
system to facilitate economic calculations. 

Cost—benefit analysis. Increasingly, nowadays, 
there are direct attempts to prepare an economic 
calculus to help one choose among alternatives. 
These alternatives are often called systems because 
—whether they are physically small guidance 
mechanisms or vast combinations of missiles, air- 
craft, bases, command-and-control arrangements, 
personnel, and procedures—they are sets of in- 
terrelated parts. Such cost-gain comparisons are 
sometimes called cost—effectiveness analyses or 
systems analyses, especially when they pertain to 
defense; operations research or operational anal- 
yses, particularly when they compare alternative 
modes of operation; cost-benefit analyses, origi- 
nally where they evaluated natural-resource proj- 
ects [see WATER RESOURCES]; or simply economic 
analyses, where they examine conventional eco- 
nomic policies. The term “cost-benefit analyses” 
is now used rather widely, and it does seem to 
convey the underlying idea fairly well. 

Cost-benefit analyses can be viewed as consist- 
ing of several elements: (1) the desired objec- 
tives, such as (in defense) being prepared to 
carry out specified missions; (2) the alternative 
systems, or means of trying to achieve the ob- 
jectives; (3) the costs entailed by each of the 
systems; (4) models to help trace out the costs in- 
curred and achievements provided; and (5) a cri- 
terion (involving both cost and achievement) to 
identify the best system. With regard to the last 
element, discussing the correct way to design cri- 
teria may seem like discussing the correct way to 
find the Holy Grail. In a world of uncertainty and 
nth best, judgments must help shape choices and 
no operational test of preferredness can be above 
suspicion. Moreover, analyses vary in their quality 
(which is hard to appraise) and in their applica- 
bility to different decisions. For these reasons the 
responsible decision makers must treat cost-benefit 
analyses as “consumer research” and introduce 
heroic judgments in reaching final decisions. In 
a sense, then, it may be both presumptuous and 
erroneous to discuss having a test of preferredness 
in these quantitative analyses. 

Criteria should be discussed, nonetheless, in 
connection with such analysis. First, cost-benefit 
analysts must deal with a closely related set of 
issues: what are relevant impacts on achievement 
of objectives and what are relevant impacts on 
costs? (These issues confront one in the first 
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four elements of cost-benefit analysis, of course, 
whether or not one includes a fifth element called 
criterion selection.) Second, cost-benefit analysts 
do apply criteria, especially in designing and re- 
designing the alternatives to be compared. They 
delete features that appear to be inefficient, add 
features that appear to be improvements, and 
probe for alternative combinations that are worth 
considering. This screening of possibilities and re- 
designing of alternative systems entails the use of 
criteria, and these should be explicitly considered 
and exhibited. Moreover, whether or not they ought 
to, analysts often present the final comparisons 
in terms of a criterion. 

Thus, while it is wrong to talk as if a definitive 
criterion is an element of every analysis, we ought 
to emphasize certain points about criterion selec- 
tion. First, many tests of preferredness can be 
quite misleading. For example, such tests as the 
ratio of gains to costs, the ratio of part of the gains 
to part of the costs, and the maximum gains for 
particular inputs should be avoided (Hitch & 
McKean 1960, pp. 158-181). Second, it should 
be stressed that good (although never perfect) 
criteria usually take one of the following forms: 
(1) maximum gains minus costs (to be used 
wherever possible); (2) maximum achievement 
of an objective for a given cost; or (3) minimum 
cost of achieving a specified amount of an objective. 

Needless to say, analyses should attempt to take 
into account all gains and all costs. Some people 
feel that there are two types of gain or cost, eco- 
nomic and noneconomic, and that economic anal- 
ysis has nothing to say about the latter. This 
distinction is neither very sound nor very useful. 
People pay for—that is, they value—paintings, as 
well as shoes; peace of mind, as well as aluminum 
pans; a lower probability of death, as well as gar- 
bage disposal. The significant categories are not 
economic and noneconomic items but (1) gains 
and costs that can be measured in monetary units, 
e.g., production or use of items, like steel, that have 
market prices to which users can adjust their trans- 
actions; (2) other commensurable effects, e.g., im- 
pacts on retaliatory capability of better warning 
systems as compared with the hardening of missile 
bases; (3) incommensurable effects that can be 
quantified separately but not in terms of a common 
unit, e.g., capability of destroying x targets and ca- 
pability of showing resolve by putting y aircraft on 
forward bases; and (4) nonquantifiable effects. 
Examples of the latter are impacts on morale and 
thence on capability; impacts on the probability of 

limited war, of escalation, and of nuclear war; and 
impacts on the chances of working out mutually 


acceptable arms controls. In taking a position on 
an issue, each of us implicitly quantifies such con- 
siderations. But there is no way to make generally 
valid quantifications that would necessarily agree 
with those of other persons. These distinctions be- 
tween types of effects of alternative policies do 
serve a useful purpose, however, especially in warn- 
ing us of the limitations of cost-benefit analysis. 

Cost-benefit analysis necessarily involves grop- 
ing and the making of subjective judgments, not 
just briskly proceeding with dispassionate scientific 
measurements. No one says, “This is the defense 
objective, and here are the three alternative sys- 
tems to be compared; now trace out the impacts of 
each on cost and on achievement of the objective, 
and indicate the preferred system.” What happens 
is that those making the analysis spend quite some 
time attempting an operational statement of an 
objective, for example, the capability, after receiv- 
ing a specified strike, to destroy a designated set 
of targets with a stipulated probability. Then, a first 
attempt is made at designing the alternative ways 
of producing this capability. Preliminary costs are 
estimated. Members of the research team perceive 
that the alternative systems would affect other ob- 
jectives, such as the deterring of minor aggressions 
or the providing of a flexible rather than a spasm 
response to major aggressions. Relations with other 
countries may be immediately involved. If the anal- 
ysis is being made by a relatively small nation, for 
example, its defense posture would have subtle and 
crucial impacts on the reactions of neighboring 
countries and of large nations. The analysts rede- 
sign the alternatives in the light of these impacts, 
perhaps so that each alternative performs at least 
“acceptably” with respect to each objective. Next, it 
may appear that certain additional features, e.g., 
decoys or extra radar coverage, would add greatly 
to capability but not much to cost. Or the cost 
group may report that certain arctic installations 
are extremely expensive and that relocating them 
would reduce costs greatly with little impairment 
of effectiveness. In both cases the systems have to 
be modified again. This cut-and-try procedure is 
essential. Indeed, this process of redesigning the 
alternatives is probably a more important contribu- 
tion than the final cost-effectiveness exhibits them- 
selves. In any event, preparing such an analysis is 
a process of probing and not at all a methodical, 
scientific comparison following prescribed proce- 
dures (Quade 1964, chapter 3). 

The incommensurables and uncertainties in 
cost-benefit analysis are pervasive. Consider the 
achievement of each of the multiple objectives 
mentioned above. The various achievements or ef- 


fects of the alternative systems can perhaps be in- 
dividually described, but they cannot be expressed 
in terms of a common denominator. Judgments 
about the extent of these effects and their worth 
have to be made. Similarly, some costs—the oppor- 
tunity cost of devoting resources to defense, the 
sacrifice of crew lives, the impairment of morale 
and alertness in the performance of dull, monoto- 
nous tasks—if quantifiable at all, cannot validly 
be put in terms of a common denominator. Fur- 
thermore, because of uncertainties, whatever esti- 
mates can be prepared should in principle be prob- 
ability distributions rather than unique figures for 
costs and gains. The system that performs the best 
in one contingency may perform the worst in an- 
other. This may call for redesigning the systems to 
insure against catastrophic outcomes, and it def- 
initely calls for caution in interpreting the results 
of cost-benefit analyses. Finally, costs and gains 
occur over a period of time, not at a single point 
in time, and there is no fully acceptable means of 
handling these streams in analyzing defense 
options. To be compared, alternative cost—gain 
streams must each be reduced to a single number, 
such as present worth. Capabilities occurring in 
different years can hardly be so discounted satis- 
factorily. They must therefore be stipulated, and 
only the cost streams discounted. But when the 
time path or anything else is stipulated, this im- 
plies that there are no trade offs. Also, discounting 
the cost streams alone poses some problems. For 
example, most persons would be inclined to dis- 
count risky proposals at high rates—but with the 
gains stipulated, unless risks under alternative sys- 
tems are carefully equalized, the risky proposals 
would be relatively cheap and attractive! (For 
more detail on the difficulties regarding incommen- 
surables, uncertainties, and time, see Hitch & 
McKean 1960, chapters 10, 11.) 

These difficulties are present because life is com- 
plex and choice is hard, They are not created by 
cost-benefit analysis. Moreover, they do not render 
such quantitative economic analysis useless. They 
simply mean that the analysts have to be discrim- 
inating about when and how to use various tools. 
Cost-benefit analyses are less helpful in applica- 
tions where uncertainties are great, as in the case 
of choosing between disarmament options or be- 
tween exploratory research and development pro- 
posals, and in cases where jncommensurables are 
highly significant, as in selecting the size of de- 
fense programs, than they are in choosing between 
“certain force-structure alternatives, such as alter- 
native mixes of Polaris and Minuteman missiles or 
alternative base locations. 
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Perhaps the main conclusion, however, is that 
we must always regard analyses as only aids or 
inputs to decisions, not as oracular touchstones. 
(And, needless to say, fragments of cost-benefit 
analyses, interaction models, games that force 
people to consider the reactions of allies and ene- 
mies, improvements in computational and analyti- 
cal techniques—any activities that help shed light 
on costs or gains— are also inputs to decisions and 
can help increase economic efficiency in defense.) 
Even the most comprehensive of cost-benefit anal- 
yses must be interpreted, not applied mechanically. 
If employed cautiously and with institutional 
checks and balances to insure against misuse, cost— 
benefit analyses can be quite useful. They can 
sometimes help eliminate really bad choices, al- 
lowing the decision makers to choose from a short 
list of reasonably good options. They can sometimes 
point with considerable clarity to improvements, 
even though neither they nor any other technique 
can identify truly optimal choices, Most important 
of all, cost-benefit analyses can provide the right 
framework for decision makers, bargainers, or any- 
one else “to think in.” It is the right kind of frame- 
work to use in organizing the evidence and one’s 
thoughts and intuitions regarding alternatives. 

The role of institutional arrangements. In de- 
fense, as in other problems of choice, we must rec- 
ognize that factors other than information and ex- 
hortation play a role in shaping actual choices, The 
hypothesis that each individual, including the offi- 
cial, attempts to maximize his own utility should 
be kept in mind, and the way in which rules and 
institutions shape the costs and rewards confront- 
ing the individual should be recognized. 

The utility-maximization hypothesis does not 
imply that people are extremely selfish and brutish 
or anything of the sort. It merely holds that individ- 
uals get satisfaction from many things—play, 
power, material goods, spiritual well-being, helping 
others, getting their own way—and that these 
things are to some extent substitutes for each other. 
That being the case, if the cost to consumers or 
officials of one item or action increases, they will 
demand less of it and more of other things. An in- 
crease in the rewards to officials or other individ- 
uals from one item or action will cause them to 
demand more of it and less of other items. (For a 
discussion of the utility-maximization hypothesis, 
see Alchian & Allen 1964, chapter 2.) This general 
theorem can lead to many testable hypotheses re- 
garding both private and public activities, including 
defense. A 

Models of the private economy make use of this 
hypothesis, and they yield many good predictions. 
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In the private sector of the economy, the price 
mechanism causes individuals to feel most of the 
costs of and gains from their actions. In the public 
sector the bargaining mechanism, although a very 
imprecise instrument, performs some of the same 
function (Lindblom 1955). If a decision maker 
contemplates an action, he will consider some of 
the costs he would impose on others, at least on 
those who have the power to retaliate. He will con- 
sider some of the gains he would bestow on others, 
at least on those with whom he can bargain for 
favors. Spill-over costs and gains prompt one to 
consider central intervention as a mechanism to 
capture these, yet central control can actually so 
blunt the bargaining mechanism as to lead to the 
neglect of such externalities. If there are more 
checks and balances, top management may be im- 
pelled to take more of the costs and gains into ac- 
count. In any organization there is a whole spec- 
trum of possible bargaining arrangements, each 
with its own advantages and disadvantages. 

These points are pertinent to defense planning. 
A high degree of centralization brings some advan- 
tages, but it also means that bargaining power is 
less dispersed, and so ultimately important consid- 
erations may be disregarded (Schlesinger 1966). 
In a defense department, as in any bureaucracy, 
there are natural tendencies to centralize authority. 
With five-year program budgeting, this tendency 
may be unduly reinforced. Although the defense 
minister or chief may wish to coordinate interre- 
lated choices, as long as the military branches have 
the authority to allocate their own budgets they will 
do much as they please anyway. So the program 
budget may become, not an information system, 
but an approved five-year plan that serves as an in- 
stitutional instrument of control. 

This arrangement can correct some of the un- 
desirable effects of the rivalry between military 
branches in shaping programs that are interrelated. 
But, as I have argued above, the fact that centrali- 
zation brings gains does not automatically mean 
that it is an improvement. For it also brings some 
ill effects or costs, which should be weighed against 
the gains. With centralization one set of views 
plays a greater role in decision making, and dissent- 
ing views play lesser roles. In other words, there 
are fewer checks and balances on the views of the 
central group. Also, if central managers try to con- 
trol in much detail, they find it imperative to sim- 
plify decision making and to discourage program 
changes. Finally, lower-level incentives to dissent 
and criticize and urge changes may diminish if 
such activities begin to be unrewarding. All these 
forces can in the long run produce disadvantages: 


(1) the suppression of alternatives; (2) a neglect 
of part of the costs and gains from alternative pol- 
icies; and (3) a neglect of uncertainties. 

One group’s view of the future will be less diver- 
sified than the separate judgments of a multiplicity 
of groups. Dominance of one group may lead to the 
disregarding of trade offs and options that others 
may take seriously, to the treating of certain costs 
and gains more lightly than others would, and to 
the regarding of a particular subset of contingen- 
cies and uncertainties as the major ones. The need 
to simplify—to use half-page summaries and rules 
of thumb—strengthens these same tendencies. 
They may be further reinforced by a loss of incen- 
tives at lower levels, perhaps by tacit agreements 
to refrain from introducing disturbing considera- 
tions. For some choices, such as force-structure de- 
cisions that have crucial interrelationships, central 
control is appropriate. For other choices, such as 
exploratory research and development decisions 
that are dominated by uncertainty, a greater degree 
of decentralization may be desirable. 

We need to learn more about institutional ar- 
rangements for harnessing the efforts of individ- 
uals to promote various objectives. We have much 
to learn about the effects on incentive of alterna- 
tive bargaining structures, procedures, “rules of 
the game,” relationships between defense agencies 
and contractors, and so on. In the U.S. Department 
of Defense and in similar organizations in other 
nations, numerous institutional modifications are 
being tried, such as organizational changes, pro- 
gram budgeting, incentive contracts, contracts in 
which savings resulting from the use of cheaper 
components are shared, and stock funds under 
which certain activities are managed almost like 
private businesses (Hitch & McKean 1960, chapter 
12). In the long run, institutional arrangements 
will turn out to be fully as important as the use 
of an economic calculus in improving defense 
choices. 

Roran N. MCKEAN 


[See also BUDGETING; CIVIL-MILITARY RELATIONS; 
Mııırary POLICY; and the detailed guide under 
MILITARISM. ] 
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ECONOMIES OF SCALE 


It is commonly observed that in producing and 
distributing almost every economic good there is 
some systematic relationship between the size or 
scale of the plant and the production cost per unit 
of output, and a similar relationship between the 
scale of the firm and the unit cost of producing and 
distributing the good. Let scale be measured as the 
rate of output per unit of time which a plant or 
firm is best designed to produce (can produce 
more economically than any other size of plant or 
firm). Let us also suppose that any plant or firm 
is operated at its best rate of output. Producing at 
that output rate, a plant or firm will have a certain 
cost per unit of output for production, and a firm 
a certain unit cost for production plus distribution. 
But these unit costs will not necessarily be invari- 
ant to the scale of the plant or firm, (For formal 
analysis of the relationship of the scale to the rate 
of utilization of the plant or firm and for definition 
of the scale curve, or long-run average cost curve, 
see Viner 1931 and Stigler 1946, pp. 128-142.) 

As the scale of a plant is enlarged from the 
smallest feasible size through progressively larger 
sizes, increases in the scale of plant will generally 
result in lower production costs per unit of output 
until some critical scale is reached beyond which 
further increases in scale will leave unit produc- 
tion costs unchanged. This critical plant scale may 
be designated as the minimum optimal scale of 
plant in any given industry. Similarly, as the scale 
of a firm operating a plant or plants in an industry 
is progressively increased to and beyond the size 
required for the operation of one plant of minimum 
optimal scale, the firm may realize reductions in 
the cost of producing and distributing a good, pos- 
sibly until it is large enough to operate two or more 
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such plants, although again it apparently tends to 
reach a critical scale beyond which further scale 
increases will not result in lower unit costs of pro- 
duction and distribution. 

“Plant” in this context means a factory, mill, or 
other assemblage of connected productive facilities 
located on a single site (frequently also called an 
“establishment” ), and “firm” means an independent 
administrative and control unit which manages a 
plant or plants and distributes their outputs. The 
reductions of costs with increases in the scales of 
plants or firms reflect, in general, economies of 
scale; in particular, economies of large-scale plant 
and economies of large-scale firm (in essence 
economies of the multiplant firm). The economies 
in question are in general “internal economies” 
(economies internal to the plant or firm in the 
sense that they are attributable to increased size of 
the individual unit), as distinguished from “ex- 
ternal economies,” which are reductions in unit 
costs that may occur as the result of increases in 
the size of an industry, accomplished by an in- 
crease in the number of firms. 

Realization of scale economies results in the de- 
cline of unit costs of production in a plant until 
a minimum optimal (lowest-unit-cost) scale is 
reached; thereafter, further increases in scale will 
not tend either to reduce or to increase unit produc- 
tion costs, and there will be an indefinitely wide 
range of optimal plant scales that are larger than 
the minimum optimal scale. The same may also be 
true of firms in that, once they have attained mini- 
mum optimal scales, further growth will neither 
increase nor decrease their unit costs. But there is 
the possibility that the growth of firms to very 
large size will result in an increase in their unit 
costs—that there are diseconomies of very large 
scale firms which place an upper limit on firm 
scales that are optimal. The actual existence of 
such diseconomies is both debatable and debated. 

Economies of scale may be either “real” econo- 
mies, reflecting a reduction in the physical quan- 
tities of productive factors needed to produce a 
unit of output (and a corresponding reduction in 
money costs), or “strictly pecuniary” economies, 
reflecting only a reduction in the prices at which 
the firm acquires productive factors, no real cost 
savings being involved. Economies of large-scale 
plant typically have a technological basis and are 
real economies. They are generally attributed to 
real savings in production costs resulting from 
specialization of labor, specialization of capital 
equipment, and specialization of management func- 
tions that large plant scales permit. Economies of 
the large or multiplant firm, so far as they are 


realized, may be real economies of specialization of 
management, or real savings in shipping costs 
through the operation of geographically dispersed 
plants where nationwide distribution is for some 
reason advantageous to the firm, They may also 
be strictly pecuniary economies resulting from 
lower prices paid for productive factors as a result 
of the monopsonistic power of large-volume buyers, 
or from access to financing on preferential terms. 
The realization of strictly pecuniary economies is 
of no advantage to society except insofar as “pass- 
ing them on” to customers of large firms through 
lower output prices may have desired effects on 
the distribution of income, (An extensive analysis 
of the bases of scale economies is found in Robin- 
son 1931.) 

Real economies of large-scale plant are due 
fundamentally to indivisibilities or “lumpiness” in 
the particular forms of specialized factors or agen- 
cies of production, which can be acquired or used 
only in certain finite sizes and in some cases are 
more efficient in larger than in smaller sizes. With 
infinite divisibility of every productive factor and 
agent, there would be no plant scale-economies. 
This fact explains why progressive expansion of a 
plant will lead to a minimum optimal scale beyond 
which the plant will be neither more nor less ef- 
ficient. At some critical finite scale, the indivisi- 
bilities of specialized factors or agencies which 
favor large plant-size will be fully overcome (the 
advantages of specialization will be exhausted), 
and further increases in scale can do no better than 
duplicate plants of minimum optimal scale on a 
single site. (For more extensive analysis of this 
and related matters, see Bain 1952, pp. 110-117. 
For analysis of the reflection of scale economies in 
production functions, see Boulding [1941] 1955, 
chapter 34.) The same phenomenon of exhaustion 
of scale economies at some finite scale is encoun- 
tered with any real economies of the multiplant 
firm, and probably with its strict pecuniary econo- 
mies as well. 

If diseconomies of very large firms are encoun- 
tered (and the evidence bearing on this issue is 
inconclusive), it is because of the inherent inef- 
ficiency of very large administrative organizations, 
afflicted with inflexible regulations, red tape, and 
extensive internal communication problems. Some 
theorists have argued that management is essen- 
tially a “fixed factor” which cannot be expanded in 
size beyond certain limits without assuming an 
altered and less efficient form. 

Quantitative importance. What has been said 
so far refers entirely to the qualitative character of 
the relationship of the scale of the plant or firm to 


the unit cost of production and distribution of a 
good, and to the theoretical explanation of such a 
relationship. The important economic issues turn 
upon the question of how quantitatively important 
these scale economies (and diseconomies) are in 
various industries. This is because the extent of 
their importance in any industry determines the 
minimal degree of plant concentration and mini- 
mal degree of concentration of industry control by 
firms which are consistent with plants and firms of 
reasonably efficient scale; it also determines (if 
diseconomies of very large firms are involved) the 
maximal degree of concentration by firms which 
is consonant with efficiency. 

Specifically, therefore, we wish to know for each 
of a large group of industries and for types of in- 
dustries similar in the relevant respects: (1) the 
approximate percentage of industry output which 
would be supplied by one plant of minimum op- 
timal scale; (2) the approximate percentage of 
industry output which would be supplied by one 
firm of minimum optimal scale; (3) the extent to 
which production costs per unit of output of plants 
of progressively less than minimum optimal scale 
exceed lowest attainable costs (i.e., the slope of the 
plant “scale curve” relating unit costs to scale, over 
a range of suboptimal plant scales); (4) the extent 
to which unit production costs plus distribution 
costs of firms of progressively less than minimum 
optimal scale exceed lowest attainable costs (i.e., 
the slope of the firm “scale curve”); and (5) 
whether or not (and if so, to what extent) dis- 
economies of very large firms are encountered in 
practice, 

This information should permit us to determine 
for any industry, or for groups of similar industries, 
how large a number of reasonably efficient plants 
and how large a number of reasonably efficient 
firms the industry can support (as well as how 
small a number of firms, should diseconomies of 
very large firms be significant ). And this determina- 
tion should in turn indicate the extent to which 
the pursuit of efficiency in scale of plants and 
firms is consistent with degrees of concentration 
of industry control by firms which are respec- 
tively compatible with competition, oligopoly, and 
monopoly. It should also allow us to appraise exist- 
ing degrees of concentration of control in indi- 
vidual industries, and particularly the development 
of large multiplant firms, with an eye to finding to 
what degree they are justified by the pursuit of 
«scale economies and conversely to what degree 
concentration is greater, and market structures less 
competitive, than considerations of economy actu- 


ally require. 
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There is not as yet available an adequate body 
of evidence on the quantitative importance of scale 
economies in large numbers of industries in all of 
the several sectors (e.g., agriculture, manufactur- 
ing, wholesale and retail trade, the service trades, 
construction, and so on) of any national economy. 
Bain has, nevertheless, developed data (1956b) 
on the importance of scale economies in a sample 
of 20 manufacturing industries in the United 
States as of the early 1950s, the majority of them 
having moderate to high concentration of control 
by relatively few firms. The findings for this sample 
may be viewed as tentatively and roughly indica- 
tive of the importance of economies of large plants 
and firms in American manufacturing industries 
generally. 

As for economies of large-scale plant, it appears 
that the proportion either of national industry ca- 
pacity (where the industry has a unified national 
market) or of capacity supplying the largest re- 
gional submarket (where the industry is frag- 
mented into regional submarkets) that would be 
provided by one plant of minimum optimal scale 
is from 15 to 25 per cent in two out of 20 cases, 
from 10 to 15 per cent in four cases, from 5 to T% 
per cent in six cases, from 2¥2 to 5 per cent in 
three cases, and below 2% per cent in five cases. 
In a number of industries in which the proportion 
of the market supplied by one plant of minimal 
optimal scale is appreciable, however, the elevation 
of unit costs at appreciably smaller plant scales is 
relatively slight, so that plants with substantially 
less than minimal optimal scale would be reason- 
ably efficient; in only about 40 per cent of the in- 
dustries do plants with scales from one-half to 
one-fourth the minimum optimal scale experience 
significantly higher production costs. Taking into 
account both the ratio of the output of a plant of 
minimum optimal scale to the size of the market 
and the relative flatness of many plant scale curves 
at suboptimal scales, we draw the following tenta- 
tive conclusions about the 20 manufacturing in- 
dustries. 

(1) In two, plant-scale economies are very im- 
portant, in the sense that the output of a plant of 
minimum optimal scale exceeds 10 per cent of the 
designated market output and that unit costs would 
be at least moderately higher at half-optimal scale. 

(2) In five, plant-scale economies are moder- 
ately important in that the output of a plant of 
minimum optimal scale is 4 or 5 per cent of the 
designated market output and that unit costs would 
be moderately higher at half-optimal scale. 

(3) In nine, plant-scale economies are relatively 


unimportant, either because a plant of minimum 
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optimal scale would supply a small percentage of 
the output in the designated market or because 
plants of suboptimal scale experience only slightly 
higher costs, or for both reasons. 

(4) In four, information does not permit defi- 
nite classification, but important plant-scale econo- 
mies may be present in two of the four cases. 

So far as we may generalize from these findings 
(1956b, pp. 71-82), the importance of plant-scale 
economies is widely variable among industries gen- 
erally and among industries with high concentra- 
tion of control by a few firms. In a distinct minority 
of cases, plant-scale economies are sufficiently im- 
portant to justify a moderately high degree of plant 
and firm concentration. In the substantial majority 
of cases, plant-scale economies are such that rea- 
sonably efficient production is consistent with the 
existence of a substantial number of individual 
plants serving an unsegmented national or prin- 
cipal regional submarket; high concentration of 
control by firms is not required to take advantage 
of economies of large-scale plants. 

For the same sample of industries, economies of 
the large-scale or multiplant firm appeared to be 
relatively unimportant, In half of 12 industries for 
which pertinent data can be found, there appear 
to be no economies of multiplant firm whatever. In 
the other half, from slight to modest economies 
(involving a cost advantage of from 2 to 4 per 
cent) are found for firms controlling several 
optimal-sized plants. These economies are attrib- 
uted in some cases to large-scale management and 
in others to reduced costs of distribution, typically 
through the use of regional plants to supply re- 
gional markets (1956b, pp. 83-93). In the latter 
case, the economies are generally conditional on 
nationwide sales promotion of goods with appreci- 
able transportation costs, and although they repre- 
sent savings to the firms involved, they are not 
savings to the economy, which could be as effi- 
ciently supplied by regional firms not engaging in 
this type of sales promotion. (Economies of large- 
scale distribution are discussed at greater length in 
Bain 1956a, pp. 343-345.) With a few exceptions, 
realization of reasonably efficient production and 
distribution by manufacturing industries does not 
really require multiplant firms; efficiency requires 
firms with outputs no larger than one plant of 
minimum optimal scale. In the few exceptional 
cases, the efficient multiplant firm would ordinarily 
not need to have more than plant of minimum 
optimal scale in any regional submarket. Realiza- 
tion of economies of the multiplant firm, therefore, 
would not require higher concentration of control 
by firms in such submarkets than would realization 


of economies of large-scale plants alone—as seems 
generally to be the rule. Economies of the multi- 
plant firm appear to have been greatly overrated by 
various observers making casual judgments. 

Our investigation in general reveals no firm or 
systematic evidence that diseconomies of large 
scale tend necessarily to be encountered by very 
large firms in manufacturing industries. Some in- 
stances are found in which the largest firms in an 
industry have higher unit costs than somewhat 
smaller firms, but their cost disadvantages appear 
to be attributable to a variety of special circum- 
stances not intrinsically linked with scale of firm 
per se. Diseconomies of large-scale firm thus do 
not tend to provide an effective check on degrees of 
concentration of industry control by firms that are 
much higher than required for efficiency in pro- 
duction and distribution. Indeed, the incidence of 
dominant firms which are from four or five to 
twenty or more times as large as is required for 
exploiting all visible economies of large-scale pro- 
duction and distribution is quite high in the 20 
American manufacturing industries sampled, 

One reason that economies of large scale are not 
such as to justify high degrees of concentration of 
industry control by firms in the United States is 
that the national and major regional submarkets 
in this country are very large—most usually large 
enough to absorb the outputs of numerous plants 
and firms of reasonably efficient scale. In other 
industrialized countries outside the communist 
bloc, national and regional markets range from 
moderately smaller to much smaller, but in most 
such industries production techniques and methods 
do not differ enough from those of the United 
States that the scales of reasonably efficient plants 
are appreciably different. In these other countries, 
therefore, realization of efficiency in scale of plants 
and firms typically would require from moderately 
to substantially greater degrees of plant and firm 
concentration than is required in the United States. 
But actual plant concentration in their industries 
most frequently is not high enough for reasonable 
efficiency, although concentration of industry con- 
trol by firms may be. (Findings on this matter are 
presented in Bain 1966, chapters 3-5.) 

As to attained efficiency in the United States, 
the findings based on the sample of 20 manufac- 
turing industries are that between 70 and 90 per 
cent of the output of a manufacturing industry is 
typically supplied by plants and firms of reasonably 
efficient scale, whereas the remaining 10 to 30, 
per cent is supplied by plants (and typically one- 
plant firms) that suffer from appreciable disecono- 
mies of unduly small scale. Such an inefficient 


fringe of small plants appears generally to supply 
a larger proportion of industrial output in most 
other industrialized countries outside the commu- 
nist bloc (Bain 1966, pp. 55-66). 

Large-scale sales promotion. This discussion 
has not dwelled on possible “economies of large- 
scale promotion,” in part because the proper term 
would be “advantages” rather than “economies”— 
cost additions rather than savings generally being 
associated with sales promotion. The question 
nevertheless remains as to whether there are sys- 
tematic advantages to firms in large-scale sales 
promotion programs and expenditures. The evi- 
dence is largely negative. Although it is true that, 
in industries selling differentiated products, large 
established firms are frequently able to maintain 
large market shares with smaller sales-promotion 
costs per unit of sales than those incurred by 
smaller firms to maintain smaller shares of the 
market, it is not generally true that the smaller 
firms can match the advantaged position of the 
larger ones by simply increasing their sales-promo- 
tion outlays. In effect, there is no unique functional 
relationship between size of promotional outlays 
and volume of sales, exploitable by all firms, com- 
parable to the unique relationship between scale of 
plant and production costs. The term “economies 
of large-scale sales promotion” is thus inherently 
misleading (Bain 1956a, pp. 339-343). 


The foregoing discussion has emphasized the 
fact that although appreciable economies of large- 
scale plant are encountered at least in manu- 
facturing, as well as some slight economies of 
multiplant firms, they are not important enough 
quantitatively to justify high degrees of plant and 
firm concentration in the great majority of manu- 
facturing industries in the United States. It is still 
true, however, that plants of absolutely very large 
size are required for efficiency in many lines of 
manufacturing and that substantially smaller plants 
suffer diseconomies of insufficient scale. Various 
findings which have been made to the effect that 
the smallest plants and one-plant firms in most 
manufacturing industries are, or can be, as ef- 
ficient as or more efficient than plants and firms of 
larger sizes generally rest on statistical misinter- 
pretation of defective cost data and of irrelevant 
profit data (see, for example, U.S. Federal Trade 
Commission 1941, pp. 12-92; Kaplan 1948, chap- 
ter 5) and can at present be regarded as thor- 
oughly invalidated. Inefficiencies of unduly small- 
scale plants indeed exist, and, as noted above, are 
encountered in an inefficient fringe of firms in 
most manufacturing industries. 
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Much has of necessity been left unsaid about 
scale economies, particularly concerning their im- 
portance in industries outside the manufacturing 
sector. Scattered evidence suggests that economies 
of large-scale plant are exceedingly important in 
most public-utility industries and have some im- 
portance in construction industries. Significant 
economies of the large-scale firm are definitely 
encountered in some of the distributive trades, in- 
cluding especially the food-distribution industry. 
We lack systematic data, however, to support a 
general appraisal of the importance of scale econo- 
mies in these sectors. 

Jor S. BAIN 


[See also INDUSTRIAL CONCENTRATION.] 
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About the turn of the century, Dutch politicians 
and scholars interested in the economic condition 
of the East Indian colony—now called Indonesia— 
were confronted with a baffling situation. Accord- 
ing to the economic views generally held by Dutch 
economists in the last decades of the nineteenth 
century, the unhampered development of free enter- 
prise should have favored not only the European 
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plantation owners but eventually the Javanese 
peasantry as well. The establishment of Western- 
style, rationally operated plantations should have 
had a stimulating effect on the peasants, encour- 
aging them to cultivate crops for the world market 
after the example of the European planters. 

About 1900, however, it was clear that the lib- 
eral policy had failed, and it was officially recog- 
nized that a state of “diminishing welfare” pre- 
vailed in the East Indies. Dutch economists started 
to investigate the causes of the failure, and they 
asked themselves to what extent the main theses 
of classical economics were valid for the economic 
situation in tropical colonies. 

In 1910, the young Dutch economist J. H. Boeke, 
in his doctoral dissertation, denied that the tenets 
of the classical school were at all applicable to trop- 
ical colonial conditions and posited the need for a 
separate theoretical approach to the specific prob- 
lems of what today would be called the under- 
developed territories (Koninklijk Instituut 1961, pp. 
8 ff.). 

In so doing, Boeke inaugurated a prolonged dis- 
cussion on the fundamentals of economic dualism 
in underdeveloped territories, a discussion that has 
continued until the present day. 

Boeke’s theory. On closer investigation of 
Boeke’s argument, the basis for his dualistic theory 
(not only in his dissertation, but in his later work 
as well) proves to be a distinction between “eco- 
nomic” and “social” needs. While economic activity 
in the West, according to prevailing economic the- 
ory, was based on the all-powerful stimulus of 
economic needs, the Indonesian peasant, according 
to Boeke, was guided fundamentally by social needs. 
On this basis, Boeke has evolved a dualistic theory, 
the core of which is that the economic laws that 
were found valid for capitalistic society are not 
applicable to societies in which capitalism lives 
side by side with a peasant economy still largely 
ruled by precapitalistic relationships. It was Boeke’s 
view that the introduction of mature capitalism into 
a precapitalist peasant economy not only disturbed 
the equilibrium of the latter economic system, but 
obstructed the gradual development of capitalistic 
forms comparable to those in the evolution of the 
Western economy. 

In Boeke’s view, this type of dualism prevailed in 
politically independent countries outside the West- 
ern world as well as in colonial territories. He 
included southeastern Europe and the whole of 
Asia, Africa, and Latin America in this dualistic 
world, not even excepting Japan. Therefore, even 
if, in a specific region, an end were put to colonial 
tule and political independence were achieved, this 


would not put an end to economic dualism: in- 
digenous capitalists are as far removed from the 
precapitalist peasant society as foreign ones, 

Today, hardly any scholar acquainted with the 
evolution of theoretical economics during the past 
decades will find the theoretical formulation of 
Boeke’s views acceptable. The Dutch economist 
P. Hennipman, for example, sharply attacked the 
distinction between economic and noneconomic 
needs, which indeed he pronounced untenable 
(Koninklijk Instituut 1961, pp. 21-24). Boeke’s 
comparison between Western and non-Western 
society was not based on a profound intrinsic 
knowledge of the former; he compared Indonesian 
society, which he really knew, with a picture of 
Western society based on the assumption of an 
all-pervading dominance of a fictitious homo eco- 
nomicus, Scholars better acquainted with Western 
society, among them agronomists who knew the 
Dutch countryside well, argued that many phe- 
nomena considered by Boeke to be typical of 
“dualistic” societies could be found in rural areas 
of western Europe as well. 

Still, the lack of validity of Boeke’s theoretical 
assumptions hardly detracts from the significance 
of many of his factual observations, if these are 
stripped of their too absolute formulation, As a 
general typology of most nonindustrial societies, 
Boeke’s analysis remains fully relevant. One of his 
main arguments against the application of Western 
economic theory to phenomena in the rural sphere 
of tropical dependencies is the inefficacy there of 
all kinds of economic incentives. Although the 
“inverse elasticity of supply” (the decrease of eco- 
nomic effort in the face of higher profits or higher 
wages), considered by Boeke to be typically dual- 
istic, may be well known in Western societies as 
the “backward-sloping supply curve,” still the gen- 
eral lack of response in non-Western rural soci- 
eties to all kinds of economic stimuli, official pres- 
sures, and expert advice remains one of the crucial 
problems in the modern world. 

There exists a sharp cleavage between those 
countries that have been able to achieve economic 
maturity in the past and those that are still in a 
preindustrial stage. Notwithstanding its theoretical 
weaknesses, Boeke’s analysis of actual conditions 
in those nonindustrial countries still contains a 
valid challenge for the present time. 

Boeke’s views on policy. As far as practical 
policy is concerned, Boeke’s dualism, in the more 
extreme formulation presented in his later works, 
comes down to the thesis that in nonindustrial 
countries, because of the unbridgeable gulf between 
those elements actively engaged in capitalistic en- 


terprise and the rural masses still living ina pre- 
capitalistic economy, the road toward gradual de- 
velopment is blocked (Boeke 1946). The latecomers 
in the race for economic advancement play against 
odds that exceed by far the obstacles overcome by 
the Western world in the past. According to Boeke, 
the historical processes of the past century, which 
amounted to a subservience and subordination of 
the precapitalistic countryside to the interests of an 
energetic and expansive capitalism, have obstructed 
a spontaneous growth of capitalistic forms in those 
areas. In essence, Boeke’s argument is directed 
against any cheap belief in a smooth evolution. 
His works were an antidote to the rosy optimism 
prevailing in United Nations circles and elsewhere 
in the late 1940s. 

The core of Boeke’s factual analysis is formed 
by his view that in densely populated areas like the 
island of Java no actual economic development 
occurs. At best, one could speak of “static expan- 
sion”—a spread of existing patterns of land use 
and economic relationships over an ever-widening 
territory (Boeke 1953, p. 174). The increase in 
aggregate production is fully drained off by a con- 
gruent increase in population, thus preventing any 
attempt to raise per capita production. 

At present, the vicious circle characteristic of the 
actual state of the rural economy in underdeveloped 
countries is being more generally recognized. It 
becomes increasingly clear that any assumption 
that a rise in production will automatically result 
in saving and investment is unrealistic and that 
still less realistic would be an assumption that 
investment will automatically involve a rise in 
production. Any planning that starts from a pre- 
conception about valid input-output ratios in under- 
developed areas misreads the actual conditions in 
the countryside. What is needed is not a purely 
economic approach based on some arbitrary mathe- 
matical model; one should first acquire an insight 
into the sociological preconditions for economic 
development. Boeke’s dualistic approach essentially 
forms a first attempt in this direction, although he 
wrongly expressed his theory in economic instead 
of sociological terms. Only very recently have soci- 
ologists started to provide economists with the 
background information needed to build a relevant 
model. 

Geertz’s agricultural involution. The idea that 
Boeke launched as the concept of “static expansion 
has been elaborated by the American cultural an- 
thropologist Clifford Geertz in much greater detail, 
and in a somewhat different sense, as the phenom- 
enon of “agricultural involution.” Geertz holds that 
actual developments in the countryside of Java 


ECONOMY, DUAL 497 


were contrary to what should be considered an 
evolutionary pattern. There was increasing tenacity 
and internal elaboration of basic social patterns; 
tenure systems grew more intricate; tenancy rela- 
tionships more complicated; cooperative labor ar- 
rangements more complex—all in an effort to 
provide everyone a basis, however small, for living 
in the over-all system (Geertz 1963, p. 82). The 
size of the individual farm remained Lilliputian. 

A social system that Geertz calls “shared poverty” 
developed, stressing social justice and obstructing 
efficiency and economic progress. The way the 
Javanese peasantry manage to keep alive and 
abreast of the population increase in a process of 
steady intensification of traditional farming and a 
progressive elaboration of repetitive basic patterns 
amounts, according to Geertz, to “treading water.” 

There are some differences between Boeke’s and 
Geertz’s analyses. Although both of them view the 
outcome—the existing impasse—as a consequence 
of a historical process, Geertz stresses the colonial 
factor more than Boeke did, Accordingly, his inter- 
pretation of the Japanese case is different from 
Boeke’s: in his view, the Japanese economy is no 
longer dualistic, since a dynamic interaction be- 
tween the agrarian and the industrial sectors “kept 
Japan moving and ultimately pushed her over the 
hump to sustained growth” (1963, p. 135). 

The greatest difference between Java and Japan, 
according to Geertz, is that in Java the increase in 
output was soon swamped by the attendant spurt 
in population; in Japan the population relegated to 
farming remained virtually constant (1963, p. 137). 

The difference in approach between Geertz and 
Boeke also leads to a somewhat different outlook 
on the future, Boeke’s view of an inherent dualism 
gradually developed into an extreme pessimism, On 
the basis of the economic theories considered by 
him to be valid for dualistic societies, Boeke de- 
clared the poverty of the masses of Asia unchange- 
able—at least as far as the densely populated areas 
are concerned. Instead of aiming at economic de- 
velopment for the rural masses, which is anyhow 
unattainable, the authorities should put stress on 
spiritual values. Boeke thus adopted, for the rural 
masses in these areas, the Gandhian philosophy of 
“plain living and high thinking” ([1946] 1948, 
pp. 88-89). 

Although Geertz is not less gloomy about the 
present state and actual trends, he does not endorse 
this pessimistic conclusion. He once caustically dis- 
missed Boeke’s formula as a “combination of plain 
living and high thinking on the part of the mass 
and high living and plain thinking on the part of 
the elite” (1956, p. 160). Although he is not able 
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to show a clear way out of the impasse either, he 
does not pronounce that all the roads to material 
progress are blocked forever. 

“Betting on the strong.” What both Boeke’s and 
Geertz’s analyses should make fully clear is that 
the situation, as it has developed in areas such as 
Java, precludes any solution that takes for granted 
that “development” can only be slow and gradual. 
Yet actual policies are still largely dominated by 
the philosophy of gradualism. Those who are aware 
that poverty and ignorance are severe impediments 
to introducing technical innovations tend to look 
for specific agents for introducing the innovations 
needed to increase agricultural per capita produc- 
tion. It is always the more advanced farmers who 
are almost automatically selected as such agents. 
It is they who show a greater responsiveness to all 
kinds of innovations and technical improvements 
and who are much easier to approach by govern- 
ment agencies or special services. Such a method 
starts implicitly from the assumption that the ad- 
vanced farmers will set an example for the back- 
ward ones, who are expected to follow the model 
that they are able to observe from close by. The 
innovation is intended to spread, like an oil stain, 
to other layers of rural society. It is this approach 
that could be called “betting on the strong.” 

Boeke’s view regarding this “betting on the 
strong” approach has not been too consistent. In 
1910, writing on the basis of literature and without 
personal experience in the colony, he had expressed 
a distrust of the “well-to-do Natives,” among whom 
were many hadjis (the affluent and prestigeful pil- 
grims to Mecca), as agents for economic develop- 
ment. Their progress would only amount to a 
situation in which “the rich grow richer and the 
poor grow poorer” ( Koninklijk Instituut 1961, p. 33). 

Paradoxically enough, in 1927, Boeke himself, 
after a prolonged stay in Indonesia, reversed his 
stand. At that time, he did not see any possibility 
of promoting native welfare except through the 
intermediary of a small elite, consisting of “the 
vigorous, energetic, advanced elements” in Indo- 
nesian rural society. Boeke said that these wealthier 
landowners, working their way upward, should not 
be blackened and branded as usurers and blood- 
suckers (see Koninklijk Instituut 1961, p. 293). 

These elements of rural society should be ap- 
proached, by the specialized welfare agencies, in a 
way that Boeke called “person-centered.” Not com- 
pulsion, but patient persuasion should be the meth- 
od used by the welfare agencies. The effort should 
not be directed toward the poor masses, in order 
to achieve specific objectives, but toward individ- 
uals differentiating themselves from the mass by 
certain personal qualities. The basic assumption is, 


evidently, that the improvement of native welfare 
can only be a slow, gradual process. The person- 
centered welfare policy should be guided, above all, 
by “faith and patience” (Koninklijk Instituut 1961, 
p. 298). 

In his later publications, when Boeke had lost 
faith even in a slow process of spreading develop- 
ment, the wealthy “progressive” farmers more or 
less disappear from the center of his attention. His 
approach to problems of economic policy becomes 
truly dualistic, insofar as he has no faith any more 
in the possibility that the advance of the progres- 
sive farmers will slowly extend to the rural masses. 
A new policy has to be conceived for the latter 
group; as we have observed, it is the spiritual 
values that, according to Boeke's teachings in his 
later years, should be stressed in an approach that 
he called one of “village reconstruction.” 

The Indonesian experience. It is interesting to 
know how the oil-stain approach, actually applied 
by the agricultural extension service in postwar 
Indonesia, worked out in practice. As recorded by 
the capable Dutch rural sociologist H, ten Dam in 
an article based on a study of the western Java 
village of Tjibodas in the early 1950s, the fruits of 
such a policy are far from promising (Koninklijk 
Instituut 1961, pp. 347-382). In this village, the 
representatives of the government agencies came 
into personal contact only with the large landown- 
ers, who constitute roughly 14 per cent of the total 
village population. It was they who profited most 
from all the facilities provided by the agricultural 
extension services. It was they who, through a 
cooperative society in which they had the largest 
stake, got, at prices set by the extension service, 
artificial fertilizers and chemical preparations 
needed for the cultivation of profitable commercial 
crops. There was a group of smaller independent 
farmers with economic holdings, a number of 
whom attempted to follow the example of large 
landowners. But even if taken together, the large 
landowners and the independent farmers do not 
exceed 10 per cent of the total village population. 
The great majority, the farm hands and the part- 
time farmers with uneconomic holdings who have 
to work for the larger landowners as wage laborers 
or sharecroppers, do not profit at all from all the 
innovations introduced in local agriculture. 

Boeke’s early prophecy appears to have come 
true: as a consequence of a policy of “betting on 
the strong,” the rich grew richer and the poor grew, 
if not poorer, then at least more numerous—and 
more restive, 

The Indian experience. In India, whenever the 
“betting on the strong” approach was attempted, 
the results were exactly the same as in Java. Dube, 


in the villages investigated by him, assesses the 
results of the community development project, as 
far as the agricultural extension work is concerned, 
as follows: 


A closer analysis of the agricultural extension work 
itself reveals that nearly 70 per cent of its benefits 
went to the élite group and to the more affluent and 
influential agriculturists. The gains to poorer agricul- 
turists were considerably smaller. Being suspicious of 
government officials they did not seek help from the 
Project as often. . . . For the economic development 
of this group, as well as for that of the artisans and 
agricultural labourers, no programmes were initiated 
by the Project. (Dube 1958, pp. 82-83) 


The main obstacle to the oil-stain effect can be 
reduced to the obvious cause that following the 
example of the “progressive farmers” presupposes 
the possession of sufficient land and capital, besides 
the required “progressive” outlook and a mental 
preparedness to follow the example of the “strong” 
(India [Republic] 1957, vol. 1, p. 81; vol. 3, part 1, 
pp. 254-255). The small peasants in India are no 
more able to follow the example of the progressive 
farmers than their counterparts in Java had been 
able to follow, in the past, the example of the Euro- 
pean planters. Those possessing uneconomic hold- 
ings are more likely to be forced, in the long run, 
to sell their holdings in order to pay off debts. The 
larger landowners, profiting from technical innova- 
tions, may be tempted to extend their holdings; a 
progressive outlook as far as farming technique is 
concerned is by no means a guarantee for a pro- 
gressive outlook concerning principles of social 
justice, 

The prospect is complicated still more by the 
existence of a large category of landless farm 
hands. Can they be expected to profit from the 
improvements in farming techniques, even if they 
are introduced by a wealthier master—landowner? 
There is a wealth of evidence indicating that, on 
the average, real wages in rural India have not 
appreciably increased since the country gained its 
independence. Against certain local increases of 
rural employment opportunities, largely as a con- 
sequence of the introduction of irrigation works, 
we have to take into account not only the strong 
social, political, and economic position of the land- 
owning castes in the countryside, but also the rapid 
natural increase of the population in the rural 
areas. 

Only if most of the natural increase of the rural 
population is absorbed into nonagricultural occu- 
pations could a slowly mounting productivity per 
unit of land be turned, in the long run, into higher 
real wages for agricultural labor, through a raising 
of per capita production. In order to make mech- 
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anization of agriculture profitable, the absolute 
number of people dependent on agriculture would 
even have to be reduced, except insofar as a recla- 
mation of wastelands could increase the total cul- 
tivated area. But even the former condition—an 
approximately stationary population engaged in 
farming—is far from being realized in most parts 
of India, since urbanization and industrialization 
are developing much too slowly to be able to reduce 
considerably the percentage of the total population 
employed in agriculture, let alone to absorb the 
natural increase. 

The conclusion to be drawn from this constella- 
tion is obvious. Modern farm techniques are not 
likely to spread in a situation in which cheap agri- 
cultural labor is abundant. The gains to be derived 
from “community development” largely aimed at 
the “progressive farmers” will predominantly ac- 
crue to a restricted landowning group. A “betting 
on the strong” policy is likely to amount to a policy 
of “the devil take the hindmost.” 

In countries where the hindmost are restricted 
in numbers, a large proportion of them may be 
driven away from the countryside to seek employ- 
ment in an urban area; the remaining ones may, 
if they are not too numerous in comparison with 
the land surface available for cultivation, profit in 
the long run from the increased productivity per 
surface area and eventually learn improved farm- 
ing techniques from the more prosperous farmers. 

But in countries where the hindmost form a 
large majority, either as peasants with uneconomic 
holdings or as landless laborers, the “betting on the 
strong” policy cannot work. The large masses of 
the rural population are, for the time being, tied to 
their traditional rural occupations as farm hands 
or sharecroppers, which means that a gradual oil- 
stainlike spread of modern farming techniques to 
them is out of the question. These poor masses 
form a drag upon any attempt to solve the rural 
problem in a gradual way. 

Inyolution, evolution, and revolution. From the 
foregoing, it appears that sociology has not yet been 
able to establish a way to break through the rural 
impasse. It is important, however, that some schol- 
ars have been able to analyze its main causes. Their 
analyses may provide a more consistent and more 
refined way of defining the pre-take-off situation 
with which both economists and politicians are 
concerned. Rather than Boeke’s concept of dualism, 
Geertz’s concept of “involution” appears to be the 
crucial one. But, at the same time, this concept 
may point the way toward a solution of what 
seemed to Boeke to be an inherent and permanent 
dualism. If the prevailing process is one of involu- 
tion, it is clear why a gradual evolution is out of 
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the question. Sociologists and political scientists 
should investigate if the only response to the chal- 
lenge of involution should not be “revolution” —in 
the sense of a thorough reversal of the prevailing 
trends. Some thought should be given as to what 
extent such a revolution would use the substitu- 
tion for the “betting on the strong” approach of one 
of “betting on the many,” who, mainly through 
organization and intensive education toward effi- 
ciency and self-reliance, will be made strong. 

Boeke held that any innovations brought about 
by pressure were of no avail and that the only 
viable approach could be a personal one, based on 
persuasion and directed to the “happy few.” But if 
persuasion and education could be combined with 
activating the masses, which was never seriously 
attempted in a colonial setting, the outcome of such 
an approach might prove much more promising, 
even though a certain amount of pressure would 
be needed. 

Therefore the problems, if not the theoretical 
cadre and the solutions forwarded by Boeke in con- 
nection with the dualism prevailing in many parts 
of the world, are still as topical as they were fifty 
years ago. 

W. F. WERTHEIM 


[See also ASIAN SOCIETY, article on SOUTHEAST ASIA; 
DIFFUSION, articles on THE DIFFUSION OF INNOVA- 
TIONS and INTERPERSONAL INFLUENCE; ECONOMIC 
ANTHROPOLOGY; ECONOMIC GROWTH, article on NON- 
ECONOMIC ASPECTS. ] 
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ECONOMY AND SOCIETY 


The discipline of economics commonly focuses 
on only one aspect of social life—how people em- 
ploy scarce resources to produce, distribute, and 
exchange goods and services for consumption. 
Economists acknowledge that there are other as- 
pects—political, legal, educational, for instance— 
and that these aspects influence economic life sig- 
nificantly. However, they usually assume that, for 
purposes of economic analysis, these influences 
are constant and thus do not modify economic 
processes; or, as it is often put, tastes and institu- 
tions are “given.” By thus limiting the number of 
variables they study, economists have been able to 
advance their science, but they have also greatly 
oversimplified the relations between economic and 
other types of social variables. 

The field of economic sociology relaxes, as it 
were, these stringent assumptions about the rela- 
tions between economic and other types of social 
variables and explicitly focuses on explaining varia- 
tions in the ways these different types of variables 
impinge on one another. For example, in studying 
wages the economist concentrates mainly on the 
features of the market that influence and are 
influenced by wage levels, whereas the sociologist 
inquires into a wider range of causes and conse- 
quences. If management, for instance, persistently 
tinkers with wage levels, the sociologist may find 
out that this strengthens informal cliques of work- 
men and inclines them to subvert the authority of 
management; or it may foster the formation of a 
labor union or excite activity on the part of an 
existing one. Both these social consequences may 
then themselves “feed back” and influence the level 
of wages. 

Economic sociology. The relations between eco- 
nomic and other social variables may be studied 


at three levels. At the most concrete level, the 
economic sociologist applies the standard tools of 
sociology to the study of roles and organizations 
that specialize in economic activities. He may, for 
example, study the sources of recruitment, career 
patterns, life styles, and role strains of skilled 
workers in an industrial firm; or he may study the 
organization of the firm itself, analyzing the status 
systems, power and authority relations, patterns of 
deviance, cliques and coalitions, and the relations 
among these phenomena. 

At the second level, the economic sociologist 
analyzes the relations between economic structures 
and other structures. The work of Max Weber, 
especially, exemplifies this level of analysis. In 
particular, Weber was preoccupied with the condi- 
tions under which industrial capitalism of the 
modern Western type could arise and flourish. 
Initially, he was careful to distinguish between 
industrial capitalism and other forms, such as 
finance capitalism and colonial capitalism; indus- 
trial capitalism refers to the systematic and ra- 
tional organization of production itself, Having 
identified industrial capitalism, Weber sought to 
identify the historical conditions most conducive 
to its rise and continuing existence. One of his 
most famous arguments is that the rise of ascetic 
Protestantism, especially Calvinism, established 
social and psychological conditions conducive to 
this particular form of capitalism (1904-1905). 
Another well-known argument developed by Weber 
is that bureaucracy provides the most rational form 
of social organization for perpetuating industrial 
capitalism (1922). He also found in the political- 
legal complex, especially in property laws and 
monetary systems, many other institutional struc- 
tures conducive to industrial capitalism. 

Weber never developed his economic sociology 
into a full theoretical system. Rather, he remained 
at the level of generating historical insights about 
the pattern of institutional structures surrounding 
important economic phenomena. Even so, it is pos- 
sible to see his distinctive contribution to economic 
sociology. Unlike traditional economists, Weber 
was not interested in the regularities produced 
within the capitalist system of production (regu- 
larities such as the business cycle); instead, he 
was interested in establishing the important insti- 
tutional conditions under which capitalist systems 
—and their regularities—came into being. 

At the third, most abstract, level the economic 
sociologist considers the economic and other types 
of social variables as organized into analytic sys- 
tems, which cut across concrete social structures, 
and he studies the relations among these systems. 
This approach is exemplified in the works of Tal- 
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cott Parsons and Neil J. Smelser (see Parsons 
1937; Parsons & Smelser 1956; Smelser 1963), 
The economic system, for example, consists of 
suppliers of factors of production, producers, and 
consumers, none of which should be identified with 
any particular social structure but all of which can 
be viewed, at this abstract level, as interacting with 
one another in a network of markets. Furthermore, 
the economic system interacts with other social 
systems. For example, the economy provides wage 
payments and consumer goods and services in 
exchange for the motivation and skills, so neces- 
sary for a viable labor force, that are generated by 
the “pattern-maintenance” social system, which 
includes familial, educational, and specialized- 
training structures. 


The economy and cultural factors 


By culture is meant that complex of existential 
and evaluational symbols which provide meaning 
and legitimacy to the conduct of social life, Con- 
crete cultural items are cosmologies, values, ideolo- 
gies, aesthetic productions, and scientific knowl- 
edge. The economic sociologist is concerned with 
the ways in which these symbols facilitate or in- 
hibit various types of economic activity and the 
ways in which economic behavior leads to modifi- 
cations of cultural symbols. 

Little research has been conducted on the role 
of cultural factors within economic organizations. 
What has been done focuses mainly on problems 
of effective communication. One of the persistent 
problems of bureaucracies, for example, is the 
disruption that occurs when necessary information 
is altogether lacking, withheld, distorted in pas- 
sage, or too slow in arriving. Numerous studies of 
industrial bureaucracy have uncovered typical 
points of distortion and omission: subordinates 
“cover up” information they do not wish their 
superiors to know, and foremen “soften” orders 
from above out of sympathy with workmen. [See 
ORGANIZATIONS, article on ORGANIZATIONAL INTEL- 
LIGENCE.] 

Stimulated mainly by the Marxian and Weberian 
traditions, sociologists have conducted a more sub- 
stantial amount of research on the relations be- 
tween economic organization and cultural symbols. 
Some research, such as that of Robert N. Bellah 
on Tokugawa Japan (1957), focuses directly on 
Weber's problem concerning the ways in which 
religious beliefs stimulate or inhibit economic de- 
velopment. Other scholars have inquired into the 
relations between ideologies and economic activity. 
The most thorough study of ideologies as instru- 
ments of social control in industry is the compara- 
tive research of Reinhard Bendix ( 1956) on man- 
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agerial ideologies as they have developed in four 
industrializing countries—Great Britain, the United 
States, Russia, and East Germany. Bendix’s main 
concern is with the justifications that managerial 
classes have generated in the process of inducing 
workers to submit to their authority. The effects 
of ideology in relieving role-strains are emphasized 
in a study of the American business creed by 
Francis Sutton and his associates (1956). These 
investigators attribute the tenacity of the free- 
enterprise ideology among businessmen to strains 
in their roles: for example, their own ambivalence 
toward the phenomenon of bigness in the Ameri- 
can economy is smoothed over by a defiant reas- 
sertion of the values of traditional free enterprise. 
Ely Chinoy’s study of automobile workers (1955) 
also stresses the frustration-reducing elements of 
ideology, which often serves to rationalize the 
discrepancies between the expectations generated 
by the “American dream” of equality of opportunity 
and the actual life situations of the workers. 


The economy and solidary groupings 


Solidary groupings include families, friendship 
groups, and diffuse collectivities, such as neigh- 
borhood, racial, and ethnic groupings. Solidary 
groupings, which occupy an important place in the 
integrative system of society, are typically char- 
acterized by face-to-face interaction and close per- 
sonal understandings and loyalties among mem- 
bers. The economic sociologist asks how these 
groupings influence, and are influenced by, eco- 
nomic behavior. 

Industrial sociology. Recognition of the impor- 
tance of solidary groupings within industrial organ- 
izations is closely associated with the rise of the 
“human-relations” approach in industrial sociology 
during the interwar period. Before the rise of this 
school, industrial sociology in the United States 
was mainly under the influence of the “scientific- 
management” approach associated with the name 
of Frederick W. Taylor. According to this approach, 
workers in industrial organizations can be treated 
primarily as isolated neurophysiological organisms 
with definite capacities and skills; their efficiency, 
moreover, is largely a product of such factors as 
speed of work, amount of effort expended, and 
amount of rest allowed for recovery of strength. 
The scientificmanagement school more or less 
ignored the social-psychological determinants of 
morale and efficiency. 

In the mid-1920s, when the famous Hawthorne 
experiments on productivity began in the Western 
Electric Company in Chicago, the initial emphasis 
was on the effects of various nonsocial factors 
(lighting and rest periods, for example) on worker 


performance. During the course of the experi- 
ments, however, it became apparent to the investi- 
gators that these “physical” factors were not nearly 
so important in fostering high morale and produc- 
tivity as various “human” factors, such as receiving 
Status, being accepted in a meaningful primary 
group, and being allowed to express grievances to 
a patient and responsive authority. The investiga- 
tors, in short, isolated the solidary grouping as a 
determinant of economic behavior (see Roethlis- 
berger & Dickson 1939). Further experiments con- 
ducted in the Bank Wiring Room revealed that 
cliques of workmen could also affect productivity 
adversely by setting and enforcing their own infor- 
mal standards of production rates, which were 
different from those established by management. 
These kinds of findings have given rise to a whole 
tradition of research on the diverse relations be- 
tween the formal and informal organizations within 
a bureaucracy and on how these relations affect or- 
ganizational functioning. [See ORGANIZATIONS, ar- 
ticle on THEORIES OF ORGANIZATIONS. ] 

Family structure. Most research on the rela- 
tions between economic organization and solidary 
groupings concerns the ways in which certain 
kinds of family structure may facilitate or inhibit 
economic activity. For example, primogeniture may 
facilitate the formation of an urban-industrial labor 
force, since this type of inheritance system ejects 
younger sons from the land and makes them pos- 
sible candidates for other lines of employment; 
thus, research by Conrad M. Arensberg and Solon 
T. Kimball (1940) found that younger sons con- 
stituted many of the migrants in rural Ireland. 
On the other hand, certain types of family life may 
inhibit economic activity. David Landes (1951) 
has argued that the peculiar structure of the 
French business family has kept the typical firm 
small and has thus inhibited economic growth. In 
his study of the Chinese family (1949), Marion 
J. Levy isolated the factors of particularism, fa- 
voritism, and functional diffuseness as features of 
classical Chinese kinship that tended to inhibit 
industrialization. 

With respect to the influence of urban-industrial 
life on the family, most research in the interwar 
period—for example, the work of Ernest W. Bur- 
gess and Harvey J. Locke (see Burgess et al. 1945) 
—was based on the assumption that urbanization 
and industrialization lead to the isolation and dis- 
integration of the family. More recent research has 
challenged or modified this thesis. Talcott Parsons 
has argued that the urban-industrial complex does 
not cause the family to deteriorate but rather gives 
rise to a more highly specialized, and in some ways 
more effective, family system (Parsons et al. 


1955). In a massively documented study, William 
J. Goode (1963) has attempted to demonstrate and 
explain the fact that the conjugal nuclear family 
is the more-or-less universal concomitant of mod- 
ernization. 

Studies of consumption. In one respect the 
economists’ traditional concern with personal con- 
sumption reflects a preoccupation with the rela- 
tions between solidary groupings and the economy, 
since this type of consumption commonly refers 
to the demands made by the household sector on 
the production sector. Throughout most of the his- 
tory of economic thought, scholars have postulated 
some rather simple psychological principles that 
are assumed to govern consumers’ behavior—prin- 
ciples such as the hedonistic calculus, diminishing 
marginal utility, or indifference-curve maps. Mani- 
festly, however, as the work of Thorstein Veblen 
has so well demonstrated, social position and at- 
titudes directly affect tastes and, by implication, 
make for variation in individual and aggregate 
consumption functions (Veblen 1899). In recent 
decades economists and sociologists have attempted 
to mobilize a number of social and psychological 
variables to explain differential consumption and 
saving patterns. James Duesenberry, for example, 
formally incorporates a principle of imitative spend- 
ing into his theory of consumer behavior (1949). 
Milton Friedman posits age and composition of 
family as determinants of tastes, which in turn af- 
fect the consumption function (1957). George 
Katona, using the survey method, has undertaken 
to study consumer attitudes extensively and to pre- 
dict consumer behavior from these attitudes (1951; 
1960). Finally, sociologists and economists have 
produced a plethora of empirical studies showing 
how social-structural variables such as age, stage 
in family cycle, race, home-ownership status, and 
degree of urbanization determine spending and 
saving patterns. On the whole, these efforts have 
not produced a formally adequate theory of con- 
sumption that systematically incorporates social- 
structural variables. The closest approximation to 
such a theory is found in the work of Guy H. Or- 
cutt and his associates (1961), who have intro- 
duced variables such as marital status and duration 
of marriage, as well as age, education, and race of 
head of household, into probability models of 
spending and saving behavior. 


Role differentiation and stratification 

From at least the time of Adam Smith, who paid 
particular attention to the division of labor as a 
determinant of the wealth of a nation, economists 
have recognized the importance of role differentia- 
tion in economic life. By and large, however, their 
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concern has been restricted to the relations be- 
tween the specialization of labor and economic 
variables, such as productivity, wage differentials, 
the distribution of the shares of income, and se- 
lected aspects of labor-management relations. So- 
ciologists have introduced more and different kinds 
of variables into the analysis of occupations and, in 
particular, have focused explicitly on the organiza- 
tion of differentiated roles into stratified systems. 

Economists’ versions of the attitudes and be- 
havior of individuals in occupational roles generally 
follow the logic of supply and demand. They as- 
sume that the amount of work a person offers in 
the market is some function of the economic re- 
wards available to him. While a number of differ- 
ent labor-supply functions have been advanced, 
most of these ignore the effects on the worker of 
social factors other than economic incentives. By 
contrast, sociologists conceive of roles (including 
occupational roles) as organized clusters of activ- 
ities involving interaction with the physical, social, 
and cultural environments. These activities are 
structured and regulated by values, norms, and 
sanctions. Economic rewards and deprivations are 
important, but they constitute only one of several 
types of sanctions. Roles, furthermore, are subject 
to a number of types of strains, reactions to which 
influence the incumbent's attitudes and behavior. 
In analyzing the foreman’s role in industry, for 
example, sociologists have discovered that his role 
is subject to considerable strains because it has 
been emasculated in past decades in two important 
ways: first, centralized management of control has 
made him less an independent authority over pro- 
duction and more an implementor of ready-made 
decisions; second, the centralization of the hand- 
ling of grievances in the unions has relieved him 
of certain “human-relations” functions. The fore- 
man, caught in a role that is simultaneously 
limited in function and beset by cross-pressures, 
often wavers between identification with manage- 
ment, with workers, and with other foremen (Leiter 
1948). Contemporary sociological research has 
yielded similar types of descriptions and analyses 
of behavior in executive, professional, worker, and 
miscellaneous service roles. 

Above and beyond wage conditions and level of 
prosperity, which are perhaps the most important 
determinants, economists and sociologists have un- 
covered a number of variables that influence labor 
turnover and absenteeism (for example, see Parnes 
1954). Social factors influencing turnover are 0c- 
cupation (the average turnover of teachers, for 
instance, is much lower than that of factory work- 
ers); age (older workers tend to change jobs less 
often than younger workers); sex (women move 
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in and out of the labor force more than men but 
probably do not move geographically and occupa- 
tionally so much); and race (Negro men tend to 
change jobs more often than white men). Some 
of the conditions that affect absenteeism, besides 
wages, are distance of residence from plant; size 
of firm (which is undoubtedly related to morale); 
occurrence of holidays (absenteeism drops just 
before holidays); age (young men display absen- 
teeism more than old); marital status (single men 
are absent more than married men); and arduous- 
ness of work (which encourages absenteeism). [See 
LABOR FORCE, article on MARKETS AND MOBILITY.] 

Stratification systems. Much of the sociological 
research on role differentiation within economic 
organizations has concerned the relations between 
formal and informal organization (reviewed above). 
In addition, sociologists have studied status sys- 
tems of industrial organizations and their relations 
to behavior. As a general rule, status in an organi- 
zation is determined by the degree of skill, the 
level of authority and responsibility, and the level 
of remuneration of an occupational position. How- 
ever, additional criteria that are imported from 
“outside” the workplace, such as age, sex, and 
racial or ethnic background, affect the status of the 
individual in the organization. 

Because of the multiplicity of determinants of 
general status, individuals in organizations fre- 
quently experience a sense of ambiguity as to their 
actual status (Homans 1953). This leads to a focus 
on the symbolic aspects of status, such as the 
number of telephones or amount of floor space per 
office. Many conflicts in organizations revolve 
around the distribution of symbols rather than the 
distribution of the determinants of status under- 
lying these symbols. Another characteristic of in- 
dustrial organizations that arises from the multi- 
plicity of determinants of status is the tendency 
toward “status crystallization”—that is, bringing all 
aspects of status roughly into line with one another. 
If an individual is paid much for performing a low- 
skill job, for example, dissatisfactions both for 
him and his fellow workers, as well as pressures 
to balance wage and skill levels, arise. This tend- 
ency for all aspects of status to be brought into line 
underlies many conflicts in industrial settings: for 
example, opposition to promoting a young execu- 
tive too rapidly for his age and experience, to plac- 
ing women in positions of authority, and to 
elevating Negroes to high-level jobs. 

With respect to the relations between economic 
systems and stratification systems, two strands of 
research dominate the contemporary scene. The 
first, stemming from the Marxian tradition, con- 


cerns the degree to which specific types of eco- 
nomic systems appear to give rise to distinctive 
types of stratification systems. Alex Inkeles and 
Peter Rossi (1956), for instance, found that in a 
number of industrialized nations, occupations as- 
sociated with industrial production (engineer, fore- 
man, machine worker, etc.) were assigned very 
similar positions in the general prestige hierarchy 
of occupations. The second strand of research, 
stemming perhaps from an ideological preoccupa- 
tion with equality of opportunity, concerns the 
degree to which rates of upward social mobility 
increase or decrease in industrial society, Basing 
their case on extensive comparative data, Seymour 
M. Lipset and Reinhard Bendix (1959) argued that 
rates of social mobility are very similar in all the 
industrial societies of the West and that these simi- 
larities are traceable to common features of the 
occupational structures of these societies. Addi- 
tional research, the most conspicuous of which is 
the work of Natalie Rogoff (for example, 1953), 
concerns the degree to which opportunities for 
upward mobility are “closing” under conditions of 
advanced industrial development in the United 
States. 


The economy and political factors 


The term “political” here refers to the creation, 
institutionalization, and utilization of power in a 
social setting. Defined this broadly, the term en- 
compasses the study of authority systems in organi- 
zations, the study of governments, and the study 
of competition and conflict among groups. The 
economic sociologist is concerned with studying the 
mutual relations between economic activities and 
these diverse political phenomena. 

Most studies of authority systems within indus- 
trial firms involve the relations between the types 
of supervision (typically described in terms of an 
authoritarian—democratic dimension) and worker 
morale, productivity, and receptivity to innovations. 
The most common finding is that worker morale 
and productivity are higher under “employee-cen- 
tered” leadership than under leadership oriented 
to technical standards of efficiency (Viteles 1953). 
Most of the experiments and field studies on su- 
pervision have been carried out in countries with 
democratic traditions, especially the United States 
and Great Britain; obviously, studies of societies 
with different political traditions might show dif- 
ferent results. - 

Research on the political relations between eco- 
nomic organizations and other social units has 
advanced on a variety of fronts. Under the heading 
of “the economics of imperfect competition,” econ- 


omists have studied a variety of political involve- 
ments of business firms: ways in which large firms 
are able to control, and perhaps dominate, prices 
and output under monopolistic conditions; ways 
in which wealth and power become concentrated 
in the economy; and ways in which government 
policies (e.g., antitrust policies) influence the 
structure and behavior of firms (for example, 
Mason 1959). Economists and sociologists have 
devoted a modest amount of research to the power 
relations between stockholders and managers and 
between consumers and business firms, but far 
more research has been done on the power relations 
and conflicts between business firms and labor 
unions. In particular, scholars have devoted much 
study to the causes of labor disputes (see Korn- 
hauser et al. 1954; Knowles 1952). Some causes 
have been fairly well established: strikes occur 
more frequently under conditions of prosperity, 
more frequently in isolated industries with homo- 
geneous worker populations, and less frequently 
under totalitarian governments and in periods of 
national crisis. 

Finally, much contemporary research and dis- 
cussion focuses on the relations between business 
and government. This research is shrouded in ideo- 
logical controversy and confused findings. One 
school of thought, advanced by the late C. Wright 
Mills (1956), argues that national political power 
in the United States has become increasingly con- 
centrated in recent decades and that the holders 
of power and makers of important decisions are a 
small group of corporate executives and military 
officials; this view, however, has been challenged 
on both methodological and substantive grounds. 
The empirical research on economic controls over 
local politics also shows a mixed picture. For ex- 
ample, in a study of a Southern community, Floyd 
Hunter (1953) concluded that the major decisions 
were guided by a small group of economically 
dominant individuals. Other research—for ex- 
ample, that of Delbert Miller (1958) and Robert 
Schulze (1958 )—indicates that the degree to which 
business groups dominate local politics varies con- 
siderably from community to community and over 


time. 
NEIL J. SMELSER 


[Directly related are the entries ECONOMIC THOUGHT; 
INDUSTRIAL RELATIONS; INDUSTRIALIZATION; SYS- 
TEMS ANALYSIS, article on SOCIAL SYSTEMS. Other 
relevant material may be found in CONSUMERS; 
ECONOMY, DUAL; PEASANTRY; and in the biographies 
of BARNARD; DURKHEIM; Keynes, JOHN MAYNARD; 
LE PLAY; MARSHALL; MARX; PARETO; SMITH, ADAM; 
SomBarT; WEBER, MAX.] 
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Francis Ysidro Edgeworth (1845-1926) was 
raised on the family estate of Edgeworthstown, 
County Longford, Ireland. His father died when 
Edgeworth was two years old. Edgeworth studied 
under tutors and spent considerable time reading 


and memorizing the classics and English poetry. 
From an early age he also read widely in Spanish, 
French, German, and Italian. This interest in Con- 
tinental literature may well have been strengthened 
by the fact that his mother was the daughter of a 
Spanish political refugee. One of Edgeworth’s aunts 
was the famous novelist Maria Edgeworth, and the 
poet Thomas Lovell Beddoes was his cousin. 

At the age of 17, Edgeworth entered Trinity Col- 
lege, Dublin, and then proceeded to Balliol College, 
Oxford, where he received first class honors in 
litterae humaniores. After several years of prac- 
ticing law in London, he accepted a lectureship in 
logic at King’s College and was later appointed 
Tooke professor of political economy. 

In 1891 he became Drummond professor of 
political economy at Oxford and a fellow of All 
Souls College. Never marrying, he resided prin- 
cipally at All Souls for the remainder of his life, 
although he maintained rooms in London for over 
fifty years and inherited the family estate in 1911. 

Although he was highly respected as a teacher 
and original thinker by both economists and statis- 
ticians of his time, Edgeworth does not seem to 
have profoundly influenced the then current think- 
ing in either field. Present readers, however, find 
much that is stimulating and informative in Edge- 
worth and not uncommonly note with some sur- 
prise that ideas they believe to have originated 
recently are, sometimes clearly, sometimes vaguely, 
anticipated in Edgeworth’s work. 

When he became professor emeritus at Oxford 
in 1922, Edgeworth was serving his second term 
as president of the economic section of the British 
Association and was a vice-president of the Royal 
Economic Society. He had earlier served as presi- 
dent (1912) and council member of the Royal 
Statistical Society; he had been awarded its Guy 
medal in 1907. He was also a fellow of the British 
Academy. 

Undoubtedly, Edgeworth’s greatest professional 
contribution, in addition to teaching and writing, 
was his editorship of the Economic Journal. 
He became its first editor in 1891 and was later 
chairman of the editorial board. At the time of 
his death in 1926, Edgeworth was joint editor with 
Keynes. 

He is reported (L. L. P. 1926) to have been an 
effective and stimulating teacher, attracting good 
Students and encouraging many vigorous discus- 
sions. Frequently he would become one of the less 
active participants, intervening occasionally to ex- 
pose an error or to bring a neglected but fruitful 
problem to the attention of his class. 

Edgeworth’s main written contributions to eco- 


nomics are contained in seven small books and 
numerous journal articles and reviews. In 1925 
his principal articles and reviews were published 
in three volumes entitled Papers Relating to Politi- 
cal Economy (see 1891-1921) under his own 
editorship and under the sponsorship of the Royal 
Economic Society. 

Mathematical Psychics, published in 1881, is 
probably Edgeworth’s most important writing. His 
“contract curve” and its generalization, the set of 
“Edgeworth-allocations” (Debreu & Scarf 1963; 
Vind 1964) or “core,” are still basic concepts for 
the theoretical study of exchange equilibrium and 
welfare economics; and his theory of barter has 
provided a valuable point of departure for recent 
research on dynamic economic adjustments (Uzawa 
1962), Edgeworth discussed a number of interest- 
ing bargaining situations in light of the theory 
presented. 

Edgeworth’s arguments in the first section of 
Psychics against the common view that mathe- 
matics could be applied only to numerical phenom- 
ena were quite advanced and provocative then but 
will appear awkward and unnecessary to a current 
reader who has studied set theory and topology. 
Much of the remainder of the book is concerned 
with attempts to establish practical implications of 
utilitarian ethics. These views were, and remain, 
highly controversial. 

Edgeworth’s collected works include a group of 
related papers (1897a; 1897b; 1899; 1911; 1912) 
on taxation, price discrimination, and monopoly 
that involve original theoretical developments, 
which are still of considerable interest. In these 
papers are a number of famous paradoxes, such 
as: A tax on one of two monopolized commodities 
which are appropriately related in production, con- 
sumption, or both may cause prices of both com- 
modities to fall; and the introduction of price 
discrimination into a competitive market might 
benefit both producers and consumers (where con- 
sumer benefit is taken to be aggregate consumer 
surplus), The 1912 paper contains a very good 
discussion of the difficulties of establishing a the- 
oretical equilibrium for duopolists or monopolists 
who sell related commodities. A number of other 
papers deal with tax problems of Edgeworth’s time 
and with utilitarian principles for determining 
equitable taxation. Quantitative problems encoun- 
tered in contemplated applications of utilitarianism 
are also treated in his first book, New and Old 
Methods of Ethics (1877). 

Edgeworth served several years as secretary of 
the Committee on Value of the Monetary Standard 
of the British Association for the Advancement of 
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Science. This interest is reflected in ten articles 
on index numbers and the value of money. Two 
memoranda prepared for the British Association 
for the Advancement of Science (1887—1889) con- 
tain what can still be regarded as a fairly com- 
prehensive discussion of theoretical problems of 
index number constructions. 

Metretike draws a number of interesting par- 
allels beween utility and its measurement and 
application, on the one hand, and probability and 
its measurement and application, on the other 
(1887a@). Although there are some interesting ob- 
servations, there is no fundamental interrelation- 
ship such as that later achieved through the axiom 
systems of Ramsey, Savage, and others.” 

The other four of Edgeworth’s books are lectures 
on war finance delivered during the war and im- 
mediately afterward (1915a; 1915b; 1917; 1919). 
These works, like most of his writings on current 
policy issues, drew more criticism than praise from 
his colleagues: “[He] might descend eventually on 
one side or the other of the fence, but, . . . he kept 
himself so long poised evenly midway that the final 
movement when, and if, it happened, was apt to 
be unnoticed” (L. L. P. 1926). Keynes remarked: 


He feared a little the philistine comment on the strange 
but charming amalgam of poetry and pedantry, sci- 
ence and art, wit and learning, of which he had the 
secret; and he would endeavor, however unsuccess- 
fully, to draw a veil of partial concealment over his 
native style, which only served, however, to enhance 
the obscurity and allusiveness and half-apologetic air 
with which he served up his intellectual dishes. 
(Keynes [1926] 1963, p. 227) 


Some basis for these complaints must be ad- 
mitted, Even in his theoretical and highly mathe- 
matical writings, Edgeworth’s expository skill is 
notably less than his insight and logical ability. 
The overapologetic tone Keynes mentions is fre- 
quently apparent; it is illustrated by Edgeworth’s 
description of his correction of a colleague's mis- 
interpretation as: “points of detail on which the 
critical shoemaker corrected the masterpiece of 
the Grecian painter ({1891-1921] 1963, vol. 1, 
p. 143). 

Another factor which frequently led to lengthy 
and sometimes complicated passages in Edgeworth 
and drew critical comments from his contempo- 
raries was his desire to account explicitly for every 
logically possible combination of circumstances. 
Many current readers are more patient on this 
score. A more mathematically inclined generation 
is better aware of the advantages of carefully keep- 
ing track of the exact relations that exist between 
alternative conditions and consequences. 
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Among Edgeworth’s other publications that 
should be noted are his article on the law of error 
in the 1902 Encyclopedia Britannica, his article 
on probability in the 1911 edition, and his biog- 
raphy of Mill and article on index numbers in 
Palgrave's Dictionary of Political Economy. 

Edgeworth’s best-known work on distribution 
theory is a series which gives an asymptotic ap- 
proximation to a fairly general class of distribution 
functions and is still sometimes used (Cramer 
1945, pp. 228-231). Under the heading “The law 
of error,” Edgeworth used this series to derive sev- 
eral versions of the central limit theorem and to 
approximate a number of empirical frequency dis- 
tributions (1883; 1898-1900; 1905; 1926; and 
with Bowley, 1902). He also extended Galton’s 
work on correlation, and he developed a formula 


for the general multivariate normal density, and, 


considered some of its properties (1893; 1905). 

Following Bernoulli, Laplace, and Mill, Edge- 
worth was basically Bayesian in his approach to 
statistical inference, His “genuine inverse method” 
(1908-1909) consisted of finding a normal approx- 
imation to the posterior distribution of unknown 
parameters (quaesita, or frequency constants, in 
his terminology), assuming a diffuse prior distribu- 
tion, and equating the means of the posterior dis- 
tribution to their maximum likelihood estimates 
(called “most probable values”). He properly ob- 
jected to forming a uniform prior on the basis of 
ignorance, since this would imply a specific non- 
uniform prior on any transformation of the origi- 
nal parameters. However, as Edgeworth also recog- 
nized, any reasonably smooth and unconcentrated 
prior will lead to results that differ little from those 
obtained from either a uniform distribution or a 
diffuse distribution when the number of observa- 
tions is large. Thus Edgeworth’s genuine inverse 
method was based on some of the same underlying 
ideas as the theory of stable estimation examined 
earlier by Fisher and Jeffreys and recently em- 
phasized by Bayesian statisticians (Edwards et al. 
1963). 

Not all of Edgeworth’s work on inference was 
developed from this approach. He believed the 
prior distribution would often be unknown and 
looked for other principles and devices. He dis- 
cussed the difference of two means (1905; 1908- 
1909) and the possible significance of an estimate 
of trend (1886) in language much like that of the 
Neyman-—Pearson theory of hypothesis testing. 

Several of Edgeworth’s papers are concerned 
with the problem of the “best” mean (e.g., 1887a). 
This usually means an estimate of a population 
parameter, optimal according to some stated cri- 


terion; but the case of finding a suitable descrip- 
tive representative for a collection of empirical 
data is also considered. A criterion frequently em- 
ployed is that of minimizing expected loss (called 
“least detriment” or “minimum disadvantage”), 
and a number of inquiries proceed under the as- 
sumption that all that is known of the loss function 
is that it is symmetric and an increasing function 
of the deviation of an estimate from the true value. 
This assumption led Edgeworth to criticize Laplace 
for advocating widespread use of least-squares. 
Edgeworth advocated use of the median in many 
circumstances. Noting that the median of a uni- 
variate sample minimizes the sum of absolute devi- 
ations, he developed a method for fitting a straight 
line to bivariate data to minimize this sum (1923). 
Edgeworth never developed his many original 
contributions to either economics or statistics to 
make a comprehensive coordinated work. He dealt 
with questions only dimly understood in his time 
(many are still controversial and not fully devel- 
oped), and although he made significant contribu- 
tions, I suspect that he himself was not satisfied 
with the state in which hè left most of his topics. 
He was still working with enthusiasm shortly be- 
fore his death at the age of 81. His work, however, 
has retained much more interest than that of some 
who had greater impact on their contemporaries. 
Modern mathematical economists and statisticians 
find him a stimulating and reassuring intellectual 
forebear, surprisingly up-to-date in many respects 

and still instructive. 
CLIFFORD HILDRETH 


[For discussion of the subsequent development of Edge- 
worth’s ideas, see BAYESIAN INFERENCE; UTILITY.] 
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EDUCATION 


The articles under this heading are for the most 
part devoted to discussions of education as an insti- 
tutionalized form of socialization to adult roles, The 
articles under SoctatizaTIon describe other and 
broader aspects of this concept. Education as an 
occupation is described in TEACHING. Particular in- 
stitutions that have an educational function are 
described in ADULT EDUCATION; COMMUNICATION, 
Mass; RELIGIOUS ORGANIZATION; UNIVERSITIES; 
VOLUNTARY ASSOCIATIONS. The economic aspects 
of education are discussed in CAPITAL, HUMAN; the 
psychological aspects in ACHIEVEMENT TESTING; 
EDUCATIONAL PSYCHOLOGY; INTELLECTUAL DEVEL- 
OPMENT; INTELLIGENCE AND INTELLIGENCE TEST- 
Inc; LEARNING. Other relevant material may be 
found in EQUALITY; LEISURE; OCCUPATIONS AND 
CAREERS; PROFESSIONS; and the biographies of 
Dewey; DURKHEIM; MANNHEIM; MONTESSORI; 
VEBLEN; WALLER; WEBER, MAX. 


1. THE STUDY OF EDUCATIONAL 
SYSTEMS 
1. EDUCATION AND SOCIETY 
11. EDUCATIONAL ORGANIZATION 


Burton R. Clark 
C. Arnold Anderson 
A. H. Halsey 


I 
THE STUDY OF EDUCATIONAL SYSTEMS 


asic activity of mankind. Men 


Instruction is a b: 
inform one another, conveying belief, knowledge, 
earn a living, 


and skill, as they raise a family, 
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govern a polity, minister a church, nurse the ill, 
encounter friends, or communicate through a mass 
medium. All social systems, large or small, contain 
occasions for learning and participate to some de- 
gree in transmitting culture and in socializing the 
individual, But the degree of educational involve- 
ment is often minor, since many systems of regular 
transaction leave instruction undifferentiated and 
incidental and are not characterized by their edu- 
cational effort. Some webs of human relations, 
however, are designed to instruct: they raise in- 
struction to purpose; they possess roles of teacher 
and learner; they receive social definitions—school, 
college, education—that signify their emphasis. 
They are set apart in society precisely to differenti- 
ate instruction from other activities and make it a 
form of adult work. When educational activities 
are thus concentrated, we call them educational 
systems, 

The study of education, while prepared to pur- 
sue instruction in its weaker expressions, focuses 
on these units that constitute a distinctive sector. 
What transpires within the school and college to 
determine human behavior and the relations of 
men? What activities besides instruction emerge 
to characterize the educational enterprise? How 
are educational systems shaped by, and how do 
they in turn affect, the structures of society that 
allocate resources, exercise power, and provide 
social order? 

Education fascinates the citizen and the scholar 
alike in the modern era, as it grows in importance 
but retreats from the view of the amateur, New 
and expanding economic, political, and social func- 
tions pull education into the mainstream of society. 
At the same time, growing size, deepening com- 
plexity, and ever greater specialization mask the 
processes of education—in the recesses of the city 
school district, the privacy of thé boarding school, 
the esoteric divisions of the giant university, the 
educational subsystems of religious, industrial, and 
military establishments. Rapid change undermines 
the understanding of education that everyone pos- 
sesses from the remembrance of things past. Car- 
ing more but knowing less, practical men support 
systematic study of education in the hope it will 
inform the conventional wisdom and advise edu- 
cational policy. Aware that political, economic, 
social, and psychological theory must comprehend 
the educational domain, social scientists have 
turned a forgotten realm into a field of special in- 
terest. As they point out, a major social institution 
that touches the lives of all and incorporates a fifth 
or fourth or third of the population at one time is 


overlooked only at considerable peril to social prac- 
tice and social theory. 

Among the attending social sciences, sociology 
casts a wide net in its pursuit of educational be- 
havior. The sociology of education is partly a hold- 
ing company for men who study particular social 
forms and processes and happen to have followed 
their interests into the educational domain. Stu- 
dents of small-group behavior, childhood and ado- 
lescence, deviancy, stratification, occupations, sci- 
ence, law, formal organizations, and leadership are 
found wandering in schools and colleges in search 
of findings that would develop a perspective, sug- 
gest a concept, or warrant a generalization. The 
ties of the sociology of education to general soci- 
ology are thus many and intimate, defined and 
redefined by the shifting interests and locales of 
sociological research. This research field is also, 
however, a search for unique features of social 
Structure that characterize educational systems. 
Every major social institution has dynamics of its 
own, special social functions, and particular prob- 
lems of performance. When society locates so 
much cultural transmission and socialization in a 
separate major institution, the distinctive social 
qualities of that institution will have significant 
consequence for the rest of society. The need for 
social research here is that of clarifying the major 
implications of concentrating instruction in ad- 
ministered settings, where it becomes the business 
and vested interest of a segment of the population. 

Dividing along lines of social structure, there are 
four broad sectors of sociological approach: the 
connection of education to the rest of society; the 
character of education as a broad institution com- 
posed of many organizations and practitioners; the 
internal life of the school or college; and the sys- 
tems of organized instruction that arise in major 
institutions other than education itself, 


Education and society 


Education, economy, and politics. As a society 
undergoes industrialization and modernization, its 
instruction of the young becomes extensively dif- 
ferentiated, internally complex, and elaborately 
connected with other features of society. Education 
becomes more necessary for the economy and 
linked closely to it as a major mediator between 
manpower demand and labor supply. Occupational 
competence, general and specific, is increasingly 
certified by schooling, and achievement is thereby 
prefigured, as labor shifts from manual to mental 
and from low to high degree of skill. In the aggre- 
gate, those who leave school early are designated 
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for unskilled work, and those who remain on the 
educational escalator are carried to the jobs for 
which their general education and specialized vo- 
cational training have earmarked them. Higher 
education is also deeply involved in technological 
advance, as a location for scientific work and as 
the enterprise that trains the modern researcher 
and technologist. Thus, education becomes a way 
of investing in human capital across many levels 
of skill. 

If brain workers are the economic need of socie- 
ties in advanced industrialism, governmental as 
well as industrial leaders must become interested 
in the agencies that attempt systematically to de- 
velop mental capacities. The relation of education 
to government becomes extensive as public officials 
scrutinize schools and colleges for labor-force effec- 
tiveness and worry about the costs and benefits of 
different programs for training educated labor. The 
relation of education to the main stream of politics 
grows as mass education creates new publics— 
some active, some passive—that attract the poli- 
tician and the political party to issues of what the 
government is and should be doing in education. 

Mass education also deepens the role of the 
school in political socialization. In the past, the 
central political contribution of the schools has 
been the training of a small governmental elite, as 
in the case of the Victorian public school in Eng- 
land (Wilkinson 1964). In modern society, educa- 
tion becomes a prime source of differences in po- 
litical perspectives across large populations. The 
more highly educated are more aware of the impact 
of government and are more likely to consider 
themselves free to engage in political discussions 
and competent to influence governmental affairs; 
they “possess the keys to political participation and 
involvement” (Almond & Verba 1963, p. 318). 
Among the settings that shape belief, the school is 
also pre-eminently within the reach of political 
control, tempting new governments or revolution- 
ary regimes to use it as an instrument for legiti- 
mating particular forms of government and reform. 
Strong pressures exist to move education from 
elite to mass indoctrination. 

Thus, education increasingly becomes a branch 
of political economy. The sociology of education 
here converges with the growing body of inquiry 
into the economics and politics of education and 
with the interests of political scientists and polit- 
ical sociologists in governance and political social- 
ization. 
~ Social stratification and mobility. As education 
connects more closely to the economy and the po- 
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litical order, the role of education in assigning 
status to individuals and groups also sharpens and 
intensifies. The paths of social mobility run 
through the school; the system divides the young 
and assigns them to adult statuses by means of 
years of schooling and specific occupational prep- 
aration. Thus, the equality of educational provision 
and access, across divisions of class, ethnicity, and 
race, becomes a critical social and educational 
problem. Does the educational system function 
primarily as an institution of social inheritance, 
stabilizing social position across generations, or as 
an institution of social mobility, appointing sons to 
statuses different from their fathers’? Whatever 
else education in advanced industrial societies is 
about, it is about equality. 

Massive inequality is generated in modern so- 
ciety when there is considerable difference in the 
quantity and quality of schooling between rural 
and urban areas, among regions of a country, and 
among the neighborhoods and suburbs of the 
metropolis. Where schooling is of slight conse- 
quence to the fate of the masses, such patterned 
differences matter little. But as schooling becomes 
the precursor of adult status, the demand to equal- 
ize educational provision is heightened, In the 
United States, where social and educational doc- 
trines have emphasized equal opportunity, the com- 
prehensive school was for some time considered an 
assurance of equal treatment. But the pulling apart 
of the classes and the races in the major metro- 
politan areas, often into separate educational dis- 
tricts, has turned the public schools into agencies 
that are unequally equipped. Extensive regional 
differences have also been revealed, especially in 
the education of the Negro minority. With this, 
equality becomes a political and administrative 
issue of planning across larger sectors of popula- 
tion to equalize school resources, personnel, and 
climates of learning. 

Personal aspiration and the capacity to use edu- 
cation—or be processed by it—also varies greatly 
by the class- and ethnic-linked characteristics of 
the home, the age group, the neighborhood, and 
the state or region of country. These shaping en- 
vironments of youth, external to the school, are 
ascribed rather than achieved. When these early 
and systematic inequalities of background are put 
alongside the growing importance of education in 
adult status, they appear unfair, and their reduc- 
tion becomes a matter of social justice. Certain 
social settings of child rearing—the slums—are 
deemed educationally incompetent, giving the child 
so little educational capacity and drive that he ap- 


512 EDUCATION: The Study of Educational Systems 


pears culturally stunted and destined to be shunted 
to the depths of the lower class (Work Conference 
. + . 1963), Democratic doctrine and the politics of 
democracy then urge that the educational domain 
be extended into earlier years of life, so that formal 
agents of care and instruction may blunt and over- 
come the educationally negative effects of family 
and neighborhood. The extending and deepening 
connections of education to the economy and social 
status thus move educational institutions into a 
deeper encroachment upon the domains of social- 
ization that in early society were monopolized by 
family, church, and community. 

Modern research (e.g., Halsey et al. 1961) sug- 
gests the existence of deep and systematic differ- 
ences in educability that are a product of the early 
preschool and extraschool environments. A society's 
pool of ability is socially as well as biologically de- 
fined; “talent” is a function of social strata, educa- 
tional provision, and the interaction between the 
two, 

Education and culture. Education’s broadest 
function has been to act as caretaker and dispenser 
of certain cultural resources of society. The raising 
of the culturally unformed—the child and the im- 
migrant—to the state of capable adult is the ac- 
tivity on which the formal institution of education 
was founded. Formal education is therefore an 
effort to do explicitly and systematically what 
family and community had long accomplished in 
an undifferentiated fashion before society became 
so complex that the task had to be performed by 
specialists, 

Of all the relations of education to society, the 
primordial function of cultural transmission is the 
one most seriously disturbed by modern social 
forces. There is some slippage of this function to 
other institutions: thus, educational subsystems 
emerge and develop extensively in the institutions 
of adult work. Moreover, there is a growing amount 
of unsystematic cultural indoctrination through the 
mass media—television, radio, movies, records— 
where instruction is combined with entertainment 
and commerce. The media blend, competing with 
the school and affecting its performance, is new in 
its near-universal coverage. The child tunes in 
daily to central molders of taste and transmitters 
of lore; a significant part of traditional and emer- 
gent culture is surely now communicated through 
these new channels. The problem for research is 
to ascertain what part of the imagery and knowl- 
edge of youth is transmitted in these extraeduca- 
tional ways, what alterations in the emphases of 
the core culture are made, and how the work of the 
school and college is affected. The possibility exists 


that the very form of new means of communica- 
tion, particularly the visual sensing of television, 
affects the relation of the person to the environ- 
ment (McLuhan 1964). 

The transmission of culture also becomes seg- 
mental in the upper levels of the educational sys- 
tem, as formal instruction is both extended and 
differentiated to prepare men for an elaborating 
structure of skilled occupations. Men must be dif- 
ferentially socialized to the spreading variety of 
adult statuses. The professional school is a pur- 
veyor of specialized culture, and even the disci- 
plines of the liberal arts grow less liberal in modern 
times as academic men respond to proliferating 
knowledge by developing expertise in a narrow 
segment. These tendencies weaken rather than 
strengthen understanding among educated men, 
as experts discourse in esoteric languages and gaze 
down the tunnels of special perspectives. 

But the relation of education to culture is per- 
haps most changed, not by alterations in the trans- 
mitting of a heritage, but by expansion in the cre- 
ating of knowledge. This historically minor role 
evolves toward dominance in higher education as 
nations increase their commitment to research. 
The involvement of the university, as locus of re- 
search and trainer of the researcher, makes edu- 
cation an active, intrusive force in culture as well 
as social and political affairs. The university has 
helped to instigate and propel the explosion in 
knowledge and the ascendance of science that 
characterize technological societies. This dynamic 
relation constitutes part of the awesomeness of the 
educational enterprise in modern times. 

Social change and social integration. The great- 
er the scope of the educational institution, the more 
complex is its relation to social order. Schools and 
colleges, as noted above, increasingly undertake 
broad functions for the mass of the population that 
were formerly fulfilled by family, community, and 
church. The educational system, when socially ef- 
fective, becomes a device for orderly change across 
generations in the class and elite location of indi- 
viduals and groups. But the system may weaken 
the integration of society through lack of articula- 
tion or incapacity to adapt. Articulation is partly a 
problem of how well educational output matches 
occupational demand. An oversupply of men edu- 
cated in public administration and law, or other- 
wise prepared to seize coveted posts in the civil 
Service, is a source of a discontented and restless 
intelligentsia in industrializing societies that are 
at the same time short of scientists and engineers. 
A vast supply of men trained below a general 
threshold of functional literacy offers the possi- 
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bility in advanced industrial societies of a mass of 
unskilled workers ill-fitted for the job-upgrading 
and retraining that spreads throughout the modern 
economy. 

Adaptation is partly a problem of how the major 
training institution confronts rapid social change. 
Mannheim (1950, p. 248) maintained that the 
school of the past was “a training ground for imi- 
tative adjustment to an established society,” while 
the modern school is (or ought to be) “an intro- 
duction into an already dynamic society.” To face 
only tradition is potentially malintegrative; new 
knowledge must be brought into the curriculum, 
and the new and the old made coherent. Estimates 
must be made of behavior appropriate to an un- 
known future, increasing the pressure to educate 
for “adaptability.” But this is to risk loss of age-old 
values, cultural discontinuity, and crisis in per- 
sonal identity. Flexibility, a training ideal sug- 
gested by rapid change, may cause confusion and 
lend a hand to chaos. Social integration, in its in- 
structional bases, depends in part on the capacity 
of the school to blend adaptive flexibility with 
stable imitation of the past, working to avoid both 
atomized individualism and cultural orthodoxy. 

Since education faces many forces in complex 
societies, its adaptation in one direction may gen- 
erate serious strain in another. The close link to the 
economy in advanced industrial societies turns 
education into a talent farm, a massive “people- 
processing” enterprise preparing manpower to the 
specifications of occupational demand and govern- 
mental blueprint. Such an enterprise is also char- 
acteristically large in scale and highly specialized 
in internal operation. One outcome of massive 
processing is a callous relation between those who 
administer and teach and those who occupy the 
chairs of the student. The processing hardly 
bothers the student with only vocational objectives 
in mind, but it seriously disturbs intellectual inter- 
ests of faculty and students. Idealistic youth, con- 
cerned about personal identity and social justice 
in the mass society, sees itself poorly served by in- 
structional systems that are hooked to the require- 
ments of technological advance, Thus, as educa- 
tion becomes “Establishment,” it contributes to 
modern intellectual discontent and alienation. 

When the educational system is hypnotized with 
occupational demand, it will also overlook the re- 
quirements of a man when he is off the job. Clearly, 
education must learn to contribute substantially to 
„the use of free time. Yet the orientation to duty and 
the tight schedules of schools that prepare the 
young for bureaucratized work are antithetical to 
the sensibilities appropriate for leisure. Fixation on 


utilitarian study also renders art a frill in the cur- 
riculum, reducing the aesthetic experience of the 
young and leaving it to other institutions to sustain 
the arts in society. 

In the short span of a generation or two an edu- 
cational system may even prove critical to the iden- 
tity and integration of a nation. In the rapid change 
characteristic of the last half of the twentieth 
century, new nations and traditional societies 
struggle to bring their populations to modernity. 
One example is Israel, where mass immigration 
focuses national energies on assimilation of tra- 
ditional folk. By 1960 Israel was a secular, Western- 
type democracy, with an economy rooted in science 
and technology. But over a half of the country’s 
large wave of immigrants in the 1950s were Afro- 
Asian Jews, illiterate and poor and Eastern in terms 
of values. In such circumstances, the socialization 
that takes place in the immigrant family threatens 
rather than supports national integration. The 
youth of the traditional families must be captured 
by such arms of the state as the school and the 
army, rapidly weaned from tradition, and trans- 
formed into modern citizens and workers; or else 
when the children of today come to adulthood and 
the ballot, there may be two nations or a nation 
with a different identity, [See REFUGEES, article 
on ADJUSTMENT AND ASSIMILATION.] 

Many other nations in Asia, Africa, and Latin 
America, some newly formed as well as undergoing 
rapid modernization, find the educational system 
playing a critical role in nationhood and national 
change. The educational system strains to provide 
the trained men for the expanding upper sectors of 
a rapidly changing structure of occupations, par- 
ticularly the men who can plan and lead modern 
government and industry. The teachers and gradu- 
ates of the system, far ahead of the masses in 
capacity to understand the modern polity, chal- 
lenge traditional ruling elites, by means ranging 
from the barricades of the street demonstration to 
the desks of the civil servant. Critical problems of 
national identity sometimes center on the capacity 
of the schools to promote a national language ina 
multilingual new society; for example, there has 
been an attempt in Malaya, peopled by Malays, 
Chinese, Indians, and Europeans, to make Malay 
the unifying language of the nation, Whatever else 
education in the modernizing society is about, it is 
about national integration and modern competence, 
Here the sociology of education becomes pre- 
eminently a part of the sociology of national devel- 
opment [see, for example, Foster 1965; Coleman 
1965; see also LINGUISTICS, article on THE SPEECH 
COMMUNITY; MODERNIZATION]. 
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Finally, education has become a peculiarly cre- 
ative force in society. Its laboratories may serve 
industry and government, but its scholars and sci- 
entists also create the knowledge that opens new 
vistas and undermines existing economic and polit- 
ical structures. Its teachings may prepare the com- 
pliant worker, but its teachers also create attitudes 
critical of established ways and acquaint the young 
with ideals and institutions of freedom. Complex 
systems of prolonged education lead toward change 
as well as stability, critical thought as well as un- 
thinking imitation, discontinuity and eruption as 
well as continuity and slow evolution. The tend- 
encies are plural, leaving the outcome open to the 
play of values, the effectiveness of group action, 
and the exercise of political power. 


The major educational institution 


The major educational institution of a society 
has a division of labor, a structure of control, and 
a work force. Important social phenomena in their 
own right, these components of the system at large 
affect the educational process and its impact on 
the individual and society. The institutional fea- 
tures in part reflect the larger society, but they also 
have dynamics of their own and consequences not 
planned by political leaders and administrative 
staffs. Several issues involved in the division of 
labor and the structure of control will be reviewed 
here. 

As education is extended to a larger share of the 
population, lengthened to occupy more years in the 
life cycle, and linked more closely to the allocation 
of occupation and status, the tasks of education 
are diversified. More educational sectors form 
around permutations of program, personnel, and 
clientele, intensifying problems of division of labor 
and control. In societies characterized by central 
control of educational policy, national governmen- 
tal authorities attempt systematically to allocate 
tasks to sectors and to standardize programs and 
requirements of entry and exit within those major 
segments. In decentralized educational structures, 
tasks may be defined and seized by private initia- 
tive and dispersed public authorities, resulting in 
greater variation in the wares that schools offer 
parents and students. 

Centralization of educational control in a na- 
tional ministry, or, at a lower level, in a regional 
or state department, serves certain ends and pre- 
sents certain institutional problems, while decen- 
tralization serves other ends and leads to problems 
of a different order, Centralization serves integra- 
tion, orderly procedure, the uniform application of 


standards, and innovation from the top. It allows 
for redistribution of resources across a large system 
to reduce the disparity among local educational 
settings, between the poor and the rich and the 
backward and the advanced. The problems posed 
by central control are ones of bureaucratic rigidity, 
massive error, and the lessening of initiative in the 
provinces; such an establishment may not have the 
foresight, sensibility, and planning capacity to 
make the myriad adjus@nents required of a diver- 
sified institution in a rapidly changing society, 

Decentralization serves local adaptation to the 
diverse values of plural societies and the tailoring 
of school interests to specific group interests, for 
example, the preparation of Philadelphia gentle- 
men, the maintenance of Lutheranism, the accul- 
turation of a minority. It allows for grass-roots 
initiative. The problems posed by decentralized 
control are ones of local initiative becoming sub- 
servient to local status quo and of piecemeal re- 
sponse to needs and interests that extend across 
society. A locally adaptive system becomes a case 
of institutional drift as authority moves upward in 
other institutions: for example, as national corpo- 
rations, unions, and regulatory agencies come to 
dominate the economy and as the work of educa- 
tion comes to relate closely to the national economy 
and such concerns as the technology of national 
advance and protection. The perceived costs of drift 
and the natural insensitivity of local authority to 
national problems promote efforts to centralize con- 
trol or to construct national mechanisms for influ- 
encing the scattered authorities who retain formal 
control. 

The modern state needs to order the educational 
division of labor. Where a national bureaucratic 
machinery is weak or lacking, the effort to coordi- 
nate the division of labor will encourage the 
strengthening of that machinery and also the 
growth of an ancillary structure, composed of vol- 
untary associations, interorganization compacts, 
study committees, accrediting bodies, and testing 
organizations, that helps to knit together ‘separate 
public and private authorities. The ancillary groups, 
lacking command, operate primarily through the 
persuasion of money, prestige, and competitive ad- 
vantage of large organization over solitary effort. 
At the same time, the modern state also needs to 
Support research, to diversify its advanced educa- 
tion, and to develop sectors of creative adjustment 
to rapid change that cannot be anticipated in all 
its complexity in the central office. Where decen- 
tralization or federalism is lacking, the effort to 
diversify and provide for widespread initiative will 
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encourage the proliferation of centers of influence 
within the system and result in a loosening of the 
tight integration of the traditional structure. 

Thus, decentralized educational structures are 
under pressure to change the internal division of 
labor and control toward national order, through 
ancillary organization as well as stronger national 
bureaucracy, On the other hand, centralized edu- 
cational systems are under pressure to loosen the 
traditional bureaucracy in order to admit new 
forms of secondary and higher education and to 
exploit the role of education in research and inno- 
vation, The strain toward coordinated order and 
the strain toward individual and organizational 
autonomy are fundamental conflicting tendencies 
inherent in education in advanced industrial soci- 
eties, With different cultural traditions and dif- 
ferent structures of political power, societies will 
pursue somewhat different resolutions of this basic 
conflict. But we may also expect some convergence 
among nations to occur on a model of the educa- 
tional system that weds central direction to exten- 
sive consultation and persuasion and that places 
preparation for unguided change on a par with 
direct implementation of national policy. 


The educational organization 


The school or college is a formal social system 
with external relations and internal patterns that 
condition the educational process (Bidwell 1965). 
The individual agency has a particular part to play 
in the educational division of labor, and its place 
in the institutional web affects the status of the 
organization and its members. A school is judged 
externally by public perception of how its main 
tasks connect with the general status system of 
society, “Vocational” secondary schools that lead to 
low-status occupations, compared to “academic” 
secondary schools leading to high-status occupa- 
tions, everywhere labor under a stigma that affects 
the recruitment and morale of teachers and the 
self-conception and ambition of students. It is vir- 
tually an iron law of school status that a parity of 
esteem cannot be achieved by schools performing 
different educational and social tasks (Banks 
1955). In systems of specialized secondary schools, 
for example, the tripartite English structure, the 
difficulties of school esteem turn some educators 
and laymen toward the more comprehensive forms 
of school organization, grouping curricular streams 
within a common school in order to avoid the status 
degradation of a good share of the teachers and 
‘the students. As schools and colleges grow in num- 
ber and variety, the status of one among the others 
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is increasingly affected by a contest composed of 
the effort of favored organizations to protect and 
proclaim their acknowledged superiority, the at- 
tempts of consuming publics to discriminate and 
label, and the maneuvers of some reformers, ad- 
ministrators, and the less-favored teachers to screen 
the division of work from the public assessments 
that define status inequalities (Clark 1960). 

In the internal structure of the educational or- 
ganization, the growing scale and proliferation of 
work activities induce a differentiation of roles. 
Administration becomes separated from teaching; 
because of the need for coordination, a hierarchy 
of administrative levels comes into being. Teaching 
itself becomes more specialized as knowledge pro- 
liferates, The magnitude of the task of advising 
and channeling students produces the nonteaching 
counselor. As the role structure of the school be- 
comes more complicated, there is a bureaucratiza- 
tion of relations, with responsibilities and jurisdic- 
tions formally circumscribed and rules formally 
elaborated to preform decisions for maximum fair- 
ness and dependability. As written records shadow 
the student and experts assess him and assign him 
to treatments, the formal apparatus takes on such 
weight that it is able to compete with the home 
and neighborhood in determining the future of the 
young. Parental and student orientations must 
interact with the increasingly more systematized 
means of achievement. The orientations are some- 
times defined, sometimes redefined, and always 
processed by the everyday activities of a bureauc- 
racy (Cicourel & Kitsuse 1963). 

The weight of bureaucracy varies among schools 
and colleges, however, depending on the exercise 
of other forms of influence, particularly the col- 
legial authority of faculties, and on the informal 
counterpressures of faculty and student subcul- 
tures. The importance of bureaucracy also depends 
on whether the large school or college is a unitary 
organization or a federation of quasi-independent 
clusters of faculty and students. Indeed, organiza- 
tional structure may possibly have strong and last- 
ing effects on individual character. Personal iden- 
tity is hard to establish in the modern fast-changing 
world, and the campus is one of the critical locales 
of identity formation. If identity is dependent on 
personal relations, then the decline in community 
typical of the large mass campus will disturb and 
weaken identity processes. Growing impersonality 
in schools and colleges apparently has greater social 
consequence than impersonality in nearly all other 
types of organizations. The effort of some educa- 
tional reformers to structure large campuses as 
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federations of small states is a response to this 
growing belief. On this apparently critical issue, 
the sociological imagination is still largely impris- 
oned by clichés and uninformed by research. 


Education in other institutions 


The immense appetite of modern society for 
systematic instruction extends educational work 
throughout society. Substantial educational sub- 
systems develop within the military, industry, and 
the church to instill specific skill, knowledge, or 
perspective not provided in the regular line of 
schools and colleges. This development is particu- 
larly powered by rapid change in work skills and 
the complex demands of modern organization, A 
modern business firm takes care of its competence 
by systematic training of personnel, from top man- 
agement to the rank and file. Men alternate be- 
tween production and instruction, with blocs of 
hours or weeks or months given over to the class- 
room within or outside the firm. The modern mili- 
tary establishment is a vast educational enterprise: 
its competence depends on the capacity to trans- 
form raw recruits into technicians and to create 
an officers’ corps in which the manager and the 
technologist assume places alongside the fighting 
commander of old. Military careers increasingly 
depend on effective performance in the schools 
that train officers to successively higher levels of 
complex skill and thought. 

New nations also find the military a primary 
educational workshop, In Israel, the army plays an 
important role alongside the schools in the accul- 
turation of immigrants: young men learn Hebrew, 
their own history and geography, the modern-style 
discipline of working with others, and new work 
skills. In most modernizing societies, military offi- 
cers are among the best educated groups, gaining 
particular competence in engineering and admin- 
istration, Combining modern expertise with access 
to means of power, they play a critical role in the 
politics of national development. 

Religious organizations, long educational in their 
formal and quasi-formal indoctrination of the flock, 
plunge further into organized instruction as they 
turn the church basement into community center 
and the church official into social worker and adult 
educator, broadening activities to maintain a cen- 
tral place in secularizing societies. In the United 
States of the 1960s, more persons participate in the 
adult classes of churches and synagogues than in 
those offered by colleges and universities, particu- 
larly in the South. The established churches cap- 
ture educational audiences more effectively than 
the educational courses of television (Johnstone & 


Rivera 1965, pp. 10 ff., 53-55, 111), When to the 
formal classes we add the deliberate socialization 
performed from the pulpit, it is clear that a broad 
definition of education would have to include much 
investigation of the church as an instructional 
medium. 

Similar trends obtain in the public bureau, the 
union, and the professional association, Institu- 
tions of adult work and pleasure find it to their 
self-interest to encourage the emergence of an edu- 
cational subsystem and thus to spread formal 
education throughout society. Thus, for an ever 
larger share of the population, formal education is 
the main means to cultural as well as occupational 
qualification, Knowledge is a prime ingredient of 
the society based on science, technology, and ex- 
pertise, and the primacy of knowledge is the pri- 
macy of education. In the face of large dangers and 
uncertainty, a learning society offers the oppor- 
tunity that very large numbers of men may partici- 
pate effectively in a complex culture and in the 
social and political affairs of a complex society. In 
this possibility lies the educational promise of 
modern man. 

Burton R. CLARK 
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LL 
EDUCATION AND SOCIETY 


Education can be viewed as including all com- 
municating of knowledge and shaping of values; in 
this sense it is synonymous with socialization. Dur- 
ing most of human history deliberate instruction 
has been incidental and sporadic, and even in the 
most complex societies much instruction, in the 
broader sense of the term, devolves upon agencies 
other than schools, In this article, however, the 
focus is mainly upon formalized education as car- 
ried out by distinctive institutions, especially schools. 
The aim of formal education is to prepare the 
child for the transition from the confined but dif- 
fuse relationships of the family to the more imper- 

*sonal and diversified relationships of the larger 
society. In its core sense, education rests on tutelage 
of child by adult; only in the complex societies is 
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instruction often given by one adult to another. 
There are multiple explanations for the appearance 
of schools, and these same causes continue to gen- 
erate new sorts and levels of schools as societies 
become more complex. 


Rise of formal education 


In many preliterate tribes an age group under- 
goes tuition in adult skills, and particularly in cult 
rituals and in religious beliefs and symbolisms, 
under the care of a specially designated adult. 
Ideological and ritual components appear in every 
example of formal education, although they be- 
come less definite and presumably less effective as 
schools are given more diversified and preponder- 
antly secular responsibilities. Occasionally we ob- 
serve a ruling group attempting to convert youth 
to beliefs not shared by most parents; typically, 
however, schools are expected to create consensus 
among a numerous citizenry around certain unify- 
ing themes. 

A second source of formal schooling lies in the 
necessity that some youths acquire proficiency in 
a set of skills that may not only differ from those 
of the parents but may also be too complicated for 
the parents to teach. Until a late stage in the tech- 
nological development of a society most individuals 
learn their livelihood skills as a by-product of grow- 
ing up. The first deviation from that pattern is 
exemplified by apprenticeship under a master in a 
different household; the youth’s parents may teach 
the same craft, but the required impersonality of 
instruction encourages shifting the responsibility to 
another family. In many societies schools are set 
up to teach bookkeeping and correspondence arts 
to children of traders and artisans; this type of in- 
struction is sometimes given in elementary schools, 
but wherever vocational training is organized in 
special schools, it normally occurs in a second or 
third level of school that builds upon the basic edu- 
cation of a “common” school. In most societies 
schools contribute little of the specialized training 
for manual occupations but function rather to pre- 
pare the child for later training in both manual and 
nonmanual skills on the job. 

Although various sorts of extra-school and infor- 
mal education are common to all societies, the full 
development of instruction occurs only after the 
appearance of writing. The adoption of writing adds 
a new dimension to the life of a society: a sense of 
the past, sometimes also anticipation of the future, 
and new agencies for coordinating the activities of 
scattered and heterogeneous groups. Writing may 
become the monopoly of a priesthood and its secu- 
lar use be confined mainly to officials, or it may 
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become widely used. Within a priesthood diversified 
and graded schools may develop; likewise, instruc- 
tion and the certifying of competence may become 
highly elaborated among officials. 

It has been pointed out that clocks came into use 
in order to give precision to the temporal coordina- 
tion of activities extending beyond direct obser- 
vation and the range of the voice. Likewise, literacy 
became diffused as it proved useful to coordinate 
activities over space as well as time. A society is 
defined by the area within which interchange is 
frequent and over which men are responsive to in- 
fluences from shared focal centers; crystallization 
and preservation of this unity come to depend pro- 
gressively upon formal training in basic literacy 
skills and in the unifying themes of the culture. 
However, deliberate efforts by ruling groups to 
spread literacy widely were rare until modern times 
—although schools were widespread some cen- 
turies ago in China and also in less elaborate form 
in India and parts of the Islamic world. Common 
schools were established usually in an effort to en- 
sure unity of sacred or secular ideology throughout 
a population; more direct economic and political 
considerations have become important only in 
modern times. 

Even in societies with a relatively simple tech- 
nology, schooling beyond simple literacy is fostered 
by the need to keep records and chronicles and 
conduct correspondence, and by the desire to read 
holy books. As shown by the initiative displayed 
among craftsmen and traders, ability to keep sim- 
ple accounts and to handle transactions in distant 
markets provides another incentive. As trading and 
other contacts proliferate, the services of the occa- 
sional scribe prove inadequate. The advantages of 
literacy become important to more and more of the 
populace as ordinary men acquire the capacity and 
the right to choose among alternative ways of life. 
Useful books are published as well as uplifting ones, 
along with tales about the enlarging world and 
melodramas of social life. The exhilaration of 
widening experience through symbols of what is 
not immediately present in time or space should 
not be underestimated. 

Schooling, unlike less formal training and ap- 
prenticeship, does not include participation in adult 
activities; instead, school subjects and adult life 
are mediated through the teacher, who becomes 
one of the key specialists in society, alongside the 
warrior, priest, trader, official, and master crafts- 
man. In societies in which literacy is not widely 
diffused, such as Europe in the early medieval 
period, book learning takes on the character of a 
special apprenticeship for entry into an intellectual 


elite. Unless they happen to be born into this elite, 
students in such societies will not ordinarily meet 
with adults outside the schools who make use of 
school learning in their daily life, for this is a 
feature only of societies in which the printed word 
is both widely available and commonly understood. 
But even in highly literate societies the student’s 
opportunities for observing the uses of learning are 
severely limited. One reason for this is the increas- 
ing length of the period of compulsory education. 
The school pupil vicariously anticipates adult activ- 
ities, but his perception of them is filtered through 
the teacher and the school environment. As the 
school years pass he will increasingly anticipate his 
own social future and come to perceive the rele- 
vance of his lessons. However, only if the education 
is vivid and learning meaningful will he incorporate 
his school experiences into his conception of him- 
self so that learning becomes a development of self 
rather than authoritarian drill. In varying degrees, 
schooling enables the pupil to acquire skills that 
facilitate joint activities and that enable him to 
participate imaginatively in the lives of other peo- 
ple (including men who are dead or not yet living), 
to relate himself to activities not embraced in the 
life of his kin group, and to conceptualize social 
entities, such as the nation. 

At all levels the lessons of the schoolroom are 
surrounded by moral and value judgments, regard- 
less of curriculum. One of the most significant 
tendencies is for the formalized pattern of school 
lessons themselves to become instruments of moral 
teaching through the discipline or mastery of a 
subject that is “outside” the pupil. There are right 
ways to solve a problem; there is a correct spelling; 
an essay is ordered or disjointed. This morality 
characterizes the growing number of specialties and 
professions whose members have been subjected to 
the discipline of impersonal assessment of achieve- 
ment. Direct moral instruction that is not linked 
to cognitive achievement is less likely to succeed. 

As societies become larger and more complex, 
literacy must become more widely dispersed and 
more firmly inculcated, for the organization of vir- 
tually all aspects of national life depends upon it. 
A growing proportion of youths need an extended 
basic schooling in the common language of politi- 
cal, economic, and social intercourse; only with 
this will they be able to undertake the further spe- 
cialized training made necessary by the increasing 
diversification of adult roles. Schooling is at once 
a homogenizing and differentiating process. Also, 
as social change accelerates and acquiescence in* 
or commitment to change spreads, the adaptive or 
tradition-conserving function of schools is replaced 


g 


in part by the support and creation of change. The 
growing complexity of society and the associated 
expansion and elaboration of the school system are 
reflected in an alteration and multiplication of pro- 
cedures relating to selection of pupils for retention 
in higher schools and for their allocation among 
types of schools and curricula. The roles of teachers 
are first differentiated from other vocations and 
then internally divided in myriad ways. 


Schools and society 


Individuation and socialization: this duality runs 
through the history of education. Educational par- 
ticipation gives people things in common, but it 
also sets them apart into special worlds of discourse 
and of activity. Schools are expected to give youth 
the common skills that are simple and universal, 
as well as to identify those few individuals who can 
excel, rise to eminence in learning, and rule their 
fellows. Shared arts of communication and con- 
sensus on cherished values are prime aims of 
schools everywhere, but teachers are relied upon 
also to nourish the talented and indulge the privi- 
leged. How large a part schools play in homogeni- 
zation and differentiation (and in what patterns) 
is at once a reflection of and a factor in the scale 
of the society and its realized or perceived need for 
integration. It depends also upon whether the 
society is elitist or democratic in orientation. 

In premodern societies, where each community 
or district has its own dialect, costume, and even 
units of measure, schools may be nonexistent and 
unnecessary as agencies of social integration. How- 
ever, the unifying of societies and nations has re- 
quired the virtual obliteration of provincial cultures 
and the imposition of common expectations and 
practices to override local customs. Historically, the 
most frequent function of schools in this process 
has been the cultural homogenization of an elite. 
However, the molding of a linguistically and ideo- 
logically unified elite has reinforced status distinc- 
tions and thus has also been divisive, setting the 
elite apart. Parallel events are observable today in 
the newer nations, where secondary school (espe- 
cially if residential ) and university graduates make 
up the new leadership that strives to weld local 
cultures into a nation. Even where underlying ide- 
ologies are democratic, these few leaders risk be- 
coming alienated from other citizens who do not 
participate at all in the “high culture.” - 

The unification of nations and larger societies 


cannot progress far without the common school for 


widening circles of ordinary folk. This was exempli- 
glish “dame schools,” 


fied in the flourishing of the En gaan 
with their modicum of training in literacy and 
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ciphering and their preoccupation with teaching of 
the proper behavior to the “lower orders.” With its 
gradual extension and improvement, the common 
school both challenges elitism and takes on respon- 
sibility for fostering consensus among all citizens 
around selected themes. The national language 
slowly spreads through successive generations of 
pupils, accommodating them to acknowledgment of 
the ruler’s writs, participation in the rituals of patri- 
otism, and familiarity with the heroes and events 
of literature and history. The diffusion of literacy 
both incites and broadens participation in political 
life, and the spreading of facility in numbers under- 
lies the knitting together of the larger economy. 

The common schools perform an interclass as- 
similating function, however limited and crude. 
Through the common schools children of peasants 
and laborers acquire a less abrasive speech, less 
violent habits of conduct, some insight into the 
attractions of cultural amenities, and a few folk- 
ways of complex social living. Slowly increasing 
numbers learn the advantages of deferred gratifica- 
tion and of systematic work. In the more demo- 
cratic school systems, this social assimilation may 
go on even within the university. Meanwhile, the 
high evaluation of education that motivates some 
to enter the preferred occupations stimulates the 
further spread of schooling throughout the popu- 
lation. 

In the early stages of the diffusion of schools, 
the divisiveness produced by them extends spatially, 
as well as along the lines of status. Acceptance of 
schools beyond (and even below) the compulsory 
level is uneven among the districts and regions of 
a society. Common education proves to be more 
useful in some areas than in others, and in some 
districts it may quickly lead to the adoption of 
secondary and higher education. Residents of some 
areas draw ahead of residents of others in learning 
and in prosperity, in active sharing of the national 
culture, and in accessibility to the amenities of an 
advancing society. This disparity is most extreme 
in plural societies lacking a single mother tongue 
and especially in the new nations where, to a degree 
hitherto unparalleled in history, schools are being 
relied upon to create a unified people out of a 
throng of tribes. If vernacular languages are used 
in the early grades, the common schools may have 
little unifying effect. 

Where schools are operated by religious groups 
—even where there is central inspection—the value 
content of curricula may become equivocal. Each 
nation’s history affects the balance of those influ- 
ences that bridge the religions or strengthen their 
inward-looking tendencies. However, the contest for 


520 EDUCATION: Education and Society 


autonomy and for central financial aid can increase 
central control over curricula, Where many forces 
work to create a broad underlying national unity, 
separate schools may have little divisive effect. 
Indeed, for religious as for other minorities, “paro- 
chial” schools commonly serve as way stations on 
the road to assimilation, especially where a single 
language is used by all. What is common to these 
lessons and the atmosphere of all schools leads 
each group into assimilation of pervasive national 
ways of life. 

As the duration of common schooling is extended, 
so also are both the duration of the period in which 
schools may carry out their socially integrating 
functions and the further years of school life for 
those who continue into the increasingly elaborated 
programs of middle and higher education. Again, 
we see both merging and separating forces at work 
as schools play a growing role in the society. The 
special cultures of the various better-educated 
groups that are fostered by their schooling are less 
exclusive than those in simpler elitist societies; 
these restricted cultures are in greater degree an 
extension in depth of the themes and practices 
that pervade the lower schools as well. 

Thus, education continuously raises barriers to 
communication among the members of a society, 
even as it is also continuously eroding isolation. 
Associated with schooling are the occupational spe- 
cialties of a complex society, and some of these 
groups exploit the schools to reinforce their favored 
positions by restricting numbers of qualified en- 
trants. However, rising demands from the economy 
and populist pressures also force the breaking of 
such obstructions, Meanwhile, the rising standard 
of average schooling means that fewer occupa- 
tional boundaries are chasms between sodden ig- 
norance and genteel learning. 

Diffusion of both schooling and consumer goods 
creates increasing diversity of ways of life even 
while fostering status continuities within the soci- 
ety. This process finds its most complete expression 
in the university, for no matter how much a society 
may press toward “comprehensive” rather than dif- 
ferentiated education for its various subpopulations, 
there is no escaping extreme specialization at the 
university level. And it is in the universities, in 
recent years, that there has arisen the greatest 
concern that learned men shall share a common 
culture. 


Conservation and innovation 


The principal task of formal education in the 
past was to inculcate accepted bodies of informa- 
tion and sets of beliefs—to conserve and pass down 


a heritage. However, in the more dynamic societies 
schools are called upon to play a more creative 
role by supporting or fostering change; everywhere 
they have indirectly provided support for change 
by giving men the basic tools with which to en- 
large their knowledge indefinitely and to manipu- 
late it creatively. The counterpoint of socialization 
and individuation is thus again at work. 

Educational conservatism that finds its source 
in established religion or ethics was exemplified in 
Confucian learning and the bureaucratic examina- 
tions in China and in the development of clerkly 
schooling in medieval Europe. Yet even in those or 
similar situations, schooling did not function solely 
to rigidify ideas, for among the elites philosophical 
speculation could not be confined. And alongside 
the erudite education there arose also more popular 
and pragmatic kinds. Soon after the introduction 
of printing, books expounding practical novelties 
for daily life and craft practice poured from the 
press—and in China over the centuries from the 
hands of the copyist—opening new possibilities to 
men with even a minimum of literacy. The same 
funds of ideas stimulated religious and political 
innovation, enhanced in some societies by popular 
reading of the holy books. 

The modern age takes it for granted that change 
is natural. Literacy has meant exposure to novelty 
and has prepared men of the most diverse stations 
to react to new ideas from many quarters, Preserv- 
ing the continuity of expanding knowledge has 
become a principal source of innovation, It is sig- 
nificant that much of the Western technological 
revolution was the work of artisans who fused 
their ingenuity with published manuals to accel- 
erate the rate of invention. Indeed, learned men 
and literate artisans were in more intensive inter- 
action in past generations than they are today. Pro- 
gressively, the potentialities of written communica- 
tion were exploited by advocates of political and 
social reform, by religious leaders, and by the 
sellers of goods. In our much-schooled age we have 
forgotten how little schooling men need in order to 
put literacy to use, as we have forgotten also the 
capacity of oral traditions. 

Modernization of new nations. The many edu- 
cational developments that unfolded slowly through- 
out the history of Western countries are telescoped 
in the nations that are today borrowing educational 
systems, but similar borrowing had gone on among 
the Western countries as they were shaping their 
educational systems to changing conceptions of 
social need. The new nations find the task of adap* 
tation more difficult because they are anxious to 
communicate an ill-defined and often nonexistent 


traditional unitary culture and equally eager to use 
schools to produce a technology for which tradi- 
tional culture has little pertinence, Often, then, the 
implanted Western education has created mainly a 
small deviant group devoted to modernization. Al- 
though it is possible to absorb folklore into history 
and literature courses, other features of the tradi- 
tional culture cannot be adapted so easily to meet 
the demands of modernization. In the societies 
where the fusion of traditional with technological 
folkways occurred more gradually, more of the 
traditional culture was preserved. 

As school attendance diffuses, one principal re- 
sult is deeper appreciation of the utility of educa- 
tion and thus the emergence of impulses for addi- 
tional and different kinds of education. These 
attitudes spread through a population much as do 
other new cultural traits, with patterned foci and 
gradients of educational development; the climax 
areas correspond generally to those where other 
impulses to social change are also vigorous. 

Wherever Western-type schools have taken root, 
their activity is associated with receptivity to new 
ideas and practices in many spheres of life, and 
the individual with more schooling possesses more 
traits of modernity. He is more informed about 
the world, more likely to accept the new, more 
appreciative of the complexities of his society, and 
better able to comprehend the more complex forces 
at work around him. Assimilation of schooling by 
females adds a special multiplier to these changes 
and serves also to transform family life in ways 
that foster more adaptive children. However, in all 
these respects there is much overlapping among 
subpopulations. The less educated are often better 
informed in pragmatic matters than the highly 
educated, and they can be more open-minded to 
novelty. Illiterate men in societies with strong oral 
traditions may have broad perspectives. 

As common schooling is extended, its conserving 
function with respect to high culture spreads to 
more people. At the same time, the change-support- 
ing influence of the schools permeates larger parts 
of the society and challenges elitist practices. 
Moreover, the more dynamic and creative ways of 
teaching begin to be effective even in the common 
schools, which come to embrace elements of science 
and social studies along with the three Rs—the 
whole in a context of change. Secondary and espe- 
cially university education becomes predominantly 
oriented to subjects relating to the less conservative 

arts of the culture. 

Of particular importance is the transformation 
of the language skills of children by the school. 
Not only does the working language of the typical 
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child expand but it also becomes increasingly ori- 
ented to use in ways instrumental to change. How- 
ever, in this process the handicaps of those groups 
least affected by informal as well as formal educa- 
tion become more visible. Laggard groups may 
become more than nominally absorbed into the 
schools only slowly, and in many advanced soci- 
eties the task of making formal education effective 
for such groups has emerged as crucial in a time 
of rapid occupational change. 

Schools as agencies of change. It is Western 
education, that of the civilization whose science 
and technology is now spreading everywhere, that 
has seen the most explicit use of schools as agen- 
cies of change. Response to technological require- 
ments appeared first at intermediate levels of the 
schools. Apart from the proliferation of apprentice- 
ship and ciphering and similar schools, demands 
for formal technical instruction were met in many 
countries by a diversification of secondary voca- 
tional programs, by technical institutes, and by 
university-level technological training that was 
often segregated from the university proper. The 
“agricultural and mechanical” colleges of the 
United States borrowed many components from 
Europe and with them created a more permissive 
and diversified system, which in turn was copied 
in Europe and elsewhere. Outside the prestige 
branches of education, varied practical training 
supplanted the traditional “cultural” subjects and 
initiated a reorientation of education toward pro- 
grams more supportive of change. 

However, in almost all societies universities have 
been the sanctuaries for critical analysis of the 
society; even where they have made little use of 
that privilege, it has usually been available. And 
insofar as high-level intellectual training has been 
required by practical activities, universities have 
been persuaded to provide that instruction. How- 
ever belatedly they may have responded, the uni- 
versities now constitute centers for induced change; 
they train more and more segments of the national 
elite, In a few countries it has come to be accepted 
that many kinds of occupations deserve university 
sponsorship, that new vocations call for new kinds 
of professors, and that varied motives and stand- 
ards of competence may legitimately be part of 
university life. 

Schools resist change both by passivity and by 
active opposition. It is easier to continue on well- 
trod, familiar paths, and typically the new is not 
welcomed by teachers, most of whom have spent 
their lives within schools. At the higher levels, 
resistance in part reflects devotion to particular 
intellectual traditions, often confounded with con- 
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ventions as to the “standards” that a university 
must uphold. These conventions lose their power 
only after science and technology have manifested 
self-sustaining powers and critical importance for 
national survival. 

Regardless of intent, schools assist change when 
they provide the simple literacy that enables men 
to read and the core training in the traditional 
branches of learning. They favor change also by 
identifying and promoting the talented who, even 
when co-opted into presumably constraining pres- 
tige positions, often prove recalcitrant to custom. 
They foster change more directly when they accept 
new materials into classrooms or accept respon- 
sibility to train for new vocations. They are com- 
pletely involved when or if they encourage inde- 
pendent and critical thinking (at whatever level) 
and when they undertake scientific research and 
carry on the special kind of apprenticeship called 
graduate study. Finally, expansion of adult educa- 
tion and refresher courses at all levels of schooling 
is becoming an integral part of social organization 
to counter obsolescence and foster more rapid 
change. 

Combining the homogenizing-differentiating di- 
chotomy with the conserving—changing dichotomy 
gives us four categories in terms of which the 
different stages of education may be described. It is 
in common schooling (which may last as long as 
12 or even 14 years) that the homogenizing and 
conserving functions are most distinctly merged. 
Furthermore, any separation of schools for differ- 
ent subpopulations, relegating most children to in- 
ferior schools, also upholds tradition. On the other 
hand, as more individuals share schooling through 
many years, change is favored simply because in- 
dividuals begin to learn the scientific and technical 
ideas that link them to other influences making 
for change. It is at the university level that special- 
ization accompanied by advanced instruction com- 
bines differentiation with facilitation of change. 


Education and social class 


Structuring of a school system so that pupils 
pass from one class or grade to the next by pro- 
motion is in part a response to problems of scale 
and in part associated with a more elaborated con- 
tent of teaching. It may also reflect a shift away 
from an elitist system, manifested, for example, by 
the decline of gentlemanly prerogatives at the uni- 
versity and the adoption of merit competition for 
entry. The oldest formal promotion system, that of 
China, emphasized advancement from one exam- 
ination level to eligibility to sit for the next. Over 


much of its history that system used achievement 
criteria for advancement with only minor compli- 
cation by status considerations, and China was the 
source of the now world-wide examinations for 
testing competence. 

Once schooling extended beyond ecclesiastical 
bounds or tutorial practice for formation of an 
elite, a three-level system emerged in the countries 
of western Europe: universities for an elite, gram- 
mar schools for clerks and as preparation for the 
university, and primary schools (if any) for the 
populace. This system became entrenched even as 
schooling became widely diffused. Meanwhile, it 
has been economic limitation as much as snobbery 
that has preserved the idea that the proportion of 
children enrolled should diminish rapidly at each 
successive level of school. Tacitly or explicitly, it 
has been assumed also that lessons grow in diffi- 
culty with level and that progressively fewer chil- 
dren can cope with them. In premodern economies 
relatively few men are needed for the nonmanual 
occupations. These conditions and assumptions 
have been slowly modified in the more advanced 
countries, as people come to believe both that more 
education is needed among the citizenry of a de- 
mocracy and that education is a satisfying good 
in its own right to which all are entitled. However, 
the pyramidal form persists, and among the func- 
tions of the educational system are selection of 
those pupils who may move to successively higher 
courses and also, as schools are diversified, alloca- 
tion of those who do continue among various types 
and qualities of school. 

In societies that have developed school systems 
there are sets of adult vocations for which corre- 
sponding levels or varieties of schooling are nor- 
mally the minimal conditions of entry. Selection 
for continuation in school thus becomes at the 
same time one factor in the allocation of ultimate 
roles in society, and allocation among specialized 
courses of study establishes links to particular sub- 
sets of adult positions within the broader categories 
anticipated by educational selection. Thus, the 
schools become related in ever more complex ways 
to economic and social stratification and to the 
mobility structures of the society. At the same time, 
the procedures adopted for intraschool selection 
and allocation have repercussions on pedagogical 
processes, teacher orientations, and the general 
atmosphere of the school. 

Education-determined status. A formal school 
system, then, never exists solely as an ornament 
to the leisure of a privileged stratum but serves 
also as a means for determining which individuals 


will occupy various positions in the society. Tradi- 
tionally, few children from humble families gained 
admittance to the “classical” schools. Since passage 
through the upper levels of such schools facilitated 
entry into elitist occupations and positions, edu- 
cation contributed as much to intergenerational 
stability as to mobility in status. However, as lower- 
status children come to form an appreciable pro- 
portion of postprimary enrollments, schooling plays 
a more important part in distributing individuals 
among occupations and status levels; educated and 
able sons of workers fill the expanding white-collar 
and professional positions and even displace some 
of the less able sons of the older elites. In open 
and nonrestrictive economies, however, once pro- 
longed schooling becomes widespread the separate 
and distinctive effect of the sheer amount of school- 
ing upon economic and social status is blurred. 
Parental status will play a small part, but native 
ability, drive, health, and luck make for wide status 
variability among men of equivalent schooling. 
Thus neither the association of schooling with 
parental status nor its association with the indi- 
vidual’s own adult status is simple or stable. The 
clearest instances of education-determined status 
exist where the individual's status is defined by his 
position in a bureaucracy and where such a posi- 
tion is conditional upon possession of particular 
certificates, Developing societies during the period 
when school opportunities are still distributed nar- 
rowly (in proportionate enrollments but not neces- 
sarily in terms of parental status) provide the 
most striking contemporary examples of education- 
determined status; however, they also display the 
precarious political influence of educated elites as 
populist political practices take root. Moreover, in 
such societies the status associated with years of 
schooling is status in a new structure growing up 
beside the old system. The connections between old 
and new status systems may be tenuous and the 
roles of schooled men in them quite disparate, al- 
though as yet we know little about these rela- 


tionships. 


Selection and allocation 

In the intraschool sorting operations many fac- 
tors are at play. In all societies self-selection (or, 
conversely, dropping out) is important. In earlier 
periods when the schools retained a literary or re- 
ligious flavor throughout, self-selection (which al- 
ways incorporates some ability factors) occurred 
sometimes mainly within the elite and sometimes 
“more broadly, especially in recruitment to the 
clergy. As enrollments have broadened and school- 
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ing has become prolonged and diversified, self- 
selection—as well as self-allocation to lines of 
study or types of schools—has been extended to 
increasing proportions of the population. 

However, even in the most egalitarian societies, 
readiness to use schools is unevenly distributed 
through both geographic and social space. These 
imparities exceed what might be explained by dif- 
ferences in financial capacity or native ability. In 
part, they are associated with area and ethnic dif- 
ferentials in what schooling can contribute to sub- 
sequent incomes and positions, but the degree of 
awareness of both the pragmatic and the more eso- 
teric values of schooling reflects also the communi- 
cation structure of the society. Interacting with 
self-selection are the practices and policies of 
teachers and the rules for promotion and allocation 
that have become educational customs. These range 
from informal guidance to the most formal and 
rigid examination systems. 

Examination systems. Especially pervasive are 
the difficulties associated with the use of examina- 
tions, although a clear appreciation of this prob- 
lem is comparatively recent. So long as schooling 
beyond simple literacy was confined to small minor- 
ities, Western countries relied primarily upon self- 
selection and status selection, with informal identi- 
fication of talent by performance in the lower 
school. However, as school systems grew this pro- 
cedure became inadequate, and teachers’ assess- 
ments and classroom exercises were supplemented 
and even replaced by systematic testing, often by 
examinations prepared by outside experts. Where- 
as standardized external examinations were adopt- 
ed in China at a time when they could be limited 
to classical literary material, in the more recent 
Western systems such examinations were intro- 
duced at a time when schools and curricula were 
rapidly diversifying. No single traditional set of 
criteria comparable to that of the Chinese could 
suffice, and in the West standardized examinations 
have seldom become so central a feature of an 
entire school system as in some Oriental lands, By 
the same token, rote learning has been less per- 
vasive, and it is no accident that “progressive” edu- 
cation emerged in the United States—the least 
rigid and least examination-prone of the Western 
nations. p 

In the West testing shifted to a mass basis in 
response to several circumstances: growing enroll- 
ments, pressures from new subpopulations for en- 
try to postprimary schools, concern for protecting 
or imposing “standards” (which extended in some 
nations to intellectual life outside the schools), 
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rising aspirations for equality of educational op- 
portunity, and most recently anxiety about identi- 
fying the unutilized “reserves of talent.” Increasing 
reliance upon examinations has injected into previ- 
ously looser systems the well-known pressures upon 
teachers to orient their lessons to expected tests 
rather than to exploration of the subject or to the 
child’s mental and creative development. On the 
other hand, mass testing has led to increasing 
awareness of irregularities in standards among 
schools and the educational poverty of the poorest 
schools, particularly in decentralized systems. The 
counterpoints of quality versus quantity and of 
conformity to set standards versus diversity of edu- 
cational outcomes have been laid out with a new 
clarity in recent years. 

Selection procedures. Meanwhile, there has 
been a growing concern about the validity of tests 
as bases for selection and allocation, however re- 
liable they may be for measuring a restricted set 
of accomplishments. Tests in the traditional aca- 
demic core subjects may satisfactorily predict future 
academic success along the same lines; however, 
they are of limited help in predicting occupational 
success in nonacademic positions that nevertheless 
presuppose a strong educational preparation. In- 
deed, as we become more familiar with the intrica- 
cies of selection in practice (of which examinations 
are only a part), doubts about its validity multiply. 
Errors can be diminished by postponing the age at 
which selection (or allocation to preferred streams ) 
must occur; by giving children a good number of 
years in which to manifest their potentialities, 
underestimation of talent is less likely. Indeed, the 
present reaction in favor of more “general” educa- 
tion shows that some hoary educational traditions 
contain much merit. 

Systematic investigation, reinforced by changing 
social philosophies, is revealing more about how 
the capacity of children to benefit from school is 
modified by their home environments, including 
participation in diverse kinds of outside supportive 
activities; educability is now considered to be a 
function of prior educational experiences. These 
relationships are dramatized also in the schools of 
newly developing countries and among the “cul- 
turally deprived” in the backward rural areas and 
city slums of Western countries. 

Again we face conflicting considerations: many 
of the educationally handicapped will be reluctant 
pupils if held in school—or marginal workers if 
they leave. Some respond by apathy or rebellion 
because they dislike school and think that what 
they study is useless. Among many who wish to 
continue but who do poorly in lessons, repeated 


failure induces withdrawal and lethargy, which 
sometimes turns into open aggression. Others re- 
sent being treated perpetually as nonadults and 
still others, while surviving in school, see no con- 
nection between what they do as pupils and what 
they expect to do as adults. The growing tendency 
in contemporary society to impose formal educa- 
tional requirements for entry to even the least 
skilled jobs therefore presents schools with an un- 
precedented task. 

Many of these problems reflect unimaginative 
teaching, while others arise from pressures on 
schools to orient their programs to selection and 
allocation standards for admission to a next level 
of school. As professionals, teachers face a dilem- 
ma. Selecting a minority of superior pupils is an 
obligation, and their own prestige is affected by 
the outcome, not to mention that encouragement of 
superior performance is what many teachers most 
enjoy. However, there is also a primary obligation 
to maximize the learning of all pupils. Teachers 
have both the instrumental task of impelling pupils 
to learn set lessons and the expressive task of 
making learning pleasant. The process of selection 
throws the balance to the instrumental side. 

Effects of the type of schooling and the type of 
curriculum upon the choice of a particular voca- 
tion shift with technological advances and chang- 
ing societal structures. Traditionally, only a small 
proportion of families have relied upon formal in- 
struction for vocational training; manual workers 
and large parts of the middle stratum have relied 
largely upon apprenticeship. As the typical indi- 
vidual’s schooling was lengthened, three things 
have happened. First, additional general education 
has become a prerequisite for a rising proportion 
of jobs, and hence specialized vocational training 
has been deferred; most of higher education re- 
mains, as it has always been, vocational. Second, 
much of what previously would have been learned 
on the job has been formalized and incorporated 
in vocational and technical schools. Third, on-the- 
job training has also multiplied, and the age-cycle 
pattern of vocational and refresher or upgrading 
study has become more complex and extended. 

As the first and third of these tendencies go 
further, schools can decrease their responsibility 
for allocation, but insofar as the second tendency 
predominates, schools are expected to assign chil- 
dren early to courses that prepare for particular 
kinds of occupation. This is not a task for which 
schools are well qualified. In practice they identify 
the pupils that are ready for further academic edu: 
cation—although with less validity than is com- 
monly believed—while giving little useful peda- 
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gogical guidance to others. The problem is most 
critical, although often least recognized, where 
postschool occupational selection and allocation 
processes are the most rigidly geared to diplomas 
(whether by convention, rulings of vocational as- 
sociations, or government regulation), and it is 
exacerbated to the degree that status of pupils’ 
families strongly affects educational aspirations or 
performance. Allocation by the school may range 
from a formal priority system based on examina- 
tions through less formal constraints on pupil 
choices, Even though guidance may be offered, this 
rests on a shaky scientific basis and much of it re- 
flects social bias as well as pedagogic rigidity. 
Moreover, specialized capacities are rare; most 
pupils will have aptitudes in almost equal degree 
for a wide range of courses and of occupations, at 
least within a broad band of general ability. 
Recently a tendency toward “manpower plan- 
ning” has emerged and is rapidly being diffused 
throughout the world. Its aim is to forecast the 
numbers of individuals with different occupational 
qualifications needed by the society, to ration school 
places accordingly, and then to direct individuals 
toward the “right” preparation. The validity of 
these forecasts remains unproved in the present 
state of research, quite apart from the value ques- 
tion of how much direction should be given to indi- 
vidual choices of education or vocation. 
C. ARNOLD ANDERSON 


(Directly related are the entries LITERACY; TEACHING; 
UNIVERSITIES. Other relevant material may be found 
in SOCIALIZATION; SOCIAL MOBILITY.] 
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The Sociology of 


m 
EDUCATIONAL ORGANIZATION 


The study of educational organization has its 
place in the social sciences mainly as an applica- 
tion of theories of organization. In such theories 
any organization, whether a national system of 
education, an industry, or a single institution, such 
as a school, hospital, or prison, is treated as if it 
were a self-contained system of social relation- 
ships. The link to the wider network of social rela- 
tionships, of which any organization must be a 
part, is then treated in terms of boundary main- 
tenance, functional exchange, or the distinction 
between internal structure and external environ- 
ment. At the most macroscopic level, studies of 
education concentrate on the functional relations 
between education and the other great institutional 
orders of society—the economy, the polity, religion, 
and kinship—and form part of general compara- 
tive sociology. 


Educational organizations—that is, the social 
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structure and functioning of an educational sys- 
tem or its constituent units—are studied at de- 
scending levels of generality. An educational system 
or subsystem, an individual school or a university, 
will have a formal constitution defining the dis- 
tribution of roles within it, the allocation of re- 
sources and disciplinary powers, and the content 
and method of teaching. It will also have an 
informal pattern of power, influence, and commu- 
nication and a characteristic value system. Ini- 
tially, organizations are formal arrangements made 
to pursue defined ends; however, within them arise 
spontaneous friendships, loyalties, habits of work, 
and routines of communication that may support 
or subvert, but that will in any case modify and 
complicate, the functioning of the system as de- 
fined in its formal charter. 

These aspects of organization may be studied 
both individually and in relation to one another; 
but in the case of the sociology of the school, the 
effectiveness of any study depends upon knowledge 
at a more macroscopic level. There is what W. W. 
Waller termed “a separate culture of the school,” 
but this is responsive to the structure of the wider 
society. Wider social trerids can and do affect and 
alter the functions of schools, colleges, and univer- 
sities, with or without any explicit redirection of 
aim or overt reorganization. In any case, they give 
rise to the stresses and strains that underlie and 
permeate the daily life of educational institutions. 
Thus, the analysis of the role of the teacher or the 
pupil must sooner or later go beyond the boundary 
of the school or college as a social system and 
take account of those influences in the external 
environment that support or frustrate tacit or ex- 
plicit educational goals as they are interpreted 
within the organization. The structure and culture 
of a school derive partly from the logic of a teach- 
ing and learning situation, but they are also, and 
usually more importantly, shaped by outside influ- 
ences. Persons entering the organization bring with 
them expectations and assumptions about their 
own rights and obligations. The family, neighbor- 
hood, class, and ethnic environment of pupils send 
them into school with varying attitudes toward 
their educational experience. To understand the 
internal life of the school, it is necessary to see it 
in a wider social context; in this way the sociology, 
anthropology, and social psychology of the school 
is necessarily linked to the study of the family, the 
neighborhood, the religious community, and, ulti- 
mately, to the whole of society. 

Sociological studies. Education is a topic as old 
as civilization, but only in recent years has educa- 
tional organization, as described above, been de- 


fined as a special area for sociological study. There 
is no article on educational organization in the 
Encyclopaedia of the Social Sciences, which was 
published in the early 1930s. However, the general 
field of the sociology of education has a recogniza- 
ble, if checkered, history, which dates perhaps from 
the publication of Lester Ward’s Dynamic Sociol- 
ogy in 1883 and certainly from the delivery of 
Durkheim's lectures (1902-1911) at the Sorbonne 
in the early years of the twentieth century. Thor- 
stein Veblen’s satirical and bitter classic on The 
Higher Learning in America, which constituted a 
landmark in the analysis of the university as a 
social system, appeared in 1918, and Willard W. 
Waller's treatment of the school in sociologically 
similar, if politically more urbane, terms was pub- 
lished in 1932. 

However, it is only since World War 11 that the 
study of educational organization has begun to 
flourish as an integral part of the social sciences. 
This development owes much to the rise of a group 
of American and European sociologists who have 
been determined to treat education in the same 
terms as any other social institution, that is, as an 
object intrinsically worthy of scientific study, apart 
from any concern for social policy or human 
betterment. 


Education and society 


Appreciation of the character and significance 
of educational systems is becoming increasingly 
important for the understanding of contemporary 
society. Education has expanded rapidly with the 
advance of industrialization; it is being extended 
to new nations and new classes of students, and 
new pedagogical methods are continually being 
developed. Now that as much as 5 per cent of 
national income may be spent on education, eco- 
nomists, sociologists, and psychologists are all 
drawn to analyze the comparative costs, efficiency, 
and functioning of the various systems of teaching, 
research, and administration. 

Educational planning plays a prominent part in 
the modern government of both advanced and 
so-called underdeveloped countries, and in the 
recent past it has produced a large literature of 
official reports and academic studies that illustrate 
the problems of educational organization at widely 
varying levels of educational development. In Cali- 
fornia the problem is to devise a system of mass 
higher education, whereas in underdeveloped coun- 
tries like Nigeria the problem is to develop higher 
education while at the same time creating adequate 
primary and secondary education. One of these 
official planning reports deserves special mention: 
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the Robbins Report, which has provided a detailed 
description (and prescription) for higher educa- 
tion in Britain, has also yielded an excellent com- 
parative sociography of the secondary and higher 
educational systems of Australia, Canada, France, 
Germany, the Netherlands, New Zealand, Switzer- 
land, the United States, and the Soviet Union 
(Great Britain . . . 1963). 


Types of educational organization 


Educational systems have one universal charac- 
teristic: they transmit knowledge and belief. They 
consist, therefore, of a disciplined relation between 
teacher and taught, but they can and do vary ac- 
cording to what is to be taught, how, by whom, 
and to whom. Accordingly, the classification of 
types and the analysis of variations may proceed 
from all or any of the following four points of 
reference. First, the aims of the organization define 
its role in relation to knowledge and belief, whether 
as guardian, critic, or innovator. Second, the control 
of the organization refers to its relations—of con- 
trol and subservience, service and support—with 
other social institutions, notably the state, the 
church, the family, and the employer. Third, the 
formal organization refers to the allocation of roles 
within the organization, the rights and duties at- 
taching to them, and the methods of selection for 
the various positions of administration, teaching, 
research, and learning. Finally, the informal organ- 
ization consists of the “subcultures,” or spontane- 
ous groupings and patterns of interaction that 
form within the formal framework of the organi- 
zation. 

Weber's typology of educational systems. No 
typology of educational organization based on all 
these sources of variation exists as yet in the 
literature, although the accumulation of case 
studies and comparative work promises to make 
the task feasible. A beginning is to be found in 
Max Weber's discussion of charisma and bureauc- 
racy and their application to the social roles of 
the Chinese literati (Weber 1906-1924). Weber's 
principal reference is to the aims and functions 
of whole systems of education rather than to the 
differentiation of functions among particular or- 
ganizations within them. His attention is, more- 
over, largely confined to those forms of education 
aimed at producing members of the elite or ruling 
strata of societies, and thus he has little or nothing 
to say about modern mass education. 

Most educational systems have been restricted 
“to elites, and from this point of view Weber dis- 
tinguished three broad types of social personality: 
the charismatic, that of the cultivated man, and 


527 


the expert. These three social types correspond to 
three types of power and authority in society. The 
first is that of the charismatic leader whose per- 
sonal gift or mission is magically or divinely in- 
spired, The second includes a wide range of forms 
of authority sanctioned by custom and tradition— 
for example, that of the Chinese mandarins, the 
minority leisure class of citizens in ancient Greece 
or Rome, or the gentlemanly strata of eighteenth- 
century Europe. The third type corresponds to the 
rational and bureaucratic forms of authority typi- 
cal of advanced industrial societies, for example, 
that of the special expertise of scientists and pro- 
fessional men. In the total range of educational 
systems, the first and third of Weber’s types are 
polar opposites, and the most numerous actual 
examples are found within the intermediate range. 
In reality there are no pure cases of these types, 
as Weber took care to emphasize. 

Strictly speaking, the qualities necessary for 
charismatic leadership cannot be transmitted by a 
system of education in the sense that this would 
be understood in the modern world. By definition, 
charisma cannot be created by training; it is a 
personal gift of grace that either exists in a person 
or infiltrates him through magical rebirth. In con- 
trast, the cultivation of the pupil for the style of 
life of a secular or religious status group is in 
principle possible for anyone, and this is likewise 
true for the type of organization that sets out to 
train experts for practical usefulness in a public 
authority, a business enterprise, or a scientific 
laboratory. Although entry to either the second 
system (that of “education” ) or the third (that of 
“training”) is open in principle to individuals of 
any social origin, the goals will nevertheless vary 
—in the former case, according to the idea of 
cultivation held by the dominant stratum; in the 
latter, according to the internal requirements of 
the expertise. Thus, the second type of organiza- 
tion includes a wide range of actual educational 
goals. The goal may be the production of a socially 
distinctive type of knight or courtier, as in the case 
of the Japanese samurai; it may be the education 
of a scribe or intellectual, where a priestly class is 
dominant; or it may be the amateur gentlemanly 
administrator, as in imperial Britain. 

Systems of education for membership in a cul- 
tivated status group have usually been under reli- 
gious control. This is true of the Christian, Islamic, 
and Judaic traditions, but the Chinese literati and 
the Hellenic philosophers’ schools are important 
exceptions. The education of the Chinese man- 
darin, although based on sacred texts, consisted 
in laymen teaching laymen, and the Hellenic 
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schools were completely secular and designed for 
the education of a leisured ruling class. 

Bureaucratization. The characteristic mark of 
the type of education that aims at imparting spe- 
cialized expert training is the standardized exam- 
ination system. This is not to say that examination 
systems are either indispensable to bureaucratic 
forms of authority or unknown to prebureaucratic 
systems. For example, the special examination 
system came late to the French, English, and 
American bureaucracies, whereas classical China 
for many centuries enjoyed a politically organized 
examination system for official careers. However, 
the Chinese system was not designed to examine 
special skills but rather to ensure that successful 
candidates were broadly equipped with the high 
culture of the literati. In the case of the modern 
bureaucracies, qualifications for entry are increas- 
ingly defined in terms of specialized scientific and 
technological training, at the expense of wide hu- 
manistic cultivation. As Weber put it: “Behind all 
the present discussions of the foundations of the 
educational system, the struggle of the ‘specialist 
type of man’ against the older type of ‘cultivated 
man’ is hidden at some decisive point. This fight 
is determined by the irresistibly expanding bu- 
reaucratization of all public and private relations 
of authority and by the ever-increasing importance 
of expert and specialized knowledge” ({1906-1924] 
1946, p. 243). 


Educational systems in industrial society 


Bureaucratic organization of authority and the 
specialization of knowledge are the starting points 
for the analysis of educational systems in modern 
industrial society. Education is a planned process 
of cultural transmission, consisting of three ele- 
ments: preservation, innovation, and dissemina- 
tion. Whereas in simple cultures only preservation 
is important, in modern industrial society all three 
elements are essential to the maintenance of a 
culture that is cumulative in its complexity. Such 
cultures require long and specialized education for 
increasing proportions of the population, as well 
as the organization iof innovation in institutions 
of research and development. The translation of 
these processes of cultural transmission into forms 
of educational organization varies according to the 
political and social structure of nations, but the 
basic trend toward large-scale specialization and 
bureaucratization is unmistakable. 

The major force for change has been the pres- 
sure in industrial society to incorporate a scien- 
tific culture into the organization of studies. Thus, 
since the second half of the nineteenth century, 


educational systems have expanded in numbers of 
students, range of teaching, and provision for re- 
search. They have also become increasingly de- 
pendent on governments for financial support and 
increasingly controlled by the state. Primary edu- 
cation is compulsory for all, and secondary educa- 
tion is available for an increasing majority. 

The structure of systems of higher education is 
the outcome of a long process of historical adapta- 
tion, with the dominant line of descent stemming 
from the European medieval universities. There 
are other more ancient traditions of higher learn- 
ing in the Islamic world and in China and India. 
But the spread of Western technological civiliza- 
tion has carried the European conception of a uni- 
versity throughout the world and especially to those 
parts of Africa, Latin America, and Asia that at 
some time have been dominated or ruled by Euro- 
pean countries. 

There are considerable variations in the place 
retained by universities in expanded systems of 
higher education. The general trend has been 
toward functional specialization among different 
kinds of colleges and institutes, although, of 
course, the new professions based on technology 
and the social sciences meet with greater or lesser 
acceptance in the teaching schedule of different in- 
stitutions. It is in the United States that the newer 
branches of higher education are most likely to be 
included within the university systems; American 
graduate schools have been particularly willing to 
incorporate almost every kind of vocational course. 
The Soviet Union represents the opposite tendency; 
the Russian institutes are highly differentiated, 
providing education in a single specialized field 
and entirely separated from the universities. The 
western European universities occupy an inter- 
mediate position, with technology and teacher 
training commonly provided in separate institutions 
that are often of lower prestige. 

Aims. Education in industrial societies is char- 
acteristically aimed at the production of experts; 
consequently, it is organized in a complex of 
schools and colleges with more or less specialized 
aims in relation to education, training, and re- 
search. Quite apart from the need in all societies 
(industrial or preindustrial) to ensure the com- 
munication of some kind of common value system, 
the economic functions of education require that 
secondary and higher forms of schooling be based 
on universal or near-universal primary schooling. 

Primary schooling. All advanced industrial 
countries have universal compulsory schooling at 
the primary stage and curricula designed to equip 
the whole population with basic literacy and 
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numeracy in preparation for possible entry to more 
specialized education at the secondary stage. Pri- 
mary schools are at the same time the major 
agencies of socialization into groups beyond the 
family. According to Parsons (1959), the Ameri- 
can primary school weans children from complete 
reliance on the moral assumptions of particularism 
and diffuseness, which are characteristic of kin 
relations, and inculcates acceptance of universal- 
ism and specificity as the essential basis for role 
playing in a differentiated industrial economy. 

Secondary schooling. Economic demand for 
trained scientific and professional manpower and 
a growing political demand for equality of educa- 
tional opportunity result everywhere in educational 
expansion. Throughout the world the systems vary 
enormously. In Soviet Russia the school system has 
been centrally organized to meet estimated de- 
mands for labor through selective technical educa- 
tion; and it has incorporated both political indoc- 
trination and a fusing of school and work at the 
secondary level in order to foster loyalty to Soviet 
society and its political and social ideology. It is 
significant, in terms of both of these aims of 
Russian education, that plans have been anounced 
for secondary boarding schools to be established 
for 80 per cent of pupils by 1980 (Great Britain . . . 
1963). In western Europe the organization of 
education still preserves its traditional form, re- 
flecting the demands of the class system for school- 
ing according to social origin and as preparation 
for a particular social position. Secondary schools 
are differentiated, with a minority offering cur- 
ricula that prepare pupils for entry to higher edu- 
cation. In most European countries entry to skilled 
trades is accomplished by some form of appren- 
ticeship or learnership system within industry; 
however, the trend is toward transferring industrial 
training to technical schools and colleges, along 
either American or Russian lines. 

In the United States secondary schooling devel- 
oped from a movement to Americanize large num- 
bers of immigrants and to raise the educational 
and social level of new and growing communities. 
However, in many countries there has generally 
been opposition to the integrated extension of com- 
pulsory common schooling, The problem in Europe, 
and in many of its Asian and African colonial or 
former colonial territories, is to develop compre- 
hensive types of secondary education out of sys- 
tems of separate schools with unequal social pres- 

„ tige, which were designed historically for different 
social classes (see, for instance, Pedley 1963). 
Attempts in this direction have met with varying 
success. In Czechoslovakia, Yugoslavia, and else- 
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where in eastern Europe the break with tradition 
has been sharp. There is also a strong movement 
in Scandinavia in favor of “comprehensive schools” 
(in which all children of compulsory school age 
are given various types of training within the same 
institution); weaker movements in favor of such 
schools exist in France and Britain. 

Universities. The European universities remain 
elitist in conception. Although in many countries 
(with the notable exceptions of England and the 
Soviet Union) they are formally open to all sec- 
ondary school graduates, compared with American 
universities they are highly restricted both in num- 
bers and in terms of the social origins of their 
students. 

Universities are usually defined as self-governing 
communities of scholars. In practice, they may be 
neither self-governing nor communities nor en- 
tirely composed of scholars. Nevertheless, the idea 
of a corporate body that acts as the “guardian” of 
a world-wide intellectual culture is what enables 
the term “university” to be applied to institutions 
of widely ranging size, quality, and purpose. Intel- 
lectual culture is preserved in universities through 
teaching directed particularly at those students 
capable of forming the academic succession. In 
modern times, especially since the rise of the nat- 
ural sciences, the universities have come to be 
concerned as much with extending as with pre- 
serving knowledge, that is, with research as much 
as with teaching. 

Control. The state exercises a large measure of 
control over the educational system in all industrial 
countries, The degree to which religious and pri- 
vate control is permitted varies, and private educa- 
tion may be of crucial social significance, as in 
the case of the British “public” schools, The degree 
of centralization also varies; this determines the 
level at which religious bodies, political parties, 
teachers’ unions, and parents’ associations exert 
pressure in favor of their particular interests. 
Centralized government may be combined with 
centralized administration as in France or with 
decentralized administration as in Sweden, or both 
may be decentralized as in the United States. But 
decentralization does not guarantee the autonomy 
of individual schools. There is, for example, a 
marked contrast between Britain and the United 
States in terms of the greater degree of insulation 
of the British school from outside influences and 
the greater freedom of the classroom teacher to 
determine his own methods of teaching and choice 
of textbooks. 

In most countries th 
is mainly borne by the state, 


e financing of universities 
although in some 
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countries, for example the United States, there are 
many purely private institutions. The British uni- 
versities, although increasingly dependent for their 
finance on state funds, are nevertheless independ- 
ent of direct state control and are linked to the 
government through the University Grants Com- 
mittee. In the former British colonial territories 
similar arrangements for state financial support 
have been made, coupled with self-government in 
individual universities. Frequently, however (as 
in Nigeria and Ghana), there is greater stress than 
in England on the right of the state to control the 
university according to what is conceived as the 
national interest. 


Principles of authority 


Burton Clark (1962, p. 152) distinguished three 
principles of authority in the control of schools and 
colleges: public trusteeship, bureaucracy, and col- 
leagueship. The dominant form is that of the pub- 
lic trust, through which control is exercised by a 
lay body representing state, community, religious, 
or other interests. This type of control is common 
among both state and private schools and colleges. 
In the public trust the formal position of the in- 
ternal administration of the school is subordinate 
to the policy of the lay governors. But in practice 
the line between policy and administration is often 
difficult to determine. In any case, the increasing 
size and specialization of educational organizations 
favor the growth of bureaucracy and/or the col- 
legiate type of authority. Bureaucratic and trustee 
types of authority share the characteristic of a 
hierarchical allocation of jurisdictions and duties, 
whereas the collegiate type presupposes the equal 
sharing of power and responsibility among peers. 

Bureaucratic organization is typical of large and 
internally differentiated groups that aim at pro- 
ducing a specified and standardized “product.” In 
schools or colleges that concentrate on instruction 
rather than education and that standardize their 
product by regular examinations, bureaucratic or- 
ganization is correspondingly viable. But educa- 
tional organizations, and especially universities, 
usually have other functions, and where the rela- 
tionship between teacher and students involves 
character training, there must be an element of 
particularism that cannot be bureaucratized. For 
example, the collegiate ideal associated with tradi- 
tional Oxford and Cambridge is one of a commu- 
nity of established older and aspiring younger 
scholars living closely together and cooperating in 
the task of preserving and transmitting a cultivated 
way of life. This collegiate system is not conceived 
as narrowly intellectual in its scope, and far less 


as a tradition of occupational training. It is in- 
tended to pass on to each new generation of schol- 
ars a total culture or style of life, including care- 
fully nurtured elements of mind and aesthetic 
taste and character in due measure. Relations of 
teachers and taught within this system are par- 
ticularistic, affective, and attended by diffuse ob- 
ligations. 

By contrast, in the kind of college that trains 
the student for a lucrative, specialized technical 
position in business or the professions, the typical 
organization is bureaucratic. The institution as- 
sumes no responsibility for the values or social 
character of the novitiate; teacher and taught meet 
only in the context of formal instruction. The re- 
lationship is segmental rather than comprehensive, 
and the obligations specific rather than diffuse. 

Specialization and professionalization. The mod- 
ern social context of higher education increas- 
ingly favors the bureaucratic type of institution. A 
technological economy seeks to fashion the insti- 
tution of higher learning as an antechamber for 
its manpower demands and as a source of market- 
able technical innovation. As systems of higher 
education expand, they recruit students from wider 
and wider ranges of social origin, a practice that no 
longer guarantees the homogeneity of family and 
school backgrounds, on which the traditional col- 
legiate life of the ancient universities depended. 
Students seek a degree course to earn a living, 
rather than college residence to complete their in- 
duction into a style of life. Similarly, in the same 
context, the career interests of professors and lec- 
turers encourage research, which brings academic 
and professional recognition, or administration, 
which brings local reward, rather than teaching, 
which commonly brings neither. 

The trend toward increasingly large schools and 
colleges is coupled with the proliferation of more 
specialized administrative, teaching, and research 
roles, especially, although by no means exclusively, 
at the higher stages of education. Teaching has 
become an elaborate hierarchy of specialized pro- 
fessions, with specific qualifications for entry and 
increasing social and professional distance between 
the positions at the apex of the university system 
and the classroom teacher in a primary school. 
For example, the traditional integration of second- 
ary and university teaching careers in France is 
breaking down under this pressure. At the same 
time, the geographical and social range of recruit- 
ment to teaching at all levels is widening into a 
national and even international market. Admini- 
stration of large schools and colleges has also 
created specialized careers in educational admini- 
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stration, ancillary health and welfare services, and 
especially student counseling or vocational guid- 
ance. 

Conflicts within the organization. The profes- 
sionalization and specialization of the teaching 
professions constitute another force favoring col- 
leagueship rather than bureaucratic authority. “Ra- 
tionality in academic settings often seems best 
served by flat structures with relatively little hier- 
archy, a minimum of rules, and much freedom 
for the typical practitioner” (Clark 1962, pp. 161— 
162). Conflicts within schools and universities 
often express themselves as struggles between the 
administration and the faculty over the degree of 
“flatness” of organization and its application to the 
multiple functions of education. Thus, if the formal 
organization of schools and colleges is made more 
bureaucratic through increasing scale, conflicts are 
typically brought about through specialization and 
take the form of opposition between bureaucratic 
and professional definitions of roles. In another 
guise similar conflicts emerge between “locals” and 
“cosmopolitans” (Lazarsfeld & Thielens 1958; cf. 
Caplow & McGee 1958), with the former oriented 
toward the organization, especially its teaching 
functions, and the latter oriented toward a profes- 
sion extending beyond the boundaries of the in- 
stitution, especially toward the world-wide pursuit 
of new knowledge in a specialist field of science or 
scholarship. The informal organization of educa- 
tional institutions is also conditioned by these con- 
flicts as well as by accommodations to the formal 
disciplinary relations between teachers and stu- 


dents. 


Student subcultures 
The conception of an organization as defined in 
its formal charter or organization chart, or its 
normative conception as expressed, for example, in 
“the idea of a university” (Newman 1852), may 
or may not faithfully reproduce an objective de- 
scription of the statistical norms of behavior within 
it. Typically, both the formal organization and the 
normative conception are extensively modified by 
the existence of an informal organization, and in 
the case of schools and universities this informal 
structure appears as a set of subcultures among 
teachers and students. The distinction between 
“Jocals” and “cosmopolitans” referred to above is 
now commonplace in studies of university teachers. 
Student subcultures have also been identified in 
_ recent studies of colleges and high schools. Martin 
Trow and Burton Clark have distinguished four 
types of subcultures that flourish in varying de- 
grees on different types of campuses (Clark 1962, 
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chapter 6). The traditional culture is collegiate; it 
is pleasure seeking and manifests intense loyalty 
to the organization, although not to the intellectual 
purposes of the organization. Where the latter is 
also present, that is, where the loyalties of stu- 
dents to their college are articulated through the 
intellectual concerns of the senior members, there 
emerges an academic culture, symbolized by the 
library and the seminar. Where neither is present 
there is a consumer—vocational culture—a time- 
serving college attendance for the sake of acquiring 
a degree or diploma and, hence, a claim to a job. 
The fourth possibility occurs where a high value 
is attached to ideas and the institution is dis- 
esteemed; those circumstances encourage the non- 
conformist or bohemian culture which is sometimes 
found on the edge of academic life. 

It should be emphasized that the authors have 
tried to distinguish cultures in terms of shared or 
group norms rather than to suggest classifications 
of either students or universities. All four sub- 
cultures may be found in any university and may 
influence any student. The four student cultures 
are derived from two variables, namely, degree of 
attachment to the institution and degree of attach- 
ment to intellectual ideas. The reference to “intel- 
lectual ideas” is intended to distinguish knowledge 
and techniques that are “useless” from those that 
are “useful” in the sense of being marketable or 
defined in terms of an occupational requirement. 
The distinction between academic and vocational 
subcultures is close to the distinction made earlier 
between “education” for a nonprofessional elite 
position and “training” for a clearly defined pro- 
fessional occupation. 

Institutional and intellectual loyalties are 
strengthened or weakened by the modes of recruit- 
ment, social origins and destiny of the student, the 
social organization of the university, and the in- 
terests of the teaching, research, and administrative 
staff. In all industrial societies the increasing im- 
portance of the examination system, the intrusion 
of science, and the growth of the professions and 
of governmental administration mark the begin- 
ning of a double shift from the collegiate toward 
both the academic and the vocational types of 
culture among students. 

Vocational culture is fostered under circum- 
stances in which students are recruited widely 
from all social classes, especially from those that 
need and want to use educational qualifications to 
gain occupational advantage and security. It grows 
most readily in large, impersonal, nonresidential 
institutions, where the administration is highly 
bureaucratic and the interests of the staff are in- 
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clined toward research and administration rather 
than teaching. Scientific industrialism generates 
precisely these conditions. It encourages, over a 
widening range of occupations, the allocation of 
roles according to achievement rather than by 
ascription, with a consequent endemic search for 
talented people and the proliferation of educational 
institutions as training centers for a society that 
exhibits an increasingly complex division of labor. 
Modern universities in industrial towns, as well 
as (even more clearly) technical and commercial 
colleges and evening institutes, have recruited stu- 
dents and organized their studies in ways that 
unavoidably tend to promote a vocational rather 
than an academic alternative to traditional col- 
legiate culture. 

Rejection of educational aims. Rather similar 
subcultures are also found in schools where the 
nonconformist variant shades over into, or is re- 
placed by, a delinquent subculture that expresses 
rejection of the educational aims of the school. 
In a study of American high schools Coleman 
(1961) argued that these variations in patterns of 
values and behavior inside the school reflect the 
existence of a general adolescent culture in the 
wider society. He points out that diverse organiza- 
tions—including factories, jails, armies, and hos- 
pitals—consist of an administrative corps control- 
ling a larger subordinate group. The relationship 
takes the form of an exchange bargain or wages 
being exchanged for work; in the case of a school, 
“grades” and promotion are exchanged for learning 
and good behavior. Under all such circumstances, 
pressure is put on individuals by the subordinate 
group to conform to the group’s norms in order to 
prevent excessive competition and ensure a maxi- 
mum return for minimum effort on the part of the 
average member. Where the adolescent culture is 
collegiate, vocational, or delinquent rather than 
academic, the result is to thwart high scholastic 
standards and to reduce the correlation between 
high intelligence and high academic performance. 
The challenge then becomes one of devising an 
organization that will harness the energy of the 
adolescent culture to educational aims. 


A. H. HALSEY 


[See also ACADEMIC FREEDOM; ADULT EDUCATION; Bu- 
REAUCRACY; TEACHING; UNIVERSITIES; and the biog- 
raphy of WALLER.] 
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EDUCATIONAL PSYCHOLOGY 


A definitive history of educational psychology is 
still wanting. Existing histories of areas of psychol- 
ogy are generally addressed to other purposes, with 
the development of educational psychology treated 
incidentally (for example, Boring 1929; Murphy 
1929). Burt (1957) prepared a history of educa- 
tional psychology but it was limited to Britain. 

Early origins. The beginnings of recorded specu- 
lation about the relationships between human na- 
ture and the educational process may be found in 
the classical Greek period (Brubacher 1947). The 
writings of such pre-Socratic philosophers as 
Democritus (5th century, B.C.) stressed the effects 
of the home on the learner and anticipated, among 
modern notions, the belief in the value of training 
children to manage property by sharing property 
with them. 

The most detailed treatment of psychological 
matters relevant to education in this early Greek 
period is found in the works of Aristotle. Of par- 
ticular significance for the course of subsequent 
developments in education were his emphasis on 
the intellectual aspects of behavior and education 
and his support for a “faculty” psychology, which 
asserted some independence among components 
of mental activity and the possibility of training 
these components, Or “faculties,” through their 
“exercise.” Aristotelian influence remained strong 
through ensuing centuries and may still be found 

on the conduct of education. 


in some current views : 
Possibly the next major influence on educational- 
d in the efforts of 


psychological thought occurre : 
«Scholastics to rationalize Aristotelian thinking in 
terms of the doctrines of Christianity. The most 
complete and influential product of this effort lies 
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in the magnum opus of Aquinas, Summa theo- 
logica, which for the most part continues to con- 
stitute the educational frame of reference for a 
significant group of Catholic educators and psy- 
chologists. 

Faculty psychology versus associationism. In 
the seventeenth century came the first strong at- 
tack by empiricists on the entrenched faculty psy- 
chology of Aristotle and on Descartes’s thesis that 
“true knowledge” is based on ideas that are innate 
in man rather than learned. According to the pro- 
test, epitomized by Locke (Brett 1921), the mind 
at birth is blank (tabula rasa), rather than a repos- 
itory of preformed ideas. Ideas, Locke maintained, 
result from the sensory experiences of the child 
and his reflection upon these sensations. This em- 
piricist position found many adherents—although 
many of them rejected the postulated “faculty” of 
reflection—and led to a psychology of learning 
called “associationism,” in which Aristotelian “fac- 
ulties” were firmly rejected in favor of explanations 
of learning through the association of sensory ex- 
periences and ideas. It is ironical that this “non- 
Aristotelian” explanation was quite congruent with 
some of Aristotle’s own thinking. It was Aristotle 
who first enunciated the principles of learning 
through “similarity,” “contrast,” and “contiguity.” 

By the eighteenth century, there had emerged 
two sharply competing educational psychologies: 
faculty psychology and associationism. The con- 
flicting viewpoints controlled the pedagogical scene 
through the next century, with faculty psychology 
in the more influential position. Toward the close 
of the nineteenth century, the two schools of 
thought, then even more sharply differentiated, 
were represented in two textbooks, one written by 
Alexander Bain and the other by James Sully. In 
Outlines of Psychology With Special Reference to 
the Theory of Education (1884), Sully emphasized 
the faculty-psychology position. Bain utilized the 
associationist position in Education as a Science 
(1879). 

Formal discipline. In the principles and prac- 
tices of education in the United States and England 
during the nineteenth century, formal discipline 
was the pedagogical rule, and curriculum content 
was largely selected not for the instrumental value 
of the knowledge area, but rather for its promise 
of providing rigorous and systematic exercise of the 
postulated mental faculties. Nor were those edu- 
cators with associationist views free of this formal- 
discipline approach to education. Apparently their 
practices partook of the same type of discipline, 
providing formal training of the senses in the same 
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fashion in which their counterparts were training 
mental faculties (Brubacher 1947, p. 143). 

Child-centeredness. A resistance to both faculty 
psychology and formal discipline began to develop 
early in the nineteenth century. Johann Pestalozzi, 
the Swiss social reformer, and Friedrich Froebel, 
the German mystic, made strong petitions for the 
recognition of the importance of the learner's 
unique interests and abilities in selecting educa- 
tional ends and curriculum content. At the same 
time, another German, Johann Friedrich Herbart, 
introduced the concept of “apperceptive mass,” the 
totality of conscious ideas, constituting the syn- 
thesizing agent of experience—and implied the 
importance of the learner’s interests in structuring 
the procedures of education. Those who later were 
to adopt the “child-centered” orientation in educa- 
tion and child rearing found frequent cause to refer 
to the ideas of Pestalozzi, Froebel, and Herbart, as 
well as to those of the earlier Rousseau, 

Emergence of a scientific posture. In contrast 
to the earlier, almost exclusively speculative, ap- 
proach to the nature and nurture of learning, the 
closing decades of the nineteenth century saw the 
tise of systematically empirical and experimental 
treatments of relevant questions, It has been sug- 
gested that the formal beginning of educational 
psychology should be placed in the 1880s, when 
the experimental and empirical studies of Sir 
Francis Galton (association, reaction time, sensory 
acuity), G. Stanley Hall (“contents” of children’s 
minds ), and Hermann Ebbinghaus (memory) were 
published. 

In the next decade, Hall began the publication 
of the Pedagogical Seminary, and William James 
effectively began to popularize psychology in edu- 
cation through his Talks to Teachers and public 
lectures. 

Intelligence testing. It was also in the 1890s 
that Alfred Binet, of France, began the work that 
was to dominate the thinking about intelligence 
and educability in the twentieth century. Faced 
with the task of developing procedures for iden- 
tifying mental retardation among Parisian pupils, 
and in collaboration initially with V. Henri and 
finally with Theodore Simon, Binet developed an 
intelligence test around the concept of “mental 
age,” in which complex mental processes were 
examined rather than merely reaction time and 
sensory discrimination skill. 

The 1905 Binet-Simon Scale was translated 
into English by Henry Goddard in 1908 and used 
in American schools. Frederick Kuhlmann pro- 
duced a revision for the United States in 1919. 
Louis Terman, in 1916, extended and modified the 


Binet-Simon Scale (the Stanford-Binet); with sub- 
sequent revisions, the Stanford-Binet became the 
most widely known and used individual intelligence 
test in the world, and this led to translations and 
adaptations for widely different cultures (for ex- 
ample, the Tanaka—Binet in Japan). Terman’s 
monumental inquiries into giftedness (Genetic 
Studies of Genius, 1925-1929) partially relied 
upon and sprang from his work in the development 
of the Stanford-Binet Scale. 

Thorndike and connectionism. At the begin- 
ning of the twentieth century, the first identifiable 
“giant” in educational psychology emerged—Ed- 
ward Lee Thorndike, who is generally acknowl- 
edged as the first man who might appropriately 
deserve the label “educational psychologist.” Al- 
though Thorndike had previously made a signifi- 
cant contribution to the field of animal learning 
in 1901, he made a dramatic impact on pedagogy 
when he and Woodworth produced the classic 
studies that “slew the dragon of formal discipline” 
(McDonald 1964). The import of the findings was 
—and continues to be—that what is learned in 
one sphere of activity “transfers” to another sphere 
only when the two spheres share common “ele- 
ments.” That is, the transfer of training is not 
inherently a property of a mental faculty or of a 
disciplinary content. 

Later findings, including those of another pio- 
neer in educational psychology, Charles H. Judd, 
suggested that the “common elements” might be as 
broad as the level of generalization. In any event, 
the implications for a teaching method and the 
selection of curriculum content were clear and in- 
fluential on school practices—the necessity for 
deliberate teaching for transfer through seeking 
or demonstrating commonalities among different 
spheres, content, problems, and so on. The most 
recent development of this transfer-of-training “the- 
ory” is found in Bruner’s ( 1960) argument for 
identifying and teaching the basic “structure” of 
any knowledge area—the hypothesis being that 
there is a basic defining structure to any domain 
of concern which provides an area of knowledge 
with the greatest part of its generalizing and heu- 
ristic power. 

The contributions of Thorndike went far beyond 
his transfer-of-training theory. In addition to ef- 
fecting more interest in rigorous and objective re- 
search on educational problems, he proposed a 
utilitarian theory of learning through the formation 
of “connections” between stimuli and responses 
(now referred to as “connectionism” ), advising that 
desirable connections be rewarded and undesirable 
connections punished (Thorndike 1913, p. 20). Of 


à 


the many assessments of Thorndike’s work and in- 
fluence, McDonald’s (1964) is particularly pene- 
trating. 

Dewey and progressive education. No discus- 
sion of educational psychology in the Western 
Hemisphere, eastern Europe, or the Far East can 
ignore the thinking of John Dewey, whose major 
intellectual image is not that of an educational psy- 
chologist. Dewey, easily acknowledged as the spon- 
sor, if not the father, of the “progressive education” 
movement, produced ideas that effectively became 
embedded in psychological thinking and teaching 
in professional education and that continued to 
influence the conduct of education in the second 
half of the twentieth century. Of the massive ele- 
ments of the philosophy proposed (known as “in- 
strumentalism”), those most clearly bearing on the 
psychology of learning and teaching are his con- 
cepts of “growth” and “intelligent action.” For 
Dewey, learning was “intelligent action,” in which 
the learner continuously evaluates his experience 
in view of what his purposes are and what conse- 
quences he actually experiences. The product of 
this continued evaluation is a redefinition of pur- 
poses, continued criticized experiences, and so on 
without end. “Growth” (more effective purposing 
and more intelligent action) is the end sought by 
teachers and learners. In this conception, respect 
for the interest of the learner is a matter of high 
priority; similarly, the utilization of the learner's 
purposes is both important and, indeed, inescap- 
able. Individuality of the pupil and the utilization 
of problem solving as a pedagogical technique be- 
came shibboleths of education in the 1930s and 
1940s, although there is a good reason to believe 
that many teachers who professed Dewey's views 
failed to understand them adequately. 

Between the world wars. In the period between 
World War 1 and World War u, three additional 
influences were at work on the character of educa- 
tional psychology: psychoanalysis, gestalt psychol- 
ogy, and developments in mental measurement. 

Psychoanalysis. The image of Freud’s theoriz- 
ing, which loomed large over psychological think- 
ing throughout the world at the turn of the twen- 
tieth century, began to have visible impact on the 
emphases and practices in education, particularly 
in non-Catholic areas of Europe and in the United 
States. The conclusion that there were crucial 
formative influences in human behavior led to a 
strong interest in early childhood education and 
the relationship between home background and 
schooling. A mental health movement was born, 
and educators began to express interest in the non- 
intellectual domain of the learner, not only as a 
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legitimate educational concern in its own right, 
but for its possible relationship to the problems of 
teaching and learning the intellectual content of 
curricula. 

Gestalt psychology. From Germany came a 
strong dissent to the learning principles pro- 
pounded by Thorndike. The gestaltists thought that 
Thorndike’s connectionism was too analytical and 
mechanical to account for the organization of be- 
havior. They proposed that behavior is “wholistic”; 
the search should be for patterns, or “gestalten,” 
rather than for linkage of discrete experiences. 
There emerged from this view an emphasis on the 
“education of the whole child,” a view that was 
reinforced by, and in turn reinforced, the mental- 
hygiene developments in education and the views 
of Dewey. Insight and reorganization became the 
new watchwords, although confirming experimen- 
tal evidence was difficult to find. Textbooks on edu- 
cational psychology appeared with gestalt, “field,” 
and “organismic” biases (for example, Ogden 1926; 
Commins 1937). 

Development in mental measurement. Although 
the development of mental tests had begun in 
the previous century, the need for effective and 
rapid selection and description devices in mobiliz- 
ing and assigning military personnel during World 
War 1 led to intensification of activity in the meas- 
urement field. It was here ‘that the prototype of 
pencil-and-paper personality scales was born and 
group intelligence tests were first used (Yerkes 
1921). The strong popularization of these scales 
led quickly to their appearance in the testing pro- 
grams of schools and in the programs of teacher 
education. The ready availability of usable meas- 
ures also fostered more study of individual differ- 
ences in the schools and made possible the provi- 
sion of more information about pupils for use in 
planning curricula and in dealing with learning 
problems. 


Present status and concerns 

In general, the procedures of education since 
World War 1x have been formulated somewhat in- 
dependently of the formal theories of learning 
elaborated by psychologists, who have had little 
interest in the specifics of the school context (Bus- 
well 1956). Relating the two domains is difficult, 
in part because theories of learning are not suf- 
ficiently mature to offer reliable and confirmable 
predictions about particular events. Equally rele- 
vant, however is the realization that behavioral 
situations in the schools are complex and hetero- 
geneous (Spence 1959, p. 88). At present, while 
the school represents an enormously complex psy- 
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chological arena, the science of psychology is still 
struggling with its basic concepts and laws. The 
beginnings of a resolution of this dilemma are pres- 
ently found in two forms: reconceptualizations of 
(a) the focus of research in educational psychology 
and (b) the nature of “teaching.” 

Reconception of research. The change in the 
conception of research in educational psychology 
is best demonstrated by the difference between two 
yearbooks of the National Society for the Study 
of Education (NSSE), published more than twenty 
years apart and addressed to the implications of the 
psychology of learning for educational practice. In 
the first yearbook (NSSE 1942), an attempt was 
made to identify the necessary implications for 
education from each of several competing theories 
of learning. Twenty-two years later (NSSE 1964), 
however, the earlier intention was overlooked in 
favor of finding general cues for education in learn- 
ing theories by looking primarily to the daily prob- 
lems of the classroom as a source of research 
activity in educational psychology. That is, an 
earlier deductive approach has given way to a less 
rigid inductive orientation focused on the school. 
An examination of the literature on educational 
psychology suggests that the foregoing interest is 
represented in research activity. Journals repre- 
senting research and thinking in educational psy- 
chology have increased in number and distribution 
in all major areas of the world and increasingly 
include studies and conclusions more closely rele- 
vant to the affairs of the classroom teacher and 
the school administrator. At the same time, how- 
ever, attempts to clarify the relevance of learning 
theory for pedagogy have also been accelerated. 

Reconception of “teaching.” Historically, at- 
tempts to make psychology meaningful to teachers 
have been handicapped by a tendency to think of 
teaching as the application of reliable rules and 
principles—that is, teachers have tended to look 
to psychology for immediately applicable and valid 
procedures—despite the fact that such dogmatic 
assistance is precluded by the relative instability 
of psychological data and the highly situational 
nature of teaching. In the second half of the pres- 
ent century, several attempts have been made to 
elaborate models and paradigms for teaching that 
would place the utilization of psychology in more 
reasonable perspective (Gage 1963). One implica- 
tion of these reconceptions is that the act of teach- 
ing is inherently a hypothesizing act (Coladarci 
1963). The educator, focused on desired changes 
in the learner, views his operations and arrange- 
ments as “hypotheses” about the conditions produc- 


ing a change in the learner; the educator explicitly 
generates these hypotheses from the best available 
human knowledge, which is in itself hypothetical 
in character. There have been recent applications 
of this position in many areas of the world, and 
its impact is evidenced in changes in the curricu- 
lum of teacher education (for example, Carpenter 
& Haddan 1964; Calonghi 1956; Derbolav 1961; 
Moreira 1955). 

Status of the educational psychologist. Itis gen- 
erally conceded that the prestige of educational 
psychology and educational research has tended 
to be low among social scientists in general and 
psychologists in particular (Haggard 1954). The 
marked exception appears to be in the Soviet Union, 
where the psychology of education is given high 
priority at national and local levels; for example, 
it is clear that education is viewed as the major 
field of application for psychology, since the Insti- 
tute of Psychology in Moscow, the official center for 
Russian psychological research, is responsible to 
the Academy of Educational Sciences (Simon & 
Simon 1963, p. 3). Elsewhere, however, it has been 
thought that educational psychologists, after the 
Thorndike era, have been less than adequately pre- 
pared in psychology and insufficiently productive 
of ideas and research. 

In the middle of the present century, systematic 
attention has been devoted to this matter by pro- 
fessional organizations in psychology and by uni- 
versities offering doctoral programs in this field of 
specialization. The American Psychological Asso- 
ciation in 1954 formed the Committee on Relations 
between Psychology and Education to assess the 
situation and make recommendations. As a result 
of this, among other pressures, the gulf between 
the standards of psychology in general and those 
of educational psychology is gradually narrowing. 
In 1964 the Division of Educational Psychology of 
the American Psychological Association provided 
for the Edward Lee Thorndike Award for distin- 
guished psychological contributions to education. 
With increased visibility and professional acknowl- 
edgment, the specialization is gradually achieving 
more effective status and more rigorous training 
requirements. Carroll ( 1963), after measuring edu- 
cational psychology against the criteria for a dis- 
cipline, has concluded that the field may be so 
classified at this time. 

Specializations. Although specific specializa- 
tions in educational psychology vary with emerging 
needs and changes in education, some subareas 
have remained stable over time and in most areas 
of the world. The field of tests and measurements 


is well defined and popular, particularly in North 
America, England, the Continent, and Japan. De- 
velopments in this special field have been increased 
markedly by recent advances in mathematical sta- 
tistics and electronic computers. 

School psychology, representing the special diag- 
nostic—clinical function in school systems, is repre- 
sented in most countries and provides the bridge 
between clinical psychology and educational psy- 
chology. Mental hygiene, although frequently clas- 
sified with school psychology, continues to carry 
the influence brought to bear on education initially 
by Freud’s work and more recently by psychiatry in 
general (Krugman 1958). Guidance and counsel- 
ing, where it is distinguished from school psychol- 
ogy, represents the interests of those who provide 
individual counsel for students with nonpathologi- 
cal personal and academic problems, including 
vocational guidance at the secondary school and 
college levels. 

School psychometry, rapidly becoming acknowl- 
edged as an identifiable and appropriate role, is 
composed of the administration of individual intel- 
ligence and personality tests in schools, together 
with assistance in the planning of testing programs 
and construction of achievement tests in various 
subjects. Because of the crucial importance of read- 
ing skill in education and society, the psychology 
of reading is attracting a growing number of psy- 
chologists. The importance of the early childhood 
years in education and the increase in preschool 
programs has led to the child development special- 
ization, comprising both those who engage in pre- 
school education and those with primarily research 
interests. 

Most recently, a new specialization has emerged, 
social psychology of education, prompted by re- 
search and theory that emphasize the social nature 
of school instructional groups and the social-psy- 
chological factors in learning, motivation, aspira- 
tion, and identification. Evidence of this particular 
aspect has been seen in the United States, Sweden, 
Denmark, and France [see READING DISABILITIES]. 

Topics in educational psychology. Specific 
topics of study in educational psychology vary 
considerably over time and across cultural-national 
lines. Four general categories represent most of 
those subjects presently pursued in this field: (a) 
learning, which is by far the most frequent topic 
in the field; (b) readiness for learning, which in- 


cludes the phenomena of interests, aptitudes, and 


_ motivation; (c) mental health and social adjust- 


ment, which focus on the noncognitive purposes of 
the school and correlates of intellectual learning; 
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and (d) measurement and evaluation, which com- 
prise the techniques for assessing the educational 
growth of learners, diagnosing learning problems, 
and clarifying the criteria to be used in an evalua- 
tion of the school. 

Three specific aspects of educational psychology 
merit particular mention in view of their continu- 
ing importance, widespread concern, and critical 
relevance in pedagogy: the educability of the 
learner, individual differences, and educational 
technology. 

Educability. The degree to which a pupil is 
ready to profit from instruction and the rate at 
which he is able to learn are patently vital con- 
siderations in education. The earlier point of view 
on educability supported a unitary and genetic 
explanation of pupil ability. The general assump- 
tion was that success in the various demands of 
school curricula is attributable to a general under- 
lying ability of genetic origin. Although this view 
still continues in many quarters, two developments 
that would support a contrary view have occurred. 
On the basis of the work of many investigators, 
primarily in England and the United States, the 
general concept of intelligence can be seen to com- 
prise subabilities that are not very highly correlated 
with one another (for example, memory, verbal 
fluency, arithmetic computation, and spatial per- 
ception). As a result, differential intelligence tests 
have been developed and are widely used in today’s 
schools. These tests, instead of providing a single 
index, such as intelligence quotient, produce a set 
of scores (a profile) that shows the relative ability 
of the learner in each of several areas. The second 
development is the ideological rejection of genetic 
explanations of educability on the part of educators 
and psychologists in the Soviet Union and similarly 
persuaded political states. The Soviet assumption 
is that internal conditions of the child (for ex- 
ample, “intelligence”) are themselves products of 
external forces (for example, educative treatments) 
(Bogoiavlenski & Menchinskaia 1959). 

One development consistent with almost any 
conception of educability is the attempt to con- 
struct intelligence tests that are “culture fair,” that 
is, satisfy the assumption that each examinee has 
had equal opportunity to learn the demands posted 
in the tests. Piaget and his colleague Inhelder have 
attracted considerable interest in their attempt to 
discover a genetic order in the development of 
various concepts in children (Inhelder & Piaget 
1955); their conclusions, however, have been met 
largely with reservations thus far in non-French 


cultures. 
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Individual differences. Although research in 
individual differences has a long history in psychol- 
ogy and education, work in this field continues to 
increase in quantity and scope. Among recent data, 
those affecting school procedures most significantly 
are the findings that differences “within” a child 
are almost as extensive as differences between chil- 
dren. That is, the various skills and achievement 
levels of a particular learner are not homogeneous; 
indeed, achievement in some areas, such as music, 
may be uncorrelated with achievement in others, 
such as mathematics. The import of this for the 
school was quickly realized and is exemplified in 
such practices as ungraded schools, differential 
assignment of pupils to instructional groups in 
curricular areas, and increased use of differential 
aptitude tests in pupil assignment and educational 
diagnosis. 

Educational technology. When the radio and 
motion picture became available to the schools, 
studies in educational psychology demonstrated the 
effectiveness of these media in attitude change and 
information learning. Television, a later arrival in 
the technological resources of education, has been 
undergoing similar assessment and appears to 
serve the schools’ purposes at least as well as the 
more traditional devices. However, the foregoing 
media largely provided stimulus situations only; 
there was no incorporation of the learner's re- 
sponses. This limitation, noted by Porter (1957), 
has apparently been overcome in the later techno- 
logical development in instruction—programmed in- 
struction, including the use of “teaching machines.” 

The development of programmed instruction, 
stemming from earlier work by the American psy- 
chologists S. L, Pressey and B, F. Skinner, occupied 
a large portion of research activity among educa- 
tional psychologists in the 1960s. The basic char- 
acteristics of the procedure are: (a) breaking down 
the task (or content) to be learned into small se- 
quential steps; (b) presenting the organized ma- 
terial to the individual learner (sometimes by 
machine processes) in such a way that he must 
respond correctly to each step before continuing 
to the next; (c) informing the learner of the cor- 
rectness of his response; and, in some variations, 
(d) selecting the next step for the learner in terms 
of the correctness or incorrectness of his previous 
response. A considerable amount of research on 
the validity of this procedure has been completed, 
but much is still under way. Excellent summaries 
have been published by Lumsdaine (1963; 1964) 
and Stolurow (1961). The findings seem to indi- 
cate that such programming procedures not only 
reduce the amount of the teacher's time needed for 


each pupil but also produce at least as much, if 
not more, learning in the curriculum areas involy- 
ing a great deal of structure, such as algebra, and 
information, such as biology [see LEARNING, article 
on PROGRAMMED LEARNING]. 

ARTHUR P. COLADARCI 


[See also EpucaTion; TEACHING. Other relevant mate- 
rial may be found in ACHIEVEMENT TESTING; COUN- 
SELING PSYCHOLOGY; DEVELOPMENTAL PSYCHOLOGY; 
INTELLIGENCE AND INTELLIGENCE TESTING; LEARN- 
ING; LEARNING THEORY; and in the biographies of 
DEWEY; MONTESSORI; THORNDIKE.] 
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EGO PSYCHOLOGY 
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EHRENBERG, RICHARD 


Richard Ehrenberg, economist and economic his- 
torian, was born in 1857 in Wolfenbüttel in the 
Grand Duchy of Brunswick, Germany. He was the 
son of a Jewish educator and the brother of a 
famous Göttingen law professor and authority on 
insurance law. Coming from an impecunious fam- 
ily, Ehrenberg was not able to finish his secondary 
school education but instead had to work as a bank 
clerk to earn his living. However, after saving a 
little money, he applied for admission at the Uni- 
versity of Tübingen and was accepted as a student 
despite the fact that he did not have his Abitur. 
Eventually, after further studies at Munich and 
Göttingen, he acquired a doctoral degree in eco- 
nomics. 

Ehrenberg then became the secretary of the Al- 
tona chamber of commerce, a position that gave 
him unusual insight into the way business was 
conducted in one of Europe's largest commercial 
ports. At the same time, he devoted himself to 
historical studies and, in 1896, published one of 
his most important works, Das Zeitalter der Fugger, 
a book that has remained so fresh that a French 
edition was published as recently as 1955. Its ap- 
pearance brought Ehrenberg the appointment of 
associate professor at Göttingen. He moved to the 
University of Rostock as a full professor in 1898 
and taught there until his sudden death in 1921. 

Throughout his life, Ehrenberg was interested 
in new problems and in new solutions for old prob- 
lems. He was a prolific writer and did research in 
many areas. Because the University of Rostock 
owned the papers of Heinrich von Thiinen, Ehren- 
berg was drawn-into studying the work and ideas 
of this important nineteenth-century German eco- 
nomic theorist, whose influence can clearly be dis- 
cerned in Ehrenberg’s writings. Thiinen had made 
important contributions not only to economic theory 
but also to agricultural economics, and since Ros- 
tock was located in one of the fertile regions of 
the German Empire, it was natural that Ehren- 
berg’s attention would be attracted to that area of 
research also, German economic thinking at that 
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time was dominated by problems of Sozialpolitik, 
and it is not surprising that Ehrenberg shared an 
interest in labor problems, In this sphere, he strove 
to achieve a reconciliation of businessmen and 
workers, based on the recognition of their com- 
munity of interest. 

However, none of these are the fields in which 
Ehrenberg did his most creative work. By inclina- 
tion he was a historian, and his early work on the 
Fuggers, in fact a book on sixteenth-century finan- 
cial institutions, is evidence that talent coincided 
with inclination. This historical bent was comple- 
mented by experience with modern business ac- 
quired in his former chamber-of-commerce position. 
As a result, his interest turned from economic his- 
tory proper to what today we would call “business 
history.” In 1902 and 1905, respectively, Ehrenberg 
published the first and second volumes of Grosse 
Vermögen: the first treats Jakob Fugger, the Roth- 
schilds, and Krupp; the second, based on the auto- 
biography of John Parish, a Hamburg merchant, 
deals with the house of Parish in Hamburg. In 
1906, Ehrenberg published Die Unternehmungen 
der Briider Siemens, a book in which he proved 
a pioneer in the particular brand of business his- 
tory developed some thirty years later by N. S. B. 
Gras at the Graduate School of Business, Harvard 
University. In fact, Ehrenberg exerted some in- 
fluence on Gras. 

It was neither Ehrenberg’s historical bent nor his 
own experience with business that alone moved 
him to study business and businessmen in histori- 
cal perspective. As a matter of fact, he also be- 
longed to that group of German economists who 
first promoted the study of business administration, 
a research area then designated in Germany as 
Privatwirtschaftslehre. Consequently, he became 
one of the first to propose an academic education 
for businessmen and was active in efforts to pro- 
mote that goal by establishing the so-called Han- 
delshochschulen. 

According to Ehrenberg, economic theory, too, 
could profit from the study of business records, 
especially of the accounts of enterprises, as well 
as of household budgets, in that it might thereby 
become more “exact.” To this end he founded the 
Vereinigung fiir Exakt-vergleichende Wirtschafts- 
forschung, whose name conveys Ehrenberg’s idea 
that the use of business records for improving 
economic theory could be combined with the com- 
parative study of economic life in various places 
and periods. To promote this program, he founded, 
in 1905, the Thiinen-Archiv: Organ fiir exakte 
Wirtschaftsforschung (with Volume 3 the title was 
changed to Archiv fiir exakte Wirtschaftsforschung). 


Nine volumes appeared, the last in 1922; many of 
Ehrenberg’s important papers appeared in this 
periodical and a bibliography of his writings is in 
the last volume. However, he was unsuccessful in 
his attempt at promoting the use of business rec- 
ords in economic research. Economists had not yet 
learned to distinguish between macroeconomics 
and microeconomics, a fact that made his sugges- 
tions premature and led to some confusion. 
Ehrenberg’s outstanding achievements have 
never received the recognition they deserve. His 
work was not compatible with that of the younger 
historical school, then dominating German eco- 
nomics, and he himself was on bad terms with 
Gustav Schmoller, its master. In Germany, this 
assessment has persisted: the Neue deutsche Bio- 
graphie, a biographical dictionary in the process of 
publication, does not contain an article on Ehren- 
berg. 
Fritz REDLICH 


[For discussion of the subsequent development of 
Ehrenberg’s ideas, see the biography of Gras.) 
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Eugen Ehrlich (1862-71918) is commonly con- 
sidered the founder of the sociology of law, a dis- 
cipline standing somewhere between jurisprudence 
and pure sociology. 

Ehrlich was born in Czernowitz, the capital of 
the province of Bukovina, then a part of the Aus- 
trian Empire (now Russia). He studied law at the 
University of Vienna and, after receiving his law 
degree, became a Privatdocent at that university. 
In 1897 he accepted an appointment as professor 
of Roman law at the university in his native town 
and remained there until his death in 1918 or 
1919. (The exact death date is not known because 
the transfer of this Province to Rumania at that 
time was accompanied by much disorder. ) 

From the beginning of his academic career, Ehr- . 
lich supported the theory of free law (freie Rechts- 
wissenschaft) put forward at that time in Germany 
by Hermann Kantorowicz and his followers. This 


school was opposed to the jurisprudence of con- 
cepts (Begriffsjurisprudenz), then the dominant 
theory in almost all countries except perhaps the 
United States and England. According to the juris- 
prudence of concepts, a judicial decision is always 
based on pre-established statements of the so-called 
written law; all cases, foreseen or unforeseen, can 
be solved on the basis of the existing system of 
statutes and of authoritative judicial decisions. 
Against this view Kantorowicz argued that the 
judge must form his decision on the basis of his 
sense of justice, using the facts accumulated 
during the trial. 

Ehrlich did not entirely support the teachings of 
the school of free law. Instead he suggested that 
the law as expressed in judicial decisions should 
be distinguished from the law as expressed in social 
conduct (lebendes Recht). The latter can be ascer- 
tained by observing human behavior in society; it 
is particularly apparent in organized activity. To 
study it Ehrlich advocated using the methods of 
the most progressive sociologists of his time: the 
observation, generalization, and arrangement of 
findings into a consistent system, followed by the 
application of the results to concrete cases. 

Ehrlich’s views were first published in his Funda- 
mental Principles of the Sociology of Law (1913). 
In addition to its discussion of sociological juris- 
prudence, the book contains several interesting and 
important chapters on the development of law in 
different countries. Although the book greatly in- 
fluenced the further development of the science of 
law, the two chapters explicitly devoted to the soci- 
ology of law are not systematic enough to provide 
the basis of a genuine science. 

As one of the originators of a new science, Ehr- 
lich discussed at length the methods that should be 
applied to develop it but did not venture to predict 
future findings. The two concluding chapters of his 
Sociology of Law are devoted to the methods to be 
applied to build up the science: in the first, he 
emphasizes the importance of legal history and 
juristic science (roughly, jurisprudence); in the 
second, he speaks of a study of “living law,” very 


similar to the study of the current customary law, 


but hinting also at some sociological methods of 
other things, he 


studying social reality. Among gs, 
published a questionnaire concerning the > living 
law” of the peoples of Bukovina, predominantly 
Rumanians and Ukrainians. 

Ehrlich was by no means the first to suggest 
that statute law does not explain the whole nature 
of the law. The Russo-Polish jurist Lev Petra- 
zhitskii had expressed such views in his Vvedenie v 
izuchenie prava i nravstvennosti and Teoriia prava 


EHRLICH, EUGEN 541 


i gosudarstva v sviazi s teoriei nravstvennosti 
(“Introduction to the Study of Law and Morals” 
and “Theory of Law and State,” respectively; both 
are in Petrazhitskii 1905-1907). The nature of the 
law, according to Petrazhitskii, is primarily psycho- 
logical and is determined by individual psychology. 
His three closest followers, Pitirim A. Sorokin, 
Georges D. Gurvitch, and Nicholas Timasheff, con- 
sidered his emphasis on individual psychology a 
grave error and shifted their focus to collective 
psychology, ultimately going still further and trans- 
ferring the entire subject to sociology. 

Another contemporary of Ehrlich, Roscoe Pound, 
also published articles on sociological jurispru- 
dence. Although he expressed ideas that were more 
moderate than those of Ehrlich, he did distinguish 
between law in books and law in life. Many other 
American jurists of that period were also arguing 
against the jurisprudence of concepts. 

Also in the Ehrlich tradition, there arose in 
Sweden the Uppsala school, headed by Axel Hiiger- 
ström (see Inquiries Into the Nature of Law and 
Morals 1953, a compilation of Hagerstrém’s theo- 
retical contributions). The Uppsala school taught 
that the basic concepts of law had been miscon- 
ceived: there are no such things as rights and 
obligations; there are only human ideas about these 
subjects. This point of view is similar to that of 
Petrazhitskii, although Hägerström was not fa- 
miliar with Petrazhitskii’s work. 

Although the sociology of law is still in its in- 
fancy, Ehrlich’s work has given a new impetus to 
the idea of the study of the law as a social fact and 
not just as a system of ideas expressed in books. 

N. S. TIMASHEFF 


[For the historical context of Ehrlich’s work, see the 
biographies of KANTOROWICZ and Pounp. For dis- 
cussion of the subsequent development of his ideas, 
see JURISPRUDENCE; LAW, article on THE SOCIOLOGY 
oF LAW; and the biography of SoROKIN.] 
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Luigi Einaudi (1874-1961), Italian economist 
and statesman, was born at Carri, in the province 
of Cuneo, He studied with Salvatore Cognetti de 
Martis at the University of Turin and was ap- 
pointed a fellow in political economy there in 1899 
and professor of financial science in 1907. He 
taught for a short time at a secondary school and 
held an appointment as teacher of economics at 
the Luigi Bocconi Commercial University in Milan, 
and for more than forty years he was on the fac- 
ulty at Turin. 

Forced to flee Italy after the 1943 armistice, 
Einaudi taught for a while at the University of 
Geneva. After his return to Italy, he became suc- 
cessively governor of the Bank of Italy in 1945, 
deputy to the Constituent Assembly in 1946, vice- 
president of the Council and minister of the budget 
in De Gasperi’s fourth cabinet in 1947, senator 
in 1948, and on May 11, 1948, president of the 
republic. 

He returned to private life in 1955 at the end 
of his presidential term. For the rest of his life he 
lived on an estate near his birthplace, taking par- 
ticular pleasure in his magnificent library of eco- 
nomic works. As late as 1961 he was working on 
the problem of Italian monetary reform, suggest- 
ing that a new unit of currency be adopted, to be 
worth 1,000 lire and to be called the scudo. 

Although he found some merit in Pareto and his 
followers, Einaudi is reputed to have considered 
Adam Smith, David Ricardo, and John Stuart Mill 
the greatest of all economists. Always a strong 
supporter of freedom and an opponent of monop- 
oly, Einaudi’s economic writings included a defense 
of trade union freedom, yet he never observed the 
degree to which the state and the trade unions 
themselves were becoming monopolies. 


Many of Einaudi’s most important writings are 
in the field of finance and taxation. The best 
known is probably his Principi di scienza della 
finanza (1932), which was published in many 
editions. He also made studies of the Italian fiscal 
system, the relationship between taxes and sav- 
ings, the myths and paradoxes of fiscal justice, 
and the relationships between taxes and land. 

During his period of exile in Switzerland, he 
published, under the pseudonym of Junius, I pro- 
blemi economici della federazione europea (1945), 
His commitment to federation dated back to the 
turn of the century, and he had long been con- 
cerned with the problems it involved. 

Einaudi’s primary interest was, however, his- 
tory—both economic history and the history of 
economic thought. His study of the Piedmont fi- 
nancial system during the War of the Spanish 
Succession, 1701-1714, is a model of patient in- 
quiry, and it produced material of significance to 
twentieth-century problems. An earlier work, Un 
principe mercante: Studio sulla espansione coloni- 
ale italiana (1900b), is a study of the expansion 
of a “developing” country. He also analyzed the 
works of individual economists, including those of 
Ferrari, Adam Smith, and Sismondi. 

In addition to teaching and writing on econom- 
ics, Einaudi was active as an editor and journalist. 
He brought out in 1933, for example, a new edi- 
tion of Jules Dupuit’s De l'utilité et de sa mesure, 
in Italian. For many years he was editor of the 
journal La riforma sociale. The title was an ac- 
curate reflection of his liberal social views, and 
the journal was suppressed in the 1930s, He later 
started another journal, the Rivista di storia eco- 
nomica, intending to imply by the title change that 
the contents were politically safe. Since this was 
not the case, the new journal survived only three 
years, Einaudi remained throughout his life a clas- 
Sicist in economics and an old-fashioned liberal 
in politics. 

Uco Parr 
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ELASTICITY 


The quantity of a commodity that people want 
to purchase (demand) per unit of time or want to 
sell (supply) per unit of time depends in part upon 
the price of the commodity. Economists have 
sought ways to describe and measure the relation- 
ship between changes in the rate of quantity de- 
manded or supplied and changes in price. 

Alfred Marshall (1885, p. 260) first used the 
term elasticity to refer to the percentage change in 
quantity demanded (Aq/q) divided by the per- 
centage change in price (Ap/p), and indicated 
that elasticity at C, on the demand curve ABCDE 
(see Figure 1), is CG divided by CF. He elaborated 
these ideas in his Principles (1890, book 3, chapter 
3) and the concept has since been adopted by the 
profession. 

Prior to Marshall, there had been no generally 
accepted terminology Or measure of the relation- 
ship between price and quantity, although concepts 
similar to elasticity had been suggested. For ex- 
ample, Mill (1848, book 3, chapter 2, sec. 4) com- 
pared percentage changes in price and in quantity, 
but he did not divide the two to get what we now 
call elasticity. Cournot (1838, chapter 4, sec. 24) 
showed the relationship between elasticity and 
total revenue (see below) by comparing the ratio 
of the change in quantity demanded to the change 
in price (Aq/Ap) and the ratio of quantity de- 
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manded to price (q/p), but he did not suggest di- 
viding the two ratios to measure elasticity. 

Elasticity of demand. Elasticity of demand is 
defined as the percentage change in the quantity 
demanded, divided by the percentage change in 
price. It is derived from the demand curve, which 
shows the absolute quantity demanded as a func- 
tion of price. Since the rate of demand rises if the 
price falls, and vice versa, the elasticity of demand 
is actually negative, except that it is common to 
ignore the algebraic sign and use the absolute 
value. By definition, demand is inelastic if the 
elasticity is less than one and is elastic if the elas- 
ticity is greater than one. An elasticity of one is 
called unit, or unitary, elasticity. 

Elastic demand. If demand is elastic, a given 
percentage increase in price causes a larger per- 
centage decline in the quantity demanded, or a 
given percentage reduction in price causes a larger 
percentage increase in quantity demanded. In the 
limiting case of perfectly (infinitely) elastic de- 
mand, which is illustrated by the horizontal line 
segment DE in Figure 1 (if it is visualized as ex- 
tended horizontally so as to constitute the entire 
demand curve), any increase in price, no matter 
how small, will cause the quantity demanded to 
decline to zero; and any reduction in price, again 
no matter how small, will cause the quantity de- 
manded to increase without limit. 

Demand is usually elastic for commodities for 
which there are good substitutes, so that the de- 
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Figure 1 
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mand facing a firm is more elastic than the demand 
facing an industry, since a buyer may substitute 
the products of other firms in the industry. It is 
part of the definition of perfect competition that 
there must be so many firms in the industry pro- 
ducing such excellent substitutes that the demand 
facing the firm is perfectly (infinitely) elastic, even 
though the demand facing the industry may be 
inelastic. [See COMPETITION.] 

Inelastic demand. Conversely, if demand is in- 
elastic the percentage change in quantity is smaller 
than the percentage change in price. An increase 
in price therefore causes a proportionately smaller 
decline in the quantity demanded, and a reduction 
in price causes a proportionately smaller increase 
in the quantity demanded. In the limiting case of 
zero elasticity (perfectly inelastic demand), which 
is illustrated by the vertical line segment AB in 
Figure 1 (if it is visualized as extended vertically 
so as to constitute the entire demand curve), 
neither an increase nor a reduction in price has 
any effect upon the quantity demanded. 

Products that have no good substitutes, like to- 
bacco, or that require a very small proportion of 
the consumer's income, like salt, frequently have 
inelastic demands. 

Relation to total revenue. Elasticity of demand 
also shows how the total dollar value of sales of the 
commodity responds to a change in its price. For 
example, if the price of a commodity with inelastic 
demand is increased, the total revenue of the seller 
will increase even though the quantity demanded 
declines somewhat, since the increase in price more 
than offsets the reduction in volume. This creates 
a presumption that a seller faced with an inelastic 
demand will raise his price and will continue to do 
so until demand becomes elastic. Similar reasoning 
shows that a reduction in price reduces total rev- 
enue if demand is inelastic but increases total 
revenue if demand is elastic, since the increase in 
volume more than offsets the reduction in Price. 
Finally, an increase in price reduces total revenue 
if demand is elastic and does not change total rev- 
enue if demand has unit elasticity. 

Relation to marginal revenue. There is also a 
logical relationship between the elasticity of de- 
mand and marginal revenue. Marginal revenue is 
defined as the change in the total revenue per unit 
change in quantity sold, if the relationship between 
the price and the quantity sold is given by the de- 
mand curve. Positive marginal revenue therefore 
implies that total revenue increases if the quantity 
sold increases, or declines if the quantity sold de- 
clines. Since the relation between price and quan- 
tity is given by the demand curve, an increase in 
sales volume must be accompanied by a decline in 


price. Thus, a positive marginal revenue means 
that total revenue increases as volume increases 
and price declines. Since a decline in price in- 
creases total revenue only if demand is elastic, 
positive marginal revenue must be associated with 
elastic demand. Furthermore, since profit maxi- 
mization requires that marginal cost equal marginal 
revenue, and marginal cost is always positive, it 
follows that the marginal revenue of the profit- 
maximizing firm must be positive and the demand 
elastic. 

Similar reasoning shows that negative marginal 
revenue is associated with inelastic demand: if 
demand is inelastic, an increase in price reduces 
volume and increases total revenue, so that mar- 
ginal revenue is negative. The intermediate case 
of unit elasticity of demand implies zero marginal 
revenue, since any change in price is just balanced 
by a change in volume, and total revenue is the 
same at any price or at any volume. 

Relation to price. Elasticity of demand usually 
changes with the price of the product. Thus, it is 
incorrect to say that the demand for cigarettes is 
inelastic or that the demand for butter is elastic, 
The correct statement is that these demands are 
elastic or inelastic at the prices that are currently 
charged. At higher prices the demand for cigar- 
ettes might well become elastic, while the demand 
for butter might well become inelastic if its price 
fell far enough below the price of oleomargarine. 

In the particular case of the straight-line de- 
mand curve, which is so frequently used for illus- 
trative purposes in economics, the demand is elastic 
and marginal revenue is positive throughout the 
upper half of the possible price range, while de- 
mand is inelastic and marginal revenue negative 
throughout the lower half of the price range. 

Elasticity of supply. Defined as the percentage 
change in the quantity offered for sale divided by 
the percentage change in price, elasticity of supply 
is derived from the supply curve, which shows the 
absolute amount producers wish to sell as a func- 
tion of price. Since sellers wish to sell more at a 
higher price, the elasticity of supply is positive. If 
the supply curve is inelastic, a relatively larger in- 
crease in price is required to produce a given (per- 
centage) increase in quantity supplied, If the supply 
curve is elastic, the sellers are more responsive to 
a change in price, and the increase in the quantity 
supplied is larger, on a percentage basis, than the 
increase in price. 

Other elasticities. Marshall originally intro- 
duced the concept of elasticity in connection with 
demand and supply curves, but economists soon 
started to use the ratio of the percentage changes 
to describe the relationship between any two varia- 


. 


bles, provided only that the variables are function- 
ally related. By 1913, Johnson (p. 503) was able 
to write, “This form of expression [the ratio of per- 
centage changes] corresponds to the general notion 
of elasticity.” 

The most important elasticities, other than de- 
mand and supply, are income elasticity of demand 
and cross-elasticity of demand. 

Income elasticity of demand. The demand for 
a commodity depends upon its price, consumers’ 
incomes, the prices of other commodities on which 
consumers might spend their incomes, and per- 
haps on other variables. Quantity demanded is 
therefore functionally related to price, if incomes 
and other prices do not change; and the (price) 
elasticity of demand is a property of this function. 
Quantity demanded is also functionally related to 
income, if the price of the commodity and other 
prices do not change. The income elasticity of de- 
mand is a property of this function and is defined 
as the percentage change in the quantity demanded 
divided by the percentage change in income. 

If income elasticity of demand is negative, con- 
sumers purchase less of the commodity as their 
incomes rise, and the commodity is, by definition, 
an inferior good. If income elasticity is positive, 
but less than one, consumers buy more of the 
product as their incomes rise (if prices do not 
change), but spend a smaller proportion of their 
incomes on the commodity. For commodities with 
income elasticities greater than one, consumers 
spend a higher proportion of a higher income on 
the commodity, and the quantity demanded grows 
at a faster (percentage) rate than income grows. 

Cross-elasticity of demand. Quantity demanded 
is also functionally related to the prices of other 
commodities, if the commodity’s own price and 
consumers’ incomes do not change. Cross-elasticity 
of demand is a property of this function and is 
defined as the percentage change in the quantity 
demand of commodity A divided by the percentage 
change in the price of commodity B. 

The commodities A and B are, by definition, sub- 
stitutes if the cross-elasticity is positive, since an 
increase in the price of B then causes a reduction 
in the demand for B and an increase in the demand 
for A. Conversely, if cross-elasticity is negative, the 
commodities are complements, since an increase 
in the price of B causes consumers to demand less 
of both commodities. 

Mathematical definitions. Point elasticity. lf 
the variables q, p, and y are related by the function 
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q=f(p.y), the p elasticity of q, or the elasticity 
of q with respect to p, is customarily defined as 
Nw = (0q/8p)(p/q), no matter what variables the 
letters represent. 

If, as is customary, q represents the quantity de- 
manded, p the price, and y the consumers’ incomes, 
then ny is the (price) elasticity of demand and 
is negative. Marginal revenue, M, is defined as 
M = ô(pq)/ðq, and it can be shown that M = 
P(1 + 1/0): 

Arc elasticity. Sometimes the functional rela- 
tionship between p and q is unknown, but two 
points, B and D, with coordinates (qı, pı) and 
(q:, ps) as in Figure 1, are known; and the arc elas- 
ticity between the two points is desired. The usual 
practice is to assume that the demand curve pass- 
ing through these points is a straight line, HBIDJ, 
and to calculate the point elasticity at I, the point 
midway between B and D. The formula for the arc 
elasticity, so defined, is 


While this procedure is arbitrary, it has two ad- 
vantages over calculating the elasticity at either B 
or D. First, the elasticity has the same value 
whether the change is from B to D or from D to B. 
More important, however, is the fact that the rela- 
tionships between elasticity, total revenue, and 
marginal revenue, which were discussed above, are 
valid if elasticity is calculated at I, but not if it is 


calculated at either B or D. 
DIRAN BODENHORN 


[See also DEMAND AND SUPPLY; and the biographies 
of COURNOT; MARSHALL; MILL] 
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